| • Science | • People | • Locations | • Timeline |
| Contents | ||
The Speex coder uses the Ogg bitstream format, and the Speex designers see their project as complementary to the Vorbis general-purpose audio compression project.
Applications designers looking for a general-purpose speech codec should consider Speex.
Unlike many other speech codecs, Speex is not targeted at cell phones but rather at voice over IP (VoIP) and file-based compression. The design goals have been to make a codec that would allow both very good quality speech and low bit-rate, which led to the development of a codec with multiple bit-rates. Very good quality also meant the support of wideband (16 kHz sampling rate) in addition to narrowband (telephone quality, 8 kHz sampling rate). Designing for VoIP instead of cell phone use means that Speex must be robust to lost packets, but not to corrupted ones since packets either arrive unaltered or don't arrive. All this led to the choice of CELP as the encoding technique to use for Speex. One of the main reasons is that CELP has long proved that it could do the job and scale well to both low bit-rates (think DoD CELP@4.8 kbit/s) and high bit-rates (think G.728 @ 16 kbit/s). The main characteristics can be summarized as follows:
Speex is mainly designed for 3 different sampling rates: 8 kHz, 16 kHz, and 32 kHz. These are respectively refered to as narrowband, wideband and ultra-wideband.
Speex encoding is controlled most of the time by a quality parameter that ranges from 0 to 10. In constant bit-rate (CBR) operation, the quality parameter is an integerThe integers consist of the positive natural numbers (1, 2, 3, …) the negative natural numbers (−1, −2, −3,. and the number zero. The set of all integers is usually denoted in mathematics by Z (or Z in blackboard bold, ), which st, while for variable bit-rate (VBR), the parameter is a floatThere are several meanings of float Buoyancy Float (project management) Floating point, short for number Floats are used by seaplanes to land on water. Insurance (investable policyholder funds)..
With Speex, it is possible to vary the complexity allowed for the encoder. This is done by controlling how the search is performed with an integer ranging from 1 to 10 in a way that's similar to the -1 to -9 options to gzipgzip is short for GNU Zip, a GNU open-source replacement for the Unix compress program. Gzip is based on the deflate algorithm, which is a combination of LZ77 and Huffman coding. Deflate' was developed in response to patents that covered LZW and other com and bzip2bzip2 is an open source data compression algorithm and program developed by Julian Seward. bzip2 compresses most files more effectively than more traditional gzip or Zip, but is slower. Nonetheless, with the constant effect of Moore's Law making computer compressionThere are several types of compression physical compression data compression multimedia compression image compression audio compression video compression bandwidth compression audio level compression compression (functional analysis). utilities. For normal use, the noise level at complexity 1 is between 1 and 2 dB higher than at complexity 10, but the CPU requirements for complexity 10 is about 5 times higher than for complexity 1. In practice, the best trade-off is between complexity 2 and 4, though higher settings are often useful when encoding non-speech sounds like DTMF tones.