Revision as of 17:24, 28 April 2011 edit Krkwrgr (talk \| contribs) 56 edits No edit summary ← Previous edit		Revision as of 19:47, 27 October 2011 edit undo Lmatt (talk \| contribs) Extended confirmed users 8,802 edits m clean up using AWB Next edit →
Line 1: '''Code-excited linear prediction''' ('''CELP''') is a [[speech coding]] algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algorithms, such as [[Residual-excited linear prediction\|RELP]] and [[Linear predictive coding\|LPC]] [[vocoders]] (e.g., [[FS-1015]]). Along with its variants, such as [[ACELP]], [[RCELP]], [[LD-CELP]] and [[VSELP]], it is currently the most widely used speech coding algorithm. It is also used in [[MPEG-4 Audio]] speech coding. CELP is commonly used as a generic term for a class of algorithms and not for a particular codec. == Introduction == The CELP algorithm is based on four main ideas: * Using the [[source-filter model of speech production]] through [[linear prediction]] (LP)(see the textbook "speech coding algorithm"); * Using an adaptive and a fixed codebook as the input (excitation) of the LP model; * Performing a search in closed-loop in a “perceptually weighted ___domain”. * Applying [[vector quantization~~\|vector quantization~~]] (VQ)]] The original algorithm as simulated in 1983 by Schroeder and Atal required 150 seconds to encode 1 second of speech when run on a [[Cray-1]] supercomputer. Since then, more efficient ways of implementing the codebooks and improvements in computing capabilities have made it possible to run the algorithm in embedded devices, such as mobile phones. == CELP decoder == [[File:Celp decoder.svg\|300px\|thumb\|Figure 1: CELP decoder]] Line 21: The filter that shapes the excitation has an all-pole model of the form <math>1/A(z)</math>, where <math>A(z)</math> is called the prediction filter and is obtained using linear prediction ([[Levinson recursion\|Levinson–Durbin algorithm]]). An all-pole filter is used because it is a good representation of the human vocal tract and because it is easy to compute. == CELP encoder == The main principle behind CELP is called [[Analysis-by-Synthesis\|Analysis-by-Synthesis (AbS)]] and means that the encoding (analysis) is performed by perceptually optimizing the decoded (synthesis) signal in a closed loop. In theory, the best CELP stream would be produced by trying all possible bit combinations and selecting the one that produces the best-sounding decoded signal. This is obviously not possible in practice for two reasons: the required complexity is beyond any currently available hardware and the “best sounding” selection criterion implies a human listener. Line 29: * The fixed (innovation) codebook is searched === Noise weighting === Most (if not all) modern audio codecs attempt to [[Psychoacoustics\|shape the coding noise]] so that it appears mostly in the frequency regions where the ear cannot detect it. For example, the ear is more tolerant to noise in parts of the spectrum that are louder and vice versa. That's why instead of minimizing the simple quadratic error, CELP minimizes the error for the ''perceptually weighted'' ___domain. The weighting filter W(z) is typically derived from the LPC filter by the use of [[bandwidth expansion]]: Line 36: where <math>\gamma_1 > \gamma_2</math>. == See also == * [[MPEG-4 Part 3]] (CELP as an MPEG-4 Audio Object Type) * [[G.728]] - Coding of speech at 16 kbit/s using low-delay code excited linear prediction * [[G.718]] - uses CELP for the lower two layers for the band (~~50-6400~~ 50–6400 Hz) in a two stage coding structure * [[G.729.1]] - uses CELP coding for the lower band (~~50-4000~~ 50–4000 Hz) in a three-stage coding structure * [[Comparison of audio codecs]] == External links == * This is based on a [http://people.xiph.org/~jm/papers/speex_lca2006.pdf paper] presented at [http://linux.conf.au/ Linux.Conf.Au] * Some parts based on the [[Speex]] codec [http://www.speex.org/docs/ manual] * [http://www.speech.cs.cmu.edu/comp.speech/Section3/Software/celp-3.2a.html reference implementations] of CELP 1016A (CELP 3.2a) and LPC 10e. * [http://www.otolith.com/otolith/olt/lpc.html Linear Predictive Coding (LPC)] == References == * B.S. Atal, "The History of Linear Prediction," ''IEEE Signal Processing Magazine'', vol. 23, no. 2, March 2006, pp.  154–161. * M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in ''Proceedings of the IEEE [[International Conference on Acoustics, Speech, and Signal Processing]]'' (ICASSP), vol. 10, pp.  937–940, 1985.▼ ▲* M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in ''Proceedings of the IEEE [[International Conference on Acoustics, Speech, and Signal Processing]]'' (ICASSP), vol. 10, pp. 937–940, 1985. {{Compression Methods}}

Code-excited linear prediction: Difference between revisions