Content deleted Content added
KamikazeBot (talk | contribs) m robot Modifying: ru:CELP |
IznoRepeat (talk | contribs) m add WP:TEMPLATECAT to remove from template; genfixes |
||
(41 intermediate revisions by 29 users not shown) | |||
Line 1:
{{Short description|Speech coding algorithm}}
'''Code-excited linear prediction''' ('''CELP''') is a [[speech coding]] algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algorithms, such as [[Residual-excited linear prediction|RELP]] and [[Linear predictive coding|LPC]] [[vocoders]] (e.g., [[FS-1015]]). Along with its variants, such as [[ACELP]], [[RCELP]], [[LD-CELP]] and [[VSELP]], it is currently the most widely used speech coding algorithm. It is also used in [[MPEG-4 Audio]] speech coding. CELP is commonly used as a generic term for a class of algorithms and not for a particular codec.▼
{{No footnotes|date=May 2022}}
▲'''Code-excited linear prediction''' ('''CELP''') is a [[linear predictive coding|linear predictive]] [[speech coding]] algorithm originally proposed by
==
The CELP algorithm is based on four main ideas:
* Using the [[source-filter model of speech production]] through [[linear prediction]] (LP) (see the textbook "speech coding algorithm");▼
▲* Using the [[source-filter model of speech production]] through [[linear prediction]] (LP)(see the textbook "speech coding algorithm");
* Using an adaptive and a fixed codebook as the input (excitation) of the LP model;
* Performing a search in closed-loop in a
* Applying [[vector quantization
The original algorithm as simulated in 1983 by Schroeder and Atal required 150 seconds to encode 1 second of speech when run on a [[Cray-1]] supercomputer. Since then, more efficient ways of implementing the codebooks and improvements in computing capabilities have made it possible to run the algorithm in embedded devices, such as mobile phones.
==
[[File:
Before exploring the complex encoding process of CELP we introduce the decoder here. Figure 1 describes a generic CELP decoder. The excitation is produced by summing the contributions from
:<math>e[n]=
where <math>e_{
The filter that shapes the excitation has an all-pole model of the form <math>1/A(z)</math>, where <math>A(z)</math> is called the prediction filter and is obtained using linear prediction ([[Levinson recursion|
==
The main principle behind CELP is called
In order to achieve real-time encoding using limited computing resources, the CELP search is broken down into smaller, more manageable, sequential searches using a simple perceptual weighting function. Typically, the encoding is performed in the following order:
* Linear Prediction Coefficients (LPC) are computed and quantized, usually as [[Line spectral pairs|LSPs]]▼
* The adaptive (pitch) codebook is searched and its contribution removed▼
* The fixed (innovation) codebook is searched▼
▲* [[Linear
=== Noise weighting ===▼
▲* The adaptive (pitch) codebook is searched and its contribution removed.
▲* The fixed (innovation) codebook is searched.
Most (if not all) modern audio codecs attempt to [[Psychoacoustics|shape the coding noise]] so that it appears mostly in the frequency regions where the ear cannot detect it. For example, the ear is more tolerant to noise in parts of the spectrum that are louder and vice versa. That's why instead of minimizing the simple quadratic error, CELP minimizes the error for the ''perceptually weighted'' ___domain. The weighting filter W(z) is typically derived from the LPC filter by the use of [[bandwidth expansion]]:
Line 37 ⟶ 39:
where <math>\gamma_1 > \gamma_2</math>.
==
* [[MPEG-4 Part 3]] (CELP as an MPEG-4 Audio Object Type)
* [[G.728]]
* [[G.718]]
* [[G.729.1]]
* [[Comparison of audio
* [[CELT]] is a related audio codec that borrows some ideas from CELP.
==References==
* B.S. Atal, "The History of Linear Prediction," ''IEEE Signal Processing Magazine'', vol. 23, no. 2, March 2006, pp.
* This is based on a [http://people.xiph.org/~jm/papers/speex_lca2006.pdf paper] presented at [http://linux.conf.au/ Linux.Conf.Au] ▼
* M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in ''Proceedings of the IEEE [[International Conference on Acoustics, Speech, and Signal Processing]]'' (ICASSP), vol. 10, pp.
* Some parts based on the [[Speex]] codec [http://www.speex.org/docs/ manual]▼
* [http://www.speech.cs.cmu.edu/comp.speech/Section3/Software/celp-3.2a.html reference implementations] of CELP 1016A (CELP 3.2a) and LPC 10e.▼
* [http://www.otolith.com/otolith/olt/lpc.html Linear Predictive Coding (LPC)]▼
==External
▲* This article is based on a [http://people.xiph.org/~jm/papers/speex_lca2006.pdf paper] presented at [http://linux.conf.au/ Linux.Conf.Au]
▲* B.S. Atal, "The History of Linear Prediction," ''IEEE Signal Processing Magazine'', vol. 23, no. 2, March 2006, pp. 154–161.
▲* [http://www.speech.cs.cmu.edu/comp.speech/Section3/Software/celp-3.2a.html reference implementations] of CELP 1016A (CELP 3.2a) and LPC 10e. {{Webarchive|url=https://web.archive.org/web/20161212000335/http://www.speech.cs.cmu.edu/comp.speech/Section3/Software/celp-3.2a.html |date=2016-12-12 }}
▲* [https://web.archive.org/web/20090602220112/http://www.otolith.com/otolith/olt/lpc.html Linear Predictive Coding (LPC)]
=== Selected readings ===
▲* M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in ''Proceedings of the IEEE [[International Conference on Acoustics, Speech, and Signal Processing]]'' (ICASSP), vol. 10, pp. 937–940, 1985.
* [https://www.speex.org/docs/manual/speex-manual/node9.html Introduction to CELP Coding]
* [https://cnx.org/content/m10482/latest/ Speech Processing: Theory of LPC Analysis and Synthesis] {{Webarchive|url=https://web.archive.org/web/20140615041652/http://cnx.org/content/m10482/latest/ |date=2014-06-15 }}
{{Compression Methods}}
{{Compression
[[Category:Speech codecs]]
[[Category:Data compression]]
|