Content deleted Content added
No edit summary |
Adding short description: "Lossy audio coding technique" |
||
(58 intermediate revisions by 36 users not shown) | |||
Line 1:
{{Short description|Lossy audio coding technique}}
{{about|the signal coding technique|the Bluetooth audio codec|SBC (codec)}}
{{no footnotes|date = October 2011}}
[[File:SubBandCoding.svg|thumb|500px|Sub-band coding and decoding signal flow diagram]]
==Basic Principles==▼
SBC depends on a phenomenon of the human hearing system called masking. Normal human ears are sensitive to a wide range of frequencies. However, when a lot of signal energy is present at one frequency, the ear cannot hear lower energy at nearby frequencies. We say that the louder frequency masks the softer frequencies. The louder frequency is called the masker.▼
In [[signal processing]], '''sub-band coding''' ('''SBC''') is any form of [[transform coding]] that breaks a signal into a number of different [[frequency band]]s, typically by using a [[fast Fourier transform]], and encodes each one independently. This decomposition is often the first step in data compression for audio and video signals.
SBC is the core technique used in many popular [[lossy audio compression]] algorithms including [[MP3]].
The basic idea of SBC is to save signal bandwidth by throwing away information about frequencies which are masked. The result won't be the same as the original signal, but if the computation is done right, human ears can't hear the difference. ▼
==Encoding audio signals==
The simplest way to digitally encode audio signals is
The more bits used to represent each sample, the finer the granularity in the digital representation, and thus the smaller the quantization error. Such ''quantization errors'' may be thought of as a type of noise, because they are effectively the difference between the original source and its binary representation. With PCM, the audible effects of these errors can be mitigated with [[dither]] and by using enough bits to ensure that the noise is low enough to be masked either by the signal itself or by other sources of noise. A high quality signal is possible, but at the cost of a high [[bitrate]] (e.g., over 700 [[kbit/s]] for one channel of CD audio). In effect, many bits are wasted in encoding masked portions of the signal because PCM makes no assumptions about how the human ear hears.
Coding techniques reduce bitrate by exploiting known characteristics of the auditory system. A classic method is nonlinear PCM, such as the [[μ-law algorithm]]. Small signals are digitized with finer granularity than are large ones; the effect is to add noise that is proportional to the signal strength. Sun's [[Au file format]] for sound is a popular example of mu-law encoding. Using 8-bit mu-law encoding would cut the per-channel bitrate of CD audio down to about 350 kbit/s, half the standard rate. Because this simple method only minimally exploits masking effects, it produces results that are often audibly inferior compared to the original.
▲The utility of SBC
▲The basic idea of SBC is to
Decoding is much easier than encoding, since
==Applications==
Beginning in the late 1980s, a standardization body, the [[Moving Picture Experts Group]] (MPEG), developed standards for coding of both audio and video. Subband coding resides at the heart of the popular MP3 format (more properly known as [[MPEG-1 Audio Layer III]]), for example.
Sub-band coding is used in the [[G.722]] codec which uses sub-band adaptive differential pulse code modulation (SB-[[ADPCM]]) within a bit rate of 64 kbit/s. In the SB-ADPCM technique, the frequency band is split into two sub-bands (higher and lower) and the signals in each sub-band are encoded using ADPCM.
==
* [https://web.archive.org/web/20070613152917/http://www.otolith.com/otolith/olt/sbc.html Sub-Band Coding Tutorial]
{{Compression Methods}}
[[Category:Data compression]]
[[Category:Audio engineering]]
[[Category:Signal processing]]
|