Quantization (signal processing)

In digital signal processing, quantization is the process of approximating a continuous range of values (or a very large set of possible discrete values) by a relatively-small set of discrete symbols or integer values. More specifically, a signal can be multi-dimensional and quantization need not be applied to all dimensions. A discrete signal need not necessarily be quantized (a pedantic point, but true nonetheless and can be a point of confusion). See ideal sampler.

File:FloorQuantizer.png

Quantization of x using Q(x) = floor(Lx)/L.

A common use of quantization is in the conversion of a continuous signal into a discrete signal by sampling and then quantizing. Both of these steps are performed in analog-to-digital converters with the quantization level specified by a number of bits. A specific example would be compact disc (CD) audio which is sampled at 44,100 Hz and quantized with 16 bits (2 bytes) which can be one of 65,536 ( $2^{1}6$ ) possible values per sample.

The simplest and best-known form of quantization is referred to as scalar quantization, since it operates on scalar (as opposed to multi-dimensional vector) input data. In general, a scalar quantization operator can be represented as

Q(x)=g(\operatorname {round} (f(x)))

where $x$ is a real number, $i=\operatorname {round} (f(x))$ is an integer, and $f(x)$ and $g(i)$ are arbitrary real-valued functions. The integer value $i=\operatorname {round} (f(x))$ is the representation that is typically stored or transmitted, and then the final interpretation is constructed using $g(i)$ when the data is later interpreted. The integer value $i$ is sometimes referred to as the quantization index.

In computer audio and most other applications, a method known as uniform quantization is the most common. If $x$ is a real valued number between -1 and 1, a uniform quantization operator that uses M bits of precision to represent each quantization index can be expressed as

Q(x)={\frac {\operatorname {round} (2^{M-1}x)}{2^{M-1}}}

.

In this case the $f(x)$ and $g(i)$ operators are just multiplying scale factors (one multiplier being the inverse of the other). The value $2^{-(M-1)}$ is often referred to as the quantization step size. Using this quantization law and assuming that quantization noise is approximately uniformly distributed over the quantization step size (an assumption typically accurate for rapidly varying $x$ or high $M$ ) and assuming that the input signal $x$ to be quantized is approximately uniformly distributed over the entire interval from -1 to 1, the signal to noise ratio (SNR) of the quantization can be computed as

{\frac {S}{N_{q}}}\approx 20\operatorname {log} _{10}(2^{M})=6.0206M\operatorname {dB}

.

From this equation, it is often said that the SNR is approximately 6 dB per bit.

In digital telephony, two popular quantization schemes are the 'A-law' (dominant in Europe) and 'µ-law' (dominant in North America and Japan). These schemes map discrete analog values to an 8-bit scale that is nearly linear for small values and then increases logarithmically as amplitude grows. Because the human ear's perception of loudness is roughly logarithmic, this provides a higher signal to noise ratio over the range of audible sound intensities for a given number of bits.

Compression

Quantization also plays a part in lossy data compression. One such lossy compression scheme is JPEG. During compression, the coefficients of the discrete cosine transform are quantized to facilitate the entropy encoding step. So by reducing the set of values (the post-quantized step of JPEG typically yields many zero values which be exploited to reduce the number of bits needed) by quantization, higher compression ratios can be achieved.

External Links

Paper on mathematical theory and analysis of quantization

Quantization (signal processing)

Compression

See also

External Links