Linear code

This is an old revision of this page, as edited by Reetep (talk | contribs) at 11:39, 13 May 2006 (Uses of linear codes). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In mathematics and information theory, a linear code is an important type of block code used in error correction and detection schemes. Linear codes allow for more efficient encoding and decoding algorithms than other codes (cf. syndrome decoding).

Formal definition

A linear code of length   and rank   is a linear subspace   with dimension   of the vector space   where   is the finite field with   elements. Such a code with parameter   is called a q-ary code (eg when  , the code is a 5-ary code). Notable exceptions to this are the cases   and  , when the code is described as binary and ternary (respectively).

Properties

By virtue of the fact that the code is a subspace of  , the sum   of two codewords in   is also a codeword (ie an element of the subspace  ). Thus the entire code (which may be very large) may be represented as the span of a minimal set of codewords (known as a basis in linear algebra). These basis codewords are often collated in the rows of a matrix known as a generating matrix for the code  .

The subspace definition also gives rise to the important property that the minimum Hamming distance between codewords is simply the minimum Hamming weight of all codewords since:

 

implying:

 

Codes in general are often denoted by the letter  . A linear code of length  , rank   (ie having   codewords in its basis and   rows in its generating matrix) and minimum Hamming weight   is referred to as an   code.

Remark. This is not to be confused with the notation   to denote a non-linear code of length  , size   (ie having   codewords) and minimum Hamming distance  .

Examples

Some examples of linear codes include:

Uses

Binary linear codes (refer to formal definition above) are ubiquitous in electronic devices and digital storage media. For example the Reed-Solomon code is used to store digital data on a compact disc.


Sphere packings and lattices

Block codes are tied to the sphere packing problem which has received some attention over the years. In two dimensions, it is easy to visualize. Take a bunch of pennies flat on the table and push them together. The result is a hexagon pattern like a bee's nest. But block codes rely on more dimensions which cannot easily be visualized. The powerful Golay code used in deep space communications uses 24 dimensions. If used as a binary code (which it usually is,) the dimensions refer to the length of the codeword as defined above.

The theory of coding uses the N-dimensional sphere model. For example, how many pennies can be packed into a circle on a tabletop or in 3 dimensions, how many marbles can be packed into a globe. Other considerations enter the choice of a code. For example, hexagon packing into the constraint of a rectangular box will leave empty space at the corners. As the dimensions get larger, the percentage of empty space grows smaller. But at certain dimensions, the packing uses all the space and these codes are the so called perfect codes. There are very few of these codes.

Another item which is often overlooked is the number of neighbors a single codeword may have. Again, lets use pennies as an example. First we pack the pennies in a rectangular grid. Each penny will have 4 near neighbors (and 4 at the corners which are farther away). In a hexagon, each penny will have 6 near neighbors. When we increase the dimensions, the number of near neighbors increases very rapidly.

The result is the number of ways for noise to make the receiver choose a neighbor (hence an error) grows as well. This is a fundamental limitation of block codes, and indeed all codes. It may be harder to cause an error to a single neighbor, but the number of neighbors can be large enough so the total error probability actually suffers.