Unary coding

This is an old revision of this page, as edited by DavidCBryant (talk | contribs) at 12:29, 21 February 2007 (Forced P to be an operatorname in <math> expressions.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Unary coding is an entropy encoding that represents a natural number, n, with n − 1 ones followed by a zero. For example 5 is represented as 11110. Some representations use n ones followed by a zero. The ones and zeros are interchangeable without loss of generality.

Unary coding is easily shown to be an optimally efficient encoding for the following discrete probability distribution

for . It is in fact optimal for any geometric distribution

for which k ≥ φ = 1.61803398879…, the golden ratio, or, more generally, for any distribution for which

for .

A modified unary encoding is used in UTF-8.