Content deleted Content added
m Reverted edits by 2604:3D08:E87F:FA76:2D3C:91BD:AE71:8E3A (talk) (HG) (3.4.3) |
No edit summary Tags: Mobile edit Mobile web edit |
||
Line 1:
==General structure==
Since the aim of a multibyte encoding system is to minimise changes to existing application software, some characters must retain their pre-existing single-unit codes, even while other characters have multiple units in their codes. The result is that there are three sorts of units in a variable-width encoding: '''singletons''', which consist of a single unit, '''lead units''', which come first in a multiunit sequence, and '''trail units''', which come afterwards in a multiunit sequence. Input and display software obviously needs to know about the structure of the multibyte encoding scheme but other software generally doesn't need to know if a pair of bytes represent two separate characters or just one character.
|