Unicode equivalence: Difference between revisions

Content deleted Content added
Tree4rest (talk | contribs)
Broke off last sentence of intro as it summarizes both proceeding paragraphs, and directed readers to the 2.1 Normal forms section for clarification of how this results in for total forms.
Tree4rest (talk | contribs)
No edit summary
Line 8:
The standard also defines a [[text normalization]] procedure, called '''Unicode normalization''', that replaces equivalent sequences of characters so that any two texts that are equivalent will be reduced to the same sequence of code points, called the '''normalization form''' or '''normal form''' of the original text. For each of the two equivalence notions, Unicode defines two normal forms, one '''fully composed''' (where multiple code points are replaced by single points whenever possible), and one '''fully decomposed''' (where single points are split into multiple ones).
 
These traits can beget combined intoin the four normal forms, as explained below, any of which can be used in text processing.
 
==Sources of equivalence==