Revision as of 09:26, 21 January 2022 edit Alexander Davronov (talk \| contribs) Extended confirmed users 10,942 edits →Normalization: "software implementing"; subsection Tags: Reverted Visual edit ← Previous edit		Revision as of 06:28, 24 January 2022 edit undo Vfnn (talk \| contribs) 29 edits Undid revision 1067025660 by Alexander Davronov (talk) "software" is not a countable noun Tags: Undo Reverted Next edit →
Line 50: ==Normalization== AThe ~~text~~implementation ~~processing software implementating the~~of Unicode string ~~search~~searches and ~~comparison~~comparisons ~~functionality~~in text processing software must take into account the presence of equivalent code points. In the absence of this feature, users searching for a particular code point sequence would be unable to find other visually indistinguishable glyphs that have a different, but canonically equivalent, code point representation. ~~=== Algorithms ===~~ Unicode provides standard normalization algorithms that produce a unique (normal) code point sequence for all sequences that are equivalent; the equivalence criteria can be either canonical (NF) or compatibility (NFK). Since one can arbitrarily choose the [[representative (mathematics)\|representative]] element of an [[equivalence class]], multiple canonical forms are possible for each equivalence criterion. Unicode provides two normal forms that are semantically meaningful for each of the two compatibility criteria: the composed forms NFC and NFKC, and the decomposed forms NFD and NFKD. Both the composed and decomposed forms impose a '''canonical ordering''' on the code point sequence, which is necessary for the normal forms to be unique.

Unicode equivalence: Difference between revisions