Content deleted Content added
→Semantically distinct characters: remove some clear and present OR, want citation for what should be a fact |
m →Glyph substitution and composition: fixed citation template(s) to remove page from Category:CS1 maint: Extra text & general fixes using AWB (11754) |
||
Line 23:
* '''[[typographic ligature|Ligatures]]'''. Ligatures such as ‘ffi’ in the Latin script were often encoded as a separate character in legacy character sets. Unicode’s approach to ligatures is to treat them as rich text and, if turned on, handled through glyph substitution.
* '''Precomposed Roman numerals'''. For example, Roman numeral twelve (‘Ⅻ’: U+216B) can be decomposed into a Roman numeral ten (‘Ⅹ’: U+2169) and two Roman numeral ones (‘Ⅰ’: U+2160).
* '''Precomposed [[vulgar fraction|fractions]]'''. These decomposition have the keyword <fraction>. A fully conforming text handler should<ref>{{cite web|author=The Unicode Consortium|authorlink=Unicode Consortium|year=2010|title=The Unicode Standard, Version 6.0.0|publisher=Addison-Wesley Professional|isbn=978-0321480910|pages=
* '''Contextual glyphs or forms'''. These arise primarily in the Arabic script. Using fonts with glyph substitution capabilities such as [[OpenType]] and [[Apple Advanced Typography|TrueTypeGX]], Unicode conforming software can substitute the proper glyphs for the same character depending on whether that character appears at the beginning, end, middle of a word or in isolation. Such glyph substitution is also necessary for vertical (top to bottom) text layout for some East Asian languages. In this case glyphs must be substituted or synthesized for wide, narrow, small and square glyph forms. Non-conforming software or software using other character sets instead use multiple separate character for the same letter depending on its position: further complicating text processing.
|