Unicode control characters: Difference between revisions

Content deleted Content added
Use {{cite iso-ir}} for ISO-IR references.
Update unichar template to match character name from definitive source https://unicode.org/Public/UNIDATA/UnicodeData.txt
Line 26:
== Language tags ==
{{main|Tags (Unicode block)}}
Unicode previously included 128 characters, now deprecated, for language tags. These characters essentially mirrored the 128 ASCII characters but were used to identify the subsequent text as belonging to a particular language according to [[BCP 47]]. For example, to indicate subsequent text as the variant of English as written in the United States, the sequence {{unichar|E0001|LANGUAGE TAG}}, {{unichar|E0065|Tag Latin Small Letter e}}, {{unichar|E006E|Tag Latin Small Letter n}}, {{unichar|E002D|Tag Hyphen-minus}}, {{unichar|E0075|Tag Latin Small Letter u}} and {{unichar|E0073|Tag Latin Small Letter s}} would have been used.
 
These language tag characters would not be displayed themselves. However, they would provide information for text processing or even for the display of other characters. For example, the display of Unihan ideographs might have substituted different glyphs if the language tags indicated Korean than if the tags indicated Japanese. Another example, might have influenced the display of decimal digits 0 through 9 differently depending on the language they appeared in.
Line 34:
 
== Interlinear annotation ==
Three formatting characters provide support for [[Ruby text|interlinear annotation]] ({{unichar|FFF9|INTERLINEAR ANNOTATION ANCHOR}}, {{unichar|FFFA|INTERLINEAR ANNOTATION SEPARATOR}}, {{unichar|FFFB|INTERLINEAR ANNOTATION TERMINATOR}}). This may be used for providing notes that would typically be displayed between the lines of other text. Unicode considers such annotation to be rich text and recommends using other protocols for such annotation. The W3C [[Ruby character#Ruby markup|Ruby markup]] recommendation is an example of an alternate protocol supporting more advanced interlinear annotation.
 
== Bidirectional text control ==