Content deleted Content added
m Task 18 (cosmetic): eval 4 templates: del empty params (6×); |
|||
Line 2:
{{about|writing systems found in Unicode|the "Script" style of Latin letters in Unicode|Mathematical Alphanumeric Symbols|and|Script typeface}}<blockquote></blockquote>[[File:Armenian language in the Armenian alphabet.svg|thumb|[[Armenian script]]]]
In [[Unicode]], a '''script''' is a collection of [[Letter (alphabet)|letter]]s and other written signs used to represent textual information in one or more [[writing system]]s.<ref>{{cite web|url=http://unicode.org/glossary/|title=Glossary
The unified [[Combining Diacritical Marks for Symbols|diacritical character]]s and unified [[General Punctuation|punctuation characters]] frequently have the "common" or "inherited" script property. However, the individual scripts often have their own [[punctuation]] and [[diacritic]]s, so that many scripts include not only letters, but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and [[Space (punctuation)|space]] characters.
Unicode 13.0 defines 154 separate scripts, including 91 modern scripts and 63 ancient or historic scripts.<ref>{{cite web|url=https://www.unicode.org/Public/UNIDATA/Scripts.txt|title=Unicode Character Database: Scripts
== Definition and classification ==
Line 21:
=== {{anchor|Common and inherited scripts}}{{anchor|Special script property values}}Special script property values ===
In addition to explicit or specific script properties Unicode uses three special values:<ref name=Unicode_script_property>{{cite web|url=https://www.unicode.org/reports/tr24/|title=UAX #24: Unicode Script Property
;Common: Unicode can assign a character in the [[Universal Character Set|UCS]] to a single script only. However, many characters — those that are not part of a formal natural language writing system or are unified across many writing systems may be used in more than one script. For example, currency signs, symbols, numerals and punctuation marks. In these cases Unicode defines them as belonging to the "common" script ([[ISO 15924]] code "Zyyy").
;Inherited: Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, {{unichar|0308|Combining Diaeresis|cwith=}} may combine with either {{unichar|0065|Latin Small Letter E}} to create a Latin "ë", or with {{unichar|0435|Cyrillic Small Letter IE}} for the Cyrillic "ё". In the former case it inherits the Latin script of the base character whereas in the latter case it inherits the Cyrillic script of the base character.
|