Unicode alias names and abbreviations: Difference between revisions

Content deleted Content added
Unicode 16.0 release
 
(9 intermediate revisions by 5 users not shown)
Line 4:
==Background==
The formal, primary Unicode name is unique over all names, only uses certain characters & format, and is guaranteed never to change. The formal name consists of characters A–Z (uppercase), 0–9, " " (space), and "-" (hyphen).
Next to this name, a character can have one or more formal (normative) '''alias names'''. Such an alias name also follows the rules of a name: characters used (A-Z, -, 0-9, &lt;space>) and not used (a-z, %, $, etc.). Alias names are also unique in the full name set (that is, all names and alias names are all unique in their combined set). Alias names are formally described in the Unicode Standard.<ref name="NameAliases">{{cite web|url=https://www.unicode.org/Public/1416.0.0/ucd/NameAliases.txt |title=NameAliases-14.0.0.txt|access-date=20212024-09-1411|date=20202024-1004-2124|publisher=The Unicode Consortium}}</ref><ref>{{cite bookweb|url=https://www.unicode.org/versions/Unicode14Unicode16.0.0/ch24.pdfcore-spec/chapter-24|title=The Unicode Standard|version=14.0.0|publisher=The Unicode Consortium|isbn=978-1-936213-29-0|date=2021}}</ref> In this sense, an abbreviation is also considered a Unicode ''name''.
 
==Reason to add an alias==
Line 12:
;1. Abbreviation
:Commonly occurring abbreviations (or acronyms) for control codes, format characters, spaces, and variation selectors.
:There are 353354 such aliases, including 256 aliases for variant selectors (VS-1 ... VS-256).
:For example, {{unichar|00A0|no-break space}} has alias {{smallcaps|NBSP}}.
:Presentation: in the code charts, the abbreviation is shown in a dashed box: {{Unicode alias/abbrbox|abbr=NBSP|title=no-break space}}.
Line 22:
;3. Correction
:This is a correction for a "serious problem" in the primary character name, usually an error.
:There are 2935 such aliases.
:For example, {{unichar|2118|SCRIPT CAPITAL P}} is actually a ''lowercase'' p, and so is given alias name {{smallcaps2|1=WEIERSTRASS ELLIPTIC FUNCTION}}: "actually this has the form of a lowercase calligraphic p, despite its name, and through the alias the correct spelling is added."
:Presentation: A corrected name is preceded by symbol ※ (the [[reference mark]]<!-- {{unichar|203B|REFERENCE MARK}} -->).
;4. Alternate
:A fewFor widely used alternate namesname for formata characterscharacter.
:There is 1 such alias.
:Example: {{unichar|FEFF|ZERO WIDTH NO-BREAK SPACE}} has alternate {{smallcaps2|1=BYTE ORDER MARK}}.
Line 36:
:Presentation: These figment abbreviations are not published in Standard; the chart shows "XXX" for each informally, that is: not a unique or identifying abbreviation.
 
==FormalList of aliases==
{{Unicode alias/tableheader|version=15.0}}
{{Unicode alias/row|U+=0000 |rows=1 |rownr=1 |range_text= |dec=0 |name=<control-0000> |namelabel=<control-0000> |wl=Null character |abbr=NUL |control=NULL |alternate= |correction= |figment= |informal= |chartid=0000 |charttitle=C0 Controls and Basic Latin |note=}}
Line 150:
{{Unicode alias/row|U+=180E |rows=1 |rownr=1 |range_text= |dec=6158 |name=MONGOLIAN VOWEL SEPARATOR |namelabel= |wl=Mongolian vowel separator |abbr=MVS |control= |alternate= |correction= |figment= |informal= |chartid=1800 |charttitle=Mongolian |note=}}
{{Unicode alias/row|U+=180F |rows=1 |rownr=1 |range_text= |dec=6159 |name=MONGOLIAN FREE VARIATION SELECTOR FOUR |namelabel= |wl=Variation selector |abbr=FVS4 |control= |alternate= |correction= |figment= |informal= |chartid=1800 |charttitle=Mongolian |note=}}
{{Unicode alias/row|U+=1BBD |rows=1 |rownr=1 |range_text= |dec=7101 |name=SUNDANESE LETTER BHA |namelabel= |wl= |abbr= |control= |alternate= |correction=SUNDANESE LETTER ARCHAIC I |figment= |informal= |chartid=1BB801B80 |charttitle=Sudanese |note={{Unicode version|prefix=added in version|version=15.0}}}}
{{Unicode alias/row|U+=200B |rows=1 |rownr=1 |range_text= |dec=8203 |name=ZERO WIDTH SPACE |namelabel= |wl=Zero-width space |abbr=ZWSP |control= |alternate= |correction= |figment= |informal= |chartid=2000 |charttitle=General Punctuation |note=}}
{{Unicode alias/row|U+=200C |rows=1 |rownr=1 |range_text= |dec=8204 |name=ZERO WIDTH NON-JOINER |namelabel= |wl=Zero-width non-joiner |abbr=ZWNJ |control= |alternate= |correction= |figment= |informal= |chartid=2000 |charttitle=General Punctuation |note=}}
Line 184:
{{Unicode alias/row|U+=122D4 |rows=1 |rownr=1 |range_text= |dec=74452 |name=CUNEIFORM SIGN SHIR TENU |namelabel= |wl=Cuneiform |abbr= |control= |alternate= |correction=CUNEIFORM SIGN NU11 TENU |figment= |informal= |chartid=12000 |charttitle=Cuneiform |note= }}
{{Unicode alias/row|U+=122D5 |rows=1 |rownr=1 |range_text= |dec=74453 |name=CUNEIFORM SIGN SHIR OVER SHIR BUR OVER BUR |namelabel= |wl=Cuneiform |abbr= |control= |alternate= |correction=CUNEIFORM SIGN NU11 OVER NU11 BUR OVER BUR |figment= |informal= |chartid=12000 |charttitle=Cuneiform |note= }}
{{Unicode alias|U+=12327 |rows=1 |rownr=1 |range_text= |dec=74535 |name=CUNEIFORM SIGN UN GUNU |namelabel= |wl= |abbr= |control= |alternate= |correction=CUNEIFORM SIGN KALAM |figment= |informal= |chartid=12000 |charttitle=Cuneiform |note=}}
{{Unicode alias|U+=1680B |rows=1 |rownr=1 |range_text= |dec=92171 |name=BAMUM LETTER PHASE-A MAEMBGBIEE |namelabel= |wl= |abbr= |control= |alternate= |correction=BAMUM LETTER PHASE-A MAEMGBIEE |figment= |informal= |chartid=16800 |charttitle=Bamum Supplement |note=}}
{{Unicode alias/row|U+=16E56 |rows=1 |rownr=1 |range_text= |dec=93782 |name=MEDEFAIDRIN CAPITAL LETTER HP |namelabel= |wl=Medefaidrin |abbr= |control= |alternate= |correction=MEDEFAIDRIN CAPITAL LETTER H |figment= |informal= |chartid=16E40 |charttitle=Medefaidrin |note= }}
{{Unicode alias/row|U+=16E57 |rows=1 |rownr=1 |range_text= |dec=93783 |name=MEDEFAIDRIN CAPITAL LETTER NY |namelabel= |wl=Medefaidrin |abbr= |control= |alternate= |correction=MEDEFAIDRIN CAPITAL LETTER NG |figment= |informal= |chartid=16E40 |charttitle=Medefaidrin |note= }}
Line 190 ⟶ 192:
{{Unicode alias/row|U+=1B001 |rows=1 |rownr=1 |range_text= |dec=110593 |name=HIRAGANA LETTER ARCHAIC YE |namelabel= |wl=Hentaigana |abbr= |control= |alternate= |correction=HENTAIGANA LETTER E-1 |figment= |informal= |chartid=1B000 |charttitle=Kana Supplement |note= }}
{{Unicode alias/row|U+=1D0C5 |rows=1 |rownr=1 |range_text= |dec=118981 |name=BYZANTINE MUSICAL SYMBOL FHTORA SKLIRON CHROMA VASIS |namelabel= |wl=Byzantine Musical Symbols |abbr= |control= |alternate= |correction=BYZANTINE MUSICAL SYMBOL FTHORA SKLIRON CHROMA VASIS |figment= |informal= |chartid=1D000 |charttitle=Byzantine Musical Symbols |note= }}
{{Unicode alias|U+=1E899 |rows=1 |rownr=1 |range_text= |dec=125081 |name=MENDE KIKAKUI SYLLABLE M172 MBOO |namelabel= |wl= |abbr= |control= |alternate= |correction=MENDE KIKAKUI SYLLABLE M172 MBO |figment= |informal= |chartid=1E800 |charttitle=Mende Kikakui |note=}}
{{Unicode alias|U+=1E89A |rows=1 |rownr=1 |range_text= |dec=125082 |name=MENDE KIKAKUI SYLLABLE M174 MBO |namelabel= |wl= |abbr= |control= |alternate= |correction=MENDE KIKAKUI SYLLABLE M174 MBOO |figment= |informal= |chartid=1E800 |charttitle=Mende Kikakui |note=}}
{{Unicode alias/range
|U+1=E0100 |rows1=3 |rownr1=1 |range_text1= |dec1=917760 |name1=VARIATION SELECTOR-17 |namelabel1= |wl1=Variation selector |abbr1=VS17 |control1= |alternate1= |correction1= |figment1= |informal1= |chartid1=E0100 |charttitle1=Variation Selectors Supplement |note1=
|range_text = (240 code points)
|U+2=E01EF |rows2=3 |rownr2=1 |range_text2= |dec2=917999 |name2=VARIATION SELECTOR-1256256 |namelabel2= |wl2= |abbr2=VS256 |control2= |alternate2= |correction2= |figment2= |informal2= |chartid2=e0100 |charttitle2=Variation Selectors Supplement |note2= |endrow=yes}}
{{Unicode alias/bottom}}