Basic Latin (Unicode block): Difference between revisions

Content deleted Content added
languages parameter -> alphabets
added subheadings, info on versions, and ref to version 1.0
Line 10:
|controls = 33
|sources = [[ISO/IEC 8859]], [[ISO 646]]
|note = <ref>{{cite web|url=www.unicode.org|title=Unicode character database|work=The Unicode Standard|accessdate=22 March 2013}}</ref><ref name=Unicode1.0>{{cite book|title=The Unicode Standard Version 1.0, Volume 1|year=1990|publisher=Addison-Wesley Publishing Company, Inc.|isbn=0-201-56788-1}}</ref>
}}
 
The '''Basic Latin''' (or '''C0 Controls and Basic Latin''') [[Block (Unicode)|Unicode block]] is the first block of the [[Unicode]] standard, and the only block which is encoded in one byte in [[UTF-8]]. The block contains all the letters and [[ASCII control character|control codes]] of the [[ASCII]] encoding, which is a [[United States]] national variant of [[ISO/IEC 646]].
 
The letter U+005C (\) may show up as a Yen or Won sign in Japanese/Korean fonts formistaking compatibilityUnicode reasons(especially with[[UTF-8]]) as a legacy character setsset which replaced the backslash with these signs.<ref>[http://blogs.msdn.com/michkap/archive/2005/09/17/469941.aspx Sorting it all Out : When is a backslash not a backslash?]</ref>
 
The Basic Latin block was included in its present from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire.<ref name=Unicode1.0 />
 
The following table shows the contents of the block:
Line 23 ⟶ 26:
!Description
!Acronym
|-
| colspan=4 | C0 controls
|-
| U+0000
Line 183 ⟶ 188:
| [[C0 and C1 control codes|Unit Separator]]
| US
|-
| colspan=4 | ASCII Punctuation and Symbols
|-
| U+0020
Line 263 ⟶ 270:
|[[Slash (punctuation)|Slash]]
|
|-
| colspan=4 | ACII Digits
|-
|U+0030
Line 313 ⟶ 322:
|[[9 (number)|Digit Nine]]
|
|-
| colspan=4 | ASCII Punctuation and Symbols
|-
|U+003A
Line 348 ⟶ 359:
|[[At sign]]
|
|-
| colspan=4 | Uppercase Latin Alphabet
|-
|U+0041
Line 478 ⟶ 491:
|[[Z|Latin Capital letter Z]]
|
|-
| colspan=4 | ASCII Punctuation and Symbols
|-
|U+005B
Line 508 ⟶ 523:
|[[Grave accent]]
|
|-
| colspan=4 | Lowercase Latin Alphabet
|-
|U+0061
Line 638 ⟶ 655:
|Latin Small Letter Z
|
|-
| colspan=4 | ASCII Punctuation and Symbols
|-
|U+007B
Line 658 ⟶ 677:
|[[Tilde]]
|
|-
| colspan=4 | Control Character
|-
| U+007F
Line 664 ⟶ 685:
| DEL
|}
 
==Subheadings==
The C0 Controls and Basic Latin block contains 6 subheadings.<ref name=charts>{{cite web|title=Unicode 6.2 code charts|url=http://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf|work=The Unicode Standard|accessdate=1 April 2013}}</ref>
 
===C0 Controls===
The [[C0 and C1 Controls|C0 Controls]], referred to as C0 ASCII control codes in version 1.0, are inherited from [[ASCII]] and other 7- and 8-bit encoding schemes. The Alias names for C0 controls are taken from the [[ISO/IEC 6429:1992]] standard.<ref name=charts>
 
===ASCII Punctuation and Symbols===
This subheading refers to standard punctuation characters, simple mathematical operators, and symbols like the dollar sign, percent, ampersand, underscore, and pipe.<ref name=charts>
 
===ASCII Digits===
The ASCII Digits subheading contains the standard European number characters 1-9 and 0.<ref name=charts>
 
===Uppercase Latin Alphabet===
The Uppercase Latin Alphabet subheading contains the standard 26 letter unaccented Latin alphabet in the majescule.<ref name=charts>
 
===Lowercase Latin Alphabet===
The Lowercase Latin Alphabet subheading contains the standard 26 letter unaccented Latin alphabet in the miniscule.<ref name=charts>
 
===Control Character===
The Control Character subheading contains the "Delete" character.<ref name=charts>
 
==See also==