Unicode block

This is an old revision of this page, as edited by DePiep (talk | contribs) at 11:39, 17 May 2010 (add sortkey (hidden) to Block range. rm Order now.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

A Unicode block is defined as one continuous range of codepoints. The block may be defined with starting and ending codepoints, and can be named. The block explicitly can include codepoints that are unassigned and non-characters. A block may be subdevided into smaller blocks.[1]

Unicode Blocks [2] and contained Scripts [3] [4]
Block range Block name Number of codepoints[* 1] Plane Scripts

[* 2] [* 3]

Remark
000000 U+0000..U+007F Basic Latin 128 0 BMP Latin, Common
000080 U+0080..U+00FF Latin-1 Supplement 128 0 BMP Latin, Common
000100 U+0100..U+017F Latin Extended-A 128 0 BMP
000180 U+0180..U+024F Latin Extended-B 208 0 BMP Latin
000250 U+0250..U+02AF IPA Extensions 96 0 BMP Latin
0002B0 U+02B0..U+02FF Spacing Modifier Letters 80 0 BMP Latin, Common
000300 U+0300..U+036F Combining Diacritical Marks 112 0 BMP Inherited
000370 U+0370..U+03FF Greek and Coptic 144 0 BMP Greek, Coptic, Common
000400 U+0400..U+04FF Cyrillic 256 0 BMP Cyrillic, Inherited
000500 U+0500..U+052F Cyrillic Supplement 48 0 BMP Cyrillic
000530 U+0530..U+058F Armenian 96 0 BMP Armenian, Common
000590 U+0590..U+05FF Hebrew 112 0 BMP Hebrew
000600 U+0600..U+06FF Arabic 256 0 BMP Arabic, Common, Inherited
000700 U+0700..U+074F Syriac 80 0 BMP Syriac
000750 U+0750..U+077F Arabic Supplement 48 0 BMP Arabic
000780 U+0780..U+07BF Thaana 64 0 BMP Thaana
0007C0 U+07C0..U+07FF NKo 64 0 BMP Nko
000800 U+0800..U+083F Samaritan 64 0 BMP Samaritan
000900 U+0900..U+097F Devanagari 128 0 BMP Devanagari, Common, Inherited
000980 U+0980..U+09FF Bengali 128 0 BMP Bengali
000A00 U+0A00..U+0A7F Gurmukhi 128 0 BMP Gurmukhi
000A80 U+0A80..U+0AFF Gujarati 128 0 BMP Gujarati
000B00 U+0B00..U+0B7F Oriya 128 0 BMP Oriya
000B80 U+0B80..U+0BFF Tamil 128 0 BMP Tamil
000C00 U+0C00..U+0C7F Telugu 128 0 BMP Telugu
000C80 U+0C80..U+0CFF Kannada 128 0 BMP Kannada, Common
000D00 U+0D00..U+0D7F Malayalam 128 0 BMP Malayalam
000D80 U+0D80..U+0DFF Sinhala 128 0 BMP Sinhala
000E00 U+0E00..U+0E7F Thai 128 0 BMP Thai, Common
000E80 U+0E80..U+0EFF Lao 128 0 BMP Lao
000F00 U+0F00..U+0FFF Tibetan 256 0 BMP Tibetan, Common
001000 U+1000..U+109F Myanmar 160 0 BMP Myanmar
0010A0 U+10A0..U+10FF Georgian 96 0 BMP Georgian, Common
001100 U+1100..U+11FF Hangul Jamo 256 0 BMP Hangul
001200 U+1200..U+137F Ethiopic 384 0 BMP Ethiopic
001380 U+1380..U+139F Ethiopic Supplement 32 0 BMP Ethiopic
0013A0 U+13A0..U+13FF Cherokee 96 0 BMP Cherokee
001400 U+1400..U+167F Unified Canadian Aboriginal Syllabics 640 0 BMP Canadian_Aboriginal
001680 U+1680..U+169F Ogham 32 0 BMP Ogham
0016A0 U+16A0..U+16FF Runic 96 0 BMP Runic, Common
001700 U+1700..U+171F Tagalog 32 0 BMP Tagalog
001720 U+1720..U+173F Hanunoo 32 0 BMP Hanunoo, Common
001740 U+1740..U+175F Buhid 32 0 BMP Buhid
001760 U+1760..U+177F Tagbanwa 32 0 BMP Tagbanwa
001780 U+1780..U+17FF Khmer 128 0 BMP Khmer
001800 U+1800..U+18AF Mongolian 176 0 BMP Mongolian, Common
0018B0 U+18B0..U+18FF Unified Canadian Aboriginal Syllabics Extended 80 0 BMP Canadian_Aboriginal
001900 U+1900..U+194F Limbu 80 0 BMP Limbu
001950 U+1950..U+197F Tai Le 48 0 BMP Tai_Le
001980 U+1980..U+19DF New Tai Lue 96 0 BMP New_Tai_Lue
0019E0 U+19E0..U+19FF Khmer Symbols 32 0 BMP Khmer
001A00 U+1A00..U+1A1F Buginese 32 0 BMP Buginese
001A20 U+1A20..U+1AAF Tai Tham 144 0 BMP Tai_Tham
001B00 U+1B00..U+1B7F Balinese 128 0 BMP Balinese
001B80 U+1B80..U+1BBF Sundanese 64 0 BMP Sundanese
001C00 U+1C00..U+1C4F Lepcha 80 0 BMP Lepcha
001C50 U+1C50..U+1C7F Ol Chiki 48 0 BMP Ol_Chiki
001CD0 U+1CD0..U+1CFF Vedic Extensions 48 0 BMP Common, Inherited
001D00 U+1D00..U+1D7F Phonetic Extensions 128 0 BMP Cyrillic, Greek, Latin
001D80 U+1D80..U+1DBF Phonetic Extensions Supplement 64 0 BMP Latin, Greek
001DC0 U+1DC0..U+1DFF Combining Diacritical Marks Supplement 64 0 BMP Inherited
001E00 U+1E00..U+1EFF Latin Extended Additional 256 0 BMP Latin
001F00 U+1F00..U+1FFF Greek Extended 256 0 BMP Greek
002000 U+2000..U+206F General Punctuation 112 0 BMP Common, Inherited
002070 U+2070..U+209F Superscripts and Subscripts 48 0 BMP Latin, Common
0020A0 U+20A0..U+20CF Currency Symbols 48 0 BMP Common
0020D0 U+20D0..U+20FF Combining Diacritical Marks for Symbols 48 0 BMP Inherited
002100 U+2100..U+214F Letterlike Symbols 80 0 BMP Latin, Greek, Common
002150 U+2150..U+218F Number Forms 64 0 BMP Latin, Common
002190 U+2190..U+21FF Arrows 112 0 BMP Common
002200 U+2200..U+22FF Mathematical Operators 256 0 BMP Common
002300 U+2300..U+23FF Miscellaneous Technical 256 0 BMP Common
002400 U+2400..U+243F Control Pictures 64 0 BMP Common
002440 U+2440..U+245F Optical Character Recognition 32 0 BMP Common
002460 U+2460..U+24FF Enclosed Alphanumerics 160 0 BMP Common
002500 U+2500..U+257F Box Drawing 128 0 BMP Common
002580 U+2580..U+259F Block Elements 32 0 BMP
0025A0 U+25A0..U+25FF Geometric Shapes 96 0 BMP Common
002600 U+2600..U+26FF Miscellaneous Symbols 256 0 BMP Common
002700 U+2700..U+27BF Dingbats 192 0 BMP Common
0027C0 U+27C0..U+27EF Miscellaneous Mathematical Symbols-A 48 0 BMP Common
0027F0 U+27F0..U+27FF Supplemental Arrows-A 16 0 BMP Common
002800 U+2800..U+28FF Braille Patterns 256 0 BMP Braille
002900 U+2900..U+297F Supplemental Arrows-B 128 0 BMP Common
002980 U+2980..U+29FF Miscellaneous Mathematical Symbols-B 128 0 BMP Common
002A00 U+2A00..U+2AFF Supplemental Mathematical Operators 256 0 BMP Common
002B00 U+2B00..U+2BFF Miscellaneous Symbols and Arrows 256 0 BMP Common
002C00 U+2C00..U+2C5F Glagolitic 96 0 BMP Glagolitic
002C60 U+2C60..U+2C7F Latin Extended-C 32 0 BMP Latin
002C80 U+2C80..U+2CFF Coptic 128 0 BMP Coptic
002D00 U+2D00..U+2D2F Georgian Supplement 48 0 BMP Georgian
002D30 U+2D30..U+2D7F Tifinagh 80 0 BMP Tifinagh
002D80 U+2D80..U+2DDF Ethiopic Extended 96 0 BMP Ethiopic
002DE0 U+2DE0..U+2DFF Cyrillic Extended-A 32 0 BMP Cyrillic
002E00 U+2E00..U+2E7F Supplemental Punctuation 128 0 BMP Common
002E80 U+2E80..U+2EFF CJK Radicals Supplement 128 0 BMP Han
002F00 U+2F00..U+2FDF Kangxi Radicals 224 0 BMP Han
002FF0 U+2FF0..U+2FFF Ideographic Description Characters 16 0 BMP Common
003000 U+3000..U+303F CJK Symbols and Punctuation 64 0 BMP Han, Common, Inherited
003040 U+3040..U+309F Hiragana 96 0 BMP Hiragana, Common, Inherited
0030A0 U+30A0..U+30FF Katakana 96 0 BMP Katakana, Common
003100 U+3100..U+312F Bopomofo 48 0 BMP Bopomofo
003130 U+3130..U+318F Hangul Compatibility Jamo 96 0 BMP Hangul
003190 U+3190..U+319F Kanbun 16 0 BMP Common
0031A0 U+31A0..U+31BF Bopomofo Extended 32 0 BMP Bopomofo
0031C0 U+31C0..U+31EF CJK Strokes 48 0 BMP Common
0031F0 U+31F0..U+31FF Katakana Phonetic Extensions 16 0 BMP Katakana
003200 U+3200..U+32FF Enclosed CJK Letters and Months 256 0 BMP Katakana, Hangul, Common
003300 U+3300..U+33FF CJK Compatibility 256 0 BMP Katakana, Common
003400 U+3400..U+4DBF CJK Unified Ideographs Extension A 6592 0 BMP Han
004DC0 U+4DC0..U+4DFF Yijing Hexagram Symbols 64 0 BMP Common
004E00 U+4E00..U+9FFF CJK Unified Ideographs 20992 0 BMP Han
00A000 U+A000..U+A48F Yi Syllables 1168 0 BMP Yi
00A490 U+A490..U+A4CF Yi Radicals 64 0 BMP Yi
00A4D0 U+A4D0..U+A4FF Lisu 48 0 BMP Lisu
00A500 U+A500..U+A63F Vai 320 0 BMP Vai
00A640 U+A640..U+A69F Cyrillic Extended-B 96 0 BMP Cyrillic
00A6A0 U+A6A0..U+A6FF Bamum 96 0 BMP Bamum
00A700 U+A700..U+A71F Modifier Tone Letters 32 0 BMP Common
00A720 U+A720..U+A7FF Latin Extended-D 224 0 BMP Latin, Common
00A800 U+A800..U+A82F Syloti Nagri 48 0 BMP Syloti_Nagri
00A830 U+A830..U+A83F Common Indic Number Forms 16 0 BMP Common
00A840 U+A840..U+A87F Phags-pa 64 0 BMP Phags_Pa
00A880 U+A880..U+A8DF Saurashtra 96 0 BMP Saurashtra
00A8E0 U+A8E0..U+A8FF Devanagari Extended 32 0 BMP Devanagari
00A900 U+A900..U+A92F Kayah Li 48 0 BMP Kayah_Li
00A930 U+A930..U+A95F Rejang 48 0 BMP Rejang
00A960 U+A960..U+A97F Hangul Jamo Extended-A 32 0 BMP Hangul
00A980 U+A980..U+A9DF Javanese 96 0 BMP Javanese
00AA00 U+AA00..U+AA5F Cham 96 0 BMP Cham
00AA60 U+AA60..U+AA7F Myanmar Extended-A 32 0 BMP Myanmar
00AA80 U+AA80..U+AADF Tai Viet 96 0 BMP Tai_Viet
00ABC0 U+ABC0..U+ABFF Meetei Mayek 64 0 BMP Meetei_Mayek
00AC00 U+AC00..U+D7AF Hangul Syllables 11184 0 BMP Hangul
00D7B0 U+D7B0..U+D7FF Hangul Jamo Extended-B 80 0 BMP Hangul
00D800 U+D800..U+DB7F High Surrogates 896 0 BMP
00DB80 U+DB80..U+DBFF High Private Use Surrogates 128 0 BMP
00DC00 U+DC00..U+DFFF Low Surrogates 1024 0 BMP
00E000 U+E000..U+F8FF Private Use Area 6400 0 BMP
00F900 U+F900..U+FAFF CJK Compatibility Ideographs 512 0 BMP Han
00FB00 U+FB00..U+FB4F Alphabetic Presentation Forms 80 0 BMP Latin, Hebrew, Armenian
00FB50 U+FB50..U+FDFF Arabic Presentation Forms-A 688 0 BMP Arabic, Common
00FE00 U+FE00..U+FE0F Variation Selectors 16 0 BMP Inherited
00FE10 U+FE10..U+FE1F Vertical Forms 16 0 BMP Common
00FE20 U+FE20..U+FE2F Combining Half Marks 16 0 BMP Inherited
00FE30 U+FE30..U+FE4F CJK Compatibility Forms 32 0 BMP Common
00FE50 U+FE50..U+FE6F Small Form Variants 32 0 BMP Common
00FE70 U+FE70..U+FEFF Arabic Presentation Forms-B 144 0 BMP Arabic, Common
00FF00 U+FF00..U+FFEF Halfwidth and Fullwidth Forms 240 0 BMP Latin, Katakana, Hangul, Common
00FFF0 U+FFF0..U+FFFF Specials 16 0 BMP Common
010000 U+10000..U+1007F Linear B Syllabary 128 1 SMP Linear_B
010080 U+10080..U+100FF Linear B Ideograms 128 1 SMP Linear_B
010100 U+10100..U+1013F Aegean Numbers 64 1 SMP Common
010140 U+10140..U+1018F Ancient Greek Numbers 80 1 SMP Greek
010190 U+10190..U+101CF Ancient Symbols 64 1 SMP Common
0101D0 U+101D0..U+101FF Phaistos Disc 48 1 SMP Common, Inherited
010280 U+10280..U+1029F Lycian 32 1 SMP Lycian
0102A0 U+102A0..U+102DF Carian 64 1 SMP Carian
010300 U+10300..U+1032F Old Italic 48 1 SMP Old_Italic
010330 U+10330..U+1034F Gothic 32 1 SMP Gothic
010380 U+10380..U+1039F Ugaritic 32 1 SMP Ugaritic
0103A0 U+103A0..U+103DF Old Persian 64 1 SMP Old_Persian
010400 U+10400..U+1044F Deseret 80 1 SMP Deseret
010450 U+10450..U+1047F Shavian 48 1 SMP Shavian
010480 U+10480..U+104AF Osmanya 48 1 SMP Osmanya
010800 U+10800..U+1083F Cypriot Syllabary 64 1 SMP Cypriot
010840 U+10840..U+1085F Imperial Aramaic 32 1 SMP Imperial_Aramaic
010900 U+10900..U+1091F Phoenician 32 1 SMP Phoenician
010920 U+10920..U+1093F Lydian 32 1 SMP Lydian
010A00 U+10A00..U+10A5F Kharoshthi 96 1 SMP Kharoshthi
010A60 U+10A60..U+10A7F Old South Arabian 32 1 SMP Old_South_Arabian
010B00 U+10B00..U+10B3F Avestan 64 1 SMP Avestan
010B40 U+10B40..U+10B5F Inscriptional Parthian 32 1 SMP Inscriptional_Parthian
010B60 U+10B60..U+10B7F Inscriptional Pahlavi 32 1 SMP Inscriptional_Pahlavi
010C00 U+10C00..U+10C4F Old Turkic 80 1 SMP Old_Turkic
010E60 U+10E60..U+10E7F Rumi Numeral Symbols 32 1 SMP Arabic
011080 U+11080..U+110CF Kaithi 80 1 SMP Kaithi
012000 U+12000..U+123FF Cuneiform 1024 1 SMP Cuneiform
012400 U+12400..U+1247F Cuneiform Numbers and Punctuation 128 1 SMP Cuneiform
013000 U+13000..U+1342F Egyptian Hieroglyphs 1072 1 SMP Egyptian_Hieroglyphs
01D000 U+1D000..U+1D0FF Byzantine Musical Symbols 256 1 SMP Common
01D100 U+1D100..U+1D1FF Musical Symbols 256 1 SMP Common, Inherited
01D200 U+1D200..U+1D24F Ancient Greek Musical Notation 80 1 SMP Greek
01D300 U+1D300..U+1D35F Tai Xuan Jing Symbols 96 1 SMP Common
01D360 U+1D360..U+1D37F Counting Rod Numerals 32 1 SMP Common
01D400 U+1D400..U+1D7FF Mathematical Alphanumeric Symbols 1024 1 SMP Common
01F000 U+1F000..U+1F02F Mahjong Tiles 48 1 SMP Common
01F030 U+1F030..U+1F09F Domino Tiles 112 1 SMP Common
01F100 U+1F100..U+1F1FF Enclosed Alphanumeric Supplement 256 1 SMP Common
01F200 U+1F200..U+1F2FF Enclosed Ideographic Supplement 256 1 SMP Hiragana, Common
020000 U+20000..U+2A6DF CJK Unified Ideographs Extension B 42720 2 SIP Han
02A700 U+2A700..U+2B73F CJK Unified Ideographs Extension C 4160 2 SIP Han
02F800 U+2F800..U+2FA1F CJK Compatibility Ideographs Supplement 544 2 SIP Han
0E0000 U+E0000..U+E007F Tags 128 14 SSP Common
0E0100 U+E0100..U+E01EF Variation Selectors Supplement 240 14 SSP Inherited
0F0000 U+F0000..U+FFFFF Supplementary Private Use Area-A 65536 15 PUA
100000 U+100000..U+10FFFF Supplementary Private Use Area-B 65536 16 PUA
  1. ^ Includes unassigned and non-character codepoints
  2. ^ The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the Block-name.
  3. ^ "Common" (Zyyy) and "Inherited" (Qaai) are Scripts in ISO 15924

See also

References