List of Unicode characters: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 21:21, 6 January 2025 edit 2806:109f:a:90ce:f93c:3216:40a8:ec14 (talk) No edit summary Tags: Reverted Mobile edit Mobile web edit ← Previous edit		Latest revision as of 13:51, 26 August 2025 edit undo Vanisaac (talk \| contribs) Autopatrolled, Extended confirmed users 94,172 edits rv vandalism Tag: Undo
(36 intermediate revisions by 21 users not shown)
Line 1: {{Short description\|None}} {{for\|a higher-level list of entire blocks rather than ~~individuals~~individual characters\|Unicode block}} [[File:New Unicode logo.svg\|thumb\|upright\|Unicode logo]] {{SpecialChars}} As of [[Unicode]] version 16.0, there are ~~155~~292,~~063~~531 assigned [[character (computing)\|characters]] with [[code point]]s,<ref>{{cite web\|url=https://www.unicode.org/versions/stats/charcountv16_0.html \|title=Unicode Version 16.0 Character Counts}}</ref> covering 168 modern and historical [[Script (Unicode)\|scripts]], as well as multiple symbol sets. As it is [[WP:CHOKING\|not technically possible]] to list all of these characters in a single Wikipedia page, this list is limited to a subset of the most important characters for English-language readers, with links to other pages which list the supplementary characters. This article includes the 1,062 characters in the Multilingual European Character Set 2 ([[MES-2]]) subset, and some additional related characters. == Character reference overview == Line 12: \| colheaderstyle=padding:0.15em 0.15em 0.25em;font-weight:normal; \|colstyle=white-space:nowrap; \| col1header = [[Alphabet]]ic \| col1headercolspan = 2 \| col1 = {{legend\|#A9A9C0\|[[#Latin script\|Latin]]}} {{legend\|#008899\|[[#Cyrillic\|Cyrillic]]}} {{legend\|#2175D8\|[[#Greek and Coptic\|Greek]]}} {{legend\|#003A66\|[[#Armenian\|Armenian]]}} {{legend\|#0000FF\|[[#Georgian\|Georgian]]}} {{legend\|#00FFFF\|[[#Mongolian\|Mongolian]]}} {{legend\|#0090DA\|[[#Other African scripts\|Neo-Tifinagh]]}} {{legend\|#97CBFF\|[[#Other American scripts\|Osage]]}} {{legend\|#FF69B4\|[[#East Asian writing systems\|Hangul]]{{sup\|a}}}} ~~\| col2header = {{longitem\|[[Logogram\|Logographic]]<br />and [[Syllabary\|Syllabic]]}}~~ \| col2 = {{legend\|#~~EE1C25~~00FFFF\|[[#~~East Asian writing systems~~Mongolian\|~~Hanzi~~Mongolian]] ~~{{smaller\|[L]}}~~}} {{legend\|#~~C71585~~0090DA\|[[#~~East~~Other ~~Asian~~African ~~writing systems~~scripts\|~~Kana~~Neo-Tifinagh]]}} {{~~smaller~~legend\|#97CBFF\|~~[S]}}{{\}}~~[[#~~East~~Other ~~Asian~~American ~~writing systems~~scripts\|~~Kanji~~Osage]~~] {{smaller\|[L~~]~~}}{{nbsp\|2}}~~}} {{legend\|#FF69B4\|[[#East Asian writing systems\|~~Hanja~~Hangul]]{{sup\|~~b}} {{smaller\|[L]}}}} {{legend\|#8B008B\|[[#Other American scripts\|Cherokee]] {{smaller\|[S]~~a}}}} \| col3header = {{longitem\|[[~~Abjad~~Logogram\|Logographic]]<br />and [[Syllabary\|Syllabic]]}} \| col3 = {{legend\|#EE1C25\|[[#East Asian writing systems\|Hanzi]] {{smaller\|[L]}}}} {{legend\|#C71585\|[[#East Asian writing systems\|Kana]] {{smaller\|[S]}}{{\}}[[#East Asian writing systems\|Kanji]] {{smaller\|[L]}}{{nbsp\|2}}}} {{legend\|#FF69B4\|[[#East Asian writing systems\|Hanja]]{{sup\|b}} {{smaller\|[L]}}}} {{legend\|#8B008B\|[[#Other American scripts\|Cherokee]] {{smaller\|[S]}}}} ~~\| col3 = {{legend\|#006624\|[[#Semitic languages\|Arabic]]}} {{legend\|#38B800\|[[#Semitic languages\|Hebrew]]}}~~ \| col4header = [[~~Abugida~~Abjad]] \| col4 = {{legend\|#~~F4C430~~006624\|[[#~~Brahmic~~Semitic ~~(Indic) scripts~~languages\|~~North Indic~~Arabic]]}} {{legend\|#~~FF671F~~38B800\|[[#~~Brahmic~~Semitic ~~(Indic) scripts~~languages\|~~South Indic~~Hebrew]]}} ~~{{legend\|#A0522D~~ \| col5header = [[Abugida]] \| col5 = {{legend\|#F4C430\|[[#Brahmic (Indic) scripts\|North Indic]]}} {{legend\|#FF671F\|[[#Brahmic (Indic) scripts\|South Indic]]}} {{legend\|#A0522D \|[[#African scripts\|Ethiopic]]}} {{legend\|#B8860B\|[[#Thaana\|Thaana]]}} {{legend\|#F0E68C\|[[#American scripts\|Canadian syllabics]]}} \| below = {{nowrap\|{{sup\|a}} [[Featural alphabet\|Featural-alphabetic]].{{nbsp\|3}}{{sup\|b}} Limited.}} Line 38 ⟶ 40: where ''name'' is the case-sensitive name of the entity. The semicolon is required. Because numbers are harder for humans to remember than names, character entity references are most often written by humans, while numeric character references are most often produced by computer programs.<ref>{{Cite book\|last=Carey\|first=Patrick~~\|url=https://www.worldcat.org/oclc/904969019~~\|title=New perspectives on XML : comprehensive\|date=2015\|others=Sasha Vodnik\|isbn=978-1-285-07582-2\|edition=3rd\|___location=\|pages=36\|publisher=Cengage Learning \|oclc=904969019}}</ref> == Control codes == Line 451 ⟶ 453: Footnotes: :<sup>1</sup> Control-C has typically been used as a "break" or "interrupt" key. :<sup>2</sup> Control-D has been used to signal "end of file" for text typed in at the terminal on Unix / Linux systems. Windows, ~~DOS~~MsDOS, and older minicomputers used Control-Z for this purpose. :<sup>3</sup> Control-G is an artifact of the days when [[teletype]]s were in use. Important messages could be signalled by striking the bell on the teletype. This was carried over on PCs by generating a buzz sound. :<sup>4</sup> Line feed is used for "end of line" in text files on Unix / Linux systems. :<sup>5</sup> Carriage Return (accompanied by line feed, and thus usually written as 'CRLF') is used as "end of line" character by Windows, ~~DOS~~MsDOS, and most minicomputers other than Unix- / Linux-based systems. [[Classic Mac OS]] and other vintage OS used CR only. :<sup>6</sup> Control-O has been the "discard output" key. Output is not sent to the terminal, but discarded, until another Control-o is typed. :<sup>7</sup> Control-Q has been used to tell a host computer to resume sending output after it was stopped by Control-S. Line 460 ⟶ 462: :<sup>9</sup> Control-U was originally used by [[Digital Equipment Corporation]] computers to cancel the current line of typed-in text. Other manufacturers used Control-X for this purpose. :<sup>10</sup> Control-X was commonly used to cancel a line of input typed in at the terminal. :<sup>11</sup> Control-Z has commonly been used on minicomputers, Windows and ~~DOS~~MsDOS systems to indicate "end of file" either on a terminal or in a text file. Unix / Linux systems use Control-D to indicate end-of-file at a terminal. == Latin script == Line 1,221 ⟶ 1,223: \| <small>0302 0246</small> \| &brvbar; \|[[~~Vertical bar\|~~Broken bar]] \| 0102 \|- Line 1,253 ⟶ 1,255: \| <small>0302 0252</small> \| &ordf; \|style="background: #ffebeb;"\|[[Ordinal indicator#~~Italian, Portuguese, and Spanish~~Usage\|Feminine Ordinal Indicator]] \| 0106 \|- Line 1,381 ⟶ 1,383: \| <small>0302 0272</small> \| &ordm; \|style="background: #ffebeb;"\|[[Ordinal indicator#~~Italian, Portuguese, and Spanish~~Usage\|Masculine ordinal indicator]] \| 0122 \|- Line 8,301 ⟶ 8,303: == Armenian == {{Main\|Armenian (Unicode block)}} {{Unicode chart Armenian~~\|fs=fgd\|x=xf\|dg=dd~~}} == Semitic languages == Line 9,231 ⟶ 9,233: == Dingbats == {{Main\|~~Dingbat\|l1=Dingbat~~Dingbats (Unicode block)}} {\| class="wikitable sortable" \|- Line 10,185 ⟶ 10,187: === Chess Symbols === {{Main\|Chess Symbols (Unicode block)}} {{Unicode chart ~~Chess Symbols~~chess}} == Special areas and format characters ==