Code page: Difference between revisions

Content deleted Content added
Removing link(s) to deleted page Code page 778
 
(13 intermediate revisions by 4 users not shown)
Line 275:
* [[Code page 301|301]] – IBM-PC Japan (Kanji) DBCS
* [[Code page 437|437]] – Original IBM PC hardware code page
* [[Code page 720|720]] – Arabic (Transparent ASMO)
* [[Code page 737|737]] – [[Greek language|Greek]]
* [[Code page 775|775]] – Latin-7
* [[Code page 808|808]] – Russian with euro (same without euro: [[Code page 866|866]])
* [[Code page 848|848]] – Ukrainian with euro (same without euro: [[Code page 1125|1125]])
* [[Code page 849|849]] – Belarusian with euro (same without euro: [[Code page 1131|1131]])
* [[Code page 850|850]] – Latin-1
* [[Code page 851|851]] – Greek
* [[Code page 852|852]] – Latin-2
* 853 – Latin-3
* [[Code page 855|855]] – [[Cyrillic script|Cyrillic]] (same with euro: [[Code page 872|872]])
* [[Code page 856|856]] – [[Hebrew alphabet|Hebrew]]
* [[Code page 857|857]] – Latin-5
* [[Code page 858|858]] – Latin-1 with [[euro]] symbol
* [[Code page 859|859]] – Latin-9
* 860 – [[Portuguese language|Portuguese]]
* [[Code page 861|861]] – [[Icelandic language|Icelandic]]
Line 300:
* [[Code page 868|868]] – [[Urdu language|Urdu]]
* [[Code page 869|869]] – [[Greek alphabet|Greek]]
* [[Code page 872|872]] – Cyrillic with euro (same without euro: [[Code page 855|855]])
* [[Code page 874|874]] – Thai with Low Tone Marks & Ancient Chars (conflictive ID with Windows 874; version with euro: [[Code page 1161|1161]] Windows version: is IBM [[Code page 1162|1162]])<!-- Attention! Neither IBM 874 nor Windows 874 are rigorously the same as ISO 8859-11 / TIS 620-2533 ISO 8859-11 is probably IBM 873-->
* [[Code page 876|876]] – OCR A
* [[Code page 877|877]] – OCR B
* [[Code page 878|878]] – [[KOI8-R]]
* [[Code page 891|891]] – Korean PC SBCS
* [[Code page 898|898]] – IBM-PC WP Multilingual
* [[Code page 899|899]] – IBM-PC Symbol
* [[Code page 903|903]] – Simplified Chinese PC SBCS
* [[Code page 904|904]] – Traditional Chinese PC SBCS
* [[Code page 906|906]] – International Set #5 3812/3820
* [[Code page 907|907]] – ASCII APL (3812)
* [[Code page 909|909]] – IBM-PC APL2 Extended
Line 330:
* [[Code page 949 (IBM)|949]] – Korean (Extended Wansung (ks_c_5601-1987)) ([[Code page 1088|1088]] + [[Code page 951|951]]) (conflictive ID with Windows 949 (Unified Hangul Code); Windows version is IBM 1363)
* [[Code page 951|951]] – Korean DBCS (IBM KS Code) (conflictive ID with Windows 951, a hack of Windows 950 with Unicode mappings for some PUA Unicode characters found in HKSCS, based on the file name)
* [[Code page 1034|1034]] – Printer Application - Shipping Label, Set #2
* [[Code page 1040|1040]] – Korean Extended
* [[Code page 1041|1041]] – Japanese Extended (JIS X 0201 Extended)
* [[Code page 1042|1042]] – Simplified Chinese Extended
* [[Code page 1043|1043]] – Traditional Chinese Extended
* [[Code page 1044|1044]] – Printer Application - Shipping Label, Set #1
* [[Code page 1086|1086]] – IBM-PC Japan #1
* [[Code page 1088|1088]] – Revised Korean (SBCS)
* [[Code page 1092|1092]] – IBM-PC Modified Symbols
* [[Code page 1098|1098]] – [[Persian language|Farsi]]
* [[Code page 1108|1108]] – DITROFF Base Compatibility
* [[Code page 1109|1109]] – DITROFF Specials Compatibility
* [[Code page 1115|1115]] – IBM-PC People's Republic of China
* [[Code page 1116|1116]] – Estonian
Line 447:
* [[Code page 1126|1126]] – IBM-PC Korean SBCS
* [[Code page 1162|1162]] – Windows Thai (Extension of [[Code page 874|874]]; but still called that in Windows)
* [[Code page 1169|1169]] – Windows Cyrillic Asian
* [[Code page 1174|1174]] – Windows Kazakh<ref name="Kazakh_1174"/><!-- Is it one of these (https://web.archive.org/web/20190419163948/http://www.sci.kz/~sairan/kazcode/ — check the tables 5.2.1)? -->
* [[Code page 1250|1250]] – Windows [[Central Europe]]
* [[Code page 1251|1251]] – Windows [[Cyrillic script|Cyrillic]]
Line 578:
{{Div col|colwidth=30em}}
* [[Code page 932 (Microsoft Windows)|932]] – Supports [[Japanese writing system|Japanese]] [[Shift-JIS]]
* [[Code page 936 (Microsoft Windows)|936]] – Supports [[Simplified Chinese characters|Simplified Chinese]] [[GB2312]] or [[GBK (character encoding)|GBK]]
* [[Unified Hangul Code|949]] – Supports [[Hangul|Korean]] Unified Hangul Code
* [[Code page 950|950]] – Supports [[Traditional Chinese characters|Traditional Chinese]] [[Big5]]
** [[Code page 950|951]] – Supports [[Traditional Chinese characters|Traditional Chinese]] [[Big5]] with [[HKSCS]]
 
{{div col end}}
 
Line 587 ⟶ 589:
 
{{Div col|colwidth=30em}}
* [[Code page 708|708]] – Arabic (ASMO 708)
* [[Code page 720|720]] – Arabic (Transparent ASMO)
* [[Code page 709|709]] – Arabic ([[Code page ASMO449+|ASMO 449+]]/BCON V4)<!-- not sure if available in any DOS -->
* [[Code page 710|710]] – Arabic (Transparent Arabic)<!-- not sure if available in any DOS -->
* [[Code page 720|720]] – Arabic (Transparent ASMO)
* [[Code page 737|737]] – [[Greek language|Greek]]
* [[Code page 850|850]] – Latin-1
* [[Code page 851|851]] – Greek
* [[Code page 852|852]] – Latin-2
* [[Code page 855|855]] – [[Cyrillic script|Cyrillic]]
* [[Code page 857|857]] – Latin-5
* [[Code page 858|858]] – Latin-1 with [[euro]] symbol
* [[Code page 859|859]] – Latin-9
* 860 – [[Portuguese language|Portuguese]]
* [[Code page 861|861]] – [[Icelandic language|Icelandic]]
Line 732:
* Symbol Set 8V — HP Arabic-8<!-- Contradictory sources about "Arabic-8"; http://h30434.www3.hp.com/t5/Printer-Software-and-Drivers/Arabic-fonts-on-Network-Printers/td-p/2231625 and http://printronix.com/emea/wp-content/uploads/manuals/PTX_PRM_ACA_P8_258187a.pdf -->
* Symbol Set 9K — HP Korean-8<!-- (ASCII + Jamo Code Table?) -->
* Symbol Set 9T — PC 8T (also known as Code Page 437-T; this is '''not''' [[code page 857]])
* Symbol Set 9V — Latin / Arabic for Windows (this is '''not''' [[code page 1256]])
* Symbol Set 11U — PC 8D/N (also known as Code Page 437-N; coded by IBM as [[code page 1058]]; this is '''not''' [[code page 865]])
Line 785:
* Symbol Set 9R — Windows 98 Cyrillic (Practically the same as [[code page 1251]])
* Symbol Set 9U — Windows 3.0
* Symbol Set 10G — PC-851 Latin/Greek (Practically the same as [[code page 851]])
* Symbol Set 10J — PS Text (Practically the same as [[PostScript Standard Encoding|Adobe Standard]])
* Symbol Set 10L — PS ITC Zapf Dingbats (Practically the same as [[Adobe Dingbats]])
* Symbol Set 10N — ISO 8859-5 Latin/Cyrillic (1988 version — IR 144)
* Symbol Set 10R — PC-855 Cyrillic (Practically the same as [[code page 855]])
* Symbol Set 10T — Teletex<!-- (CCITT T.61?) -->
* Symbol Set 10U — PC-8 (Practically the same as [[code page 437]]; coded by IBM as code page 1057)
* Symbol Set 10V — CP-864 (Practically the same as [[code page 864]])
* Symbol Set 11G — CP-869 (Practically the same as [[code page 869]])
* Symbol Set 11J — PS ISO Latin-1 (Practically the same as [[Adobe Latin-1]])
* Symbol Set 11N — ISO 8859-6 Latin/Arabic
* Symbol Set 12G — PC Latin/Greek (Practically the same as [[code page 737]])
* Symbol Set 12J — MC Text (Practically the same as [[Mac OS Roman|Macintosh Roman]])
* Symbol Set 12N — ISO 8859-7 Latin/Greek
* Symbol Set 12R — PC Gost (Practically the same as [[PC GOST Main character set|PC GOST Main]])
* Symbol Set 12U — PC-850 Latin 1 (Practically the same as [[code page 850]])
* Symbol Set 13J — Ventura International
Line 809:
* Symbol Set 14R — PC Ukrainian (Practically the same as [[RUSCII]])
* Symbol Set 15H — PC-862 Israel (Practically the same as [[code page 862]])
* Symbol Set 16U — PC-857 Latin 5 (Practically the same as [[code page 857]])
* Symbol Set 17U — PC-852 Latin 2 (Practically the same as [[code page 852]])
* Symbol Set 18N — [[UTF-8]]
* Symbol Set 18U — PC-853 Latin 3 (Practically the same as code page 853)
Line 821:
* Symbol Set 24Q — PC-Polish Mazowia (Practically the same as [[Mazovia encoding]])
* Symbol Set 25U — PC-865 Denmark/Norway (Practically the same as [[code page 865]])
* Symbol Set 26U — PC-775 Latin 7 (Practically the same as [[code page 775]])
* Symbol Set 27Q — PC-8 PC Nova (Practically the same as [[code page 999|PC Nova]])
* Symbol Set 27U — PC Latvian Russian (also known as 866-Latvian)
* Symbol Set 28U — PC Lithuanian/Russian (Practically the same as [[code page 774]])
Line 835:
{{Div col|colwidth=30em}}
* [[Code page 100|100]] – DOS Hebrew hardware fontpage (Not from IBM; [[Hebrew MS-DOS|HDOS]])<ref name="Paul_2002"/>
* [[Code page 111|111]] – DOS Greek (Not from IBM; [[AST Premium Exec DOS 5.0]]<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>)
* [[Code page 112|112]] – DOS Turkish (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>)
* [[Code page 113|113]] – DOS Yugoslavian (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>)
* [[Code page 151|151]] – DOS Nafitha Arabic (Not from IBM; [[Arabic MS-DOS|ADOS]])<!-- EPROM fontpage -->
* [[Code page 152|152]] – DOS Nafitha Arabic (Not from IBM; [[Arabic MS-DOS|ADOS]])<!-- EPROM fontpage -->
* [[Code page 161|161]] – DOS [[Arabic language|Arabic]] (Not from IBM; [[Arabic MS-DOS|ADOS]])<ref name="Paul_2002"/>
* [[Code page 162|162]] – DOS Arabic with vowel diacritics (Not from IBM; ADOS)
* [[Code page 163|163]] – DOS Arabic and French (Not from IBM; ADOS)<ref name="Paul_2002"/>
* [[Code page 164|164]] – DOS Arabic and French with vowel diacritics (Not from IBM; ADOS)
* [[Code page 165|165]] – DOS Arabic (864 Extended) (Not from IBM; ADOS)<ref name="Paul_2002"/>
* [[Code page 166|166]] – IBM Arabic PC (ADOS)<!-- hardware fontpage --><ref name="Paul_2002"/>
* [[Code page 437|190]] – DEC DOS German (appears to be identical to Code page 437)
* [[Code page 210|210]] – DEC DOS Greek (NEC Jetmate printers)
* 220 – DEC DOS Spanish (Not from IBM)
* [[Code page 489|489]] – Czechoslovakian [OCR software 1993]
* [[Code page 620|620]] – DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM)<!-- Fido Mazowia? Variant with characters "Ć" and "ć" in positions 80 and 87? -->
* [[Code page 667|667]] – DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM)
* [[Code page 668|668]] – DOS Polish<!--Different than Mazovia! --> (Not from IBM)
* [[Code page 706|706]] – MS-DOS Server Arabic Sakhr (Not from IBM; [[Sakhr Computers|Sakhr Software]] from [[MSX]] Computers)<!--Not to be confused with Arabic Sakr, below. -->
* [[Code page 707|707]] – MS-DOS Arabic Sakhr (Not from IBM; [[Sakhr Computers|Sakhr Software]] from [[MSX]] Computers)<!--Not to be confused with Arabic Sakr, below. -->
* 709 – MS-DOS Arabic ([[Code page 711ASMO449+|711ASMO 449+]]/BCON – MSV4)<!--DOS Arabicnot Nafithasure Enhancedif (Notavailable fromin IBM)any DOS -->
* [[Code page 714|714]]710 – MS-DOS Arabic Sakr (NotTransparent from IBMArabic)<!--Not tonot besure confusedif withavailable Arabicin Sakhr,any above.DOS -->
* [[Code page 715|715]]711 – MS-DOS Arabic APTECNafitha Enhanced (Not from IBM)
* [[Code page 721|721]]714 – MS-DOS Arabic Nafitha InternationalSakr (Not from IBM)<!--Not to be confused with Arabic Sakhr, above. -->
* [[Code715 page 768|768]] –MS-DOS Arabic Al-ArabiAPTEC (Not from IBM)
* 721 – MS-DOS Arabic Nafitha International (Not from IBM)
* [[Code page 770|770]] – DOS Estonian, Latvian, Lithuanian<ref name="CP770"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian RST 1095-89 National Standard)
* 768 – Arabic Al-Arabi (Not from IBM)
* [[Code page 770|770]] – DOS Estonian, Latvian, Lithuanian<ref name="CP770"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian RST 1095-89 National Standard)
* [[Code page 771|771]] – DOS Lithuanian/Cyrillic — KBL<ref name="CP771"/> (From Lithuanian Lika Software<ref name="lika"/>)
* [[Code page 772|772]] – DOS Lithuanian/Cyrillic<ref name="CP772"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1284:1993 National Standard; adopted by IBM as [[code page 1119]])
* 773 – DOS Latin-7 — KBL (From Lithuanian Lika Software)
* [[Code page 774|774]] – DOS Lithuanian<ref name="CP774"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1283:1993 National Standard; adopted by IBM as [[code page 1118]])
* [[Code page 775|775]] – DOS Latin-7 Baltic Rim (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1590-1 National Standard; adopted by IBM and Microsoft as [[code page 775]])
* 776 – DOS Lithuanian (extended CP770)<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>)
* 777 – DOS Accented Lithuanian (old) (extended CP773) — KBL<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>)
* 778 – DOS Accented Lithuanian (extended CP775)<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>)
* [[Code page 790|790]] – DOS [[Mazovia encoding|Polish (Mazovia)]] with curly quotation marks
* [[Code page 854|854]] – Spanish<ref name="Hogan_1992_REF-DE"/><ref name="Paul_2001_CODEPAGE"/><!-- Cites the Hogan book for CP854 as well since 1995. No other source found so far. May actually have been the Latin 4 code page! -->
* [[Code page 881|881]] – Latin 1 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM [[EBCDIC 881]])
* [[Code page 882|882]] – Latin 2 (ISO 8859-2) (Not from IBM; same as Code page 912; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM [[EBCDIC 882]])
* [[Code page 883|883]] – Latin 3 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM [[EBCDIC 883]])
* [[Code page 884|884]] – Latin 4 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM [[EBCDIC 884]])
* [[Code page 885|885]] – Latin 5 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM [[EBCDIC 885]])
* [[Code page 895|895]] – [[Kamenický encoding|Czech (Kamenický)]], (Not from IBM; conflictive ID with IBM CP895 — 7-bit EUC Japanese Roman)
* [[Mazovia encoding|896]] – DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM; conflictive ID with IBM CP896 — 7-bit EUC Japanese Katakana)<!-- Variant with the character "zł" in position 9B? -->
Line 891 ⟶ 893:
* [[Code page 1116|3001]] – Estonian 1 (on Star<ref name="star"/> printers); same as code page 1116
* [[Code page 922|3002]] – Estonian 2 (on Star<ref name="star"/> printers); same as code page 922
* [[Code page 437-Latvian|3011]] – Latvian 1 (on Star<ref name="star"/> printers); same as code page 437-Latvian
* [[Code page 866-Latvian|3012]] – Latvian-2 (on Star<ref name="star"/> printers); same as code page 866-Latvian (Latvian RST 1040-90 National Standard)
* [[MIK (character set)|3021]] – Bulgarian (on Star<ref name="star"/> printers); same as MIK
Line 925 ⟶ 927:
! ID !! Names !! Description !! Origin !! Platform !! DOS !! OS/2 !! Windows !! Mac !! Else !! Encoding !! Comment
|-
| 0 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || 3.3+ || 1.0+ || ?{{dunno}} || ?{{dunno}} || ?{{dunno}} || || Internal OS use<ref name="Paul_2002"/>
|-
| 437 || CP437, IBM437 || PC US || IBM<ref name="CP437"/> || IBM PC || 3.3+ || 1.0+ || {{Yes}} || ?{{dunno}} || {{Yes}} || 8-bit [[SBCS]] ||
|-
| 57344&nbsp;- 61439 || {{N/A}} || Private use derivations || IBM || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{varies|various}} || Private use code page derivations (E000h-EFFFh)
|-
| 65280&nbsp;- 65533 || {{N/A}} || Private use definitions || IBM || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{varies|various}} || Private use code page definitions (FF00h-FFFDh)
|-
| 65534 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || ?{{dunno}} || ?{{dunno}} || ?{{dunno}} || ?{{dunno}} || ?{{dunno}} || {{varies|various}} || Internal OS use (FFFEh)
|-
| 65535 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || 3.3+ || 1.0+ || ?{{dunno}} || ?{{dunno}} || ?{{dunno}} || {{varies|various}} || Internal OS use (FFFFh)<ref name="Paul_2002"/>
|}
 
Line 1,012 ⟶ 1,014:
 
== External links ==
{{Wikibooks|Character Encodings/Code Tables}}
* [http://www.ibm.com/software/globalization/cdra/glossary.jsp#SPTGLCDPG IBM CDRA glossary]
* {{webarchive|url=https://web.archive.org/web/20160205110331/http://www-01.ibm.com/software/globalization/g11n-res.html|date=2016-02-05|title=IBM code pages}}
Line 1,020 ⟶ 1,023:
* [http://www.i18nguy.com/unicode/codepages.html Character Sets And Code Pages At The Push Of A Button]
* [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp Microsoft Chcp command: Display and set the console active code page]
* [https://en.wikibooks.org/wiki/Character_Encodings/Code_Tables "Code Tables" ''Character Encoding'' Wikibook]
 
{{character encoding}}