Code page 949 (IBM): Difference between revisions

Content deleted Content added
Line 344:
When mapped to Unicode, 0xC9A1–C9FE (between the syllable and hanja ranges) are mapped to the Unicode [[Private Use Areas|Private Use Area]] code points U+E000–E05D, while 0xFEA1–FEFE (between the end of the hanja range and the end of the plane) are mapped to U+E05E–E0BB. Outside the Wansung plane, 0x8FA0–9AA5 (where the second byte is in the range 0xA1–FE) are mapped to the Private Use Area code points U+E0BC–E4CA.<ref name="icu"/> The last of these ranges cuts into the start of the [[#0x9A|0x9A row]] (shown below).
 
Collectively these private use ranges cover the code points U+E000..E4CA, allowing 1227 UDC to be mapped from IBM-949 to Unicode.<ref name="ucm"/> The separate private use area range U+F843..F86E is used by IBM to map some characters within the extended hanja range.<ref name="ucm"/> This follows early recommendations from the Unicode Consortium that corporate characters be allocated from U+F8FF downward and user-defined characters be allocated from U+E000 upward,<ref>{{cite book |section-url=https://www.unicode.org/versions/Unicode1.1.0/ch02.pdf |section=2.0: Changes in Unicode 1.0 |title=The Unicode Standard, Version 1.1 |id=UTR #4 |publisher=[[Unicode Consortium]] |pages=3–4}}</ref> and is part of a larger corporate private use area scheme which is defined internally by IBM, and includes 129192 characters and three unused positions in the range U+F83D..F8FF.<ref name="ibmpua">{{cite web |archive-url=https://web.archive.org/web/20150916190822/http://www-01.ibm.com/software/globalization/cp/cp01449.html |archive-date=2015-09-16 |url=http://www-01.ibm.com/software/globalization/cp/cp01449.html |url-status=dead |title=CPGID 01449: IBM default PUA |work=IBM Globalization: Code page identifiers |publisher=[[IBM]] |quotation=IBM has designated 195 positions from U+F83D to U+F8FF for use as IBM Corporate-zone and intends to use them consistently within IBM whenever there is a need to maintain the round-trip integrity of IBM characters. […] At present CS 3099 containing 192 IBM {{sic|Corportate}} characters has been defined.}}</ref>
 
=== {{anchor|0x9A}}Lead bytes 0x9A–9D (extended symbols and hanja) ===