Content deleted Content added
→Character repertoire: Note private use area (code page 1449) mappings. |
|||
Line 1,971:
Missing from Unicode are the traditional [[underlining|underscored]] alphabetic characters included in some of the APL code pages; their usage has been eliminated or deprecated in most APL implementations. These were produced on APL printing terminals by over-striking a straight capital letter with an underscore character. Some tables show them simulated with underlined and italic markup, not listing Unicode mappings.<ref name="tachyon310"/>
IBM assigns them GCGIDs as "LA480000" (which they name "A Line Below Capital/A Underscore (APL)"), "LB480000" ("B Line Below Capital/B Underscore (APL)") and so forth, under the "L" series used for Latin letters.<ref name="cs963" /> The use of an even number (48) rather than an odd number (47) is due to being uppercase: compare the use of SD110000 for a lone acute accent {{code|´}}, LA110000 for the lowercase {{code|á}}, and LA120000 for the uppercase {{code|Á}}.<ref name="cp1252">{{cite web |url=ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP01252.txt |title=Windows, Latin 1 |id=CPGID 01252 |publisher=[[IBM]]}}</ref> They are included in IBM's [[Private Use Areas|private use area]] scheme, encoded in reverse‑alphabetical order in the odd-numbered code points from U+F8BF to U+F8F1.<ref name="unicodenam"/>
Homologous uses of 47 include the "SD" (diacritic) series GCGID SD470000 for "Line Below/Discontinuous Underscore"<ref name="cs969">{{cite web |url=ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CS00969.txt |title=OCR B |id=GCSGID 00969 |publisher=[[IBM]]}}</ref>—i.e. [[macron below]], distinct from the ASCII underscore which is SP090000 ("Underline/Continuous Underscore")<ref name="cp1252"/>—and the "A" ([[Arabic script|Arabic letter]]) series GCGID AD470009 for the [[ḏāl]],<ref name="cp1256pdf">{{cite web |url=ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP01256.pdf |title=Windows, Arabic (PDF) |id=CPGID 01256 |publisher=[[IBM]]}}</ref> for example. Unicode's [[Latin Extended Additional]] block includes the following capital "Line Below" characters with the macron below diacritic, for Semitic transcription (it includes a pre-composed ẖ only in lowercase):
|