Digital encoding of APL symbols: Difference between revisions

Content deleted Content added
rm stray }}
m Prefer HTTPS to FTP
Line 9:
{|class="wikitable collapsible"
|+"SL" (APL functional symbol) series GCGIDs
!GCGID<ref name="cs963pdf">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CS00963.pdf |title=Graphic Escape APL2/TN - 138 (chart) |id=GCSGID 00963 |publisher=[[IBM]]}}</ref> !! IBM name<ref name="cs963">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CS00963.txt |title=Graphic Escape APL2/TN - 138 |id=GCSGID 00963 |publisher=[[IBM]]}}</ref><ref name="slgcgidchart">{{cite web |url=http://www-01.ibm.com/software/globalization/gcgid/aplsym.html |url-status=dead |archive-url=https://web.archive.org/web/20160122051352/http://www-01.ibm.com/software/globalization/gcgid/aplsym.html |archive-date=2016-01-22 |title=Graphic character identifiers: Specials, APL symbols |publisher=[[IBM]]}}</ref> !! Unicode !! Notes and other mappings
|-
|SL010000 || Up Stile (APL) || {{unichar|2308|LEFT CEILING}} ||
Line 25:
|SL070000 || Delta Stile (APL) || {{unichar|234B|APL FUNCTIONAL SYMBOL DELTA STILE}} ||
|-
|SL080000 || Circle (APL) || {{unichar|25CB|WHITE CIRCLE}}<ref name="tachyon310"/><ref name="consortiumir68"/><ref name="x3270"/><ref name="dyalogsheet">{{cite web |url=https://docs.dyalog.com/17.0/CheatSheet%20-%20Nomenclature%20-%20Functions%20and%20Operators.pdf |title=Dyalog Nomenclature: Functions and Operators |website=Dyalog - Documentation Centre |publisher=Dyalog APL}}</ref> || This is SM750000 in a non-APL context, for example, in the [[C0 control codes|C0]] replacement graphics from [[code page 437]],<ref name="cpgid437">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00437.txt |title=Personal Computer |id=CPGID 00437 |publisher=[[IBM]]}}</ref> which code pages 907, 909 and 910 inherit some or all of, retaining SM750000 in the C0 area and also including SL080000 outside of it.<ref name="cpgid907"/><ref name="cpgid909"/><ref name="cpgid910"/> Both map to U+25CB when APL is represented using Unicode characters, although SL080000 can be mapped to U+F890 in IBM's [[Private Use Areas|private use area]] scheme.<ref name="unicodenam">{{citation|mode=cs1 |title=unicode.nam: Allow the Unicode characters to be specified using either the IBM or PostScript like names. |author=IBM |author-link=IBM |date=1997}} (Included with {{citation|mode=cs2 |title=OS/2 Codepage and Keyboard Display Tools |last=Borgendale |first=Ken |url=http://www.borgendale.com/tools/tools.htm}})</ref> Compare SL590000 through SL620000 below.
|-
|SL090000 || Circle Stile (APL) || {{unichar|233D|APL FUNCTIONAL SYMBOL CIRCLE STILE}} ||
Line 123:
|SL580000 || Quote Dot (APL) || {{unichar|0021|EXCLAMATION MARK}}<ref name="mainframew"/><ref name="x3270"/><ref name="consortiumir68"/><ref name="dyalogsheet"/> || U+F88E in IBM's private use area scheme.<ref name="unicodenam"/> SP020000 ({{unichar|0021|EXCLAMATION MARK}} in a non-APL context) co-occurs at 0x5A in code page 293 (SL580000 is at 0xDB in code pages 293 and 310).<ref name="cp293txt"/> Tachyonsoft lists {{unichar|01C3|LATIN LETTER RETROFLEX CLICK}} for SL580000.<ref name="tachyon310"/>
|-
|SL590000 || Left Arrow (APL) || {{unichar|2190|LEFTWARDS ARROW}}<ref name="tachyon310"/><ref name="consortiumir68"/><ref name="x3270"/> ||rowspan=4| These arrows are SM300000, SM310000, SM320000 and SM330000 respectively in a non-APL context, for example, in the [[C0 control codes|C0]] replacement graphics from [[code page 437]],<ref name="cpgid437">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00437.txt |title=Personal Computer |id=CPGID 00437 |publisher=[[IBM]]}}</ref> which code pages 907, 909 and 910 inherit some or all of. Their APL GCGIDs can be mapped to U+F88D, U+F88C, U+F88B and U+F88A respectively in IBM's private use area scheme.<ref name="unicodenam"/>
 
Code pages 907 and 910 keep the non-APL GCGIDs for the C0 replacements but use the APL GCGIDs where the arrows appear outside of the C0 area, while code page 909 uses the APL GCGIDs multiple times, both for the C0 replacements and for between one and two occurrences of each of these arrows outside of the C0 area.<ref name="cpgid907"/><ref name="cpgid909"/><ref name="cpgid910"/>
Line 188:
===EBCDIC code pages===
====Code page 293====
'''Code page 293''' ([[CCSID]] 293),<ref>{{cite web|title=CCSID 293 information document|archive-url=https://web.archive.org/web/20160327100212/http://www-01.ibm.com/software/globalization/ccsid/ccsid293.html|archive-date=2016-03-27|url=http://www-01.ibm.com/software/globalization/ccsid/ccsid293.html}}</ref> called "APL (USA)", is an EBCDIC code page which includes APL symbols, in addition to preserving the [[ISO basic Latin alphabet|basic Latin]] letters and [[Western Arabic numerals]] at their usual EBCDIC locations.<ref name="cp293txt">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00293.txt |title=Code Page (CPGID) 00293: APL (USA) |work=REGISTRY, Graphic Character Sets and Code Pages |date=1996 |orig-year=1984 |author=IBM |author-link=IBM}}</ref><ref name="tachyon293">{{citation|mode=cs1 |url=http://www.tachyonsoft.com/cp00293.htm |title=Code Page 00293: EBCDIC APL |date=2007-07-31 |author=Tachyon Software}}</ref>
 
{|{{chset-table-header1|Code page 293<ref>{{Citation|title=Code Page CPGID 00293 (pdf)|url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00293.pdf|publisher=IBM}}</ref><ref name="cp293txt"/><ref name="tachyon293"/>}}
|-
|{{chset-left1|0x}}
Line 483:
 
====Code page 310====
'''Code page 310''' ("Graphic Escape APL/TN") includes a larger gamut of symbols, but does not itself include the basic Latin letters or the basic digits.<ref name="ibm">{{citation|mode=cs1 |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00310.txt |title=Code Page (CPGID) 00310: Graphic Escape APL/TN |date=1987 |work=REGISTRY, Graphic Character Sets and Code Pages |author=IBM |author-link=IBM}}</ref><ref name="tachyon310">{{citation|mode=cs1 |url=http://www.tachyonsoft.com/cp00310.htm |title=Code Page 00310: 3270 APL Graphic Escape |date=2007-07-31 |author=Tachyon Software}}</ref> It is used alongside [[Code page 37#37-2|Code page 37-2]],<ref name="IBM_2010">{{citation |title=xlate - Transliterate Contents of Records |publisher=[[IBM Corporation]] |orig-year=1986 |date=2010 |url=http://vm.marist.edu/%7Epipeline/bhelp/ACH0275.html |access-date=2016-10-18 |archive-date=2019-06-16 |archive-url=https://web.archive.org/web/20190616013546/http://vm.marist.edu/~pipeline/bhelp/ACH0275.html |url-status=live }}</ref> with the Code page 310 codes being prefixed by the Graphic Escape (EBCDIC 0x08)<ref>{{cite web |archive-url=https://web.archive.org/web/20180911044845/https://www-01.ibm.com/software/globalization/cdra/appendix_g1.html |archive-date=2018-09-11 |url=https://www-01.ibm.com/software/globalization/cdra/appendix_g1.html |title=Appendix G-1. EBCDIC control character definitions. |work=IBM Globalization |author=IBM |author-link=IBM}}</ref> control character.<ref name="x3270">{{cite web |url=http://x3270.bgp.nu/Charset.html |title=x3270 Character Set |work=x3270 Documentation |archive-url=https://web.archive.org/web/20181110212110/http://x3270.bgp.nu/Charset.html |archive-date=2018-11-10}}</ref><ref name="ibm3174">{{cite web |url=http://www.bitsavers.org/pdf/ibm/3174/GA27-3831-02_3174_Character_Set_Reference_199003.pdf#page=147 |title=Chapter 5. 3270 Display Station and Printer Code Page Charts (§ APL/APL2 Text Feature Code Page) |work=3174 Establishment Controller Character Set Reference |id=GA27-3831-02 |year=1990 |orig-year=1988 |edition=3rd |publisher=[[IBM]] |page=5·6}}</ref>
 
{|{{chset-table-header1|Code page 310 (prefixed with 0x08)<ref name="ibmpdf">{{Citation|title=Code Page CPGID 00310 (pdf)|url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP00310.pdf|publisher=IBM}}</ref><ref name="ibm"/><ref name="tachyon310"/><ref name="x3270"/>{{efn|Documented mappings vary.<ref name="tachyon310"/><ref name="x3270"/><ref name="ibmpdf"/><ref name="mainframew">{{cite web |archive-url=https://web.archive.org/web/20160827044130/http://mainframe.wiki/ebcdicTbl.php |archive-date=2016-08-27 |url=http://mainframe.wiki/ebcdicTbl.php |title=EBCDIC and ASCII Tables}}</ref>|name=different}}}}
|-
|{{chset-left1|0x}}
Line 1,971:
Missing from Unicode are the traditional [[underlining|underscored]] alphabetic characters included in some of the APL code pages; their usage has been eliminated or deprecated in most APL implementations. These were produced on APL printing terminals by over-striking a straight capital letter with an underscore character. Some tables show them simulated with underlined and italic markup, not listing Unicode mappings.<ref name="tachyon310"/>
 
IBM assigns them GCGIDs as "LA480000" (which they name "A Line Below Capital/A Underscore (APL)"), "LB480000" ("B Line Below Capital/B Underscore (APL)") and so forth, under the "L" series used for Latin letters.<ref name="cs963" /> The use of an even number (48) rather than an odd number (47) is due to being uppercase: compare the use of SD110000 for a lone acute accent {{code|´}}, LA110000 for the lowercase {{code|á}}, and LA120000 for the uppercase {{code|Á}}.<ref name="cp1252">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP01252.txt |title=Windows, Latin 1 |id=CPGID 01252 |publisher=[[IBM]]}}</ref> They are included in IBM's [[Private Use Areas|private use area]] scheme, encoded in reverse‑alphabetical order in the odd-numbered code points from U+F8BF to U+F8F1.<ref name="unicodenam"/>
 
Homologous uses of 47 include the "SD" (diacritic) series GCGID SD470000 for "Line Below/Discontinuous Underscore"<ref name="cs969">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CS00969.txt |title=OCR B |id=GCSGID 00969 |publisher=[[IBM]]}}</ref>—i.e. [[macron below]], distinct from the ASCII underscore which is SP090000 ("Underline/Continuous Underscore")<ref name="cp1252"/>—and the "A" ([[Arabic script|Arabic letter]]) series GCGID AD470009 for the [[ḏāl]],<ref name="cp1256pdf">{{cite web |url=ftphttps://ftppublic.softwaredhe.ibm.com/software/globalization/gcoc/attachments/CP01256.pdf |title=Windows, Arabic (PDF) |id=CPGID 01256 |publisher=[[IBM]]}}</ref> for example. Unicode's [[Latin Extended Additional]] block includes the following capital "Line Below" characters with the macron below diacritic, for Semitic transcription (it includes a pre-composed ẖ only in lowercase):
 
* {{unichar|1E06|LATIN CAPITAL LETTER B WITH LINE BELOW}}