Halfwidth and Fullwidth Forms (Unicode block): Difference between revisions

Content deleted Content added
WOSlinkerBot (talk | contribs)
m remove un-needed options from tables
m Block: {{not a typo}}
 
(18 intermediate revisions by 13 users not shown)
Line 9:
|symbols = Variant width characters
|1_0_0 = 216
|1_0_11_1 = 7
|3_2 = 2
|note = <ref>{{cite web|url=https://www.unicode.org/versions/Unicode1.0.0/Notice.pdf|title=Unicode 1.0.1 Addendum|work=The Unicode Standard|date=1992-11-03|access-date=2016-07-09|url-status=live|archive-url=https://web.archive.org/web/20160702004420/http://www.unicode.org/versions/Unicode1.0.0/Notice.pdf|archive-date=2016-07-02|df=}}</ref><ref>{{cite web|url=https://www.unicode.org/ucd/|title=Unicode character database|work=The Unicode Standard|access-date=20162023-07-09|url-status=live|archive-url=https://web.archive.org/web/20160710080729/http://www.unicode.org/|archive-date=2016-07-10|df=26}}</ref><ref>{{cite web|url=https://www.unicode.org/versions/enumeratedversions.html|title=Enumerated Versions of The Unicode Standard|work=The Unicode Standard|access-date=20162023-07-09|url-status=live|archive-url=https://web.archive.org/web/20160629001311/http://www.unicode.org/versions/enumeratedversions.html|archive-date=2016-06-29|df=26}}</ref>
}}
 
'''Halfwidth and Fullwidth Forms''' is the name of a [[Unicode block]] U+FF00&ndash;FFEF, provided so that older encodings containing both [[Halfwidth and fullwidth forms|halfwidth and fullwidth]] characters can have lossless translation to/from Unicode. It is the second-to-last block of the [[Basic Multilingual Plane]], exceptingfollowed only by the short [[Specials (Unicode Specialsblock)|Specials]] block at U+FFF0&ndash;FFFF. Its block name in Unicode 1.0 was '''Halfwidth and Fullwidth Variants'''.<ref>{{cite web |url=https://www.unicode.org/versions/Unicode1.0.0/CodeCharts2.pdf |work=The Unicode Standard |version=version 1.0 |title=3.8: Block-by-Block Charts |publisher=[[Unicode Consortium]]}}</ref>
 
Range U+FF01&ndash;FF5E reproduces the characters of [[ASCII]] 21 to 7E as fullwidth forms. U+FF00 does not correspond to a fullwidth ASCII 20 (space character), since that role is already fulfilled by U+3000 "[[ideographic space]]".
 
Range U+FF65FF61&ndash;FF9F encodes halfwidth forms of [[katakana]] and related punctuation in a transposition of theirA1 to DF in the [[JIS X 0201]] layoutencoding – see [[half-width kana]].
 
The range U+FFA0&ndash;FFDC encodes halfwidth forms of [[Hangul Compatibility Jamo|compatibility jamo]] characters for [[Hangul]], in a transposition of their [[KS C 5601#1974|1974 standard]] layout. It is used in the mapping of some IBM encodings for Korean, such as IBM code page 933, which allows the use of the [[Shift Out and Shift In characters]] to shift to a double-byte character set.<ref name="ibm933">{{cite web|url=http://demo.icu-project.org/icu-bin/convexp?conv=ibm-933|title=ICU Demonstration - Converter Explorer|author=|date=|website=demo.icu-project.org|accessdateaccess-date=7 May 2018}}</ref> Since the double-byte character set could contain compatibility jamo, halfwidth variants are needed to provide round-trip compatibility.<ref name=hwfwblame>{{Cite web|url=https://harjit.moe/hwfwblame.html|title=Halfwidth and Fullwidth blame}}</ref><ref>{{Cite web|url=http://userguide.icu-project.org/conversion/data|title=Conversion Data - Old ___location of the ICU User Guide}}</ref>
 
Range U+FFE0&ndash;FFEE includes fullwidth and halfwidth symbols.
Line 40:
|}
 
An additional variant is defined for a fullwidth [[slashed zero|zero with a short diagonal stroke]]: U+FF10 FULLWIDTH DIGIT ZERO, U+FE00 VS1 ({{not a typo|0&#xfe00;}}).<ref>{{cite web|url=https://www.unicode.org/L2/L2015/15268-slashed-zero.pdf|title=L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set|date=2015-10-30|first1=Barbara|last1=Beeton|first2=Asmus|last2=Freytag|first3=Laurențiu|last3=Iancu|first4=Murray|last4=Sargent}}</ref><ref name="stdvar"/>
 
==History==
The following Unicode-related documents record the purpose and process of defining specific characters in the Halfwidth and Fullwidth Forms block:
 
{{sticky header}}
{| class="wikitable collapsible sticky-header"
|-
! [[Unicode#Versions|Version]] !! {{nobr|Final code points<ref group=lower-alpha name=final/>}} !! Count !! [[International Committee for Information Technology Standards|L2]]&nbsp;ID !! [[ISO/IEC JTC 1/SC 2|WG2]]&nbsp;ID !! Document
Line 53 ⟶ 54:
| || {{nobr|[https://www.unicode.org/wg2/docs/n4403.pdf N4403 (pdf],}} [https://www.unicode.org/wg2/docs/n4403.doc doc]) || {{Citation|title=Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14|date=2014-01-28|first=V. S.|last=Umamaheswaran|section=Resolution M61.01}}
|-
| {{nobr|[https://www.unicode.org/L2/L2017/17056-sv-western-vs-eastasian.pdf L2/17-056]}} || || {{Citation|title=Proposal to add standardized variation sequences|date=2017-02-13|first=Ken|last=Lunde|authorlinkauthor-link=Ken Lunde}}
|-
| {{nobr|[https://www.unicode.org/L2/L2017/17436r-sv-eastsian-punct.pdf L2/17-436]}} || || {{Citation|title=Proposal to add standardized variation sequences for fullwidth East Asian punctuation|date=2018-01-21|first=Ken|last=Lunde}}
Line 63 ⟶ 64:
| {{nobr|[https://www.unicode.org/L2/L2018/18115.htm L2/18-115]}} || || {{Citation|title=UTC #155 Minutes|date=2018-05-09|first=Lisa|last=Moore|section=Consensus 154-C17|quote=Add 16 standardized variation sequences based on L2/17-436R, for Unicode 12.0.}}
|-
| {{nobr|[https://www.unicode.org/L2/L2019/19055-segment-fullwd-digits.txt L2/19-055]}} || || {{Citation|title=Proposed Changes in the Segmentation Property Values for Fullwidth Digits |date=2019-01-14|first=Laurențiu|last=Iancu}}
|-
| {{nobr|[https://www.unicode.org/L2/L2019/19008.htm L2/19-008]}} || || {{Citation|title=UTC #158 Minutes |date=2019-02-08|first=Lisa|last=Moore|section=B.11.11.1.2 Proposed changes in the segmentation property values for fullwidth digits}}
|-
| 1.0.1 || width="180" | U+FFE8..FFEE || 7 || || || (to be determined)
|-
| rowspan="11" | 3.2 || rowspan="11" width="180" | U+FF5F..FF60 || rowspan="11" | 2 || {{nobr|[https://www.unicode.org/L2/L1999/99052.htm L2/99-052]}} || || {{Citation|title=The math pieces from the symbol font|date=1999-02-05|first=Asmus|last=Freytag}}
|-
| {{nobr|[https://www.unicode.org/L2/L2001/01033-addbrackets.htm L2/01-033]}} || || {{Citation|title=Disunify braces/brackets for math, computing science, and Z notation from similar-looking CJK braces/brackets|date=2001-01-16|first1=Kent|last1=Karlsson|first2=Asmus|last2=Freytag}}
Line 91 ⟶ 92:
| {{nobr|[https://www.unicode.org/L2/L2002/02154-n2403-minutes.pdf L2/02-154]}} || [https://www.unicode.org/wg2/docs/n2403.pdf N2403] || {{Citation|title=Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19|date=2002-04-22|first=V. S.|last=Umamaheswaran|section=Resolution M41.1}}
|- class="sortbottom"
| colspan="6" | {{Reflistreflist|group=lower-alpha|refs=<ref name=final>Proposed code points and characters names may differ from final code points and names</ref>}}
|}