Chinese input method: Difference between revisions

Content deleted Content added
Hybrid: these don't have articles, can be readded later
 
(10 intermediate revisions by 9 users not shown)
Line 9:
== History ==
[[Image:Large chinese keyboard.jpg|thumb|right|An early experimental Chinese radical keyboard using 496 keys for input was developed by researchers of [[National Chiao Tung University]] in Taiwan, but was never widely used.<ref name="xinzhu">{{cite web|url=https://hccg.culture.tw/home/zh-tw/NCTU_PC_EQUIPMENT/597871|title=1973年交大研製第一個中文鍵盤|website=The memory of Hsinchu city|language=zh|access-date=2022-08-25}}</ref>]]
Chinese input methods predate the computer. One of the early attempts was an electro-mechanical [[Chinese typewriter]] Ming kwaiMingkwai ({{zh|c=明快 |p=míngkuài |w=ming-k'uai}}) which was invented by [[Lin Yutang]], a prominent Chinese writer, in the 1940s. It assigned thirty base shapes or strokes to different keys and adopted a new way of categorizing Chinese characters. But the typewriter was not produced commercially and Lin soon found himself deeply in debt.<ref>[http://203.68.20.65/science/content/1972/00110035/0018.htm 中文與計算機] {{webarchive|url=https://archive.today/20030513120935/http://203.68.20.65/science/content/1972/00110035/0018.htm |date=2003-05-13 }}</ref>
 
Before the 1980s, Chinese publishers hired teams of workers and selected a few thousand type pieces from an enormous Chinese character set. Chinese government agencies entered characters using a long, complicated list of [[Chinese telegraph code]]s, which assigned different numbers to each character. During the early computer era, Chinese characters were categorized by their radicals or Pinyin romanization, but results were less than satisfactory.
 
In the 1970s to 1980s, large keyboards with thousands of keys were used to input Chinese. Each key was mapped to several Chinese characters. To type a character, one pressed the character key and then a selection key.<ref>{{cite web |url=https://openstd.samr.gov.cn/bzgk/gb/newGbInfo?hcno=3D8D8DCA190E4CB90399EE2FB91F93CC|title=汉字整字键盘盘面字排列 |website=Standardization Administration of China |year=1987|access-date=2022-08-26}}</ref> There were also experimental "radical keyboards" with dozens to several hundreds keys. Chinese characters were decomposed into "radicals", each of which was represented by a key.<ref name="xinzhu"/><ref>{{cite journal|url=https://ir.nctu.edu.tw/handle/11536/137495 |author1=謝清俊 |author2=黃永文 |author3=林樹|title=中文字根之分析|journal=Science Bulletin National Chiao-Tung University|volume=6|issue=1|year=1973}}</ref><ref name="zhzn"/> Unwieldy and difficult to use, these keyboards became obsolete after the introduction of Cangjie input method, the first method to use only the standard QWERTY keyboard and make Chinese [[touch typing]] possible.<ref name="zhzn">{{cite book|url=http://www.open-lit.com/bid=506&id=18353|chapter=三、電腦 倉頡、天龍、零壹、漢卡|title=智慧之旅. 第3部, 炎夏(一九七三-一九九五) |author=朱邦復|publisher=時報出版|year=1995}}</ref>
 
[[Image:Keyboard layout cangjie.png|thumb|A typical [[keyboard layout]] for the Cangjie method, which is based on the [[keyboard layout#United States|United States keyboard layout]].]]
[[Chu Bong-Foo]] invented a common input method in 1976 with his [[Cangjie input method]], which assigns different "roots" to each key on a standard computer keyboard. With this method, for example, the character {{lang|zh|}} is assigned to the A key, and 月 is assigned to B. Typing them together will result in the character {{lang|zh|}} ("bright").
 
[[Image:2008 Taipei IT Month Day1 InstantDict MD6800.jpg|thumb|An electronic dictionary with Cangjie keyboard]]
Line 25:
Due to these complexities, there is no "standard" method.
 
By 1989, [[bopomofo]] and pinyin were available for the [[IBM PC]].<ref name="pournelle198902">{{Cite magazine |last=Pournelle |first=Jerry |date=February 1989 |title=Ready Line Overload |url=https://archive.org/details/eu_BYTE-1989-02_OCR/page/n175/mode/2up?view=theater |access-date=2024-10-08 |magazine=BYTE |pages=121-137}}</ref> In mainland China, pinyin methods such as [[Sogou Pinyin]] and [[Google Pinyin]] are the most popular. In [[Taiwan]], use of [[Cangjie method|Cangjie]], [[Dayi method|Dayi]], [[Boshiamy method|Boshiamy]], and [[bopomofo]] predominate; and in [[Hong Kong]] and [[Macau]], the [[Cangjie method|Cangjie]] is most often taught in schools, while a few schools teach [[CKC Chinese Input System]].<ref>{{cite magazine|url=https://www.dsedj.gov.mo/cre/tmag/07/9-2.htm|title=倉頡以外的另一個選擇 ─"縱橫輸入法"|journal=教師雜誌|issue=7|year=2004|access-date=2022-08-26}}</ref>
 
Other methods include [[handwriting recognition]], [[Optical character recognition|OCR]] and [[speech recognition]]. The computer itself must first be "trained" before the first or second of these methods are used; that is, the new user enters the system in a special "learning mode" so that the system can learn to identify their handwriting or speech patterns. The latter two methods are used less frequently than keyboard-based input methods and suffer from relatively high error rates, especially when used without proper "training", though higher error rates are an acceptable trade-off to many users.
Line 34:
{{See also|Pinyin input method|Bopomofo|Jyutping}}
[[File:Interface of Weasel Input Method.png|thumb|Interface of a [[Pinyin]] input method, showing the need to choose an appropriate word out of a list of options. The word typed is "Wikipedia" in Mandarin Chinese, but the options shown include (from top to bottom) Wikipedia, [[Uncyclopedia]], Wiki, Crisis, and Rules Violation.]]
The user enters pronunciations that are converted into relevant Chinese characters. The user must select the desired character from homophones, which are common in Chinese. Modern systems, such as [[Sogou Pinyin]] and [[Google Pinyin]], predict the desired characters based on context and user preferences. For example, if one enters the sounds ''jicheng'', the software will type {{lang|zh-Hant|繼承}} (to inherit), but if ''jichengche'' is entered, {{lang|zh-Hant|計程車}} (taxi) will appear.
 
Various Chinese dialects complicate the system. Phonetic methods are mainly based on standard [[pinyin]], [[Zhùyīn fúhào|Zhuyin]]/Bopomofo, and [[Jyutping]] in China, Taiwan, and Hong Kong, respectively. Input methods based on other [[varieties of Chinese]], like [[Hakka Chinese|Hakka]] or [[Southern Min|Minnan]], also exist.
Line 42:
==== Shuangpin ====
[[File:Microsoft Double Pinyin Scheme.png|thumb|upright=1.35|The Microsoft pinyin 2003 shuangpin scheme.]]
Shuangpin ({{lang|zh-Hans|双}}; {{lang|zh-Hant|雙}}), literally dual spell, is a stenographical phonetic [[input method]] based on [[hanyu pinyin]] that reduces the number of keystrokes for one [[Chinese character]] to two by distributing every vowel and consonant composed of more than one letter to a specific key. In most Shuangpin layout schemes such as Xiaohe, Microsoft 2003 and Ziranma, the most frequently used vowels are placed on the middle layer, reducing the risk of [[repetitive strain injury]].
 
Shuangpin is supported by a large number of pinyin input software including QQ, Microsoft Bing Pinyin, [[Sogou Pinyin]] and [[Google Pinyin]].
 
=== Shape-based ===
[[File:TypingChineseCharactersWithCangjieGedit383Ubuntu1310Screencast.webm|thumb|Typing [[Standard Chinese|Chinese]] with [[the Cangjie input method|Cangjie]]]]
* [[Cangjie input method]] (倉頡; 仓颉; Tsang-chieh)
* [[Simplified Cangjie]]
* [[Simplified Cangjie]] (簡易倉頡, known as 速成 or 'Quick' on Windows systems and 'Sucheng' on Mac OS X systems)
* [[Dayi method]]
* [[CKC Chinese Input System]] (縱橫輸入法)
* [[BoshiamyArray input method]] (嘸蝦米{{lang|zh|行列}})
* [[DayiFour-corner method]] (大易)
* [[ArrayStroke Input Method|Array inputcount method]] (行列)
* [[Wubi method]]
* [[Four-Corner Method]] (四角碼; 四角码)
* [[Zhengma method]] (鄭碼; 郑码)
* [[Oxis Chinese Character Finder]]
* [[Q9Biaoxingma method]] (九方)
* [[ZYQ method]] ({{lang|zh|正易全}})<ref>{{Cite journal |last=Zhang |first=Xiao-heng |date=2003 |title=正易全:一个动态结构笔组汉字编码输入法 (Towards Correctness, Easiness and Completeness : Building a Chinese Character Coding Input Method Based on Dynamic Structured Stroke Groups) |url=http://jcip.cipsc.org.cn/CN/abstract/abstract1316.shtml |journal=Journal of Chinese Information Processing |volume=17 (2003) |issue=3 |pages=60–66}}</ref>
* [[Shouwei method]] (首尾字型)
* [[Stroke count method]] (筆畫; 笔画)
* [[Stroke method]] (筆劃; 笔划)
* [[Wubi method]] (五筆字型; 五笔字型)
* [[Wubihua method]] (五筆畫; 五笔画)
* [[Zhengma method]] (鄭碼; 郑码)
* [[Biaoxingma method]] (表形碼; 表形码)
* [[Shou-wei Hao-ma method]] (首尾號碼)
* [[Knot DNA method]] (筆結碼)<ref>{{Citation |last=Lee (FourDollars) |first=Shih-Yuan |title=DNA Chinese Input Method |date=2013-10-07 |url=https://github.com/fourdollars/dna |access-date=2022-10-15}}</ref>
* [[TigerCode method]] (虎碼; 虎码)<ref>{{Citation |title=TigerCode home website |access-date=2023-03-04 |url=https://tiger-code.com/}}</ref>
* [[ZYQ method]] (正易全)<ref>{{Cite journal |last=Zhang |first=Xiao-heng |date=2003 |title=正易全:一个动态结构笔组汉字编码输入法 (Towards Correctness, Easiness and Completeness : Building a Chinese Character Coding Input Method Based on Dynamic Structured Stroke Groups) |url=http://jcip.cipsc.org.cn/CN/abstract/abstract1316.shtml |journal=Journal of Chinese Information Processing |volume=17 (2003) |issue=3 |pages=60–66}}</ref>
 
=== Others ===
* [[Chinese telegraph code]] ({{lang|zh-Hant|中文電碼}})
 
=== Examples of keyboard layouts ===
<gallery heights="60px" widths="140px">
Image:Keyboard layout Zhuyin.svg|A typical [[keyboard layout]] for zhuyin on computers, which can be used as an input method
Image:Wubi keyboard.png|TheA keyboard using the [[Wubi method|Wubi]] keyboard which is an input method
Image:Keyboard layout cangjie.png|A typical keyboard layout for the [[Cangjie method]], which is based on the [[keyboard layout#US|United StatesU.S. keyboard layout]]. Note the non-standard use of Z as the collision key.
Image:Keyboard layout Dayi.svg|A typical keyboard layout for the [[Dayi method]]
Image:Keyboard layout Chinese Traditional.png|Chinese (traditional) keyboard layout, a US keyboard with Zhuyin, Cangjie and Dayi key labels, which can all be used to input Chinese characters into a computer
</gallery>
 
== Software ==
* [[Microsoft Pinyin IME|Microsoft IME]]
* [[Sogou Pinyin]]
* [[Google Pinyin]]
Line 101 ⟶ 91:
** [[Big5]]
** [[Guobiao code]] (GB)
** [[Neima]] (內碼)
** [[Unicode]]
** [[Chinese telegraph code|Telegraph code]] (電報碼)
* [[Chinese character IT]]
{{div col end}}
 
== NotesReferences ==
{{reflist}}