Pinyin input method: Difference between revisions

Content deleted Content added
mNo edit summary
Language mixing: two different ways to toggle languages
 
(38 intermediate revisions by 33 users not shown)
Line 1:
{{Short description|Method of entering Chinese characters into a computer}}
[[Image:Scim-pinyin-weijibaike.png|thumb|300px|Screenshot of [[Smart Common Input Method|SCIM]]'s Smart Pinyin.]]
{{cleanup lang|date=October 2023}}
The '''pinyin method''' ({{zh|s=拼音输入法|t=拼音輸入法|p=pīnyīn shūrù fǎ}}) refers to a family of [[Chinese input methods for computers|input methods]] based on the [[pinyin]] method of romanization.
{{External links|date=January 2022}}
[[Image:Scim-pinyin-weijibaike.png|thumb|300px|Screenshot of [[Smart Common Input Method|SCIM]]'s Smart Pinyin.]]
[[File:Typing Chinese characters with SunPinyin screencast.webm|thumb|Typing using [[Intelligent Input Bus|IBus]] and SunPinyin]]
The '''pinyin method''' ({{lang-zh|s=拼音输入法|t=拼音輸入法|p=pīnyīn shūrù fǎ}}) refers to a family of [[Chinese input methods for computers|input methods]] based on the [[pinyin]] method of romanization.
 
In the most basic form, the pinyin method allows a user to input [[Chinese characters]] by entering the pinyin of a Chinese character and then presenting the user with a list of possible characters with that [[Standard Chinese phonology|pronunciation]]. However, there are a number of slightly different such systems in use, and modern pinyin methods provide a number of convenient features.
 
==Advantages and disadvantages==
The obvious advantage of pinyin-based input methods is the ease of learning for [[Standard Chinese]] speakers. Those who are familiar with pinyin and are able to recognize the resulting characters would be able to input Chinese charactersthem with almost no training, compared to other input methods.<ref>{{Cite web|url=https://www.pinyinjoe.com/about-pinyin-input.htm|title=About Pinyin input (and other methods): Chinese typing basics and background :: Pinyin Joe|website=www.pinyinjoe.com|access-date=2020-03-31}}</ref> It does not require the user to be able to construct the character from scratch, as one would do in [[written Chinese]]. Since all children in [[mainland China]] are required to learn pinyin in [[Education in China|school]], pinyin is among the most popular input methods there.<ref>{{Cite web|url=https://www.justincheuk.com/just-how-do-you-type-in-chinese-introduction-to-chinese-input-methods/|title=Just How do you Type in Chinese? Introduction to Chinese Input Methods|last=Cheuk|first=Justin|date=2016-12-09|website=Justin Cheuk: When HK Meets UK|language=en-GB|access-date=2020-03-31}}</ref>
 
For people who cannot speak Standard Chinese, the main advantage of pinyin becomes a disadvantage, as they will need to learn the Standard Chinese pronunciation of characters before they are able to use this input method.<ref>{{Cite web|url=https://www.writtenchinese.com/pros-cons-learning-chinese-pinyin/|title=Pros and Cons of Learning Chinese with Pinyin|date=2014-04-18|website=Written Chinese|language=en-US|access-date=2020-03-31}}</ref> Also, because pinyin and other [[pronunciation]]-based input methods do not rely on the written formation of the character for input (as do [[Stroke (CJK character)|stroke-based]] input methods like [[Cangjie input method|Cangjie]]), they may cause [[language attrition]] and skill loss in adults, and it may be a learning barrier for written Chinese in children.<ref>{{cite web|url=https://query.nytimes.com/gst/fullpage.html?res=9B01E1D9163EF932A35751C0A9679C8B63|title=Where the PC Is Mightier Than the Pen|last=Lee|first=Jennifer|date=2001-02-01|work=[[New York Times]]|access-date=2008-07-29}}</ref>
For people who do not speak Chinese, the main advantage of pinyin becomes its disadvantage. They will need to learn the Standard Chinese pronunciation of characters before they are able to use this input method.
 
However, since all children in [[Mainland China]] are required to learn [[pinyin]] in school, pinyin is in fact very popular there.
 
Unlike stroke-based input methods, the pinyin method only requires the user to know how to speak Mandarin and be able to recognize the characters. It does not require the user to be able to construct the character from scratch as one would do in writing Chinese. This is both an advantage and a disadvantage. It is an advantage in that people will be able to type all the characters they can recognize. It is a disadvantage in that it may cause [[language attrition]] and skill loss in adults, and it may be a learning barrier for written Chinese in children.<ref>{{cite web|url=https://query.nytimes.com/gst/fullpage.html?res=9B01E1D9163EF932A35751C0A9679C8B63|title=Where the PC Is Mightier Than the Pen|last=Lee|first=Jennifer|date=2001-02-01|publisher=[[New York Times]]|accessdate=2008-07-29}}</ref>
 
== Elements and features ==
{{More citations needed section|date=March 2020}}{{how-to|date=March 2020}}
Pinyin input methods differ in a number of possible aspects. Most pinyin input methods provide convenience features to speed up input. Some of these features can speed up typing immensely.
 
=== Conversion length ===
 
TheConversion basic idea of anlength input method is to have athe buffer that holds the user input until it is converted into characters that would otherwise be unavailable from the keyboard.
 
In the most basic systems, one character is converted at a time. This makes a very time -consuming input process. Not only does the user have to select characters one at a time, it also means that the input system does not have the ability to prioritize character choices using word phrases, grammatical structure, or context. In addition, since the input method only supports one character at a time, it likely requires the user to type out the full pinyin spelling to narrow down the selection. This system still exists in embedded applications such as cell phones.
 
Common pinyin implementations on the computer today can hold up to a clause in pinyin before requiring a conversion. The method attempts to guess the appropriate characters by using word phrases from a dictionary, grammatical structure, and context.
Line 32 ⟶ 33:
With the exception of intonation, there are two extended Latin vowels in pinyin. They are [[ü]] (U-umlaut) and [[ê]] (E-circumflex). Given that the US keyboard layout is the most common keyboard layout in China, any pinyin method implementation would need to be able to facilitate the input of those vowels on US keyboard.
 
Since the letter "v" is unused in Mandarin pinyin, it is universally used as an alias for ü. For example, typing "nv" into the input method would bring up the candidate list for {{lang-zh|p=nǚ}}.
 
The handling of ê is not as universal, since the character [[wikt:欸|欸]] is the only commonly used character with this pronunciation. It is an [[interjection]] roughly equivalent to "[[Eh]]" in English. Some IMEs, such as Google Pinyin, merge it into "e", while others create an additional letter combination for it, such as "ea" or "eh", or "ei" in iOS. Others would simply drop this sound.
 
=== Treatment of hm, hng, ng, n ===
The character 嗯 (ng) can (or should<ref>{{Cite web |url=http://news.xwh.cn/news/system/2013/05/16/010353431.shtml |title="嗯"字典里发"ng"音为啥电脑要打"en"_新闻中心_新文化网_吉林_长春_&#91;嗯 字典 输入法&#93; |access-date=2015-12-04 |archive-date=2015-12-08 |archive-url=https://web.archive.org/web/20151208131409/http://news.xwh.cn/news/system/2013/05/16/010353431.shtml |url-status=dead }}</ref>) be written using the IBUS linuxGNU/Linux and the Microsoft input method by typing "en".
 
=== Usage statistics and user dictionaries ===
Line 43 ⟶ 44:
 
=== Abbreviation ===
Abbreviation is a feature that allows the user to omit all but the first or first couple of letters in the pinyin spelling. This feature can speed up the input of long word phrases significantly. Under this feature, the user can enter the word for "concert" ({{lang-zh|s=音乐会|t=音樂會|p=yīnyuèhuì}}) by typing "yyh" as opposed to "yinyuehui".
 
In systems that support user-defined phrases, users can even define their own abbreviations that might not follow standard pinyin rules.
Line 51 ⟶ 52:
 
=== Word prediction ===
[[Word prediction]] ({{lang-zh|s=联想|t=聯想|p=liánxiǎng|l=association}}) is a feature of an input method that attempts to guess the next series of characters that the user is attempting to enter. This feature is often used to refer to two different mechanisms that have similar functions.
 
One of these mechanisms is akin to an [[auto-complete]] function for user input. While the user is typing the appropriate pinyin, the input method would take the input and look up all possible word phrases that might match the user input even though the input is incomplete. For example, when the user enters "shang", the input method would show "上海" (Shanghai) as a word candidate under this feature.
Line 58 ⟶ 59:
 
=== Double pinyin ===
[[File:MSPY-double-pinyin.svg|thumb|The default double pinyin scheme in [[Microsoft Pinyin IME]]. Many IME, including ibus-pinyin, support this scheme.]]
Vowel groups in pinyin can be up to four letters long. ''Double pinyin'' (双拼) is a method whereby longer vowel groups are assigned to consonant keys as shortcuts, and zh, ch, sh are assigned to vowel keys as shortcuts. Thus, when the input method expects a vowel, the user can use the shortcuts to speed up typing.
 
In the [[Microsoft Pinyin IME]], for example, if a user wants to input “中华人民共和国 (zhōnghuárénmíngònghéguó)”, "People's Republic of China" into the computer, they need to type "zhonghuarenmingongheguo" in Full Pinyin. In Double Pinyin, however, one only needs to type "vshwrfmngshego" (v=zh, s=ong, h=h, w=ua, r=r, f=en, m=m, n=in, g=g, s=ong, h=h, e=e, g=g, o=uo). However, with a system that uses abbreviation, the same result can be achieved by just typing in "zhrmghg".
 
=== Typo correction ===
Line 67 ⟶ 68:
 
=== Language mixing ===
Most advanced pinyin method implementations allow the mixing of English into an input stream without requiring the user to change the language mode. However,The itShift oftenkey comes(on withWindows) someor limitationsthe suchCaps asLock requiringkey (on macOS, labelled as the input中/英 key on Mac keyboards sold in China) is most commonly used tofor bethis uppercasepurpose.
 
The following examples show thetwo differencedifferent if user wishesways to enter "这个SQL漏洞可以瘫痪整个系统。" (This [[SQL]] vulnerability could paralyze the entire system.):
* "zhe ge [switch to English] SQL [switch to Chinese] loudong keyi tanhuan zhengge xitong." (Unsupported)
* "zhe ge [enable Caps Lock] SQL [disable Caps Lock] loudong keyi tanhuan zhengge xitong." (Supported)
 
==Implementations==
{{External links|section|date=September 2017}}
The following are the most popular pinyin method editors used in [[Mainlandmainland China]]. They are free to download at their official websites.
 
=== Cross Platformplatform ===
* Rime input method engine,<ref>[https://github.com/rime/librime librime homepage], [https://packages.debian.org/sid/librime-bin librime Debian package]</ref> an open source input method engine for pinyin and others, which supports Windows, macOS, and Linux (中州韻).
 
=== Windows ===
*[[Microsoft Pinyin IME]], bundled with Windows 20003.1x or higher, and bundled with all Simplified Chinese editions of Windows, developed by [[Harbin Institute of Technology]] (微软拼音输入法).
*[[ZNABC]], bundled with Simplified Chinese edition of Windows XP, developed by [[Peking University]] (智能ABC输入法).
*[[Sogou Pinyin]], [[Sogou]] created the IME on which Google based [[Google Pinyin]] (搜狗拼音输入法).
*[[Google Pinyin]], Google's implementation for Windows XPand Android. (谷歌拼音输入法)
*[http://www.unispim.com/ Ziguang Pinyin] ({{lang|zh-Hans|紫光拼音输入法}})
*[http://py.qq.com/ QQ Pinyin] ({{lang|zh-Hans|QQ拼音输入法}})
*[http://ime.baidu.com/index.html?from=fc_client Baidu Pinyin] ({{lang|zh-Hans|百度拼音输入法}})
*[http://dir.jjol.cn/Pyjj/ Pinyin Jiajia] ({{lang|zh-Hans|拼音加加输入法}})
 
=== Linux/Unix ===
*[[Fcitx]], general input method that supports Pinyin with fcitx-pinyin and fcitx-rime, among many others schemes.
*Smart Pinyin (scim-pinyin), pinyin implementation for the [[Smart_Common_Input_MethodSmart Common Input Method|SCIM]] input platform on [[Linux]], [[BSD]], and other [[Unices]].
*Bimspinyin, pinyin implementation for the [[xcin]] input platform on [[Linux]], [[BSD]], and other [[Unices]].
*[[OpenVanilla]], a cross-platform framework for Chinese and more.
Line 103 ⟶ 104:
 
=== Web ===
*[https://www.chineseconverter.com/en/convert/type-in-chinese-online-ime Type in Chinese Online (IME)] web-based IME with cross-browser support.
*[http://pinyin.sogou.com/cloud/ Sogou Cloud Input Method] web-based IME through browsers, can be embedded into other pages.
*[http://py.qq.com/web/ QQ Cloud Input Method] web-based IME through browsers, can be embedded into other pages. Similar to above.
*[https://www.google.com/inputtools/try/ Google web-based IME]
*[http://www.pinyinput.com Online Pinyin Input Method] web-based IME through browsers.
Line 112:
*[[Chinese input methods for computers]]
*[[Keyboard layout]]
*[[Wāpuro rōmaji]]
 
==References==
Line 117 ⟶ 118:
 
==External links==
{{Chinese language}}
* [http://i2pinyin.com Online Pinyin Input Method Editor (IME) 網上拼音輸入法]
{{Input method}}
* [http://chinesecharacteraday.com/where-to-start/setting-up-your-computer Chinese Pinyin Input Method Selection for Computers]
{{Authority control}}
 
{{DEFAULTSORT:Pinyin Input Method}}
[[Category:Han characterCJK input methods]]
[[Category:PinyinHan pinyin input]]