Japanese language and computers: Difference between revisions

Content deleted Content added
update
Tags: Reverted Mobile edit Mobile web edit
Tags: Reverted Mobile edit Mobile web edit
Line 53:
[[Unicode]] was intended to solve all encoding problems over all languages. The [[UTF-8]] encoding used to encode Unicode in web pages does not have the disadvantages that Shift-JIS has. Unicode is supported by international software, and it eliminates the need for gaiji. There are still controversies, however. For Japanese, the kanji characters have been [[Han unification|unified]] with Chinese; that is, a character considered to be the same in both Japanese and Chinese is given a single number, even if the appearance is actually somewhat different, with the precise appearance left to the use of a locale-appropriate font. This process, called [[Han unification]], has caused controversy.{{cn|date=October 2020}} The previous encodings in Japan, [[Free area of the Republic of China|Taiwan Area]], [[Mainland China]] and [[Korea]] have only handled one language and Unicode should handle all. The handling of Kanji/Chinese have however been designed by a committee composed of representatives from all four countries/areas.{{cn|date=October 2020}}
 
== Text inputOutput ==
{{main|Japanese inputoutput methods}}
Written Japanese uses several different scripts: [[kanji]] (Chinese characters), 2 sets of ''kana'' (phonetic syllabaries) and roman letters. While kana and roman letters can be typed directly into a computer, entering kanji is a more complicated process as there are far more kanji than there are keys on most keyboards. To inputoutput kanji on modern computers, the reading of kanji is usually entered first, then an [[input method editor]] (IME), also sometimes known as a front-end processor, shows a list of candidate kanji that are a phonetic match, and allows the user to choose the correct kanji. More-advanced IMEs work not by word but by phrase, thus increasing the likelihood of getting the desired characters as the first option presented. Kanji readings inputs can be either via [[romanization]] (''[[rōmaji]] nyūryoku,'' {{Nihongo2|[[:ja:ローマ字入力|ローマ字入力]]}}) or direct kana input (''kana nyūryoku,'' {{Nihongo2|[[:ja:かな入力|かな入力]]}}). Romaji input is more common on PCs and other full-size keyboards (although direct inputoutput is also widely supported), whereas direct kana inputoutput is typically used on mobile phones and similar devices – each of the 10 digits (1–9,0) corresponds to one of the 10 columns in the [[gojūon]] table of kana, and multiple presses select the row.
 
There are two main systems for the [[romanization]] of Japanese, known as ''[[Kunrei-shiki]]'' and ''[[Hepburn romanization|Hepburn]]''; in practice, "keyboard romaji" (also known as ''[[wapuro romaji|wāpuro rōmaji]]'' or "word processor romaji") generally allows a loose combination of both. IME implementations may even handle keys for letters unused in any romanization scheme, such as ''L'', converting them to the most appropriate equivalent. With kana inputoutput, each key on the keyboard directly corresponds to one kana. The [[JIS X 6002|JIS keyboard]] system is the national standard, but there are alternatives, like the [[thumb-shift keyboard]], commonly used among professional typists.
 
== Direction of text ==