Content deleted Content added
No edit summary |
|||
Line 3:
'''Indian Script Code for Information Interchange''' ('''ISCII''') is a coding scheme for representing various writing systems of [[India]]. It encodes the main [[Indic script]]s and a Roman transliteration. The supported scripts are: [[Assamese alphabet|Assamese]], [[Bengali alphabet|Bengal (Bangla)]], [[Devanagari]], [[Gujarāti script|Gujarati]], [[Gurmukhi]], [[Kannada script|Kannada]], [[Malayalam script|Malayalam]], [[Oriya script|Oriya]], [[Tamil script|Tamil]], and [[Telugu script|Telugu]]. ISCII does not encode the writing systems of India based on [[Persian language|Persian]], but its writing system switching codes nonetheless provide for [[Kashmiri language|Kashmiri]], [[Sindhi language|Sindhi]], [[Urdu]], [[Persian language|Persian]], [[Pashto language|Pashto]] and [[Arabic]]. The Persian-based writing systems were subsequently encoded in the [[Perso-Arabic Script Code for Information Interchange|PASCII]] encoding.
ISCII has not been widely used outside certain government institutions, although a variant without the
==Background==
The Brahmi-derived writing systems have similar structure. So ISCII encodes letters with the same phonetic value at the same code point, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki]. This will be rendered as കി in [[Malayalam]], कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the {{ctrl|ATR|internal=yes}} code described below.
One motivation for the use of a single encoding is the idea that it will allow easy [[transliteration]] from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea.
ISCII is an 8-bit encoding. The lower 128 code points are plain [[American Standard Code for Information Interchange|ASCII]], the upper 128 code points are ISCII-specific. In addition to the code points representing characters, ISCII makes use of a code point with mnemonic {{ctrl|ATR|internal=yes}} that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes such as bold and italic. ISCII does not provide a means of indicating the default writing system.
== Codepage layout ==
|