Content deleted Content added
transliteration clarification, list of supported scripts |
Copy-editing |
||
Line 1:
'''ISCII''' ('''I'''ndian '''S'''cript '''C'''ode for '''I'''nformation '''I'''nterchange) is a coding scheme for representing various [[Indic script]]s as well as a Latin-based script with diacritic marks used to depict [[Romanisation|Romanised]] Indic languages. Most of those scripts are rather similar in structure, but have different letter shapes. So ISCII tries to encode the logical structure of the [[Indic script]]s, while script-specific letter shape are expected to be selected by markup or font specification in rich text. For plain text documents the non-printing ATR character can be used to select script-specific letter shape (this mechanism is similar to the use of [[escape sequence]]s). The supported scripts are: [[Assamese script|Assamese]], [[Bengali script|Bengali]], [[Devanagari]], [[Gujarati script|Gujarati]], [[Gurumukhi]], [[Kannada script|Kannada]], [[Malayalam script|Malayalam]], [[Oriya script|Oriya]], [[Tamil script|Tamil]], and [[Telugu script|Telugu]].
It is claimed that manually switching between scripts will easily achieve automatic [[transliteration]], though this is not always straightforward as the
ISCII is a fixed-length 8-bit encoding. The lower 128 codepoints are plain [[ASCII]], the upper 128 codepoints are ISCII-specific.
ISCII has largely been obsoleted by [[Unicode]], which has however attempetd to preserve the ISCII layout for its Indic language blocks. (Unicode has a separate code-point range for each language.)
== External links ==
|