Indian Script Code for Information Interchange

This is an old revision of this page, as edited by Sietse Snel (talk | contribs) at 13:00, 17 October 2004 (sp/gr). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

ISCII (Indian Script Code for Information Interchange) is a coding scheme for representing various Indic scripts. Most of those scripts are rather similar in structure, but have different letter shapes. So ISCII tries to encode the logical structure of the Indic scripts, while script-specific letter shape are expected to be selected by markup or font specification in rich text. For plain text documents the non-printing ATR character can be used to select script-specific letter shape (this mechanism is similar to the use of [[escape sequence]s]).

By manually switching between scripts, an automatic transliteration is achieved.

ISCII is a fixed-length 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific.

Unicode has largely preserved the ISCII encoding strategy, but has assigned each language a separate codepoint range. So now there is a series of 128-codepoints-long blocks for Indic scripts.