Indian Script Code for Information Interchange: Difference between revisions

Content deleted Content added
No edit summary
Line 3:
'''Indian Script Code for Information Interchange''' ('''ISCII''') is a coding scheme for representing various writing systems of [[India]]. It encodes the main [[Indic script]]s and a Roman transliteration. The supported scripts are: [[Assamese alphabet|Assamese]], [[Bengali alphabet|Bengal (Bangla)]], [[Devanagari]], [[Gujarāti script|Gujarati]], [[Gurmukhi]], [[Kannada script|Kannada]], [[Malayalam script|Malayalam]], [[Oriya script|Oriya]], [[Tamil script|Tamil]], and [[Telugu script|Telugu]]. ISCII does not encode the writing systems of India based on [[Persian language|Persian]], but its writing system switching codes nonetheless provide for [[Kashmiri language|Kashmiri]], [[Sindhi language|Sindhi]], [[Urdu]], [[Persian language|Persian]], [[Pashto language|Pashto]] and [[Arabic]]. The Persian-based writing systems were subsequently encoded in the [[Perso-Arabic Script Code for Information Interchange|PASCII]] encoding.
 
ISCII has not been widely used outside certain government institutions, although a variant was used on [[classic Mac OS]],<ref name="appledevanagari"/> and it has now been rendered largely obsolete by [[Unicode]]. Unicode uses a separate block for each Indic writing system, and largely preserves the ISCII layout within each block.
 
==Background==
Line 312:
== Special code points ==
 
; {{anchor|INV}}INV character—code point D9 (217): The INV (invisible consonant) character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क्‍ (half ka). The Unicode equivalent is {{unichar|200D|ZERO WIDTH JOINER}} ({{ctrl|ZWJ}}). However, as noted [[#virama|below]], the ISCII halant character can be doubled or combined with the ISCII nukta to achieve effects created by {{ctrl|ZWNJ}} or ZWJ in Unicode. For this reason, [[Apple Inc|Apple]] maps the ISCII INV character to the Unicode {{ctrl|LRM|left-to-right mark}}, so as to guarantee [[round-trip format conversion|round-tripping]].<ref name="appledevanagari">{{cite web |url=https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/DEVANAGA.TXT |title=Map (external version) from Mac OS Devanagari encoding to Unicode 2.1 and later. |author=Apple |author-link=Apple Inc |publisher=[[Unicode Consortium]] |date=2005-04-05 |orig-year=1998-02-05}}</ref>
; {{anchor|ATR}}ATR character—code point EF (239): The ATR (attribute) character followed by a byte code is used to switch to a different font attribute (such as bold) or to a different ISCII or [[PASCII]] language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points.
{| class="wikitable collapsible"