Indian Script Code for Information Interchange

This is an old revision of this page, as edited by 59.94.99.203 (talk) at 03:17, 13 June 2006 (External links). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

ISCII (Indian Script Code for Information Interchange) is a coding scheme for representing various Indic scripts as well as a Latin-based script with diacritic marks used to depict Romanised Indic languages. Most of those scripts are rather similar in structure, but have different letter shapes. So ISCII tries to encode the logical structure of the Indic scripts, while script-specific letter shape are expected to be selected by markup or font specification in rich text. For plain text documents the non-printing ATR character can be used to select script-specific letter shape (this mechanism is similar to the use of escape sequences). The supported scripts are: Assamese, Bengali, Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu.

It is claimed that manually switching between scripts will easily achieve automatic transliteration, though this is not always straightforward as the various Indic scripts have incompatibilities among themselves that prevent round-tripping. See About ISCII.

ISCII is a fixed-length 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific.

ISCII has largely been obsoleted by Unicode, which has however attempted to preserve the ISCII layout for its Indic language blocks. (Unicode has a separate code-point range for each language.)

ISCII (Indian Script Code for Information Interchange) is a coding scheme for representing various Indic scripts as well as a Latin-based script with diacritic marks used to represent Romanized Indic languages. It is an 8-bit coded character set for information interchange and is intended for use in all computer and communication media that allow usage of 7 or 8 bit-character set - code extension techniques.

Since most of the Indic scripts are somewhat alike in structure, but have different letter shapes. So ISCII tries to encode the logical structure of the Indic scripts, while script-specific letter shape are expected to be selected by markup or font specification in rich text. For plain text documents the non-printing ATR character can be used to select script-specific letter shape. ISCII is a fixed-length 8-bit encoding. The lower 128 characters are same as ASCII Character set and the upper 128 characters cater to the 10 Indian scripts based on the ancient Brahmi script. The 10 scripts used for Indian languages have evolved from the ancient Brahmi script and have common phonetic structure, making a common character set possible. The Northern scripts are Devanagari, Punjabi, Gujarati, Oriya, Bengali and Assamese, while the Southern scripts are Telugu, Kannada, Malayalam and Tamil. The following are the ISCII characters: Dec Hex Character Description 161 A1 ँ Vowel-modifier CHANDRABINDU 162 A2 ं Vowel-modifier ANUSWAR 163 A3 ः Vowel-modifier VISARG 164 A4 अ Vowel A 165 A5 आ Vowel AA 166 A6 इ Vowel I 167 A7 ई Vowel II 168 A8 उ Vowel U 169 A9 ऊ Vowel UU 170 AA ऋ Vowel RI 171 AB ऎ Vowel E 172 AC ए Vowel EY 173 AD ऐ Vowel AI 174 AE एँ Vowel AYE (Devanagari Script) 175 AF ओ Vowel O 176 B0 औ Vowel OW 177 B1 आँ Vowel AU 178 B2 आँ Vowel AWE (Devanagari Script) 179 B3 क Consonant KA 180 B4 ख Consonant KHA 181 B5 ग Consonant GA 182 B6 घ Consonant GHA 183 B7 ङ Consonant NGA 184 B8 च Consonant CHA 185 B9 छ Consonant CHHA 186 BA ज Consonant JA 187 BB झ Consonant JHA 188 BC ञ Consonant JNA 189 BD ट Consonant Hard TA 190 BE ठ Consonant Hard THA 191 BF ड Consonant Hard DA 192 C0 ढ Consonant Hard DHA 193 C1 ण Consonant Soft NA 194 C2 त Consonant Soft TA 195 C3 थ Consonant Soft THA 196 C4 द Consonant Soft DA 197 C5 ध Consonant Soft DHA 198 C6 न Consonant Soft NA 199 C7 .न Consonant NA (Tamil) 200 C8 प Consonant PA 201 C9 फ Consonant PHA 202 CA ब Consonant BA 203 CB भ Consonant BHA 204 CC म Consonant MA 205 CD य Consonant YA 206 CE य़ Consonant JYA (Bengali, Assamese, Oriya) 207 CF र Consonant RA 208 D0 ऱ Consonant Hard RA (Southern Script) 209 D1 ल Consonant LA 210 D2 ळ Consonant Hard LA 211 D3 .ळ Consonant ZHA ( Tamil , Malayalam) 212 D4 व Consonant VA 213 D5 श Consonant SHA 214 D6 ष Consonant Hard SHA 215 D7 स Consonant SA 216 D8 ह Consonant HARD SHA 217 D9 INV Consonant INVISIBLE 218 DA ा Vowel Sign AA 219 DB ि Vowel Sign I 220 DC ी Vowel Sign II 221 DD ु Vowel Sign U 222 DE ू Vowel Sign UU 223 DF ृ Vowel Sign RI 224 E0 ॆ Vowel Sign E (Southern Script) 225 E1 े Vowel Sign EY 226 E2 ै Vowel Sign AI 227 E3 ॅ Vowel Sign AYE (Devanagari Script) 228 E4 ॊ Vowel Sign O (Southern Script) 229 E5 ो Vowel Sign OW 230 E6 ौ Vowel Sign AU 231 E7 अँ Vowel Sign AWE (Devanagari Script) 232 E8 ् Vowel Omission Sign (Halant) 233 E9 . Diacritic Sign (Nukta) 234 EA । Full Stop ( Viram) 235 EB This position shall not be used 236 EC This position shall not be used 237 ED This position shall not be used 238 EF This position shall not be used 239 FF ATR Attribute Code 240 F0 EXT Extension Code 241 F1 0 Digit 0 242 F2 1 Digit 1 243 F3 2 Digit 2 244 F4 3 Digit 3 245 F5 4 Digit 3 246 F6 5 Digit 4 247 F7 6 Digit 6 248 F8 7 Digit 7 249 F9 8 Digit 8 250 FA 9 Digit 9 251 FB This position shall not be used 252 FC This position shall not be used 253 FD This position shall not be used 254 FE This position shall not be used