Revision as of 07:09, 4 November 2010 edit ZanderSchubert (talk \| contribs) Extended confirmed users 1,644 edits Adding extended table ← Previous edit		Revision as of 11:50, 19 April 2011 edit undo Kevin Carmody (talk \| contribs) 106 edits Correct tables, add special character descriptions Next edit →
Line 209: \|- !{{chset-left\|A}} \|{{chset-color-~~intl~~undef}} \|~~{{chset-cell3\|0950\|ॐ\|160}}~~ \|{{chset-color-intl}} \|{{chset-cell3\|0901\|ँ\|161}} \|{{chset-color-intl}} \|{{chset-cell3\|0902\|ं\|162}} Line 258: \|{{chset-color-intl}} \|{{chset-cell3\|092D\|भ\|203}} \|{{chset-color-intl}} \|{{chset-cell3\|092E\|म\|204}} \|{{chset-color-intl}} \|{{chset-cell3\|0930\|र\|205}}▼ \|{{chset-color-intl}} \|{{chset-cell3\|095F\|य़\|206}}▼ \|{{chset-color-intl}} \|{{chset-cell3\|092F\|य\|207}} ▲\|{{chset-color-intl}} \|{{chset-cell3\|095F\|य़\|206}} ▲\|{{chset-color-intl}} \|{{chset-cell3\|0930\|र\|205}} \|- !{{chset-left\|D}} Line 291: \|{{chset-color-intl}} \|{{chset-cell3\|094D\|्\|232}} \|{{chset-color-intl}} \|{{chset-cell3\|093C\|़\|233}} \|{{chset-color-intl}} \|{{chset-cell3\|~~093D~~0964\|&#~~x093D~~x0964;\|234}} \|{{chset-color-undef}}\| \|{{chset-color-undef}}\| Line 317: \|} == Special code points == The [[nukta]] is used to create a number of characters which have precomposed forms in Unicode, as well as a number of rarer characters which don't exist in the main ISCII set, such as the Sanskrit character ॠ. '''INV character—code point D9 (217)''': The INV character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क्‍ (half ka). The Unicode equivalent is no break space 00A0 or dotted circle ◌ 25CC. '''Halant character ़—code point E8 (232)''': The halant character removes the implicit vowel from a consonant and is used between consonants to represent conjunct consonants. For example, क (ka) + ् (halant) + त (ta) = क्त (kta). The sequence ् (halant) + ् (halant) displays a conjunct with an explicit halant, for example क (ka) + ् (halant) + ् (halant) + त (ta) = क्‌त. The sequence ् (halant) + ़ (nukta) displays a conjunct with half consonants, if available, for example क (ka) + ् (halant) + ़ (nukta) = क्‍त. Unicode equivalents are as follows: ISCII single halant E8 = Unicode 092D; ISCII halant + halant = Unicode 094D + zero width non-joiner (ZWNJ) 200C; ISCII halant + nukta = Unicode 034D + zero width joiner (ZWJ) 200D. '''Nukta character ़—code point E9 (233)''': The [[nukta]] character after another ISCII character is used for a number of rarer characters which don't exist in the main ISCII set. For example क (ka) + ़ (nukta) = क़ (qa). These characters have precomposed forms in Unicode, as shown in the following table. {\| class="wikitable Unicode" border="1" style="text-align:center; font-size:120%;" ! ISCII<br>code point !! Original<br>character !! Character<br>with nukta !! Unicode<br>code point \|- \| A1 (160) \|\| ँ \|\| ॐ \|\| 0950 \|- \| A6 (166) \|\| इ \|\| ऌ \|\| 090C Line 348 ⟶ 356: \| DF (223) \|\| ृ \|\| ॄ \|\| 0944 \|- \| EA (224) \|\| ऽ। \|\| ।ऽ \|\| 0964 \|} '''ATR character—code point EF (239)''': The ATR character followed by a byte code is used to switch to a different font attribute (such as bold) or language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points. '''EXT character—code point F0 (240)''': The EXT character followed by a byte code indicates a Vedic accent. This has no direct Unicode equivalent, as Vedic accents are assigned to distinct code points. ==Code points for all languages== Each alphabet is listed in the order of its ISCII code point. Code points with asterisks ~~are~~() ~~formed~~indicate ~~with~~the acode ~~following~~point followed by nukta, e.g. क (k) + ़ = क़ (q); इ (i) + ़ = ऌ (ḷ). Each character is listed along with its Unicode code point. {\| class="wikitable collapsible collapsed" style="border:none;" Line 536 ⟶ 548: ! E9 \|\| Diacritic Sign (Nukta) \|\| \|\| ़ \|\| 093C \|\| ় \|\| 09BC \|\| ਼ \|\| 0A3C \|\| ઼ \|\| 0ABC \|\| ଼ \|\| 0B3C \|\| colspan="2"\| \|\| colspan="2"\| \|\| ಼ \|\| 0CBC \|\| colspan="2"\| \|- ! EA \|\| ~~Vowel~~Full ~~Stress~~Stop ~~Sign~~(Viram, ~~[[Avagraha\|AVAGRAH]]~~Northern Scripts) \|\| \|\| ऽ। \|\| ~~093D~~0964 \|\| ~~ঽ \|~~colspan="2"\| ~~09BD~~ \|\| colspan="2"\| \|\| ઽ colspan="2"\|~~\| 0ABD~~ \|\| ଽ colspan="2"\|~~\| 0B3D~~ \|\| colspan="2"\| \|\| ~~ఽ \|~~colspan="2"\| ~~0C3D~~ \|\| ~~ಽ \|~~colspan="2"\| ~~0CBD~~ \|\| ഽ colspan="2"\|~~\| 0D3D~~ \|- ! EA \|\| ~~Full~~Vowel ~~Stop~~Stress ~~(Viram,~~Sign ~~Northern Scripts)~~[[Avagraha\|AVAGRAH]] \|\| \|\| ।ऽ \|\| ~~0964~~093D \|\| ~~colspan="2"~~ঽ \|\| 09BD \|\| colspan="2"\| \|\| ~~colspan="2"~~ઽ \|\| 0ABD \|\| ~~colspan="2"~~ଽ \|\| 0B3D \|\| colspan="2"\| \|\| ~~colspan="2"~~ఽ \|\| 0C3D \|\| ~~colspan="2"~~ಽ \|\| 0CBD \|\| ~~colspan="2"~~ഽ \|\| 0D3D \|- ! EB \|\| colspan="20"\| Unused

Indian Script Code for Information Interchange: Difference between revisions