Revision as of 06:04, 26 August 2008 edit Radagast83 (talk \| contribs) 18,709 edits No clear consensus for merger ← Previous edit		Revision as of 21:35, 30 August 2008 edit undo Murray Langton (talk \| contribs) Extended confirmed users, Rollbackers 4,336 edits m →Character categories within scripts: spelling/grammar Next edit →
Line 522: == Character categories within scripts == {{UCS_characters}} Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few [[Precomposed character\|precomposed]] ligatures such as ǲ (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters and therefore Unicode discourages their use by authors. ~~Its~~It is unlikely ~~newt~~that new titlecase letters will be added in the future. Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as “other letter” or “modifier letter”. Ideographs such as Unihan ideographs are also categorized as “other letters”. A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are nether uppercase nor lowercase.

Script (Unicode): Difference between revisions