Revision as of 13:46, 26 July 2007 edit Gudeldar (talk \| contribs) Extended confirmed users 3,190 edits m clean up using AWB ← Previous edit		Revision as of 15:19, 16 October 2007 edit undo 130.149.105.142 (talk) No edit summary Next edit →
Line 10: While this may be done manually, and usually is in the case of ad hoc and personal documents, many [[programming language]]s support mechanisms which enable text normalization. The text normalization is useful, for example, for comparing two sequence of characters which mean the same but are represented differently. The examples of this kind of normalization include, but not limited to, "don't" vs "do not", "I'm" vs "I am", "Can't" vs "Cannot". Further, "1" and "one" are same, "1st" is same as "first", and so on. Instead of treating these strings as different, through text processing, one can treat them as same. [[Category:Unicode]]

Text normalization: Difference between revisions