Revision as of 17:26, 5 March 2010 edit Keisersoze (talk \| contribs) 1 edit No edit summary ← Previous edit		Revision as of 17:31, 10 November 2010 edit undo Jeffrey Beall (talk \| contribs) Autopatrolled, Extended confirmed users 15,604 edits m Copyedits Next edit →
Line 15: While this may be done manually, and usually is in the case of ad hoc and personal documents, many [[programming language]]s support mechanisms which enable text normalization. ~~The text~~Text normalization is useful, for example, for comparing two ~~sequence~~sequences of characters which mean the same but are represented differently. The examples of this kind of normalization include, but not limited to, "don't" vs "do not", "I'm" vs "I am", "Can't" vs "Cannot". Further, "1" and "one" are the same, "1st" is the same as "first", and so on. Instead of treating these strings as different, through text processing, one can treat them as the same. [[Category:Unicode]]

Text normalization: Difference between revisions