Text normalization: Difference between revisions

Content deleted Content added
Fixed broken link
Ryanli (talk | contribs)
m add link
Line 3:
'''Text normalization''' is the process of transforming [[writing|text]] into a single [[canonical form]] that it might not have had before. Normalizing text before storing or processing it allows for [[separation of concerns]], since input is guaranteed to be consistent before operations are performed on it. Text normalization requires being aware of what type of text is to be normalized and how it is to be processed afterwards; there is no all-purpose normalization procedure.<ref name="cs506">{{cite web
| title = CS506/606: Txt Nrmlztn
| author = [[Richard Sproat]] and Steven Bedrick
| date = September 2011
| accessdate = October 2, 2012