Text normalization: Difference between revisions

Content deleted Content added
Adamd1008 (talk | contribs)
mNo edit summary
Added stemming to the examples
Line 11:
* expanding abbreviations
* removing [[stopwords]] or "too common" words
* [[stemming]]
 
While this may be done manually, and usually is in the case of ad hoc and personal documents, many [[programming language]]s support mechanisms which enable text normalization.