Text segmentation: Difference between revisions

Content deleted Content added
No edit summary
Serapio (talk | contribs)
rv
Line 1:
'''Text segmentation''' is the process of dividing [[writing|geoffreywritten text]] into meaningful units, such as [[geosentence]]s or [[freetopic]]s. The term applies both to [[human mind|mental]] processes used by humans when reading text, and to artificial processes implemented in [[computers]], which are the subject of [[natural language processing]]. The problem is non-trivial, because while some written languages have explicit word boundary markers, such as the word spaces of written [[English language|English]] and the distinctive initial, medial and final letter shapes of [[Arabic language|Arabic]], such signals are sometimes ambiguous and not present in all written languages.
 
Compare [[speech segmentation]], the process of dividing speech into linguistically meaningful portions.