Text segmentation: Difference between revisions

Content deleted Content added
Created stub
 
+ Some see alsos
Line 2:
 
The problem is relatively trivial for written languages that have explicit word boudary markers, such as the word spaces of written [[English language|English]] of the distinctive initial, medial and final letter shapes of [[Arabic language|Arabic]]. When such clues are not consistently available, the task often requires fairly non-trivial techniques, such as statistical decision-making, large dictionaries, as well as consideration of syntactic and semantic constraints.
 
==See also==
* [[Speech segmentation]]
* [[Hyphenation]]
* [[Japanese text processing]]
 
{{writingsystem-stub}}