Revision as of 15:28, 4 March 2006 edit Jorge Stolfi (talk \| contribs) Autopatrolled, Extended confirmed users, Rollbackers 27,656 edits Created stub		Revision as of 15:43, 4 March 2006 edit undo Jorge Stolfi (talk \| contribs) Autopatrolled, Extended confirmed users, Rollbackers 27,656 edits + Some see alsos Next edit →
Line 2: The problem is relatively trivial for written languages that have explicit word boudary markers, such as the word spaces of written [[English language\|English]] of the distinctive initial, medial and final letter shapes of [[Arabic language\|Arabic]]. When such clues are not consistently available, the task often requires fairly non-trivial techniques, such as statistical decision-making, large dictionaries, as well as consideration of syntactic and semantic constraints. ==See also== * [[Speech segmentation]] * [[Hyphenation]] * [[Japanese text processing]] {{writingsystem-stub}}

Text segmentation: Difference between revisions