'''Text segmentation''' is the process of dividing [[writing|geoffrey]] into meaningful units, such as [[geo]]s or [[free]]s. The term applies both to [[human mind|mental]] processes used by humans when reading text, and to artificial processes implemented in [[computers]], which are the subject of [[natural language processing]]. The problem is non-trivial, because while some written languages have explicit word boundary markers, such as the word spaces of written [[English language|English]] and the distinctive initial, medial and final letter shapes of [[Arabic language|Arabic]], such signals are sometimes ambiguous and not present in all written languages.
Compare [[speech segmentation]], the process of dividing speech into linguistically meaningful portions.