Text segmentation: Difference between revisions

Content deleted Content added
Remove stub markers
No edit summary
Line 3:
Compare [[speech segmentation]], the process of dividing speech into linguistically meaningful portions.
 
== AutomaticSegmentation segmentationproblems ==
 
Automatic segmentation is the problem in [[natural language processing]] of implementing a computer process to segment text.
 
=== Word segmentation ===
Line 42 ⟶ 40:
}}</ref>
 
=== Automatic segmentation approaches ===
=== Approaches ===
 
Automatic segmentation is the problem in [[natural language processing]] of implementing a computer process to segment text.
 
When punctuation and similar clues are not consistently available, the segmentation task often requires fairly non-trivial techniques, such as statistical decision-making, large dictionaries, as well as consideration of syntactic and semantic constraints. Effective natural language processing systems and text segmentation tools usually operate on text in specific domains and sources. As an example, processing text used in medical records is a very different problem than processing news articles or real estate advertisements.