Revision as of 03:12, 7 September 2019 edit AnomieBOT (talk \| contribs) Bots 6,858,685 edits m Dating maintenance tags: {{Confusing section}} ← Previous edit		Revision as of 22:10, 1 October 2019 edit undo Jarble (talk \| contribs) Autopatrolled, Extended confirmed users 150,084 edits m linking Next edit →
Line 41: Topic analysis consists of two main tasks: topic identification and text segmentation. While the first is a simple [[machine learning\|classification]] of a specific text, the latter case implies that a document may contain multiple topics, and the task of computerized text segmentation may be to discover these topics automatically and segment the text accordingly. The topic boundaries may be apparent from section titles and paragraphs. In other cases, one needs to use techniques similar to those used in [[document classification]]. Segmenting the text into [[topic (linguistics)\|topic]]s or [[discourse]] turns might be useful in some natural processing tasks: it can improve [[information retrieval]] or [[speech recognition]] significantly (by indexing/recognizing documents more precisely or by giving the specific part of a document corresponding to the query as a result). It is also needed in [[topic detection]] and tracking systems and [[text summarization\|text summarizing]] problems. Many different approaches have been tried:<ref>{{Cite conference

Text segmentation: Difference between revisions