Content deleted Content added
Undid revision 599956549 by 161.64.89.126 (talk) ner crf in 2013 not notable / relevant here |
→Text segmentation: duplicate |
||
Line 29:
=== Text segmentation ===
{{main
Topic analysis consists of two main tasks: topic identification and text segmentation. While the first is a simple [[machine learning|classification]] of a specific text, the latter case implies that a document may contain multiple topics, and the task of computerized text segmentation may be to discover these topics automatically and segment the text accordingly. The topic boundaries may be apparent from section titles and paragraphs. In other cases, one needs to use techniques similar to those used in [[document classification]].
|