Revision as of 21:26, 28 October 2022 edit Pgan002 (talk \| contribs) Extended confirmed users 15,370 edits →Document summarization: Move adaptive summarization sentence out of Evaluation section into a new dedicated section. Tag: Visual edit ← Previous edit		Revision as of 01:29, 29 October 2022 edit undo Pgan002 (talk \| contribs) Extended confirmed users 15,370 edits m →Supervised learning approaches Tag: Visual edit Next edit →
Line 89: ====Supervised learning approaches==== Supervised text summarization is very much like supervised keyphrase extraction. Basically, if you have a collection of documents and human-generated summaries for them, you can learn features of sentences that make them good candidates for inclusion in the summary. Features might include the position in the document (i.e., the first few sentences are probably important), the number of words in the sentence, etc. The main difficulty in supervised extractive summarization is that the known summaries must be manually created by extracting sentences so the sentences in an original training document can be labeled as "in summary" or "not in summary". This is not typically how people create summaries, so simply using journal abstracts or existing summaries is usually not sufficient. The sentences in these summaries do not necessarily match up with sentences in the original text, so it would be difficult to assign labels to examples for training. Note, however, that these natural summaries can still be used for evaluation purposes, since ROUGE-1 ~~only~~evaluation ~~cares~~only ~~about~~considers unigrams. ====Maximum entropy-based summarization====

Automatic summarization: Difference between revisions