Revision as of 17:45, 22 November 2017 edit Tom.Reding (talk \| contribs) Autopatrolled, Extended confirmed users, Page movers, Template editors 4,364,437 edits m →Criticisms and other Lesk-based methods: Rep typographic ligatures like "ﬁ" with plain text; possible ref cleanup; WP:GenFixes on, replaced: ﬁ → fi (2) using AWB ← Previous edit		Revision as of 04:18, 2 December 2017 edit undo JCW-CleanerBot (talk \| contribs) Bots 136,778 edits m →Overview: task, replaced: Lecture Notes In → Lecture Notes in using AWB Next edit →
Line 5: ==Overview== The Lesk algorithm is based on the assumption that words in a given "neighborhood" (section of text) will tend to share a common topic. A simplified version of the Lesk algorithm is to compare the dictionary definition of an ambiguous word with the terms contained in its neighborhood. Versions have been adapted to use [[WordNet]].<ref>Satanjeev Banerjee and Ted Pedersen. ''[http://www.cs.cmu.edu/~banerjee/Publications/cicling2002.ps.gz An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet]'', Lecture Notes Inin Computer Science; Vol. 2276, Pages: 136 - 145, 2002. {{ISBN\|3-540-43219-1}} </ref> An implementation might look like this: # for every sense of the word being disambiguated one should count the amount of words that are in both neighborhood of that word and in the dictionary definition of that sense

Lesk algorithm: Difference between revisions