Revision as of 17:51, 7 March 2013 edit Yobot (talk \| contribs) Bots 4,733,870 edits m →Criticisms and other Lesk-based methods: WP:CHECKWIKI errors fixed + general fixes using AWB (8961) ← Previous edit		Revision as of 20:59, 14 March 2013 edit undo Aednichols (talk \| contribs) 426 edits →Overview: Smoothed awkward language Next edit →
Line 5: ==Overview== The Lesk algorithm is based on the assumption that words in a given "neighborhood" (section of text) will tend to share a common topic. A simplified version of the Lesk algorithm is to compare the dictionary definition of an ambiguous word with the terms contained ofin ~~the~~its neighborhood. Versions have been adapted to use [[WordNet]].<ref>Satanjeev Banerjee and Ted Pedersen. ''[http://www.cs.cmu.edu/~banerjee/Publications/cicling2002.ps.gz An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet]'', Lecture Notes In Computer Science; Vol. 2276, Pages: 136 - 145, 2002. ISBN 3-540-43219-1 </ref> ItAn ~~would~~implementation bemight look like this: # for every sense of the word being disambiguated one should count the amount of words that are in both neighborhood of that word and in the definition of each sense in a dictionary # the sense that is to be chosen is the sense which has the biggest number of this count Line 19: 2. something of this shape whether solid or hollow 3. fruit of certain evergreen trees As can be seen, the best intersection is Pine #1 ⋂ Cone #3 = 2. ==Simplified Lesk algorithm==

Lesk algorithm: Difference between revisions