Content deleted Content added
Woohookitty (talk | contribs) m WikiCleaner 0.98 - Repairing link to disambiguation page - You can help! |
|||
Line 23:
GATE includes an information extraction system called ANNIE (A Nearly-New Information Extraction System) which is a set of modules comprising a [[Lexical analysis|tokenizer]], a [[Gazetteer|gazetteer]], a [[Sentence boundary disambiguation|sentence splitter]], a [[Part-of-speech tagging|part of speech tagger]], a [[Named entity recognition|named entities]] transducer and a [[Coreference|coreference]] tagger.
Languages currently handled in GATE include [[English language|English]], [[Spanish language|Spanish]], [[Mandarin Chinese|Chinese]], [[Arabic]], [[French language|French]], [[German language|German]], [[Hindi]], [[Italian language|Italian]], [[Cebuano]], [[Romanian language|Romanian]], [[Russian language|Russian]].
There is a large set of plugins for [[machine learning]] with [[Weka (machine learning)|Weka]], RASP, MAXENT, SVM Light, for managing [[Ontologies]] like [[WordNet]], for querying [[search engines]] like [[Google]] or [[Yahoo]], for part of speech tagging with [[Brill tagger|Brill]] or TreeTager, and many more.
|