Content deleted Content added
deleted the notable comment, for many reasons. for example if you google "gate", an extremely common word, hit number 2 is the gate website - that's an indication of notability?! also see publications |
|||
Line 23:
GATE includes an information extraction system called ANNIE (A Nearly-New Information Extraction System) which is a set of modules comprising a [[Lexical analysis|tokenizer]], a [[Gazetteer|gazetteer]], a [[Sentence boundary disambiguation|sentence splitter]], a [[Part-of-speech tagging|part of speech tagger]], a [[Named entity recognition|named entities]] transducer and a [[Coreference|coreference]] tagger.
Languages currently handled in GATE include [[English]], [[Spanish]], [[Chinese]], [[Arabic]], [[French]], [[German]], [[Hindi]], [[Italian]], [[Cebuano]], [[Romanian]], [[Russian]].
There is a large set of plugins for [[machine learning]] with [[Weka (machine learning)|Weka]], RASP, MAXENT, SVM Light, for managing [[Ontologies]] like [[WordNet]], for querying [[search engines]] like [[Google]] or [[Yahoo]], for part of speech tagging with [[Brill tagger|Brill]] or TreeTager, and many more.
|