Content deleted Content added
m →Features: bold |
|||
(32 intermediate revisions by 25 users not shown) | |||
Line 1:
{{Infobox software
| name = GATE
| screenshot =
| screenshot size = 250px
| caption = GATE Developer v5 main window
| developer = [
| released = {{start date and age |1995}}
| programming language = [[Java (programming language)|Java]]
| operating system = [[Cross-platform]]
| language = English
| genre = [[Text mining]] [[Information
| license = [[LGPL]]
| website = {{url|
}}
'''General Architecture for Text Engineering'''
As of May 28, 2011, 881 people are on the gate-users mailing list at SourceForge.net, and 111,932 downloads from [[SourceForge]] are recorded since the project moved to SourceForge in 2005.<ref>
GATE community and research has been involved in several European research projects including:
▲As of May 28, 2011, 881 people are on the gate-users mailing list at SourceForge.net, and 111,932 downloads from [[SourceForge]] are recorded since the project moved to SourceForge in 2005.<ref>[http://sourceforge.net/projects/gate/ GATE project page on SourceForge]</ref> The paper "GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications"<ref>[http://gate.ac.uk/sale/acl02/acl-main.pdf "GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications", by Cunningham H., Maynard D., Bontcheva K. and Tablan V. (In proc. of the 40th Anniversary Meeting of the Association for Computational Linguistics, 2002)]</ref> has received over 800 citations in the seven years since publication (according to Google Scholar). Books covering the use of GATE, in addition to the GATE User Guide,<ref>[http://gate.ac.uk/userguide/ GATE User Guide]</ref> include "Building Search Applications: Lucene, LingPipe, and Gate", by Manu Konchady,<ref>Konchady, Manu. [http://books.google.ca/books?id=mcM-OAAACAAJ&dq=Building+Search+Applications:+Lucene,+LingPipe,+and+Gate&hl=en&ei=avbDTczPJITqrQfk1IXQBA&sa=X&oi=book_result&ct=result&resnum=1&ved=0CDEQ6AEwAA Building Search Applications: Lucene, LingPipe, and Gate]. Mustru Publishing. 2008.</ref> and "Introduction to Linguistic Annotation and Text Analytics", by Graham Wilcock.<ref>[http://books.google.co.uk/books?id=TDQJb1UgVywC&dq=Introduction+to+Linguistic+Annotation+and+Text+Analytics&printsec=frontcover&source=bl&ots=bAF26ZQSTx&sig=TbxZ_-3tRy3IeDBKFofeVN6bAIc&hl=en&ei=vc0gS7PlLo-64QaSgqnfCQ&sa=X&oi=book_result&ct=result&resnum=2&ved=0CBcQ6AEwAQ#v=onepage&q=&f=false "Introduction to Linguistic Annotation and Text Analytics", by Graham Wilcock]</ref>
== Features ==
Line 25 ⟶ 23:
GATE includes an [[information extraction]] system called '''ANNIE''' ('''A Nearly-New Information Extraction System''') which is a set of modules comprising a [[Lexical analysis|tokenizer]], a [[gazetteer]], a [[Sentence boundary disambiguation|sentence splitter]], a [[Part-of-speech tagging|part of speech tagger]], a [[Named entity recognition|named entities]] transducer and a [[coreference]] tagger. ANNIE can be used as-is to provide basic [[information extraction]] functionality, or provide a starting point for more specific tasks.
Languages currently handled in GATE include [[English language|English]], [[
Plugins are included for [[machine learning]] with [[Weka (machine learning)|Weka]], RASP, MAXENT, SVM Light, as well as a [[LIBSVM]] integration and an in-house [[perceptron]] implementation, for managing [[
GATE accepts input in various formats, such as [[Text file|TXT]], [[HTML]], [[XML]], [[DOC (computing)|Doc]], [[PDF]] documents, and [[Serialization|Java Serial]], [[PostgreSQL]], [[Lucene]], [[Oracle database|Oracle]] Databases with help of [[RDBMS]] storage over [[JDBC]].
[[JAPE (linguistics)|JAPE]] transducers are used within GATE to manipulate annotations on text. Documentation is provided in the GATE User Guide.<ref>
== GATE Developer ==
Line 37 ⟶ 35:
[[Image:GATE5 main window.png|thumb|400px|GATE 5 main window.]]
The screenshot shows the document viewer used to display a document and its annotations. In pink are
== GATE Mímir ==
<!-- re-written to remove any lingering copyright worries -->
GATE
==See also==
{{Portal|Free and open-source software}}
* [[Unstructured Information Management Architecture]] (UIMA)
* [[OpenNLP]]
* [[Pheme (project)|Pheme]], a major EU project managed by the GATE group on early detection of false information in social media
==References==
<references/>
==External links==
* {{Official website|https://gate.ac.uk/}}
{{DEFAULTSORT:General Architecture For Text Engineering}}
|