Document-term matrix: Difference between revisions

Content deleted Content added
Improving search results: disambiguate disambiguation
rm merge tag, no consensus to merge
Line 1:
{{merge to|Bag-of-words model|date=August 2012}}
{{Unreferenced stub|auto=yes|date=December 2009}}
A '''document-term matrix''' or '''term-document matrix''' is a mathematical [[Matrix (mathematics)|matrix]] that describes the frequency of terms that occur in a collection of documents. In a document-term matrix, rows correspond to documents in the collection and columns correspond to terms. There are various schemes for determining the value that each entry in the matrix should take. One such scheme is [[tf-idf]]. They are useful in the field of [[natural language processing]].