Revision as of 09:32, 6 October 2009 edit Janislaw (talk \| contribs) 18 edits m →Choice of Terms ← Previous edit		Revision as of 01:57, 2 November 2009 edit undo Danielx (talk \| contribs) 263 edits improved the intro based on the description found in http://en.wikipedia.org/wiki/Latent_semantic_analysis#Occurrence_matrix Next edit →
Line 1: '''Document-term matrix''' is a mathematical [[Matrix (mathematics)\|matrix]] that describes the frequency of terms that occur in a collection of documents. Each column corresponds to a document in the collection, and each row corresponds to a word or term. There are various schemes for determining the value that each entry in the matrix should take. One such scheme is [[tf-idf]]. They are useful in the field of [[natural language processing]]. '''Document-term matrix''' are used in [[natural language processing]] programs. They represent natural language documents as mathematical objects (a [[matrix (mathematics)\|matrix]]) and make it possible to process them as a whole. ==General Concept==

Document-term matrix: Difference between revisions