Document-term matrix: Difference between revisions

Content deleted Content added
m See also
Line 27:
=== Finding topics ===
[[Multivariate analysis]] of the document-term matrix can reveal topics/themes of the corpus. Specifically, [[latent semantic analysis]] and [[data clustering]] can be used, and more recently [[probabilistic latent semantic analysis]] and [[non-negative matrix factorization]] have been found to perform well for this task.
 
== See also ==
* [[Bag of words model]]