Document-term matrix: Difference between revisions

Content deleted Content added
{{Natural Language Processing}}
Line 29:
 
===Finding topics===
[[Multivariate analysis]] of the document-term matrix can reveal topics/themes of the corpus. Specifically, [[latent semantic analysis]] and [[data clustering]] can be used, and, more recently, [[probabilistic latent semantic analysis]] with its generalization [[Latent Dirichlet allocation]], and [[non-negative matrix factorization]], have been found to perform well for this task.
 
==See also==