Content deleted Content added
Julie.larson (talk | contribs) |
m See also |
||
Line 27:
=== Finding topics ===
[[Multivariate analysis]] of the document-term matrix can reveal topics/themes of the corpus. Specifically, [[latent semantic analysis]] and [[data clustering]] can be used, and more recently [[probabilistic latent semantic analysis]] and [[non-negative matrix factorization]] have been found to perform well for this task.
== See also ==
* [[Bag of words model]]
|