Content deleted Content added
OpenNotes1 (talk | contribs) {{Natural Language Processing}} |
→Finding topics: +LDA |
||
Line 29:
===Finding topics===
[[Multivariate analysis]] of the document-term matrix can reveal topics/themes of the corpus. Specifically, [[latent semantic analysis]] and [[data clustering]] can be used, and, more recently, [[probabilistic latent semantic analysis]] with its generalization [[Latent Dirichlet allocation]], and [[non-negative matrix factorization]], have been found to perform well for this task.
==See also==
|