Revision as of 03:43, 10 April 2014 edit Jonsafari (talk \| contribs) Extended confirmed users 2,453 edits +Category:Language modeling ← Previous edit		Revision as of 18:26, 14 April 2014 edit undo Wavelength (talk \| contribs) Extended confirmed users, Pending changes reviewers 179,502 edits inserting 1 hyphen: —> "low-dimensional"—User talk:Wavelength#Hyphenation [to Archive 6] Next edit →
Line 1: '''Probabilistic latent semantic analysis (PLSA)''', also known as '''probabilistic latent semantic indexing''' ('''PLSI''', especially in information retrieval circles) is a [[statistical technique]] for the analysis of two-mode and co-occurrence data. In effect, one can derive a low -dimensional representation of the observed variables in terms of their affinity to certain hidden variables, just as in [[latent semantic analysis]]. PLSA evolved from [[latent semantic analysis]]. Compared to standard [[latent semantic analysis]] which stems from [[linear algebra]] and downsizes the occurrence tables (usually via a [[singular value decomposition]]), probabilistic latent semantic analysis is based on a mixture decomposition derived from a [[latent class model]].

Probabilistic latent semantic analysis: Difference between revisions