Revision as of 04:35, 27 January 2007 edit ChrisDing (talk \| contribs) 32 edits No edit summary ← Previous edit		Revision as of 20:03, 29 January 2007 edit undo 132.246.126.153 (talk) Added ref. to Buntine's multinomial PCA + earlier ref to NMF-PLSA equivalence Next edit →
Line 17: The different types arise from using different [[cost function]]s (divergence functions) and/or by [[regularization (mathematics)\|regularization]] of the '''W''' and/or '''H''' matrices<ref>[[Inderjit S. Dhillon]], [[Suvrit Sra]], "[http://books.nips.cc/papers/files/nips18/NIPS2005_0203.pdf Generalized Nonnegative Matrix Approximations with Bregman Divergences]", [[NIPS]], 2005.</ref>. == ~~Relatioing~~Relation to Data Clustering == Although initially NMF is considered to be different from vector quantization ([[K-means clustering]]), it was later shown <ref> Chris Ding, Xiaofeng He, and Horst D. Simon. "On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering". Proc. SIAM Int'l Conf. Data Mining (SDM'05), pp:606-610, April 2005.</ref> that NMF is equivalent to the relaxed [[K-means clustering]] using the Frobenius norm objective function, matrix factor '''W''' contains cluster centroids and '''H''' contains cluster membership indicators; therefore NMF provides a ~~new~~ framework for data clustering. ~~It is also known~~ ~~<ref>~~ It is also known that NMF is an instance of so-called "multinomial PCA". Chris Ding and Xiaofeng He, " Nonnegative Matrix Factorization and Probabilistic Latent Semantic Indexing: Equivalence, Chi-square Statistic, and a Hybrid Method", Proc. AAAI National Conf. on Artificial Intelligence (AAAI-06), July 2006.</ref> <ref>Wray Buntine, "Variational Extensions to EM and Multinomial PCA", Proc. European Conference on Machine Learning (ECML-02), LNAI 2430, pp. 23-34, 2002. </ref> that NMF is identical to [[probabilistic latent semantic analysis]] using the [[Kullback–Leibler divergence]] objective function, which can be simplified to the Chi-square statistic at first order approximation. When NMF is obtained by minimizing the [[Kullback–Leibler divergence]], it is also equivalent to another instance of multinomial PCA, [[probabilistic latent semantic analysis]], <ref>Eric Gaussier and Cyril Goutte, "Relation between PLSA and NMF and Implications", Proc. 28th international ACM SIGIR conference on Research and development in information retrieval (SIGIR-05), pp. 601-602, 2005. </ref> which has long been used for analyzing and clustering textual data. == Uniqueness ==

Non-negative matrix factorization: Difference between revisions