Principal component analysis: Difference between revisions

Content deleted Content added
A relevant reference was added to the page.
Population genetics: Elhaik's critique of the whole methodology
Line 453:
Since then, PCA has been ubiquitous in population genetics, with thousands of papers using PCA as a display mechanism. Genetics varies largely according to proximity, so the first two principal components actually show spatial distribution and may be used to map the relative geographical ___location of different population groups, thereby showing individuals who have wandered from their original locations.<ref>{{Cite journal |last1=Novembre |first1=John |last2=Stephens |first2=Matthew |date=2008 |title=Interpreting principal component analyses of spatial population genetic variation |journal=Nat Genet |volume=40 |issue=5 |pages=646–49 |doi=10.1038/ng.139 |pmid=18425127 |pmc=3989108 }}</ref>
 
PCA in genetics has been technically controversial, in that the technique has been performed on discrete non-normal variables and often on binary allele markers. The lack of any measures of standard error in PCA are also an impediment to more consistent usage. In August 2022, the molecular biologist [[Eran Elhaik]] published a theoretical paper in [[Nature (journal)|Nature]] analyzing 12 PCA applications. He concluded that it was easy to manipulate the method, which, in his view, generated results that were 'erroneous, contradictory, and absurd.' Speciifically, he argued, the results achieved in population genetics were characterized by cherry-picking and [[circular reasoning]]. <ref>[[Eran Elhaik]], [https://doi.org/10.1038/s41598-022-14395-4 'Principal Component Analyses (PCA)‑based findings in population genetic studies are highly biased and must be reevaluated.'] in [[Nature (journal)|Nature:Scientific Reports]] August 2022, 12:14683 </ref>
 
=== Market research and indexes of attitude ===