Content deleted Content added
Finished first draft. Note that the "notable applications to human genetic data" section has a table and image that were wholesale copied from the older version of this article. All other text should be my own. |
→Genetic clustering algorithms and methods: quick addition |
||
Line 7:
== Genetic clustering algorithms and methods ==
A wide range of methods have been developed to assess the structure of human populations with the use of genetic data. Early studies of within and between-group genetic variation used physical phenotypes and blood groups, with modern genetic studies using genetic markers such as [[Alu element|Alu sequences]], [[Microsatellite|short tandem repeat polymorphisms]], and [[Single-nucleotide polymorphism|single nucleotide polymorphisms]] (SNPs), among others.<ref>{{Cite journal|last=Bamshad|first=Michael|last2=Wooding|first2=Stephen|last3=Salisbury|first3=Benjamin A.|last4=Stephens|first4=J. Claiborne|date=2004-08|title=Deconstructing the relationship between genetics and race|url=http://dx.doi.org/10.1038/nrg1401|journal=Nature Reviews Genetics|volume=5|issue=8|pages=598–609|doi=10.1038/nrg1401|issn=1471-0056}}</ref> Models for genetic clustering also vary by algorithms and programs used to process the data. Most methods for determining clusters can be categorized as '''model-based clustering methods''' (such as the algorithm STRUCTURE) or '''multidimensional summaries''' (often through prinicipal component analysis).<ref name=":0" /><ref name=":1">{{Cite journal|last=Lawson|first=Daniel John|last2=Falush|first2=Daniel|date=2012-09-22|title=Population Identification Using Genetic Data|url=http://dx.doi.org/10.1146/annurev-genom-082410-101510|journal=Annual Review of Genomics and Human Genetics|volume=13|issue=1|pages=337–361|doi=10.1146/annurev-genom-082410-101510|issn=1527-8204}}</ref> By processing a large number of SNPs (or other genetic marker data) in different ways, both approaches to genetic clustering tend to converge on similar patterns by identifying similarities among SNPs and/or [[haplotype]] tracts to reveal ancestral genetic similarities.<ref name=":1" />
=== Model-based clustering ===
|