Sequence clustering: Difference between revisions

Content deleted Content added
{{biosci-stub}} {{compu-stub}}
m copyedit
Line 1:
In [[bioinformatics]], '''[[Primary structure|sequence]] clustering''' [[algorithm]]s attempt to group sequences that are somehow related. The sequences can be either of genomic, "transcriptomic" ([[expressed sequence tag|ESTs]]) or [[protein]] origin.
sequences that are somehow related. The sequences can be either of genomic, "transcriptomic" ([[expressed sequence tag|ESTs]]) or [[protein]] origin.
For proteins, [[Homology (biology)|homologous]] sequences are typically grouped into [[protein family|families]]. For EST data, clustering is important to group sequences originating from the same [[gene]] before the ESTs are [[sequence assembly|assembled]] to reconstruct the original [[mRNA]].
 
Line 7 ⟶ 6:
 
 
Sequence clusters are often synonymous with (but not identical to) [[protein family|protein families]]. Determining a representative [[tertarytertiary structure|structure]] for each ''sequence cluster''' is the aim of many [[structural genomics]] initativesinitiatives.
 
== External links ==