Revision as of 01:07, 20 February 2004 edit Lexor (talk \| contribs) Extended confirmed users 12,806 edits fmt ext lnks, rm 404 link ← Previous edit		Revision as of 07:52, 20 February 2004 edit undo 129.177.18.46 (talk) No edit summary Next edit →
Line 1: In [[bioinformatics]], '''sequence clustering''' [[algorithm]]s attempt to group ~~[[Homology (biology)\|homologous]] sequences into [[protein family\|families]]. Generally, clustering is based on [[sequence alignment]].~~ sequences that are somehow related. The sequences can be either of genomic, "transcriptomic" ([[EST (biology)\|ESTs]]) or [[protein]] origin. For proteins, one [[Homology (biology)\|homologous]] sequences into [[protein family\|families]]. For EST data, clustering is important to group sequences originating from the same [[gene]] before the ESTs are assembled to reconstruct the original [[mRNA]]. Generally, the clustering algorithms are single linkage clustering, constructing a [[transitive closure]] of sequences with a similarity over a particular threshold. The similarity score is often based on [[sequence alignment]]. == External links ==

Sequence clustering: Difference between revisions