Revision as of 21:17, 27 August 2004 edit 81.100.89.173 (talk) No edit summary ← Previous edit		Revision as of 21:28, 27 August 2004 edit undo 81.100.89.173 (talk) No edit summary Next edit →
Line 5: Generally, the clustering algorithms are [[single linkage clustering]], constructing a [[transitive closure]] of sequences with a similarity over a particular threshold. The similarity score is often based on [[sequence alignment]]. Sequence clustering is often used to make a [[Non redundant sequence\|non-redundant]] set of [[representative sequences]] sequences. Sequence clusters are often synonymous with (but not identical to) [[protein family\|protein families]]. Determining a representative [[tertary structure\|structure]] for each ''sequence cluster''' is the aim of many [[structural genomics]] initatives. == External links ==

Sequence clustering: Difference between revisions