Sequence clustering: Difference between revisions

Content deleted Content added
No edit summary
No edit summary
Line 5:
Generally, the clustering algorithms are [[single linkage clustering]], constructing a [[transitive closure]] of sequences with a similarity over a particular threshold. The similarity score is often based on [[sequence alignment]].
Sequence clustering is often used to make a [[Non redundant sequence|non-redundant]] set of [[representative sequences]] sequences.
 
 
Sequence clusters are often synonymous with (but not identical to) [[protein family|protein families]]. Determining a representative [[tertary structure|structure]] for each ''sequence cluster''' is the aim of many [[structural genomics]] initatives.
 
== External links ==