Content deleted Content added
No edit summary |
No edit summary |
||
Line 5:
Generally, the clustering algorithms are [[single linkage clustering]], constructing a [[transitive closure]] of sequences with a similarity over a particular threshold. The similarity score is often based on [[sequence alignment]].
Sequence clustering is often used to make a [[Non redundant sequence|non-redundant]] set of [[representative sequences]] sequences.
Sequence clusters are often synonymous with (but not identical to) [[protein family|protein families]]. Determining a representative [[tertary structure|structure]] for each ''sequence cluster''' is the aim of many [[structural genomics]] initatives.
== External links ==
|