Sequence clustering: Difference between revisions

Content deleted Content added
small correction to mmseqs2 description; move most cited tools (cd-hit, usearch) to the top of list of tools; removed nrdb90.pl (very slow and outdated perl script)
Tag: references removed
AnomieBOT (talk | contribs)
Rescuing orphaned refs ("rdb90" from rev 855133831)
Line 61:
== Non-redundant sequence databases ==
* PISCES: A Protein Sequence Culling Server<ref>{{cite web|url=http://dunbrack.fccc.edu/pisces/|title=Dunbrack Lab|work=fccc.edu}}</ref>
* RDB90<ref name=rdb90/>{{cite journal|pmid=9682055
|journal=Bioinformatics
| date=Jun 1998 |volume=14
|issue=5
|pages=423–9.
|title=Removing near-neighbour redundancy from large protein sequence collections.
|author=Holm L1, Sander C.
|doi=10.1093/bioinformatics/14.5.423
}}</ref>
* UniRef: A non-redundant [[UniProt]] sequence database<ref>{{cite web|url=https://www.uniprot.org/database/DBDescription.shtml#uniref|title=About UniProt|work=uniprot.org}}</ref>
* Uniclust: A clustered UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence identity.<ref>{{cite journal