Revision as of 05:20, 16 August 2018 edit 134.76.223.13 (talk) small correction to mmseqs2 description; move most cited tools (cd-hit, usearch) to the top of list of tools; removed nrdb90.pl (very slow and outdated perl script) Tag: references removed ← Previous edit		Revision as of 08:01, 16 August 2018 edit undo AnomieBOT (talk \| contribs) Bots 6,856,778 edits Rescuing orphaned refs ("rdb90" from rev 855133831) Next edit →
Line 61: == Non-redundant sequence databases == * PISCES: A Protein Sequence Culling Server<ref>{{cite web\|url=http://dunbrack.fccc.edu/pisces/\|title=Dunbrack Lab\|work=fccc.edu}}</ref> * RDB90<ref name=rdb90/>{{cite journal\|pmid=9682055 \|journal=Bioinformatics \| date=Jun 1998 \|volume=14 \|issue=5 \|pages=423–9. \|title=Removing near-neighbour redundancy from large protein sequence collections. \|author=Holm L1, Sander C. \|doi=10.1093/bioinformatics/14.5.423 }}</ref> * UniRef: A non-redundant [[UniProt]] sequence database<ref>{{cite web\|url=https://www.uniprot.org/database/DBDescription.shtml#uniref\|title=About UniProt\|work=uniprot.org}}</ref> * Uniclust: A clustered UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence identity.<ref>{{cite journal

Sequence clustering: Difference between revisions