Sequence clustering: Difference between revisions

Content deleted Content added
No edit summary
ce
Line 9:
* Starcode:<ref>{{cite web|url=https://github.com/gui11aume/starcode|title=Starcode repository}}</ref> a fast sequence clustering algorithm based on exact all-pairs search.<ref>{{cite journal
|title=Starcode: sequence clustering based on all-pairs search
|author1=Zorita E |author2=Cuscó P |author3=Filion GJ. |journal=Bioinformatics.
|date=Jun 2015 |volume=31
|issue=12 |pages=1913–1919
Line 35:
* MMseqs2: software suite for fast and deep clustering of large protein sequence sets <ref>{{cite journal
|title=MMseqs software suite for fast and deep clustering and searching of large protein sequence sets
|author1=Hauser M. |author2=Steinegger M. |author3=Söding J. |journal=Bioinformatics.
|date=Jan 2016 |volume=32
|issue=9 |pages=1323–1330
Line 41:
|pmid= 26743509}}</ref> <ref>{{cite journal
|title=MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets
|author1=Steinegger M. |author2=Söding J. |journal=Nature Biotechnology.
|date=Oct 16, 2017 |volume=
|issue= |pages=
Line 85:
* Uniclust: A clustered UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence identity.<ref>{{cite journal
|title=Uniclust databases of clustered and deeply annotated protein sequences and alignments
|author1=Mirdita M |author2=von den Drisch L. |author3=Galiez C. |author4=Soeding J. |author5= Steinegger M. |journal=Nucleic Acids Res.
|date= Nov 2016 |volume=45
|issue=D1 |pages= D170–D176