TY - JOUR T1 - MMseqs2: sensitive protein sequence searching for the analysis of massive data sets JF - bioRxiv DO - 10.1101/079681 SP - 079681 AU - Martin Steinegger AU - Johannes Söeding Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/04/25/079681.abstract N2 - Sequencing costs have dropped much faster than Moore’s law in the past decade, and sensitive sequence searching has become the main bottleneck in the analysis of large metagenomic datasets. We therefore developed the open-source software MMseqs2 (mmseqs.org), which improves on current search tools over the full range of speed-sensitivity trade-off, achieving sensitivities better than PSI-BLAST at more than 400 times its speed. ER -