metaSPAdes: a new versatile metagenomic assembler

  1. Pavel A. Pevzner1,3
  1. 1Center for Algorithmic Biotechnology, Institute for Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia 199004;
  2. 2Department of Statistical Modelling, St. Petersburg State University, St. Petersburg, Russia 198515;
  3. 3Department of Computer Science and Engineering, University of California, San Diego, California 92093-0404, USA
  1. Corresponding author: sergeynurk{at}gmail.com
  1. 4 These authors contributed equally to this work.

Abstract

While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amplifying the challenge of metagenomic assembly. metaSPAdes addresses various challenges of metagenomic assembly by capitalizing on computational ideas that proved to be useful in assemblies of single cells and highly polymorphic diploid genomes. We benchmark metaSPAdes against other state-of-the-art metagenome assemblers and demonstrate that it results in high-quality assemblies across diverse data sets.

Footnotes

  • Received August 1, 2016.
  • Accepted March 13, 2017.

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

| Table of Contents

Preprint Server