Retrieval of a million high-quality, full-length microbial 16S and 18S rRNA gene sequences without primer bias

Nat Biotechnol. 2018 Feb;36(2):190-195. doi: 10.1038/nbt.4045. Epub 2018 Jan 1.

Abstract

Small subunit ribosomal RNA (SSU rRNA) genes, 16S in bacteria and 18S in eukaryotes, have been the standard phylogenetic markers used to characterize microbial diversity and evolution for decades. However, the reference databases of full-length SSU rRNA gene sequences are skewed to well-studied ecosystems and subject to primer bias and chimerism, which results in an incomplete view of the diversity present in a sample. We combine poly(A)-tailing and reverse transcription of SSU rRNA molecules with synthetic long-read sequencing to generate high-quality, full-length SSU rRNA sequences, without primer bias, at high throughput. We apply our approach to samples from seven different ecosystems and obtain more than a million SSU rRNA sequences from all domains of life, with an estimated raw error rate of 0.17%. We observe a large proportion of novel diversity, including several deeply branching phylum-level lineages putatively related to the Asgard Archaea. Our approach will enable expansion of the SSU rRNA reference databases by orders of magnitude, and contribute to a comprehensive census of the tree of life.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea / classification
  • Archaea / genetics
  • Bacteria / classification
  • Bacteria / genetics
  • Biodiversity
  • Eukaryota / classification
  • Eukaryota / genetics
  • High-Throughput Nucleotide Sequencing
  • Metagenome / genetics*
  • Phylogeny*
  • RNA, Ribosomal, 16S / classification
  • RNA, Ribosomal, 16S / genetics*
  • RNA, Ribosomal, 18S / genetics*

Substances

  • RNA, Ribosomal, 16S
  • RNA, Ribosomal, 18S