Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing

Sarah Goldstein, Lidia Beka, Joerg Graf, View ORCID ProfileJonathan L. Klassen
doi: https://doi.org/10.1101/362673
Sarah Goldstein
University of Connecticut, Department of Molecular and Cell Biology, Storrs CT USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lidia Beka
University of Connecticut, Department of Molecular and Cell Biology, Storrs CT USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joerg Graf
University of Connecticut, Department of Molecular and Cell Biology, Storrs CT USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jonathan.klassen@uconn.edu joerg.graf@uconn.edu
Jonathan L. Klassen
University of Connecticut, Department of Molecular and Cell Biology, Storrs CT USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jonathan L. Klassen
  • For correspondence: jonathan.klassen@uconn.edu joerg.graf@uconn.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background Short-read sequencing technologies have made microbial genome sequencing cheap and accessible. However, closing genomes is often costly and assembling short reads from genomes that are repetitive and/or have extreme %GC content remains challenging. Long-read, single-molecule sequencing technologies such as the Oxford Nanopore MinION have the potential to overcome these difficulties, although the best approach for harnessing their potential remains poorly evaluated.

Results We sequenced nine bacterial genomes spanning a wide range of GC contents using Illumina MiSeq and Oxford Nanopore MinION sequencing technologies to determine the advantages of each approach, both individually and combined. Assemblies using only MiSeq reads were highly accurate but lacked contiguity, a deficiency that was partially overcome by adding MinION reads to these assemblies. Even more contiguous genome assemblies were generated by using MinION reads for initial assembly, but these were more error-prone and required further polishing. This was especially pronounced when Illumina libraries were biased, as was the case for our strains with both high and low GC content. Increased genome contiguity dramatically improved the annotation of insertion sequences and secondary metabolite biosynthetic gene clusters, likely because long-reads can disambiguate these highly repetitive but biologically important genomic regions.

Conclusions Genome assembly using short-reads is challenged by repetitive sequences and extreme GC contents. Our results indicate that these difficulties can be largely overcome by using single-molecule, long-read sequencing technologies such as the Oxford Nanopore MinION. Using MinION reads for assembly followed by polishing with Illumina reads generated the most contiguous genomes and enabled the accurate annotation of important but difficult to sequence genomic features such as insertion sequences and secondary metabolite biosynthetic gene clusters. The combination of Oxford Nanopore and Illumina sequencing is cost effective and dramatically advances studies of microbial evolution and genome-driven drug discovery.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 05, 2018.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
Sarah Goldstein, Lidia Beka, Joerg Graf, Jonathan L. Klassen
bioRxiv 362673; doi: https://doi.org/10.1101/362673
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
Sarah Goldstein, Lidia Beka, Joerg Graf, Jonathan L. Klassen
bioRxiv 362673; doi: https://doi.org/10.1101/362673

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4369)
  • Biochemistry (9550)
  • Bioengineering (7071)
  • Bioinformatics (24776)
  • Biophysics (12567)
  • Cancer Biology (9925)
  • Cell Biology (14299)
  • Clinical Trials (138)
  • Developmental Biology (7931)
  • Ecology (12078)
  • Epidemiology (2067)
  • Evolutionary Biology (15957)
  • Genetics (10904)
  • Genomics (14708)
  • Immunology (9848)
  • Microbiology (23585)
  • Molecular Biology (9456)
  • Neuroscience (50699)
  • Paleontology (369)
  • Pathology (1535)
  • Pharmacology and Toxicology (2674)
  • Physiology (4001)
  • Plant Biology (8643)
  • Scientific Communication and Education (1505)
  • Synthetic Biology (2388)
  • Systems Biology (6415)
  • Zoology (1345)