Skip to main content

Identification of Mutations in Laboratory-Evolved Microbes from Next-Generation Sequencing Data Using breseq

  • Protocol
  • First Online:

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1151))

Abstract

Next-generation DNA sequencing (NGS) can be used to reconstruct eco-evolutionary population dynamics and to identify the genetic basis of adaptation in laboratory evolution experiments. Here, we describe how to run the open-source breseq computational pipeline to identify and annotate genetic differences found in whole-genome and whole-population NGS data from haploid microbes where a high-quality reference genome is available. These methods can also be used to analyze mutants isolated in genetic screens and to detect unintended mutations that may occur during strain construction and genome editing.

This is a preview of subscription content, log in via an institution.

Buying options

Protocol
USD   49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Springer Nature is developing a new tool to find and evaluate Protocols. Learn more

References

  1. Mardis ER (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet 9:387–402

    Article  CAS  Google Scholar 

  2. Eid J, Fehr A, Gray J et al (2009) Real-time DNA sequencing from single polymerase molecules. Science 323:133–138

    Article  CAS  Google Scholar 

  3. Trapnell C, Salzberg SL (2009) How to map billions of short reads onto genomes. Nat Biotechnol 27:455–457

    Article  CAS  Google Scholar 

  4. DePristo MA, Banks E, Poplin R et al (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43:491–498

    Article  CAS  Google Scholar 

  5. Kim D, Salzberg SL (2011) TopHat-fusion: an algorithm for discovery of novel fusion transcripts. Genome Biol 12:R72

    Article  CAS  Google Scholar 

  6. Barrick JE, Yu DS, Yoon SH et al (2009) Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 461:1243–1247

    Article  CAS  Google Scholar 

  7. Barrick JE, Lenski RE (2009) Genome-wide mutational diversity in an evolving population of Escherichia coli. Cold Spring Harb Symp Quant Biol 74:119–129

    Article  CAS  Google Scholar 

  8. Woods RJ, Barrick JE, Cooper TF et al (2011) Second-order selection for evolvability in a large Escherichia coli population. Science 331:1433–1436

    Article  CAS  Google Scholar 

  9. Blount ZD, Barrick JE, Davidson CJ, Lenski RE (2012) Genomic analysis of a key innovation in an experimental Escherichia coli population. Nature 489:513–518

    Article  CAS  Google Scholar 

  10. Milne I, Stephen G, Bayer M et al (2013) Using Tablet for visual exploration of second-generation sequencing data. Brief Bioinform 14:193–202. doi:10.1093/bib/bbs012

    Article  CAS  Google Scholar 

  11. Thorvaldsdóttir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14:178–192

    Article  Google Scholar 

  12. Jeong H, Barbe V, Lee CH et al (2009) Genome sequences of Escherichia coli B strains REL606 and BL21(DE3). J Mol Biol 394:644–652

    Article  CAS  Google Scholar 

  13. Schneider D, Duperchy E, Coursange E et al (2000) Long-term experimental evolution in Escherichia coli. IX. Characterization of insertion sequence-mediated mutations and rearrangements. Genetics 156:477–488

    CAS  Google Scholar 

  14. Danecek P, Auton A, Abecasis G et al (2011) The variant call format and VCF tools. Bioinformatics 27:2156–2158

    Article  CAS  Google Scholar 

  15. Andrews S FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

  16. Dodt M, Roehr J, Ahmed R, Dieterich C (2012) FLEXBAR—flexible barcode and adapter processing for next-generation sequencing platforms. Biology 1:895–905

    Article  Google Scholar 

  17. Zerbino DR, Birney E (2008) Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829

    Article  CAS  Google Scholar 

  18. Ribeiro FJ, Przybylski D, Yin S et al (2012) Finished bacterial genomes from shotgun sequence data. Genome Res 22:2270–2277

    Article  CAS  Google Scholar 

  19. Kurtz S, Phillippy A, Delcher AL et al (2004) Versatile and open software for comparing large genomes. Genome Biol 5:R12

    Article  Google Scholar 

Download references

Acknowledgements

D.E.D. was supported by a University of Texas at Austin CPRIT Cancer Research Traineeship. Development of breseq has been supported by an NSF Postdoctoral Research Fellowship in Biological Informatics (DBI-0630687) and by grants from the NSF BEACON Center for the Study of Evolution in Action (DBI-0939454), NIH (R00-GM087550), and CPRIT (RP130124) to J.E.B. Additional programmers and users who have provided valuable feedback and bug reports are thanked in the breseq documentation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jeffrey E. Barrick .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this protocol

Cite this protocol

Deatherage, D.E., Barrick, J.E. (2014). Identification of Mutations in Laboratory-Evolved Microbes from Next-Generation Sequencing Data Using breseq . In: Sun, L., Shou, W. (eds) Engineering and Analyzing Multicellular Systems. Methods in Molecular Biology, vol 1151. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-0554-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-0554-6_12

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-0553-9

  • Online ISBN: 978-1-4939-0554-6

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics