Quantitative phenotyping via deep barcode sequencing

  1. Andrew M. Smith1,2,3,
  2. Lawrence E. Heisler3,4,
  3. Joseph Mellor5,6,
  4. Fiona Kaper7,
  5. Michael J. Thompson7,
  6. Mark Chee7,
  7. Frederick P. Roth5,6,
  8. Guri Giaever1,3,4,8 and
  9. Corey Nislow1,2,3,8
  1. 1 Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada;
  2. 2 Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5G 1L6, Canada;
  3. 3 Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada;
  4. 4 Department of Pharmaceutical Sciences, University of Toronto, Toronto, Ontario M5S 3M2, Canada;
  5. 5 Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA;
  6. 6 Center for Cancer Systems Biology, Dana Farber Cancer Institute, Boston, Massachusetts 02115, USA;
  7. 7 Prognosys Biosciences, Inc., La Jolla, California 92037, USA

    Abstract

    Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale.

    Footnotes

    | Table of Contents

    Preprint Server