TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data

  1. Sohrab P. Shah1,4,7
  1. 1Department of Molecular Oncology, British Columbia Cancer Agency, Vancouver, BC V5Z 1L3, Canada;
  2. 2Bioinformatics Training Program, University of British Columbia, Vancouver, BC V5Z 4S6, Canada;
  3. 3Centre for Translational and Applied Genomics, Vancouver, BC V5Z 4E6, Canada;
  4. 4Department of Computer Science, University of British Columbia, Vancouver, BC V6T 1Z4, Canada;
  5. 5Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 1L3, Canada;
  6. 6Genetic Pathology Evaluation Centre, Vancouver General Hospital, Vancouver, BC V6H 3Z6, Canada;
  7. 7Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC V6T 2B5, Canada;
  8. 8Department of Gynecology and Obstetrics, University of British Columbia, Vancouver, BC V5Z 1M9, Canada
  1. Corresponding author: sshah{at}bccrc.ca

Abstract

The evolution of cancer genomes within a single tumor creates mixed cell populations with divergent somatic mutational landscapes. Inference of tumor subpopulations has been disproportionately focused on the assessment of somatic point mutations, whereas computational methods targeting evolutionary dynamics of copy number alterations (CNA) and loss of heterozygosity (LOH) in whole-genome sequencing data remain underdeveloped. We present a novel probabilistic model, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event. We evaluate TITAN on idealized mixtures, simulating clonal populations from whole-genome sequences taken from genomically heterogeneous ovarian tumor sites collected from the same patient. In addition, we show in 23 whole genomes of breast tumors that the inference of CNA and LOH using TITAN critically informs population structure and the nature of the evolving cancer genome. Finally, we experimentally validated subclonal predictions using fluorescence in situ hybridization (FISH) and single-cell sequencing from an ovarian cancer patient sample, thereby recapitulating the key modeling assumptions of TITAN.

Footnotes

  • [Supplemental material is available for this article.]

  • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.180281.114.

    Freely available online through the Genome Research Open Access option.

  • Received September 6, 2013.
  • Accepted July 23, 2014.

This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0.

| Table of Contents
OPEN ACCESS ARTICLE

Preprint Server