Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Reconstructing complex cancer evolutionary histories from multiple bulk DNA samples using Pairtree

View ORCID ProfileJeff A. Wintersinger, Stephanie M. Dobson, View ORCID ProfileLincoln D. Stein, View ORCID ProfileJohn E. Dick, View ORCID ProfileQuaid D. Morris
doi: https://doi.org/10.1101/2020.11.06.372219
Jeff A. Wintersinger
1Department of Computer Science, University of Toronto, Toronto, ON, Canada
2Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
3Ontario Institute for Cancer Research, Toronto, ON, Canada
4Vector Institute for Artificial Intelligence, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeff A. Wintersinger
Stephanie M. Dobson
5Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
6Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lincoln D. Stein
3Ontario Institute for Cancer Research, Toronto, ON, Canada
5Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lincoln D. Stein
John E. Dick
5Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
6Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John E. Dick
Quaid D. Morris
1Department of Computer Science, University of Toronto, Toronto, ON, Canada
3Ontario Institute for Cancer Research, Toronto, ON, Canada
4Vector Institute for Artificial Intelligence, Toronto, ON, Canada
5Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
7Memorial Sloan Kettering Cancer Center, New York City, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Quaid D. Morris
  • For correspondence: morrisq@mskcc.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Cancers are composed of genetically distinct subpopulations of malignant cells. By sequencing DNA from cancer tissue samples, we can characterize the somatic mutations specific to each population and build clone trees describing the evolutionary relationships between these populations. These trees reveal critical points in disease development and inform treatment. Both liquid cancers and solid tumours can be profiled with this approach.

Pairtree is a new method for constructing clone trees using DNA sequencing data from one or more bulk samples of an individual cancer. It uses Bayesian inference to compute posterior distributions over the evolutionary relationships between every pair of identified subpopulations, then uses these distributions in a Markov Chain Monte Carlo algorithm to perform efficient inference of the posterior distribution over clone trees. Pairtree also uses the pairwise relationships to detect mutations that violate the infinite sites assumption. Unlike previous methods, Pairtree can perform clone tree reconstructions using as many as 100 samples per cancer that reveal 30 or more cell subpopulations. On simulated data, Pairtree is the only method whose performance reliably improves when provided with additional bulk samples from a cancer. On 14 B-progenitor acute lymphoblastic leukemias with up to 90 samples from each cancer, Pairtree was the only method that could reproduce or improve upon expert-derived clone tree reconstructions. By scaling to more challenging problems, Pairtree supports new biomedical research applications that can improve our understanding of the natural history of cancer, as well as better illustrate the interplay between cancer, host, and therapeutic interventions.

Significance Clone trees describe the evolutionary history of a cancer and can provide insights into how the disease changed through time (e.g., between diagnosis and relapse). Pairtree uses DNA sequencing data from many samples of the same cancer to reconstruct a cancer’s evolutionary history with much greater detail and accuracy than previously possible.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Conflicts of interest, The authors declare no potential conflicts of interest.

  • * We have incorporated CALDER as one of the methods that we compare Pairtree to, and have updated all the relevant figures and text. * We have produced two new supplemental figures where we isolate our low depth simulations and show that Pairtree performs as well, or better, on the lower depth simulated datasets as on higher-depth ones. * We have added the "tree error" measure developed by Raphael lab as one of the metrics we use to compare algorithms and added a supplemental figure incorporating it. * We have added a section to the main text describing our ISA violation detection evaluations . * We have added discussion regarding the incorporation of alternative models for cancer evolution with Pairtree. This discussion paragraph is supported by three new sections in the supplemental materials where we prove that Pairtree's subclonal frequency likelihood is convex, argue that the mode is a good approximation to the marginal likelihood, and explain how various alternative models of cancer evolution may impact these two properties.

  • https://github.com/morrislab/pairtree

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted November 23, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Reconstructing complex cancer evolutionary histories from multiple bulk DNA samples using Pairtree
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Reconstructing complex cancer evolutionary histories from multiple bulk DNA samples using Pairtree
Jeff A. Wintersinger, Stephanie M. Dobson, Lincoln D. Stein, John E. Dick, Quaid D. Morris
bioRxiv 2020.11.06.372219; doi: https://doi.org/10.1101/2020.11.06.372219
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Reconstructing complex cancer evolutionary histories from multiple bulk DNA samples using Pairtree
Jeff A. Wintersinger, Stephanie M. Dobson, Lincoln D. Stein, John E. Dick, Quaid D. Morris
bioRxiv 2020.11.06.372219; doi: https://doi.org/10.1101/2020.11.06.372219

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3502)
  • Biochemistry (7343)
  • Bioengineering (5319)
  • Bioinformatics (20258)
  • Biophysics (10008)
  • Cancer Biology (7735)
  • Cell Biology (11293)
  • Clinical Trials (138)
  • Developmental Biology (6434)
  • Ecology (9947)
  • Epidemiology (2065)
  • Evolutionary Biology (13315)
  • Genetics (9359)
  • Genomics (12579)
  • Immunology (7696)
  • Microbiology (19008)
  • Molecular Biology (7437)
  • Neuroscience (41011)
  • Paleontology (300)
  • Pathology (1228)
  • Pharmacology and Toxicology (2134)
  • Physiology (3155)
  • Plant Biology (6858)
  • Scientific Communication and Education (1272)
  • Synthetic Biology (1895)
  • Systems Biology (5311)
  • Zoology (1087)