TY - JOUR T1 - Reconstructing complex cancer evolutionary histories from multiple bulk DNA samples using Pairtree JF - bioRxiv DO - 10.1101/2020.11.06.372219 SP - 2020.11.06.372219 AU - Jeff A. Wintersinger AU - Stephanie M. Dobson AU - Lincoln D. Stein AU - John E. Dick AU - Quaid D. Morris Y1 - 2021/01/01 UR - http://biorxiv.org/content/early/2021/11/23/2020.11.06.372219.abstract N2 - Cancers are composed of genetically distinct subpopulations of malignant cells. By sequencing DNA from cancer tissue samples, we can characterize the somatic mutations specific to each population and build clone trees describing the evolutionary relationships between these populations. These trees reveal critical points in disease development and inform treatment. Both liquid cancers and solid tumours can be profiled with this approach.Pairtree is a new method for constructing clone trees using DNA sequencing data from one or more bulk samples of an individual cancer. It uses Bayesian inference to compute posterior distributions over the evolutionary relationships between every pair of identified subpopulations, then uses these distributions in a Markov Chain Monte Carlo algorithm to perform efficient inference of the posterior distribution over clone trees. Pairtree also uses the pairwise relationships to detect mutations that violate the infinite sites assumption. Unlike previous methods, Pairtree can perform clone tree reconstructions using as many as 100 samples per cancer that reveal 30 or more cell subpopulations. On simulated data, Pairtree is the only method whose performance reliably improves when provided with additional bulk samples from a cancer. On 14 B-progenitor acute lymphoblastic leukemias with up to 90 samples from each cancer, Pairtree was the only method that could reproduce or improve upon expert-derived clone tree reconstructions. By scaling to more challenging problems, Pairtree supports new biomedical research applications that can improve our understanding of the natural history of cancer, as well as better illustrate the interplay between cancer, host, and therapeutic interventions.Significance Clone trees describe the evolutionary history of a cancer and can provide insights into how the disease changed through time (e.g., between diagnosis and relapse). Pairtree uses DNA sequencing data from many samples of the same cancer to reconstruct a cancer’s evolutionary history with much greater detail and accuracy than previously possible.Competing Interest StatementThe authors have declared no competing interest. ER -