Creating Standards for Evaluating Tumour Subclonal Reconstruction

Abstract
Tumours evolve through time and space. To infer these evolutionary dynamics for DNA sequencing data, many subclonal reconstruction techniques have been developed and applied to large datasets. Surprisingly, though, there has been no systematic evaluation of these methods, in part due to the complexity of the mathematical and biological questions and the difficulties in creating gold-standards. To fill this gap, we systematically elucidated key algorithmic problems in subclonal reconstruction, and developed mathematically valid quantitative metrics for evaluating them. We then developed approaches to simulate realistic tumour genomes that harbour all known mutation types and processes. Finally, we benchmarked a set of 500 subclonal reconstructions, creating a key resource, and quantified the impact of sequencing read-depth and somatic variant detection strategies on the accuracy of specific subclonal reconstruction approaches. Inference of tumour phylogenies is rapidly becoming standard practice in cancer genome analysis, and this work sets standards for evaluating its accuracy.
Footnotes
↵# These authors jointly directed the work
Subject Area
- Biochemistry (9158)
- Bioengineering (6797)
- Bioinformatics (24054)
- Biophysics (12149)
- Cancer Biology (9563)
- Cell Biology (13816)
- Clinical Trials (138)
- Developmental Biology (7653)
- Ecology (11731)
- Epidemiology (2066)
- Evolutionary Biology (15535)
- Genetics (10664)
- Genomics (14351)
- Immunology (9504)
- Microbiology (22883)
- Molecular Biology (9119)
- Neuroscience (49088)
- Paleontology (357)
- Pathology (1487)
- Pharmacology and Toxicology (2576)
- Physiology (3851)
- Plant Biology (8349)
- Synthetic Biology (2299)
- Systems Biology (6203)
- Zoology (1302)