A comprehensive multicenter comparison of whole genome sequencing pipelines using a uniform tumor-normal sample pair
Abstract
As next-generation sequencing becomes a clinical tool, a full understanding of the variables affecting sequencing analysis output is required. Through the International Cancer Genome Consortium (ICGC), we compared sequencing pipelines at five independent centers (CNAG, DKFZ, OICR, RIKEN and WTSI) using a single tumor-blood DNA pair. Analyses by each center and with one standardized algorithm revealed significant discrepancies. Although most pipelines performed well for coding mutations, library preparation methods and sequencing coverage metrics clearly influenced downstream results. PCR-free methods showed reduced GC-bias and more even coverage. Increasing sequencing depth to ∼100x (two- to three-fold higher than current standards) showed a benefit, as long as the tumor:control coverage ratio remained balanced. To become part of routine clinical care, high-throughput sequencing must be globally compatible and comparable. This benchmarking exercise has highlighted several fundamental parameters to consider in this regard, which will allow for better optimization and planning of both basic and translational studies.
Subject Area
- Biochemistry (13659)
- Bioengineering (10389)
- Bioinformatics (33063)
- Biophysics (17045)
- Cancer Biology (14113)
- Cell Biology (20012)
- Clinical Trials (138)
- Developmental Biology (10829)
- Ecology (15960)
- Epidemiology (2067)
- Evolutionary Biology (20288)
- Genetics (13371)
- Genomics (18595)
- Immunology (13697)
- Microbiology (32054)
- Molecular Biology (13347)
- Neuroscience (69845)
- Paleontology (521)
- Pathology (2175)
- Pharmacology and Toxicology (3728)
- Physiology (5841)
- Plant Biology (11981)
- Synthetic Biology (3358)
- Systems Biology (8143)
- Zoology (1839)