Development and extensive sequencing of a broadly-consented Genome in a Bottle matched tumor-normal pair for somatic benchmarks
Abstract
The Genome in a Bottle Consortium (GIAB), hosted by the National Institute of Standards and Technology (NIST), is developing new matched tumor-normal samples, the first to be explicitly consented for public dissemination of genomic data and cell lines. Here, we describe a comprehensive genomic dataset from the first individual, HG008, including DNA from an adherent, epithelial-like pancreatic ductal adenocarcinoma (PDAC) tumor cell line (HG008-T) and matched normal cells from duodenal tissue (HG008-N-D) and pancreatic tissue (HG008-N-P). The data come from thirteen whole genome measurement technologies: Illumina paired-end, Element standard and long insert, Ultima UG100, PacBio (HiFi and Onso), Oxford Nanopore (standard and ultra-long), Bionano Optical Mapping, Arima and Phase Genomics Hi-C, G-banded karyotyping, directional genomic hybridization, and BioSkryb Genomics single-cell ResolveDNA. Most tumor data is from a large homogenous batch of non-viable cells after 23 passages of the primary tumor cells, along with some data from different passages to enable an initial understanding of genomic instability. These data will be used by the GIAB Consortium to develop matched tumor-normal benchmarks for somatic variant detection. In addition, extensive data from two different normal tissues from the same individual can enable understanding of mosaicism. Long reads also contain methylation tags for epigenetic analyses. We expect these data to facilitate innovation for whole genome measurement technologies, de novo assembly of tumor and normal genomes, and bioinformatic tools to identify small and structural somatic mutations. This first-of-its-kind broadly consented open-access resource will facilitate further understanding of sequencing methods used for cancer biology.
Competing Interest Statement
A.S. and K.S. are employees of Arima Genomics. L.F.P. from BCM, was sponsored by Genentech Inc until September 2023. F.J.S from BCM, received research support from Illumina, ONT and Pacbio. A.R.H and H-C.Y. are employees of Bionano Genomics and own stock shares and options of Bionano Genomics, Inc. V.W., K.K., J.R., and I.G. are employees of BioSkryb Genomics. M.S., K.B., B.L. and S.L. are employees of Element Biosciences. S.B.K., C.L., P.B., A.M.W., I.J.M., A.A., C.K., M.W., and Y.K. are employees and shareholders of PacBio, Inc. D.L., H.B., N.I., and I.S. are employees and shareholders of Ultima Genomics. S.E. and M.W. are employees of Phase Genomics. E.C., G.H., S.G., and M.V. are employees of KromaTiD, Inc, E.C. is also a shareholder.
Subject Area
- Biochemistry (12968)
- Bioengineering (9847)
- Bioinformatics (31590)
- Biophysics (16283)
- Cancer Biology (13359)
- Cell Biology (19052)
- Clinical Trials (138)
- Developmental Biology (10320)
- Ecology (15332)
- Epidemiology (2067)
- Evolutionary Biology (19577)
- Genetics (12985)
- Genomics (17955)
- Immunology (13069)
- Microbiology (30553)
- Molecular Biology (12741)
- Neuroscience (66689)
- Paleontology (490)
- Pathology (2065)
- Pharmacology and Toxicology (3551)
- Physiology (5538)
- Plant Biology (11418)
- Synthetic Biology (3177)
- Systems Biology (7839)
- Zoology (1769)