Abstract
Background The contribution of somatic mosaicism, or genetic mutations arising after oocyte fertilization, to congenital heart disease (CHD) is not well understood. Further, the relationship between mosaicism in blood and cardiovascular tissue has not been determined.
Results We developed a computational method, Expectation-Maximization-based detection of Mosaicism (EM-mosaic), to analyze mosaicism in exome sequences of 2530 CHD proband-parent trios. EM-mosaic detected 326 mosaic mutations in blood and/or cardiac tissue DNA. Of the 309 detected in blood DNA, 85/97 (88%) tested were independently confirmed, while 7/17 (41%) candidates of 17 detected in cardiac tissue were confirmed. MosaicHunter detected an additional 64 mosaics, of which 23/46 (50%) among 58 candidates from blood and 4/6 (67%) of 6 candidates from cardiac tissue confirmed. Twenty-five mosaic variants altered CHD-risk genes, affecting 1% of our cohort. Of these 25, 22/22 candidates tested were confirmed. Variants predicted as damaging had higher variant allele fraction than benign variants, suggesting a role in CHD. The frequency of mosaic variants above 10% mosaicism was 0.13/person in blood and 0.14/person in cardiac tissue. Analysis of 66 individuals with matched cardiac tissue available revealed both tissue-specific and shared mosaicism, with shared mosaics generally having higher allele fraction.
Conclusions We estimate that ~1% of CHD probands have a mosaic variant detectable in blood that could contribute to cardiac malformations, particularly those damaging variants expressed at higher allele fraction compared to benign variants. Although blood is a readily-available DNA source, cardiac tissues analyzed contributed ~5% of somatic mosaic variants identified, indicating the value of tissue mosaicism analyses.
Footnotes
alh2194{at}cumc.columbia.edu (A.H.), Sarah.Morton{at}childrens.harvard.edu (S.U.M.), Jon_Willcox{at}hms.harvard.edu (J.A.L.W.), jgorham{at}genetics.med.harvard.edu (J.M.G.), angela_tai{at}hms.harvard.edu (A.C.T.), hq2130{at}columbia.edu (H.Q.), depalma{at}genetics.med.harvard.edu (S.D.), dmckean{at}genetics.med.harvard.edu (D.M.), eg2871{at}cumc.columbia.edu (E.G.), kathryn.manheimer{at}icahn.mssm.edu (K.B.M.), danb{at}stanford.edu (D.B.), rikim{at}chla.usc.edu (R.W.K.), jane.newburger{at}cardio.chboston.org (J.W.N.), george_porter{at}urmc.rochester.edu (G.A.P.), deepak.srivastava{at}gladstone.ucsf.edu (D.S.), martin.tristani{at}utah.edu (M.T.F.), martina.brueckner{at}yale.edu (M.B.), rickl{at}mail.rockefeller.edu (R.P.L.) Goldmuntz{at}email.chop.edu (E.G.) bruce.gelb{at}mssm.edu (B.D.G.) wkc15{at}cumc.columbia.edu (W.K.C.) cseidman{at}genetics.med.harvard.edu (C.E.S.), seidman{at}genetics.med.harvard.edu (J.G.S.), ys2411{at}cumc.columbia.edu (Y.S.)
Abbreviations
- ASD
- Autism Spectrum Disorder
- CHD
- Congenital Heart Disease
- dnSNV
- de novo SNV
- Dmis
- Deleterious Missense mutation
- DPsite
- Total Read Depth at a variant site
- DPsample
- Sample-wide Average Read Depth
- ExAC
- Exome Aggregation Consortium
- FDR
- False Discovery Rate
- gnomAD
- Genome Aggregation Database
- HHE
- High Heart Expression
- LOF
- Loss-of-Function
- LR
- Likelihood Ratio
- MAF
- Minor Allele Fraction
- N
- Total Read Depth
- Nalt
- Alternate Allele Read Depth
- OR
- Odds Ratio
- PCGC
- Pediatric Cardiac Genomics Consortium
- pLI
- Probability of Loss-of-Function Intolerance
- PV4
- P-value for strand bias, baseQ bias, mapQ bias and tail distance bias (SAMtools)
- SNV
- Single Nucleotide Variant
- VAF
- Variant Allele Fraction
- WES
- Whole Exome Sequencing