Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A mutation rate model at the basepair resolution identifies the mutagenic effect of Polymerase III transcription

Vladimir Seplyarskiy, Daniel J. Lee, Evan M. Koch, Joshua S. Lichtman, Harding H. Luan, Shamil R. Sunyaev
doi: https://doi.org/10.1101/2022.08.20.504670
Vladimir Seplyarskiy
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
2Brigham and Women’s Hospital, Division of Genetics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel J. Lee
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
2Brigham and Women’s Hospital, Division of Genetics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Evan M. Koch
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
2Brigham and Women’s Hospital, Division of Genetics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joshua S. Lichtman
3NGM Biopharmaceuticals, South San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harding H. Luan
3NGM Biopharmaceuticals, South San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shamil R. Sunyaev
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
2Brigham and Women’s Hospital, Division of Genetics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ssunyaev@hms.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

De novo mutations occur with substantially different rates depending on genomic location, sequence context and DNA strand1–4. The success of many human genetics techniques, especially when applied to large population sequencing datasets with numerous recurrent mutations5–7, depends strongly on assumptions about the local mutation rate. Such techniques include estimation of selection intensity8, inference of demographic history9, and mapping of rare disease genes10. Here, we present Roulette, a genome-wide mutation rate model at the basepair resolution that incorporates known determinants of local mutation rate (http://genetics.bwh.harvard.edu/downloads/Vova/Roulette/). Roulette is shown to be more accurate than existing models1,6. Roulette has sufficient resolution at high mutation rate sites to model allele frequencies under recurrent mutation. We use Roulette to refine estimates of population growth within Europe by incorporating the full range of human mutation rates. The analysis of significant deviations from the model predictions revealed a 10-fold increase in mutation rate in nearly all genes transcribed by Polymerase III, suggesting a new mutagenic mechanism. We also detected an elevated mutation rate within transcription factor binding sites restricted to sites actively utilized in testis and residing in promoters.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • We added additional validation of the Roulette model and showed that SNVs in RNU genes are enriched with recurrent mutations

  • http://genetics.bwh.harvard.edu/downloads/Vova/Roulette/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted May 17, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A mutation rate model at the basepair resolution identifies the mutagenic effect of Polymerase III transcription
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A mutation rate model at the basepair resolution identifies the mutagenic effect of Polymerase III transcription
Vladimir Seplyarskiy, Daniel J. Lee, Evan M. Koch, Joshua S. Lichtman, Harding H. Luan, Shamil R. Sunyaev
bioRxiv 2022.08.20.504670; doi: https://doi.org/10.1101/2022.08.20.504670
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A mutation rate model at the basepair resolution identifies the mutagenic effect of Polymerase III transcription
Vladimir Seplyarskiy, Daniel J. Lee, Evan M. Koch, Joshua S. Lichtman, Harding H. Luan, Shamil R. Sunyaev
bioRxiv 2022.08.20.504670; doi: https://doi.org/10.1101/2022.08.20.504670

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4395)
  • Biochemistry (9619)
  • Bioengineering (7111)
  • Bioinformatics (24915)
  • Biophysics (12642)
  • Cancer Biology (9980)
  • Cell Biology (14388)
  • Clinical Trials (138)
  • Developmental Biology (7977)
  • Ecology (12135)
  • Epidemiology (2067)
  • Evolutionary Biology (16010)
  • Genetics (10938)
  • Genomics (14764)
  • Immunology (9889)
  • Microbiology (23719)
  • Molecular Biology (9493)
  • Neuroscience (50969)
  • Paleontology (370)
  • Pathology (1544)
  • Pharmacology and Toxicology (2688)
  • Physiology (4031)
  • Plant Biology (8685)
  • Scientific Communication and Education (1512)
  • Synthetic Biology (2403)
  • Systems Biology (6446)
  • Zoology (1346)