Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

satmut_utils: a simulation and variant calling package for multiplexed assays of variant effect

Ian Hoskins, Song Sun, Atina Cote, Frederick P. Roth, Can Cenik
doi: https://doi.org/10.1101/2022.04.25.489390
Ian Hoskins
aDepartment of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Song Sun
bThe Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of Toronto, Toronto, Ontario, Canada
cLunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Atina Cote
bThe Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of Toronto, Toronto, Ontario, Canada
cLunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Frederick P. Roth
bThe Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of Toronto, Toronto, Ontario, Canada
cLunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Can Cenik
aDepartment of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ccenik@austin.utexas.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background The impact of thousands of individual genetic variants on molecular phenotypes for disease-relevant genes remains unknown. Multiplexed assays for variant effect (MAVEs) are highly scalable methods to annotate the relevant variants. However, current software methods for analyzing MAVEs lack standardized annotation, can require cumbersome configuration, and do not easily scale to large target regions.

Results Here, we present satmut_utils as a flexible solution for 1) simulation of saturation mutagenesis data; and 2) quantification of variants across four orders of magnitude from multiplexed assay data. Improvements of satmut_utils over existing solutions include support for multiple experimental strategies, unique molecular identifier-based consensus deduplication, and machine learning-based error correction. We developed a rigorous simulation workflow to validate the performance of satmut_utils and carried out the first benchmarking of existing software for variant calling. Finally, we used satmut_utils to determine the mRNA abundance of thousands of coding variants in cystathionine beta-synthase (CBS) by two library preparation methods. We identified an association between variants near chemical cofactor binding sites and decreased mRNA abundance. We also found a correlation between codon optimality and the magnitude of variant effects, emphasizing the potential of single-nucleotide variants to alter mRNA abundance.

Conclusions satmut_utils enables high-performance analysis of saturation mutagenesis data, achieves unprecedented specificity through novel error correction approaches, and reveals the capability of single-codon variants to alter mRNA abundance in native coding sequences.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/CenikLab/satmut_utils

  • https://github.com/ijhoskins/satmut_utils

  • https://github.com/ijhoskins/satmut_utils_supplementary

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 26, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
satmut_utils: a simulation and variant calling package for multiplexed assays of variant effect
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
satmut_utils: a simulation and variant calling package for multiplexed assays of variant effect
Ian Hoskins, Song Sun, Atina Cote, Frederick P. Roth, Can Cenik
bioRxiv 2022.04.25.489390; doi: https://doi.org/10.1101/2022.04.25.489390
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
satmut_utils: a simulation and variant calling package for multiplexed assays of variant effect
Ian Hoskins, Song Sun, Atina Cote, Frederick P. Roth, Can Cenik
bioRxiv 2022.04.25.489390; doi: https://doi.org/10.1101/2022.04.25.489390

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4654)
  • Biochemistry (10299)
  • Bioengineering (7614)
  • Bioinformatics (26189)
  • Biophysics (13446)
  • Cancer Biology (10620)
  • Cell Biology (15333)
  • Clinical Trials (138)
  • Developmental Biology (8452)
  • Ecology (12754)
  • Epidemiology (2067)
  • Evolutionary Biology (16763)
  • Genetics (11356)
  • Genomics (15400)
  • Immunology (10548)
  • Microbiology (25041)
  • Molecular Biology (10152)
  • Neuroscience (54096)
  • Paleontology (398)
  • Pathology (1655)
  • Pharmacology and Toxicology (2877)
  • Physiology (4314)
  • Plant Biology (9196)
  • Scientific Communication and Education (1579)
  • Synthetic Biology (2541)
  • Systems Biology (6752)
  • Zoology (1452)