MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect

Ammar Tareen; Mahdi Kooshkbaghi; Anna Posfai; William T. Ireland; David M. McCandlish; Justin B. Kinney

doi:10.1101/2020.07.14.201475

Abstract

Multiplex assays of variant effect (MAVEs) are a family of methods that includes deep mutational scanning (DMS) experiments on proteins and massively parallel reporter assays (MPRAs) on gene regulatory sequences. However, a general strategy for inferring quantitative models of genotype-phenotype (G-P) maps from MAVE data is lacking. Here we introduce MAVE-NN, a neural-network-based Python package that implements a broadly applicable information-theoretic framework for learning G-P maps—including biophysically interpretable models—from MAVE datasets. We demonstrate MAVE-NN in multiple biological contexts, and highlight the ability of our approach to deconvolve mutational effects from otherwise confounding experimental nonlinearities and noise.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

Moderate revisions throughout.
https://mavenn.readthedocs.io/

The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.