Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

An extension of the Walsh-Hadamard transform to calculate and model epistasis in genetic landscapes of arbitrary shape and complexity

View ORCID ProfileAndre J. Faure, View ORCID ProfileBen Lehner, View ORCID ProfileVerónica Miró Pina, View ORCID ProfileClaudia Serrano Colome, View ORCID ProfileDonate Weghorn
doi: https://doi.org/10.1101/2023.03.06.531391
Andre J. Faure
1Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andre J. Faure
  • For correspondence: [email protected]
Ben Lehner
1Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
2Universitat Pompeu Fabra (UPF), Barcelona, Spain
3Institució Catalana de Recerca i estudis Avançats (ICREA), Barcelona, Spain
4Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ben Lehner
Verónica Miró Pina
1Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Verónica Miró Pina
Claudia Serrano Colome
1Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Claudia Serrano Colome
Donate Weghorn
1Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
2Universitat Pompeu Fabra (UPF), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Donate Weghorn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Accurate models describing the relationship between genotype and phenotype are necessary in order to understand and predict how mutations to biological sequences affect the fitness and evolution of living organisms. The apparent abundance of epistasis (genetic interactions), both between and within genes, complicates this task and how to build mechanistic models that incorporate epistatic coefficients (genetic interaction terms) is an open question. The Walsh-Hadamard transform represents a rigorous computational framework for calculating and modeling epistatic interactions at the level of individual genotypic values (known as genetical, biological or physiological epistasis), and can therefore be used to address fundamental questions related to sequence-to-function encodings. However, one of its main limitations is that it can only accommodate two alleles (amino acid or nucleotide states) per sequence position. In this paper we provide an extension of the Walsh-Hadamard transform that allows the calculation and modeling of background-averaged epistasis (also known as ensemble epistasis) in genetic landscapes with an arbitrary number of states per position (20 for amino acids, 4 for nucleotides, etc.). We also provide a recursive formula for the inverse matrix and then derive formulae to directly extract any element of either matrix without having to rely on the computationally intensive task of constructing or inverting large matrices. Finally, we demonstrate the utility of our theory by using it to model epistasis within both simulated and empirical multiallelic fitness landscapes, revealing that both pairwise and higher-order genetic interactions are enriched between physically interacting positions.

Author Summary An important question in genetics is how the effects of mutations combine to alter phenotypes. Genetic interactions (epistasis) describe non-additive effects of pairs of mutations, but can also involve higher-order (three- and four-way etc.) combinations. Quantifying higher-order interactions is experimentally very challenging requiring a large number of measurements. Techniques based on deep mutational scanning (DMS) represent valuable sources of data to study epistasis. However, the best way to extract the relevant pairwise and higher-order epistatic coefficients (genetic interaction terms) from this data for the task of phenotypic prediction remains an unresolved problem. The Walsh-Hadamard transform represents a rigorous computational framework for calculating and modeling epistatic interactions at the level of individual genotypic values. Critically, this formalism currently only allows for two alleles (amino acid or nucleotide states) per sequence position, hampering applications in more biologically realistic scenarios. Here we present an extension of the Walsh-Hadamard transform that overcomes this limitation and demonstrate the utility of our theory by using it to model epistasis within both simulated and empirical multiallelic genetic landscapes.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Redesigned and simplified Figure 2 (and supplementary figures); added results from a simulated genotype-phenotype landscape; added results from an additional empirical (DMS) dataset; added a theory section related to the derivation of the error propagation.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted November 10, 2023.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
An extension of the Walsh-Hadamard transform to calculate and model epistasis in genetic landscapes of arbitrary shape and complexity
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
An extension of the Walsh-Hadamard transform to calculate and model epistasis in genetic landscapes of arbitrary shape and complexity
Andre J. Faure, Ben Lehner, Verónica Miró Pina, Claudia Serrano Colome, Donate Weghorn
bioRxiv 2023.03.06.531391; doi: https://doi.org/10.1101/2023.03.06.531391
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
An extension of the Walsh-Hadamard transform to calculate and model epistasis in genetic landscapes of arbitrary shape and complexity
Andre J. Faure, Ben Lehner, Verónica Miró Pina, Claudia Serrano Colome, Donate Weghorn
bioRxiv 2023.03.06.531391; doi: https://doi.org/10.1101/2023.03.06.531391

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Systems Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (6022)
  • Biochemistry (13708)
  • Bioengineering (10436)
  • Bioinformatics (33157)
  • Biophysics (17109)
  • Cancer Biology (14173)
  • Cell Biology (20106)
  • Clinical Trials (138)
  • Developmental Biology (10868)
  • Ecology (16018)
  • Epidemiology (2067)
  • Evolutionary Biology (20346)
  • Genetics (13395)
  • Genomics (18634)
  • Immunology (13750)
  • Microbiology (32164)
  • Molecular Biology (13392)
  • Neuroscience (70069)
  • Paleontology (526)
  • Pathology (2190)
  • Pharmacology and Toxicology (3741)
  • Physiology (5864)
  • Plant Biology (12020)
  • Scientific Communication and Education (1814)
  • Synthetic Biology (3367)
  • Systems Biology (8166)
  • Zoology (1841)