Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis

Alex Nisthal, Connie Y. Wang, Marie L. Ary, Stephen L. Mayo
doi: https://doi.org/10.1101/484949
Alex Nisthal
aDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: nisthal.alex@gmail.com steve@mayo.caltech.edu
Connie Y. Wang
bProtabit LLC, 1010 E. Union Street, Suite 110, Pasadena, CA 91106
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marie L. Ary
bProtabit LLC, 1010 E. Union Street, Suite 110, Pasadena, CA 91106
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen L. Mayo
aDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
cDivision of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: nisthal.alex@gmail.com steve@mayo.caltech.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

The accurate prediction of protein stability upon sequence mutation is an important but unsolved challenge in protein engineering. Large mutational datasets are required to train computational predictors, but traditional methods for collecting stability data are either low-throughput or measure protein stability indirectly. Here, we develop an automated method to generate thermodynamic stability data for nearly every single mutant in a small 56-residue protein. Analysis reveals that most single mutants have a neutral effect on stability, mutational sensitivity is largely governed by residue burial, and unexpectedly, hydrophobics are the best tolerated amino acid type. Correlating the output of various stability prediction algorithms against our data shows that nearly all perform better on boundary and surface positions than for those in the core, and are better at predicting large to small mutations than small to large ones. We show that the most stable variants in the single mutant landscape are better identified using combinations of two prediction algorithms, and that including more algorithms can provide diminishing returns. In most cases, poor in silico predictions were tied to compositional differences between the data being analyzed and the datasets used to train the algorithm. Finally, we find that strategies to extract stabilities from high-throughput fitness data such as deep mutational scanning are promising and that data produced by these methods may be applicable toward training future stability prediction tools.

Significance Statement Using liquid-handling automation, we constructed and measured the thermodynamic stability of almost every single mutant of protein G (Gβ1), a small domain. This self-consistent dataset is the largest of its kind and offers unique opportunities on two fronts: (i) insight into protein domain properties such as positional sensitivity and incorporated amino acid tolerance, and (ii) service as a validation set for future efforts in protein stability prediction. As Gβ1 is a model system for protein folding and design, and its single mutant landscape has been measured by deep mutational scanning, we expect our dataset to serve as a reference for studies aimed at extracting stability information from fitness data or developing novel high-throughput stability assays.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted January 31, 2019.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis
Alex Nisthal, Connie Y. Wang, Marie L. Ary, Stephen L. Mayo
bioRxiv 484949; doi: https://doi.org/10.1101/484949
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis
Alex Nisthal, Connie Y. Wang, Marie L. Ary, Stephen L. Mayo
bioRxiv 484949; doi: https://doi.org/10.1101/484949

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Subject Areas
All Articles
  • Animal Behavior and Cognition (4089)
  • Biochemistry (8772)
  • Bioengineering (6487)
  • Bioinformatics (23356)
  • Biophysics (11756)
  • Cancer Biology (9154)
  • Cell Biology (13256)
  • Clinical Trials (138)
  • Developmental Biology (7417)
  • Ecology (11376)
  • Epidemiology (2066)
  • Evolutionary Biology (15094)
  • Genetics (10403)
  • Genomics (14014)
  • Immunology (9126)
  • Microbiology (22070)
  • Molecular Biology (8783)
  • Neuroscience (47393)
  • Paleontology (350)
  • Pathology (1421)
  • Pharmacology and Toxicology (2482)
  • Physiology (3705)
  • Plant Biology (8054)
  • Scientific Communication and Education (1433)
  • Synthetic Biology (2211)
  • Systems Biology (6017)
  • Zoology (1250)