Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models

View ORCID ProfileJoseph L. Watson, View ORCID ProfileDavid Juergens, View ORCID ProfileNathaniel R. Bennett, View ORCID ProfileBrian L. Trippe, View ORCID ProfileJason Yim, View ORCID ProfileHelen E. Eisenach, Woody Ahern, View ORCID ProfileAndrew J. Borst, View ORCID ProfileRobert J. Ragotte, View ORCID ProfileLukas F. Milles, View ORCID ProfileBasile I. M. Wicky, View ORCID ProfileNikita Hanikel, Samuel J. Pellock, View ORCID ProfileAlexis Courbet, William Sheffler, View ORCID ProfileJue Wang, Preetham Venkatesh, View ORCID ProfileIsaac Sappington, Susana Vázquez Torres, View ORCID ProfileAnna Lauko, View ORCID ProfileValentin De Bortoli, Emile Mathieu, Regina Barzilay, View ORCID ProfileTommi S. Jaakkola, View ORCID ProfileFrank DiMaio, View ORCID ProfileMinkyung Baek, View ORCID ProfileDavid Baker
doi: https://doi.org/10.1101/2022.12.09.519842
Joseph L. Watson
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joseph L. Watson
David Juergens
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
3Graduate Program in Molecular Engineering, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David Juergens
Nathaniel R. Bennett
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
3Graduate Program in Molecular Engineering, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nathaniel R. Bennett
Brian L. Trippe
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
4Columbia University, Department of Statistics, New York, NY 10027, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Brian L. Trippe
Jason Yim
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
6Massachusetts Institute of Technology, Cambridge, MA 02139, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jason Yim
Helen E. Eisenach
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Helen E. Eisenach
Woody Ahern
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
7Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrew J. Borst
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrew J. Borst
Robert J. Ragotte
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robert J. Ragotte
Lukas F. Milles
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lukas F. Milles
Basile I. M. Wicky
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Basile I. M. Wicky
Nikita Hanikel
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nikita Hanikel
Samuel J. Pellock
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexis Courbet
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
9Centre National de la recherche scientifique, École Normale Supérieure rue d’Ulm, Paris 75005, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexis Courbet
William Sheffler
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jue Wang
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jue Wang
Preetham Venkatesh
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
8Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Isaac Sappington
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
8Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Isaac Sappington
Susana Vázquez Torres
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
8Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anna Lauko
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
8Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anna Lauko
Valentin De Bortoli
9Centre National de la recherche scientifique, École Normale Supérieure rue d’Ulm, Paris 75005, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Valentin De Bortoli
Emile Mathieu
10Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Regina Barzilay
6Massachusetts Institute of Technology, Cambridge, MA 02139, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tommi S. Jaakkola
6Massachusetts Institute of Technology, Cambridge, MA 02139, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tommi S. Jaakkola
Frank DiMaio
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Frank DiMaio
Minkyung Baek
12School of Biological Sciences, Seoul National University, Seoul 08826, Republic of Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Minkyung Baek
David Baker
1Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
2Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
11Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David Baker
  • For correspondence: dabaker@uw.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

There has been considerable recent progress in designing new proteins using deep learning methods1–9. Despite this progress, a general deep learning framework for protein design that enables solution of a wide range of design challenges, including de novo binder design and design of higher order symmetric architectures, has yet to be described. Diffusion models10,11 have had considerable success in image and language generative modeling but limited success when applied to protein modeling, likely due to the complexity of protein backbone geometry and sequence-structure relationships. Here we show that by fine tuning the RoseTTAFold structure prediction network on protein structure denoising tasks, we obtain a generative model of protein backbones that achieves outstanding performance on unconditional and topology-constrained protein monomer design, protein binder design, symmetric oligomer design, enzyme active site scaffolding, and symmetric motif scaffolding for therapeutic and metal-binding protein design. We demonstrate the power and generality of the method, called RoseTTAFold Diffusion (RFdiffusion), by experimentally characterizing the structures and functions of hundreds of new designs. In a manner analogous to networks which produce images from user-specified inputs, RFdiffusion enables the design of diverse, complex, functional proteins from simple molecular specifications.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted December 10, 2022.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models
Joseph L. Watson, David Juergens, Nathaniel R. Bennett, Brian L. Trippe, Jason Yim, Helen E. Eisenach, Woody Ahern, Andrew J. Borst, Robert J. Ragotte, Lukas F. Milles, Basile I. M. Wicky, Nikita Hanikel, Samuel J. Pellock, Alexis Courbet, William Sheffler, Jue Wang, Preetham Venkatesh, Isaac Sappington, Susana Vázquez Torres, Anna Lauko, Valentin De Bortoli, Emile Mathieu, Regina Barzilay, Tommi S. Jaakkola, Frank DiMaio, Minkyung Baek, David Baker
bioRxiv 2022.12.09.519842; doi: https://doi.org/10.1101/2022.12.09.519842
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models
Joseph L. Watson, David Juergens, Nathaniel R. Bennett, Brian L. Trippe, Jason Yim, Helen E. Eisenach, Woody Ahern, Andrew J. Borst, Robert J. Ragotte, Lukas F. Milles, Basile I. M. Wicky, Nikita Hanikel, Samuel J. Pellock, Alexis Courbet, William Sheffler, Jue Wang, Preetham Venkatesh, Isaac Sappington, Susana Vázquez Torres, Anna Lauko, Valentin De Bortoli, Emile Mathieu, Regina Barzilay, Tommi S. Jaakkola, Frank DiMaio, Minkyung Baek, David Baker
bioRxiv 2022.12.09.519842; doi: https://doi.org/10.1101/2022.12.09.519842

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Biochemistry
Subject Areas
All Articles
  • Animal Behavior and Cognition (4237)
  • Biochemistry (9155)
  • Bioengineering (6797)
  • Bioinformatics (24052)
  • Biophysics (12149)
  • Cancer Biology (9562)
  • Cell Biology (13814)
  • Clinical Trials (138)
  • Developmental Biology (7653)
  • Ecology (11729)
  • Epidemiology (2066)
  • Evolutionary Biology (15534)
  • Genetics (10663)
  • Genomics (14346)
  • Immunology (9503)
  • Microbiology (22876)
  • Molecular Biology (9113)
  • Neuroscience (49080)
  • Paleontology (357)
  • Pathology (1487)
  • Pharmacology and Toxicology (2576)
  • Physiology (3851)
  • Plant Biology (8347)
  • Scientific Communication and Education (1473)
  • Synthetic Biology (2299)
  • Systems Biology (6202)
  • Zoology (1302)