Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

ProteInfer: deep networks for protein functional inference

View ORCID ProfileTheo Sanderson, View ORCID ProfileMaxwell L. Bileschi, View ORCID ProfileDavid Belanger, View ORCID ProfileLucy J. Colwell
doi: https://doi.org/10.1101/2021.09.20.461077
Theo Sanderson
1Francis Crick Institute
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Theo Sanderson
  • For correspondence: [email protected]
Maxwell L. Bileschi
2Google AI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Maxwell L. Bileschi
David Belanger
2Google AI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David Belanger
Lucy J. Colwell
2Google AI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lucy J. Colwell
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Predicting the function of a protein from its amino acid sequence is a long-standing challenge in bioinformatics. Traditional approaches use sequence alignment to compare a query sequence either to thousands of models of protein families or to large databases of individual protein sequences. Here we instead employ deep convolutional neural networks to directly predict a variety of protein functions – EC numbers and GO terms – directly from an unaligned amino acid sequence. This approach provides precise predictions which complement alignment-based methods, and the computational efficiency of a single neural network permits novel and lightweight software interfaces, which we demonstrate with an in-browser graphical interface for protein function prediction in which all computation is performed on the user’s personal computer with no data uploaded to remote servers. Moreover, these models place full-length amino acid sequences into a generalised functional space, facilitating downstream analysis and interpretation. To read the interactive version of this paper, please visit https://google-research.github.io/proteinfer/

Figure
  • Download figure
  • Open in new tab

QR code for the interactive version of this preprint at https://google-research.github.io/proteinfer/

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://google-research.github.io/proteinfer/

  • https://github.com/google-research/proteinfer

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted October 06, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
ProteInfer: deep networks for protein functional inference
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
ProteInfer: deep networks for protein functional inference
Theo Sanderson, Maxwell L. Bileschi, David Belanger, Lucy J. Colwell
bioRxiv 2021.09.20.461077; doi: https://doi.org/10.1101/2021.09.20.461077
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
ProteInfer: deep networks for protein functional inference
Theo Sanderson, Maxwell L. Bileschi, David Belanger, Lucy J. Colwell
bioRxiv 2021.09.20.461077; doi: https://doi.org/10.1101/2021.09.20.461077

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6101)
  • Biochemistry (13882)
  • Bioengineering (10580)
  • Bioinformatics (33622)
  • Biophysics (17327)
  • Cancer Biology (14391)
  • Cell Biology (20386)
  • Clinical Trials (138)
  • Developmental Biology (10994)
  • Ecology (16223)
  • Epidemiology (2067)
  • Evolutionary Biology (20531)
  • Genetics (13527)
  • Genomics (18822)
  • Immunology (13949)
  • Microbiology (32536)
  • Molecular Biology (13547)
  • Neuroscience (70957)
  • Paleontology (533)
  • Pathology (2222)
  • Pharmacology and Toxicology (3781)
  • Physiology (5961)
  • Plant Biology (12170)
  • Scientific Communication and Education (1826)
  • Synthetic Biology (3406)
  • Systems Biology (8246)
  • Zoology (1875)