Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

Sheng Wang, Siqi Sun, Zhen Li, Renyu Zhang, Jinbo Xu
doi: https://doi.org/10.1101/073239
Sheng Wang
Toyota Technological Institute at Chicago
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jinboxu@gmail.com
Siqi Sun
Toyota Technological Institute at Chicago
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jinboxu@gmail.com
Zhen Li
Toyota Technological Institute at Chicago
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jinboxu@gmail.com
Renyu Zhang
Toyota Technological Institute at Chicago
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jinboxu@gmail.com
Jinbo Xu
Toyota Technological Institute at Chicago
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jinboxu@gmail.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Motivation Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not extremely useful for de novo structure prediction.

Method This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) information and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. This neural network allows us to model very complex relationship between sequence and contact map as well as long-range interdependency between contacts and thus, obtain high-quality contact prediction.

Results Our method greatly outperforms existing contact prediction methods and leads to much more accurate contact-assisted protein folding. For example, on the 105 CASP11 test proteins, the L/10 long-range accuracy obtained by our method is 83.3% while that by the state-of-the-art methods CCMpred and MetaPSICOV (the CASP11 winner) is 43.4% and 60.2%, respectively. On the 398 membrane proteins, the L/10 long-range accuracy obtained by our method is 77.3% while that by CCMpred and MetaPSICOV is 51.8% and 61.2%, respectively. Ab initio folding using our predicted contacts as restraints can yield correct folds (i.e., TMscore>0.6) for 224 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Further, our contact-assisted models also have much better quality (especially for membrane proteins) than template-based models. Using our predicted contacts as restraints, we can fold 240 of the 398 membrane proteins with TMscore>0.5. However, when the training proteins of our method are used as templates, homology modeling can only do so for 10 of the membrane proteins.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted September 06, 2016.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model
Sheng Wang, Siqi Sun, Zhen Li, Renyu Zhang, Jinbo Xu
bioRxiv 073239; doi: https://doi.org/10.1101/073239
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model
Sheng Wang, Siqi Sun, Zhen Li, Renyu Zhang, Jinbo Xu
bioRxiv 073239; doi: https://doi.org/10.1101/073239

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4113)
  • Biochemistry (8815)
  • Bioengineering (6518)
  • Bioinformatics (23459)
  • Biophysics (11789)
  • Cancer Biology (9207)
  • Cell Biology (13322)
  • Clinical Trials (138)
  • Developmental Biology (7436)
  • Ecology (11409)
  • Epidemiology (2066)
  • Evolutionary Biology (15149)
  • Genetics (10436)
  • Genomics (14043)
  • Immunology (9171)
  • Microbiology (22153)
  • Molecular Biology (8812)
  • Neuroscience (47566)
  • Paleontology (350)
  • Pathology (1428)
  • Pharmacology and Toxicology (2491)
  • Physiology (3730)
  • Plant Biology (8079)
  • Scientific Communication and Education (1437)
  • Synthetic Biology (2220)
  • Systems Biology (6037)
  • Zoology (1253)