Prediction of disordered regions in proteins from position specific score matrices

David T Jones; Jonathan J Ward

doi:10.1002/prot.10528

Prediction of disordered regions in proteins from position specific score matrices

Proteins. 2003:53 Suppl 6:573-8. doi: 10.1002/prot.10528.

Authors

David T Jones¹, Jonathan J Ward

Affiliation

¹ Department of Computer Science, Bioinformatics Unit, University College London, London, United Kingdom. dtj@cs.ucl.ac.uk

PMID: 14579348
DOI: 10.1002/prot.10528

Abstract

We describe here the results of using a neural network based method (DISOPRED) for predicting disordered regions in 55 proteins in the 5(th) CASP experiment. A set of 715 highly resolved proteins with regions of disorder was used to train the network. The inputs to the network were derived from sequence profiles generated by PSI-BLAST. A post-filter was applied to the output of the network to prevent regions being predicted as disordered in regions of confidently predicted alpha helix or beta sheet structure. The overall two-state prediction accuracy for the method is very high (90%) but this is highly skewed by the fact that most residues are observed to be ordered. The overall Matthews' correlation coefficient for the submitted predictions is 0.34, which gives a more realistic impression of the overall accuracy of the method, though still indicates significant predictive power.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Magnetic Resonance Spectroscopy
Models, Molecular
Neural Networks, Computer*
Protein Conformation*
Proteins / chemistry*
Reproducibility of Results

Substances

Proteins