The importance of intrinsic disorder for protein phosphorylation

Nucleic Acids Res. 2004 Feb 11;32(3):1037-49. doi: 10.1093/nar/gkh253. Print 2004.

Abstract

Reversible protein phosphorylation provides a major regulatory mechanism in eukaryotic cells. Due to the high variability of amino acid residues flanking a relatively limited number of experimentally identified phosphorylation sites, reliable prediction of such sites still remains an important issue. Here we report the development of a new web-based tool for the prediction of protein phosphorylation sites, DISPHOS (DISorder-enhanced PHOSphorylation predictor, http://www.ist.temple. edu/DISPHOS). We observed that amino acid compositions, sequence complexity, hydrophobicity, charge and other sequence attributes of regions adjacent to phosphorylation sites are very similar to those of intrinsically disordered protein regions. Thus, DISPHOS uses position-specific amino acid frequencies and disorder information to improve the discrimination between phosphorylation and non-phosphorylation sites. Based on the estimates of phosphorylation rates in various protein categories, the outputs of DISPHOS are adjusted in order to reduce the total number of misclassified residues. When tested on an equal number of phosphorylated and non-phosphorylated residues, the accuracy of DISPHOS reaches 76% for serine, 81% for threonine and 83% for tyrosine. The significant enrichment in disorder-promoting residues surrounding phosphorylation sites together with the results obtained by applying DISPHOS to various protein functional classes and proteomes, provide strong support for the hypothesis that protein phosphorylation predominantly occurs within intrinsically disordered protein regions.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acids / analysis
  • Amino Acids / chemistry
  • Animals
  • Databases, Protein
  • Internet
  • Phosphorylation
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Proteome / chemistry
  • Proteome / metabolism
  • Reproducibility of Results
  • Sequence Analysis, Protein / methods*

Substances

  • Amino Acids
  • Proteins
  • Proteome

Associated data

  • PDB/1ATP
  • PDB/1CDK
  • PDB/1GY3
  • PDB/1H4X
  • PDB/1H4Z
  • PDB/1IR3
  • PDB/1JBP
  • PDB/1O6I
  • PDB/1O6K
  • PDB/1PHK
  • PDB/1QMZ