Sequence-derived structural features driving proteolytic processing

Proteomics. 2014 Jan;14(1):42-50. doi: 10.1002/pmic.201300416. Epub 2013 Dec 11.

Abstract

Proteolytic signaling, or regulated proteolysis, is an essential part of many important pathways such as Notch, Wnt, and Hedgehog. How the structure of the cleaved substrate regions influences the efficacy of proteolytic processing remains underexplored. Here, we analyzed the relative importance in proteolysis of various structural features derived from substrate sequences using a dataset of more than 5000 experimentally verified proteolytic events captured in CutDB. Accessibility to the solvent was recognized as an essential property of a proteolytically processed polypeptide chain. Proteolytic events were found nearly uniformly distributed among three types of secondary structure, although with some enrichment in loops. Cleavages in α-helices were found to be relatively abundant in regions apparently prone to unfolding, while cleavages in β-structures tended to be located at the periphery of β-sheets. Application of the same statistical procedures to proteolytic events divided into separate sets according to the catalytic classes of proteases proved consistency of the results and confirmed that the structural mechanisms of proteolysis are universal. The estimated prediction power of sequence-derived structural features, which turned out to be sufficiently high, presents a rationale for their use in bioinformatic prediction of proteolytic events.

Keywords: Bioinformatics; Cleavage site; Limited proteolysis; Protease; Proteolytic processing; Regulated proteolysis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence*
  • Computational Biology / methods*
  • Models, Statistical
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Proteolysis*
  • ROC Curve

Substances

  • Proteins