A pentapeptide-based method for protein secondary structure prediction

Protein Eng. 2003 Feb;16(2):103-7. doi: 10.1093/proeng/gzg019.

Abstract

We present a new method for protein secondary structure prediction, based on the recognition of well-defined pentapeptides, in a large databank. Using a databank of 635 protein chains, we obtained a success rate of 68.6%. We show that progress is achieved when the databank is enlarged, when the 20 amino acids are adequately grouped in 10 sets and when more pentapeptides are attributed one of the defined conformations, alpha-helices or beta-strands. The analysis of the model indicates that the essential variable is the number of pentapeptides of well-defined structure in the database. Our model is simple, does not rely on arbitrary parameters and allows the analysis in detail of the results of each chosen hypothesis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein
  • Oligopeptides / chemistry*
  • Pattern Recognition, Automated
  • Protein Structure, Secondary*
  • Reproducibility of Results

Substances

  • Oligopeptides