A pentapeptide-based method for protein secondary structure prediction

A Figureau; M A Soto; J Tohá

doi:10.1093/proeng/gzg019

A pentapeptide-based method for protein secondary structure prediction

Protein Eng. 2003 Feb;16(2):103-7. doi: 10.1093/proeng/gzg019.

Authors

A Figureau¹, M A Soto, J Tohá

Affiliation

¹ Institut de Physique Nucléaire de Lyon, Université Claude Bernard, 69622 Villeurbanne Cedex, France.

PMID: 12676978
DOI: 10.1093/proeng/gzg019

Abstract

We present a new method for protein secondary structure prediction, based on the recognition of well-defined pentapeptides, in a large databank. Using a databank of 635 protein chains, we obtained a success rate of 68.6%. We show that progress is achieved when the databank is enlarged, when the 20 amino acids are adequately grouped in 10 sets and when more pentapeptides are attributed one of the defined conformations, alpha-helices or beta-strands. The analysis of the model indicates that the essential variable is the number of pentapeptides of well-defined structure in the database. Our model is simple, does not rely on arbitrary parameters and allows the analysis in detail of the results of each chosen hypothesis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Databases, Protein
Oligopeptides / chemistry*
Pattern Recognition, Automated
Protein Structure, Secondary*
Reproducibility of Results

Substances

Oligopeptides