Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction

Robin D Dowell; Sean R Eddy

doi:10.1186/1471-2105-5-71

Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction

BMC Bioinformatics. 2004 Jun 4:5:71. doi: 10.1186/1471-2105-5-71.

Authors

Robin D Dowell¹, Sean R Eddy

Affiliation

¹ Howard Hughes Medical Institute and Department of Genetics, Washington University School of Medicine, St, Louis, MO 63108 USA. robin@genetics.wustl.edu

Abstract

Background: RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparative RNA sequence analysis and a biophysical model of structure plausibility. However, the number of free parameters in an integrated model for consensus RNA structure prediction can become untenable if the underlying SCFG design is too complex. Thus a key question is, what small, simple SCFG designs perform best for RNA secondary structure prediction?

Results: Nine different small SCFGs were implemented to explore the tradeoffs between model complexity and prediction accuracy. Each model was tested for single sequence structure prediction accuracy on a benchmark set of RNA secondary structures.

Conclusions: Four SCFG designs had prediction accuracies near the performance of current energy minimization programs. One of these designs, introduced by Knudsen and Hein in their PFOLD algorithm, has only 21 free parameters and is significantly simpler than the others.

Publication types

Evaluation Study
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Computational Biology / methods
Models, Statistical
Nucleic Acid Conformation
Predictive Value of Tests
RNA / chemistry*
Stochastic Processes

Substances

RNA

Abstract

Publication types

MeSH terms

Substances

Grants and funding