ABSTRACT
Recently developed maximum entropy methods infer evolutionary constraints on protein function and structure from the millions of protein sequences available in genomic databases. The EVfold web server (at EVfold.org) makes these methods available to predict functional and structural interactions in proteins. The key algorithmic development has been to disentangle direct and indirect residue-residue correlations in large multiple sequence alignments and derive direct residue-residue evolutionary couplings (EVcouplings or ECs). For proteins of unknown structure, distance constraints obtained from evolutionarily couplings between residue pairs are used to de novo predict all-atom 3D structures, often to good accuracy. Given sufficient sequence information in a protein family, this is a major advance toward solving the problem of computing the native 3D fold of proteins from sequence information alone.
Availability EVfold server at http://evfold.org/
Contact evfoldtest{at}gmail.com
- DI
- direct information
- EC
- evolutionary coupling
- EV
- evolutionary
- MSA
- multiple sequence alignment
- PLM
- pseudo-likelihood maximization
- PPV
- positive predictive value (number of true positives divided by the sum of true and false positives)
- TM-score
- template modeling score