The Pfam protein families database

Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.

Abstract

Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is approximately 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11,912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Computational Biology / methods*
  • Computational Biology / trends
  • Databases, Nucleic Acid*
  • Databases, Protein*
  • Genome, Archaeal
  • Genome, Fungal
  • Humans
  • Information Storage and Retrieval / methods
  • Internet
  • Molecular Sequence Data
  • Protein Structure, Tertiary
  • Sequence Alignment
  • Sequence Homology, Amino Acid
  • Software