CORUM: the comprehensive resource of mammalian protein complexes--2009

Nucleic Acids Res. 2010 Jan;38(Database issue):D497-501. doi: 10.1093/nar/gkp914. Epub 2009 Nov 1.

Abstract

CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (64%), mouse (16%) and rat (12%). Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The new CORUM 2.0 release encompasses 2837 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 3198 different genes, representing approximately 16% of the protein coding genes in humans. Each protein complex is described by a protein complex name, subunit composition, function as well as the literature reference that characterizes the respective protein complex. Recent developments include mapping of functional annotation to Gene Ontology terms as well as cross-references to Entrez Gene identifiers. In addition, a 'Phylogenetic Conservation' analysis tool was implemented that analyses the potential occurrence of orthologous protein complex subunits in mammals and other selected groups of organisms. This allows one to predict the occurrence of protein complexes in different phylogenetic groups. CORUM is freely accessible at (http://mips.helmholtz-muenchen.de/genre/proj/corum/index.html).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Computational Biology / trends
  • Databases, Genetic*
  • Databases, Protein*
  • Humans
  • Information Storage and Retrieval / methods
  • Internet
  • Mice
  • Multiprotein Complexes*
  • Phylogeny
  • Protein Structure, Tertiary
  • Rats
  • Saccharomyces cerevisiae / genetics
  • Software

Substances

  • Multiprotein Complexes