COMBREX-DB: an experiment centered database of protein function: knowledge, predictions and knowledge gaps

Nucleic Acids Res. 2016 Jan 4;44(D1):D330-5. doi: 10.1093/nar/gkv1324. Epub 2015 Dec 3.

Abstract

The COMBREX database (COMBREX-DB; combrex.bu.edu) is an online repository of information related to (i) experimentally determined protein function, (ii) predicted protein function, (iii) relationships among proteins of unknown function and various types of experimental data, including molecular function, protein structure, and associated phenotypes. The database was created as part of the novel COMBREX (COMputational BRidges to EXperiments) effort aimed at accelerating the rate of gene function validation. It currently holds information on ∼ 3.3 million known and predicted proteins from over 1000 completely sequenced bacterial and archaeal genomes. The database also contains a prototype recommendation system for helping users identify those proteins whose experimental determination of function would be most informative for predicting function for other proteins within protein families. The emphasis on documenting experimental evidence for function predictions, and the prioritization of uncharacterized proteins for experimental testing distinguish COMBREX from other publicly available microbial genomics resources. This article describes updates to COMBREX-DB since an initial description in the 2011 NAR Database Issue.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Archaeal Proteins / chemistry
  • Archaeal Proteins / genetics
  • Archaeal Proteins / physiology*
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Bacterial Proteins / physiology*
  • Databases, Protein*
  • Molecular Sequence Annotation

Substances

  • Archaeal Proteins
  • Bacterial Proteins