KAAS: an automatic genome annotation and pathway reconstruction server

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W182-5. doi: 10.1093/nar/gkm321. Epub 2007 May 25.

Abstract

The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith-Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Artificial Intelligence
  • Automation
  • Chromosome Mapping / methods*
  • Computational Biology / methods*
  • Database Management Systems
  • Documentation / methods*
  • Genome*
  • Humans
  • Information Storage and Retrieval / methods
  • Internet
  • Proteome / classification*
  • Proteome / metabolism*
  • Sequence Analysis / methods*
  • Signal Transduction / physiology*
  • Vocabulary, Controlled*

Substances

  • Proteome