GenBank

Nucleic Acids Res. 2015 Jan;43(Database issue):D30-5. doi: 10.1093/nar/gku1216. Epub 2014 Nov 20.

Abstract

GenBank(®) (http://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for over 300 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Bacteria / classification
  • Databases, Nucleic Acid*
  • Genomics
  • Internet
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein