The Gene Expression Omnibus Database

Methods Mol Biol. 2016:1418:93-110. doi: 10.1007/978-1-4939-3578-9_5.

Abstract

The Gene Expression Omnibus (GEO) database is an international public repository that archives and freely distributes high-throughput gene expression and other functional genomics data sets. Created in 2000 as a worldwide resource for gene expression studies, GEO has evolved with rapidly changing technologies and now accepts high-throughput data for many other data applications, including those that examine genome methylation, chromatin structure, and genome-protein interactions. GEO supports community-derived reporting standards that specify provision of several critical study elements including raw data, processed data, and descriptive metadata. The database not only provides access to data for tens of thousands of studies, but also offers various Web-based tools and strategies that enable users to locate data relevant to their specific interests, as well as to visualize and analyze the data. This chapter includes detailed descriptions of methods to query and download GEO data and use the analysis and visualization tools. The GEO homepage is at http://www.ncbi.nlm.nih.gov/geo/.

Keywords: Data mining; Database; Functional genomics; Gene expression; High-throughput sequencing; Microarray.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Computational Biology / methods
  • Databases, Genetic*
  • Gene Expression Profiling / methods*
  • Gene Expression*
  • Genomics / methods*
  • Software
  • Web Browser