Metabolomics by numbers: acquiring and understanding global metabolite data

https://doi.org/10.1016/j.tibtech.2004.03.007Get rights and content

Abstract

In this postgenomic era, there is a specific need to assign function to orphan genes in order to validate potential targets for drug therapy and to discover new biomarkers of disease. Metabolomics is an emerging field that is complementary to the other ‘omics and proving to have unique advantages. As in transcriptomics or proteomics, a typical metabolic fingerprint or metabolomic experiment is likely to generate thousands of data points, of which only a handful might be needed to describe the problem adequately. Extracting the most meaningful elements of these data is thus key to generating useful new knowledge with mechanistic or explanatory power.

Section snippets

Measuring the metabolome

The ultimate starting point of a metabolomic experiment is to quantify all of the metabolites in a cellular system (i.e. the cell or tissue in a given state at a given point in time). Currently this is impossible, given the lack of simple automated analytical strategies that can effect this in a reproducible and robust way. The main challenges are the chemical complexity and heterogeneity of metabolites, the dynamic range of the measuring technique, the throughput of the measurements, and the

Technology platforms for metabolomics

Metabolites are chemical entities and can be analysed by the standard tools of chemical analysis such as molecular spectroscopy and MS. The resolution, sensitivity and selectivity of these technologies can be enhanced or modified by coupling them to gas chromatograpy (GC) or liquid chromatography (LC) steps. The technologies commonly exploited for different metabolomic strategies are shown in Figure 1. Generally, the technology platform of choice depends on the type of sample to be analysed.

Databases for metabolomics

In a recent study, Lyman and Varian estimated that in 2000 the world produced between 1 and 2 exabytes (1–2×1018 bytes) of ‘unique’ information per year (http://www.sims.berkeley.edu/how-much-info). This flood of data is roughly 250 megabytes for every man, woman and child on earth! IBM's estimates are that information within the life sciences doubles every 6 months (http://www.bio-itworld.com/champions/janet_perna.html); this data explosion comes from genomic sequencing, the ‘omics’ and

A paradigm shift from metabolic pathways to networks and neighbourhoods

There has been a shift from mental constructs involving metabolic pathways to those based on metabolic networks and neighbourhoods 37, 38, and many would argue that the ‘Boehringer’ metabolic pathways map needs to be updated both radically and conceptually. An excellent example of this is illustrated by the experiments of Willmitzer and colleagues [39] on the carbon sink in potatoes. The aim of these experiments was to increase the amount of starch in the tubers by the ‘rational’ overproduction

Hypothesis-generating strategies from metabolome data

Many of the pattern-recognition strategies currently pursued in metabolomics, and indeed in the analyses of all ‘omic data, are based on ‘unsupervised’ techniques [48] (Figure 2), such as hierarchical cluster analysis in which a ‘tree-like’ dendrogram (as commonly seen in taxonomic and phylogenetics) is produced. Clustering methods are used to assess, in a multivariate manner, how similar a set of samples are to one another on the basis of their metabolite profiles, although many of the methods

From metabolomics to systems biology

“When a thing was new, people said, ‘It is not true’. Later, when the truth became obvious, people said, ‘Anyway, it is not important.’ And when its importance could not be denied, people said, ‘Anyway, it is not new.” William James (1842–1920).

‘Systems biology’ describes a range of techniques, including the ‘omics and mathematical modelling, for understanding systems ‘as a whole’ 63, 64, and it is widely recognized that metabolomics will have a major part to play in its development [65].

Concluding remarks

The field of metabolomics is gaining increasing interest across all disciplines, including functional genomics, integrative and systems biology, pharmacogenomics, and (surrogate) biomarker discovery for drug discovery and therapy monitoring. As more researchers get ‘tooled up’ for metabolomics, the realization that it is easy to generate floods (or, more accurately, torrents!) of data will become apparent. Thus, in the new postgenomic era of biology, we shall need well-curated databases, very

Acknowledgements

R.G., S.V. and D.B.K. are most grateful to the UK BBSRC, UK EPRSC and UK NERC for supporting our work.

References (88)

  • M.B. Seasholtz et al.

    The parsimony principle applied to multivariate calibration

    Anal. Chim. Acta

    (1993)
  • D.B. Kell

    Genotype:phenotype mapping: genes as computer programs

    Trends Genet.

    (2002)
  • R. Brent

    Genomic biology

    Cell

    (2000)
  • S. Muggleton

    Inductive logic programming: issues, results and the challenge of learning language in logic

    Artif. Intell.

    (1999)
  • D.B. Kell et al.

    Metabolic control theory – its role in microbiology and biotechnology

    FEMS Microbiol. Rev.

    (1986)
  • R.D. Fleischmann

    Whole-genome random sequencing and assembly of Haemophilus influenzae Rd

    Science

    (1995)
  • S. Anderson

    Sequence and organization of the human mitochondrial genome

    Nature

    (1981)
  • F.R. Blattner

    The complete genome sequence of Escherichia coli K-12

    Science

    (1997)
  • J.C. Venter

    The sequence of the human genome

    Science

    (2001)
  • The International Human Genome Mapping Consortium (2001) A physical map of the human genome. Nature 409,...
  • O. Fiehn

    Metabolomics – the link between genotypes and phenotypes

    Plant Mol. Biol.

    (2002)
  • C.W.W. Beecher

    The human metabolome

  • D.A. Fell

    Understanding the Control of Metabolism

    (1996)
  • D.B. Kell et al.

    Snapshots of systems: metabolic control analysis and biotechnology in the post-genomic era

  • L.M. Raamsdonk

    A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations

    Nat. Biotechnol.

    (2001)
  • E. Urbanczyk-Wochniak

    Parallel analysis of transcript and metabolic profiles: a new approach in systems biology

    EMBO Rep.

    (2003)
  • I.D. Wilson et al.

    Topics in xenobiochemistry: do metabolic pathways exist for xenobiotics? The micro-metabolism hypothesis

    Xenobiotica

    (2003)
  • M. Mann

    Analysis of proteins and proteomes by mass spectrometry

    Annu. Rev. Biochem.

    (2001)
  • D.B. Kell et al.

    Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era

    BioEssays

    (2004)
  • J.C. Lindon

    So what's the deal with metabonomics? Metabonomics measures the fingerprint of biochemical perturbations caused by disease, drugs, and toxins

    Anal. Chem.

    (2003)
  • J.K. Nicholson et al.

    Understanding ‘global’ systems biology: metabonomics and the continuum of metabolism

    Nat. Rev. Drug Discov.

    (2003)
  • J.T. Brindle

    Rapid and noninvasive diagnosis of the presence and severity of coronary heart disease using 1H-NMR-based metabonomics

    Nat. Med.

    (2002)
  • R. Shellie

    Concepts and preliminary observations on the triple dimensional analysis of complex volatile samples by using GC×GC-TOF MS

    Anal. Chem.

    (2001)
  • Weckwerth, W. et al. (2001) Metabolomic characterization of transgenic potato plants using GC/TOF and LC/MS analysis...
  • A.L. Duran

    Metabolomics spectral formatting, alignment and conversion tools (MSFACTs)

    Bioinformatics

    (2003)
  • P.T. Ivanova

    Electrospray ionization mass spectrometry analysis of changes in phospholipids in RBL-2H3 mastocytoma cells during degranulation

    Proc. Natl. Acad. Sci. U. S. A.

    (2001)
  • J. Allen

    High-throughput classification of yeast mutants for functional genomics via metabolic footprinting

    Nat. Biotechnol.

    (2003)
  • A. Aharoni

    Nontargeted metabolome analysis by use of Fourier transform ion cyclotron mass spectrometry

    OMICS

    (2002)
  • W. Petrich

    Mid-infrared and Raman spectroscopy for medical diagnostics

    Appl. Spectrosc. Rev.

    (2001)
  • R. Goodacre et al.

    Correction of mass spectral drift using artificial neural networks

    Anal. Chem.

    (1996)
  • A. Brazma

    Minimum information about a microarray experiment (MIAME) – toward standards for microarray data

    Nat. Genet.

    (2001)
  • C.F. Taylor

    A systematic approach to modeling, capturing, and disseminating proteomics experimental data

    Nat. Biotechnol.

    (2003)
  • F. Hardy et al.

    Databases, data modelling and schemas

  • A.-L. Barabási et al.

    Network biology: understanding the cell's functional organization

    Nat. Rev. Genet.

    (2004)
  • Cited by (1051)

    View all citing articles on Scopus
    View full text