Professional Documents
Culture Documents
Fernie AR Trethewey RN Krotzky AJ Willmitzer L Met
Fernie AR Trethewey RN Krotzky AJ Willmitzer L Met
Box 1 | Metabolite profiling by mass spectrometry instrumentation are consequently also well
developed. Furthermore, despite limitations
m/z 245 73 in sensitivity and, as a result, severe restric-
18.229 min
b Malate
tions in metabolic coverage, the use of NMR
is clearly advantageous for certain biological
questions (see BOX 2).
m/z 304 245
18.249 min m/z
Challenges ahead. At present, even the combi-
GABA nation of a wide range of analytical tools
174
allows us to see only a portion of the total
304 metabolite complement of the cell (FIG. 1).
m/z 351
18.191 min This is largely due to the diversity of cellular
unknown metabolites. Escherichia coli are estimated to
contain ~750 metabolites14, whereas the total
a m/z number of metabolites that are present in the
plant kingdom has been estimated to be of
the order of hundreds of thousands15. Current
estimates vary greatly, but it seems likely that a
typical eukaryotic organism contains between
4,000 and 20,000 metabolites.
However, it is not merely a problem of
numbers — the diversity itself yields great
complexity because, unlike RNA or proteins,
which are composites of highly similar small
molecules, the physical and chemical proper-
ties of metabolites are highly divergent. This
4
Retention time (min)
50 divergence means that there is no single
extraction process that does not incur sub-
stantial loss to some of the cellular metabo-
Mass-spectrometry-based metabolite profiling intrinsically generates large and complex datasets.
lites, let alone a single analytical platform that
A typical gas-chromatography–mass-spectrometry (GC–MS) profile of a biological sample contains
can measure all metabolites. So, one main
mass-intensity data from 300–500 analytes in a file of around 20 megabytes. So, it has been a
considerable challenge to develop software and algorithms that allow the extraction and
problem for the analysis of metabolites is the
quantification of many individual component analytes. Good software has long been widely lack of a totally comprehensive approach. A
available for finding anticipated analytes within a chromatographic run and for providing a good second problem that needs to be addressed is
integration of the signal. However, there has been recent interest in the application of deconvolution the high proportion of unknown analytes
algorithms that provide an electronic separation of component peaks without prior knowledge of that is measured in metabolite profiling.
these components. The most widely used algorithm in this respect is AMDIS (‘automated mass Typically, in current GC–MS-based metabo-
spectral deconvolution and identification system’; REF. 48, see online links box), and recent studies lite profiling, a chemical structure has been
have looked specifically at deconvolution strategies in the context of metabolite-profiling unambiguously assigned in only 20–30% of
applications49. The figure shows a total ion chromatogram of the polar phase of an Arabidopsis the analytes detected16. The solutions to both
thaliana extract (panel a), and the deconvoluted spectra of a single peak of this chromatogram problems will not be rapid and will require
reveals three metabolite signals that can be clearly discriminated (panel b). As metabolite profiling successive iterative improvement. Sample
normally requires the assessment of sample and control data from multiple chromatograms, it is pre-processing, such as solid-phase extrac-
also necessary to develop software that ensures that the correct peak is identified and integrated in tion (for example, see REF. 17), and the full
each chromatogram50. In addition, high-throughput operations require software that ensures that integration of a wider range of chemistry-
the data generated by different machines are comparable. Although steps are being taken to tackle specific measurement platforms could
these issues (see REF. 51), progress will probably be made through a wider application of improve metabolite coverage.
chemometric methods52. GABA, γ-aminobutyric acid; m/z, ratio of mass to charge. Figure modified, In the longer term, new technologies
with permission, from Nature Biotechnology REF. 5 © (2000) Macmillan Magazines Ltd. might ameliorate the problem of limited
metabolite coverage. However, the level of
complexity and other factors, such as the
importance of isomers in biology, is an impor- validated and shown to provide a rich source dynamic measurement range that is required,
tant hindrance. In addition, although this of metabolic data on over 1,000 metabolites make the development of a single analytical
technology is widely discussed, there is, at pre- from Bacillus subtilis extracts13. platform that will provide a comprehensive
sent, only a single documented report, which The main alternative to mass-spectrome- analysis of the cellular metabolome a formi-
failed to provide a robust validation of the try-based approaches for metabolic profiling is dable challenge. The problem of chemical
methodology12. More data are available for provided by NMR approaches. These (and biochemical) identification of detected
CE–MS, which is a highly sensitive methodol- methods offer the advantage that, because analytes also represents an arduous task.
ogy that can detect low-abundance metabo- they have been used for years, they have Again, one approach to tackle this problem
lites and that provides good analyte separa- been subjected to intensive validation, and would be improved instrumentation and pro-
tion. CE–MS methods have been rigorously CHEMOMETRICS that are associated with NMR tocols. An obvious example of such an
t2
NMR. Such work will need to be supported
β-glucose 0.00
by the establishment of standard operating
α-glucose
procedures and database resources that are Cholesterol C18 –0.20
Lipid CH
available to the academic community. –0.40
a
Although measuring the presence or absence b –0.80 –0.40 0.00 0.40 0.60
Chemical shift (δ) t1
of a metabolite in a biological sample is in
some cases sufficient — for example, in drug Despite recent improvements in instrumentation, NMR technologies mostly have a lower
detection or in the SUBSTANTIAL EQUIVALENCE sensitivity compared with mass-spectrometry methodologies. However, computational analyses
17,18
TESTING of foodstuffs — quantitative analy- and the chemometric software associated with NMR are highly developed. Furthermore, NMR is
sis is often necessary. As is common practice non-invasive, which can be a particularly important advantage in certain situations. Although
in transcript and protein profiling, metabolite recent methodologies that couple NMR to liquid chromatography and solid-phase extraction
profiles are often expressed as ratio data com- columns to improve separation and sensitivity have been reported52,53, it is unlikely that NMR
pared with a control sample. However, there will ever attain the sensitivity of mass spectrometry. NMR is, however, important in the
are also many applications where absolute unequivocal determination of metabolite structures54 and, perhaps owing to its non-invasive
quantification is required. This is important nature, is more commonly used in mammalian systems than mass-spectrometry technologies.
to allow comparison between different sam- Perhaps the most powerful example so far of the use of NMR for metabolic profiling is provided
ple types; for example, those that originate by the work of the Nicholson group, who acquired and evaluated proton NMR (1H-NMR) spectra of
from different tissues within an organism. In human serum21. They were able to distinguish individuals with coronary heart disease from healthy
addition, absolute quantification is important individuals, and even to assess the severity of the disease. The 600-MHz 1H-NMR spectra of a
for understanding metabolic networks, as it is healthy patient (panel a) and a patient with severe artheriosclerosis (panel b) are shown. Partial least
necessary for the calculation of atomic bal- squares discriminant analysis of the obtained spectra shows the difference between healthy
(triangle) and diseased (square) individuals (panel c). This technique is therefore capable of rapid,
ances or for using kinetic properties of
non-invasive diagnosis of coronary heart disease, and could be used in population screening or in
enzymes to develop predictive models.
the testing of ameliorative drug treatments. It is currently undergoing clinical evaluation. t1 and t2
Irrespective of whether data are repre-
represent the combined variance observed across many metabolites (with t1 representing the most
sented as ratios or as absolute values, there variable metabolites). These values are used to discriminate between biological samples using
are significant chemical, biological and tech- analyses that are fundamentally similar to principal component analyses (see Glossary). LDL, low-
nical issues associated with achieving a vali- density lipoprotein; VLDL, very-low-density lipoprotein. Figure modified, with permission, from
dated quantitative dataset. Great care has to Nature Medicine REF. 21 © (2002) Macmillan Magazines Ltd.
be taken in experimental design and sam-
pling. The chemical and technical methods
that are used need to be fully appraised for then be extended by one dimension to look at Diagnostics. Metabolic profiles have been
the intrinsic variability of the system16,17 and, the correlative behaviour of individual metabo- widely used, in conjunction with statistical
ideally, the limits of detection and limits of lites across different samples4, and such correla- tools, for diagnosis. One of the first examples of
quantification of each metabolite should be tive behaviour can be built into networks23. the use of metabolite profiling in diagnostics
known. This process can be facilitated by the However, much recent interest has been was the estimation of modes of action of vari-
use of standard reference substances, and focused on grouping approaches for whole ous herbicides19. The authors carried out gas-
would ideally involve a full complement of profiles, and both SUPERVISED and UNSUPERVISED chromatography profiling of barley seeds that
isotopically labelled metabolites. Because methods have been used4,5,22,23. Working with had been treated with sublethal doses of vari-
of the interdisciplinary nature of these prob- such large data sets, a direct visualization of the ous herbicides, and compared the profiles
lems, they tend to be underestimated. It will results onto pathways is highly advantageous to obtained with those of untreated plants. This
be a challenge for this emerging technology support the interpretation of the data6–8 (see approach could discriminate between known
to establish standard operating procedures online links box). acetyl CoA carboxylase (ACC) and acetolactate
to avoid the publication of flawed datasets. synthase (ALS) inhibitors on the basis of a
Once quantitative datasets have been Applications of metabolite profiling visual comparison of metabolite profiles of
obtained, there is a wide range of data-analysis Improvements in metabolite-profiling tech- plants treated with these herbicides. This
strategies that can be pursued with metabolite nologies have already been effective in indicates that the mode of action of bioregula-
profiles4,16,19–22. The fundamental approach is, of addressing biological problems. Until recently, tors might be diagnosed by the analysis of
course, to compare the level of a metabolite the main applications have been in the area of reflective patterns of metabolite composition.
between an experimental and a control sample, diagnostics, but now the first applications of Recently, a more sophisticated analysis of the
and to use standard statistics to assess the signif- metabolite profiling to functional genomics mode of action has been carried out by Ott
icance of any differences. These approaches can and systems biology have been reported. and colleagues, who used a combination of
13. Soga, T. et al. Quantitative metabolome analysis using 28. Rashed, M. S. et al. Screening blood spots for inborn 47. Gygi, S. P. et al. Quantitative analysis of complex protein
capillary electrophoresis mass spectrometry. J. Proteome errors of metabolism by electrospray tandem mass mixtures using isotope-coded affinity tags. Nature
Res. 2, 488–494 (2003). spectrometry with a microplate batch process and a Biotech. 17, 994–999 (1999).
14. Nobeli, I., Krissinel, E. B. & Thornton, J. M. B. computer algorithm from automated flagging of abnormal 48. Stein, S. E. An integrated method for spectrum
A structure-based anatomy of the E. coli metabolome. profiles. Clin. Chem. 43, 1129–1141 (1997). extraction and compound identification from GC/MS
J. Mol. Biol. 334, 697–719 (2003). 29. Martzen, M. R. et al. A biochemical genomics approach data. J. Am. Soc. Mass Spectrom. 10, 770–781
15. Hall, R. et al. Plant metabolomics: the missing link in for identifying genes by the activity of their products. (1999).
functional genomics strategies. Plant Cell 14, 1437–1440 Science 286, 1153–1155 (1999). 49. Wagner, C., Sefkow, M. & Kopka, J. Construction and
(2002). 30. Trethewey, R. N., Krotzky, A. J. & Willmitzer, L. Metabolic application of a mass spectral and retention time index
16. Roessner-Tunali, U. et al. Metabolic profiling of transgenic profiling: a Rosetta stone for genomics? Curr. Opin. database generated from plant GC/EI-TOF-MS
tomato plants overexpressing hexokinase reveals that the Plant Biol. 2, 83–85 (1999). metabolite profiles. Phytochemistry 62, 887–900
influence of hexose phosphorylation diminishes during 31. Raamsdonk, L. M. et al. A functional genomics strategy (2003).
fruit development. Plant Physiol. 133, 84–99 (2003). that uses metabolome data to reveal the phenotype of 50. Frenzel, T., Miller, A. & Engel, K. H. A methodology for
17. Walles, M. et al. Verapamil drug metabolism studies by silent mutations. Nature Biotech. 19, 45–50 (2001). automated comparative analysis of metabolite profiling
automated in-tube solid phase microextraction. 32. Fehr, M., Lalonde, S., Lager, I., Wolff, M. W. & Frommer, data. Eur. Food Res. Technol. 216, 335–342 (2003).
J. Pharma. Biomed. Anal. 30, 307–319 (2002). W. B. In vivo imaging of the dynamics of glucose uptake 51. Duran, A. L., Yang, J., Wang, L. & Sumner, L. W.
18. Kok, E. J. & Kuiper, H. A. Comparative safety in the cytosol of COS-7 cells by fluorescent nanosensors. Metabolomics spectral formatting, alignment and
assessment for biotech crops. Trends Biotech. 21, J. Biol. Chem. 278, 19127–19133 (2003). conversion tools (MSFACTs). Bioinformatics 19,
438–444 (2003). 33. Barabasi, A. L. & Oltvai, Z. N. Network biology: 2283–2293 (2003).
19. Sauter, H., Lauer, M. & Fritsch, H. Metabolite profiling of understanding the cell’s functional organisation. 52. Waisim, M., Hassan, M. S. & Brereton, R. G. Evaluation
plants — a new diagnostic technique. Abstr. Pap. Am. Nature Rev. Genet. 5, 101–113 (2004). of chemometric methods for determining the number and
Chem. Soc. 195, 129 (1988). 34. Wagner, A. & Fell, D. A. The small world inside large position of components in high-performance liquid
20. Allen, J. et al. High-throughput classification of yeast metabolic networks. Proc. R. Soc. Lond. B 268, chromatography detected by diode array detector by
mutants for functional genomics using metabolic 1803–1810 (2001). diode array detector and on-flow 1H nuclear magnetic
footprinting. Nature Biotech. 21, 692–696 (2003). 35. Kamath, R. S. et al. Systematic functional analysis of the resonance spectroscopy. Analyst 128, 1082–1090
21. Brindle, J. T. et al. Rapid and noninvasive diagnosis of the Caenorhabditis elegans genome using RNAi. Nature 421, (2003).
presence and severity of coronary heart disease using 231–237 (2003). 53. Lindon, J. C. HPLC-NMR-MS: past, present and future.
1
H-NMR-based metabonomics. Nature Med. 8, 36. Kacser, H. & Burns, J. A. The control of flux. Symposia Drug Discov. Today 8, 1021–1022 (2003).
1439–1444 (2002). Soc. Exp. Biol. 28, 65–104 (1974). 54. Meiler, J. & Will, M. Genius: a genetic algorithm for
22. Huhman, D. V. & Sumner, L. W. Metabolic profiling of 37. Fernie, A. R. et al. Metabolic profiling at the genome level. automated structure elucidation from 13C NMR spectra.
saponins in Medicago sativa and Medicago trunculata Plant Animal Genome Abstr. XI, W307 (2003). J. Am. Chem. Soc. 124, 1868–1870 (2002).
using HPLC coupled to an electrospray 38. Kitano, H. Perspectives on systems biology. New
ion-trap mass spectrometer. Phytochemistry 59, Generation Comput. 18, 199–216 (2000). Acknowledgements
347–360 (2002). 39. Ideker, T., Galitski, T. & Hood, L. A new approach to The authors thank O. Schmitz for assistance in the preparation of
23. Kose, F., Weckwerth, W., Linke, T. & Fiehn, O. Visualizing decoding life: systems biology. Annu. Rev. Genomics figure 2. Software used in the preparation of figure 2 was provided
plant metabolomic correlation networks using clique- Hum. Genet. 2, 343–372 (2001). by OmniViz Inc.
metabolite matrices. Bioinformatics 17, 1198–1208 40. Edwards, J. S., Ibarra, R. U. & Palsson, B. O. In silico
(2001). predictions of Escherichia coli metabolic capabilities are Competing interests statement
24. Aranibar, N., Singh, B. K., Stockton, G. W. & Ott, K. H. consistent with experimental data. Nature Biotech. 19, The authors declare no competing financial interests.
Automated mode-of-action detection by metabolic 125–130 (1999).
profiling. Biochem. Biophys. Res. Commun. 286, 41. Baliga, N. S. et al. Coordinate regulation of energy
150–155 (2001). transduction modules in Halobacterium sp. analyzed by a Online links
25. Quackenbush, J. Computational analysis of microarray global systems approach. Proc. Natl Acad. Sci. USA 99,
data. Nature Rev. Genet. 2, 418–427 (2001). 14913–14918 (2002). FURTHER INFORMATION
26. Griffin, J. L. et al. NMR spectroscopy based 42. Davidson E. H. et al. A genomic regulatory network for AMDIS:
metabonomic studies on the comparative biochemistry development. Science 295, 1669–1678 (2002). http://chemdata.nist.gov/mass-spc/amdis/overview.html
of the kidney and urine of the bank vole (Clethrionomys 43. Nicholson, J. K. & Wilson I. D. Understanding ‘global’ BRENDA enzyme database: www.brenda.uni-koeln.de
glareolus), wood mouse (Apodemus sylvaticus), white systems biology: metabonomics and the continuum of GenMAPP: www.genmapp.org
toothed shrew (Crocidura suaveolens) and the laboratory metabolism. Nature Rev. Drug Discov. 2, 668–676 (2003). Kyoto Encyclopedia of Genes and Genomes:
rat. Comp. Biochem. Physiol. B 127, 357–367 (2000). 44. Weckwerth, W. Metabolomics in systems biology. Annu. http://www.genome.ad.jp/kegg/
27. Kaderbhai, N. N., Broadhurst, D. I., Ellis, D. I., Goodacre, R. Rev. Plant Biol. 54, 669–689 (2003). MapMan: http://gabi.rzpd.de/projects/MapMan
& Kell, D. B. Functional genomics via metabolic 45. Urbanczyk-Wochniak, E. et al. Parallel analysis of Max-Planck-Institut für Molekulare Pflanzenphysiologie:
footprinting: monitoring metabolite secretion by transcript and metabolic profiles: a new approach in www.mpimp-golm.mpg.de
Escherichia coli tryptophan metabolism mutants using systems biology. EMBO Reports 4, 989–993 (2003). Metabolon: http://www.metabolon.com
FT-IR and direct injection electrospray mass 46. Askenazi, M. et al. Integrating transcriptional and metabolite metanomics: http://www.metanomics.de
spectrometry. Comp. Funct. Genomics 4, 376–391 profiles to direct the engineering of Iovastatin-producing Paradigm Genetics: http://www.paradigmgenetics.com
(2003). fungal strains. Nature Biotech. 21, 150–156 (2003). Access to this interactive links box is free online.