Application of metabolomics to plant genotype discrimination using statistics 
and machine learning

Taylor, J.; King, R. D.; Altmann, T.; Fiehn, O.

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Conference Paper

Application of metabolomics to plant genotype discrimination using statistics and machine learning

MPS-Authors

/persons/resource/persons97050

Altmann, T.
Developmental Physiology and Genomics, Cooperative Research Groups, Max Planck Institute of Molecular Plant Physiology, Max Planck Society;

/persons/resource/persons97148

Fiehn, O.
Metabolomic Analysis, Department Willmitzer, Max Planck Institute of Molecular Plant Physiology, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Taylor, J., King, R. D., Altmann, T., & Fiehn, O. (2002). Application of metabolomics to plant genotype discrimination using statistics and machine learning. In European Conference on Computational Biology (ECCB 2002) (pp. 241-248).

Cite as: https://hdl.handle.net/11858/00-001M-0000-0014-2E68-2

Abstract

Motivation: Metabolomics is a post genomic technology which seeks to provide a comprehensive profile of all the metabolites present in a biological sample. This complements the mRNA profiles provided by microarrays, and the protein profiles provided by proteomics. To test the power of metabolome analysis we selected the problem of discrimating between related genotypes of Arabidopsis. Specifically, the problem tackled was to discrimate between two background genotypes (Col0 and C24) and, more significantly, the offspring produced by the crossbreeding of these two lines, the progeny (whose genotypes would differ only in their maternally inherited mitichondia and chloroplasts). Overview: A gas chromotography-mass spectrometry (GCMS) profiling protocol was used to identify 433 metabolites in the samples. The metabolomic profiles were compared using descriptive statistics which indicated that key primary metabolites vary more than other metabolites. We then applied neural networks to discriminate between the genotypes. This showed clearly that the two background lines can be discrimated between each other and their progeny, and indicated that the two progeny lines can also be discriminated. We applied Euclidean hierarchical and Principal Component Analysis (PCA) to help understand the basis of genotype discrimination. PCA indicated that malic acid and citrate are the two most important metabolites for discriminating between the background lines, and glucose and fructose are two most important metabolites for discriminating between the crosses. These results are consistant with genotype differences in mitochondia and chloroplasts.