Back

Gene expression is a poor predictor of the metabolite abundance in cancer cells

Li, H.; Barbour, J. A.; Zhu, X.; Wong, J. W. H.

2021-12-20 bioinformatics
10.1101/2021.12.19.473333 bioRxiv
Show abstract

Metabolic reprogramming is a hallmark of cancer characterized by global changes in metabolite levels. However, compared with the study of gene expression, profiling of metabolites in cancer samples remains relatively understudied. We obtained metabolomic profiling and gene expression data from 454 human solid cancer cell lines across 24 cancer types from the Cancer Cell Line Encyclopedia (CCLE) database, to evaluate the feasibility of inferring metabolite levels from gene expression data. For each metabolite, we trained multivariable LASSO regression models to identify gene sets that are most predictive of the level of each metabolite profiled. Even when accounting for cell culture conditions or cell lineage in the model, few metabolites could be accurately predicted. In some cases, the inclusion of the upstream and downstream metabolites improved prediction accuracy, suggesting that gene expression is a poor predictor of steady-state metabolite levels. Our analysis uncovered a single robust relationship between the expression of nicotinamide N-methyltransferase (NNMT) and 1-methylnicotinamide (MNA), however, this relationship could only be validated in cancer samples with high purity, as NNMT is not expressed in immune cells. Together, our findings reveal the challenge of inferring metabolite levels from metabolic enzyme levels and suggest that direct metabolomic profiling is necessary to study metabolism in cancer.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Metabolites
50 papers in training set
Top 0.1%
23.7%
2
Cancer Research Communications
46 papers in training set
Top 0.1%
7.5%
3
Scientific Reports
3102 papers in training set
Top 21%
5.1%
4
Genome Medicine
154 papers in training set
Top 1%
4.5%
5
eLife
5422 papers in training set
Top 21%
4.2%
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
7
Molecular Systems Biology
142 papers in training set
Top 0.2%
3.8%
50% of probability mass above
8
Cell Chemical Biology
81 papers in training set
Top 1.0%
2.7%
9
Nature Communications
4913 papers in training set
Top 45%
2.5%
10
Metabolic Engineering
68 papers in training set
Top 0.3%
2.2%
11
BMC Bioinformatics
383 papers in training set
Top 4%
2.0%
12
Bioinformatics
1061 papers in training set
Top 7%
1.8%
13
Cancer Research
116 papers in training set
Top 2%
1.6%
14
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.4%
15
iScience
1063 papers in training set
Top 19%
1.4%
16
Cell Systems
167 papers in training set
Top 9%
1.3%
17
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.0%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.0%
21
Genome Research
409 papers in training set
Top 3%
1.0%
22
Cell Reports
1338 papers in training set
Top 30%
0.9%
23
Nucleic Acids Research
1128 papers in training set
Top 15%
0.9%
24
Cancers
200 papers in training set
Top 4%
0.8%
25
mSystems
361 papers in training set
Top 7%
0.8%
26
Cell Metabolism
49 papers in training set
Top 2%
0.8%
27
Cell Genomics
162 papers in training set
Top 6%
0.8%
28
Genome Biology
555 papers in training set
Top 7%
0.8%
29
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
30
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.8%