Back

A novel Bayesian model for assessing intratumor heterogeneity of tumor infiltrating leukocytes with multi-region gene expression sequencing

Yang, P.; Hubert, S. M.; Futreal, P. A.; Song, X.; Zhang, J.; Lee, J. J.; Wistuba, I.; Yuan, Y.; Zhang, J.; Li, Z.

2023-10-29 bioinformatics
10.1101/2023.10.24.563820 bioRxiv
Show abstract

Intratumor heterogeneity (ITH) of tumor-infiltrated leukocytes (TILs) is an important phenomenon of cancer biology with potentially profound clinical impacts. Multiregion gene expression sequencing data provide a promising opportunity that allows for explorations of TILs and their intratumor heterogeneity for each subject. Although several existing methods are available to infer the proportions of TILs, considerable methodological gaps exist for evaluating intratumor heterogeneity of TILs with multi-region gene expression data. Here, we develop ICeITH, immune cell estimation reveals intratumor heterogeneity, a Bayesian hierarchical model that borrows cell type profiles as prior knowledge to decompose mixed bulk data while accounting for the within-subject correlations among tumor samples. ICeITH quantifies intratumor heterogeneity by the variability of targeted cellular compositions. Through extensive simulation studies, we demonstrate that ICeITH is more accurate in measuring relative cellular abundance and evaluating intratumor heterogeneity compared with existing methods. We also assess the ability of ICeITH to stratify patients by their intratumor heterogeneity score and associate the estimations with the survival outcomes. Finally, we apply ICeITH to two multi-region gene expression datasets from lung cancer studies to classify patients into different risk groups according to the ITH estimations of targeted TILs that shape either pro- or anti-tumor processes. In conclusion, ICeITH is a useful tool to evaluate intratumor heterogeneity of TILs from multi-region gene expression data.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.2%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.3%
12.4%
3
Biometrics
22 papers in training set
Top 0.1%
10.3%
4
PLOS Computational Biology
1633 papers in training set
Top 3%
10.0%
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.3%
50% of probability mass above
6
Frontiers in Genetics
197 papers in training set
Top 2%
3.9%
7
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.8%
8
PLOS ONE
4510 papers in training set
Top 44%
2.7%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.6%
10
Nucleic Acids Research
1128 papers in training set
Top 9%
2.1%
11
Genome Medicine
154 papers in training set
Top 4%
2.1%
12
Scientific Reports
3102 papers in training set
Top 59%
1.7%
13
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
1.7%
14
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.7%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.7%
16
iScience
1063 papers in training set
Top 18%
1.5%
17
Nature Communications
4913 papers in training set
Top 55%
1.3%
18
Patterns
70 papers in training set
Top 1%
1.2%
19
Quantitative Biology
11 papers in training set
Top 0.4%
1.2%
20
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
21
GigaScience
172 papers in training set
Top 3%
0.8%
22
PLOS Genetics
756 papers in training set
Top 14%
0.8%
23
Genome Biology
555 papers in training set
Top 7%
0.7%
24
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
25
Cancer Research Communications
46 papers in training set
Top 1%
0.7%
26
Communications Biology
886 papers in training set
Top 24%
0.7%
27
Advanced Science
249 papers in training set
Top 20%
0.7%
28
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
29
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
30
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.8%
0.6%