Back

clinTALL: machine learning-driven multimodal subtypeclassification and treatment outcome prediction in pediatric T-ALL

Stoiber, L.; Antic, Z.; Rebellato, S.; Fazio, G.; Rademacher, A.; Lenk, L.; Locatelli, F.; Balduzzi, A.; Cario, G.; Rizzari, C.; Cazzaniga, G.; Yu, J.; Bergmann, A. K.

2026-01-30 bioinformatics
10.64898/2026.01.28.701965 bioRxiv
Show abstract

BackgroundChildhood T-lineage acute lymphoblastic leukemia (T-ALL) is an aggressive hematologic malignancy with poor prognosis. Differently from B-cell precursor ALL, T-ALL lacks effective risk stratification strategies. A recent study has integrated whole genome and whole transcriptome data to define over 15 distinct molecular subtypes with prognostic significance. However, clinical translation of this knowledge remains challenging due to the complexity of interpreting high-dimensional multi-omics-based data. MethodsHere, we present clinTALL, a deep learning based multi-task pipeline for pediatric T-ALL subtype classification and treatment outcome estimation. The model integrates multimodal input data and uses a neural network architecture to generate a shared latent embedding for jointly learned multi-task prediction. The competing risk-based model was used to predict event-specific outcomes. The model was trained on a publicly available multimodal dataset comprising clinical, genomic and transcriptomic features of 1309 pediatric T-ALL samples. ResultsWe observed that the transcriptomic-only model achieved superior single modality results, with 92.2% accuracy for subtype prediction and a 65.9% concordance index (C-index) for event-free survival (EFS) in a cross-validation setup. Integrating all data modalities maintained high subtype classification accuracy (91.7%) and improved the overall concordance index for EFS estimation to 67.5%. The competing risk-based model enables accurate predictions of induction failure (C-index = 96.0%) and second malignant neoplasm (C-index = 62.1%). We validated molecular subtype predictions on an internal dataset of 120 pediatric T-ALL samples and obtained an accuracy of 81.8%. To facilitate the broad application of multi-omics based subtype prediction and treatment outcome inference, we provide clinTall as a Docker based application, allowing for user friendly access to the tool. The full source code of clinTALL is available on GitHub (https://github.com/UKWgenommedizin/clinTALL). ConclusionTogether, our machine learning-based framework allows for automated, accurate sub-type classification and treatment outcome inference using multimodal input data, advancing precision risk stratification for pediatric T-ALL.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Leukemia
39 papers in training set
Top 0.1%
22.4%
2
Bioinformatics
1061 papers in training set
Top 3%
10.1%
3
BMC Bioinformatics
383 papers in training set
Top 1%
8.4%
4
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
6.3%
5
Nature Communications
4913 papers in training set
Top 29%
6.3%
50% of probability mass above
6
Bioinformatics Advances
184 papers in training set
Top 0.9%
4.3%
7
Genome Medicine
154 papers in training set
Top 2%
3.6%
8
Nucleic Acids Research
1128 papers in training set
Top 7%
3.1%
9
Briefings in Bioinformatics
326 papers in training set
Top 2%
2.7%
10
Scientific Reports
3102 papers in training set
Top 53%
1.9%
11
npj Precision Oncology
48 papers in training set
Top 0.6%
1.7%
12
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
13
PLOS ONE
4510 papers in training set
Top 54%
1.7%
14
eLife
5422 papers in training set
Top 43%
1.7%
15
BMC Medical Genomics
36 papers in training set
Top 1%
0.9%
16
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
17
Blood Cancer Journal
11 papers in training set
Top 0.3%
0.8%
18
Blood Advances
54 papers in training set
Top 1%
0.8%
19
Cancer Research Communications
46 papers in training set
Top 1%
0.8%
20
PLOS Computational Biology
1633 papers in training set
Top 25%
0.7%
21
Blood
67 papers in training set
Top 1%
0.7%
22
International Journal of Molecular Sciences
453 papers in training set
Top 17%
0.7%
23
Modern Pathology
21 papers in training set
Top 0.6%
0.6%
24
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.9%
0.6%
25
Molecular Oncology
50 papers in training set
Top 1%
0.6%
26
Clinical Cancer Research
58 papers in training set
Top 2%
0.6%