Back

Counterfactual prediction of treatment effects on irregular clinical data using Time-Aware G-Transformers

Hornak, G.; Heinolainen, A.; Solyomvari, K.; Silen, S.; Renkonen, R.; Koskinen, M.

2026-04-02 health informatics
10.64898/2026.04.01.26349920 medRxiv
Show abstract

Selecting an effective treatment relies on accurately anticipating patient's response to alternative interventions. However, forecasting longitudinal clinical trajectories remains difficult because electronic health records contain heterogeneous, irregularly sampled data over extended time periods. These issues are especially relevant for laboratory measurements, which are central for diagnostics, assessment of therapeutic responses, and tracking disease progression in routine clinical practice. However, existing deep learning methods for counterfactual prediction usually assume regularly sampled data, an assumption incompatible with the irregular, heterogeneous data-generation processes of real-world clinical practice. Here we present the Time-Aware G-Transformer, which integrates causal G-computation with time-aware attention to predict counterfactual outcomes on irregular data. By explicitly conditioning on the timing of future observations and encoding measurement patterns, the model captures temporal dynamics that previous methods overlook. Evaluated on synthetic tumor growth data and on 90,753 cancer patient trajectories from an academic medical center, our approach demonstrates superior long-horizon (> 1 day) prediction accuracy and uncertainty calibration compared to state-of-the-art baselines. These results demonstrate that embedding temporal relations directly into the attention mechanism enables robust integration of patient history data for evaluating potential treatment strategies in personalized medicine.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
26.4%
2
Nature Machine Intelligence
61 papers in training set
Top 0.2%
8.6%
3
Nature Biomedical Engineering
42 papers in training set
Top 0.1%
8.6%
4
Scientific Reports
3102 papers in training set
Top 27%
4.4%
5
Patterns
70 papers in training set
Top 0.1%
4.4%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 36%
4.0%
7
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.4%
4.0%
8
Science Advances
1098 papers in training set
Top 5%
3.7%
9
Journal of Biomedical Informatics
45 papers in training set
Top 0.6%
2.7%
10
Medical Image Analysis
33 papers in training set
Top 0.4%
2.5%
11
Communications Medicine
85 papers in training set
Top 0.1%
2.1%
12
Bioinformatics
1061 papers in training set
Top 6%
2.1%
13
Advanced Science
249 papers in training set
Top 9%
1.9%
14
Cell Reports Medicine
140 papers in training set
Top 4%
1.7%
15
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.4%
16
Med
38 papers in training set
Top 0.5%
1.0%
17
JMIR Medical Informatics
17 papers in training set
Top 1%
0.9%
18
PNAS Nexus
147 papers in training set
Top 1.0%
0.9%
19
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.9%
20
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.7%
0.8%
21
eBioMedicine
130 papers in training set
Top 3%
0.8%
22
Communications Biology
886 papers in training set
Top 23%
0.8%
23
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.8%
24
PLOS ONE
4510 papers in training set
Top 67%
0.8%
25
European Respiratory Journal
54 papers in training set
Top 2%
0.8%
26
eLife
5422 papers in training set
Top 59%
0.7%
27
iScience
1063 papers in training set
Top 36%
0.7%
28
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.5%
29
Journal of Infection
71 papers in training set
Top 4%
0.5%
30
PLOS Computational Biology
1633 papers in training set
Top 28%
0.5%