Back

Propensity-score matching with GAN-generated observations from electronic health records: simulation study and application to the evaluation of prone positioning in COVID-19 patients under mechanical ventilation

Bouvarel, B.; Glemain, B.; Carrat, F.; Lapidus, N.

2025-08-01 health informatics
10.1101/2025.07.31.25332504 medRxiv
Show abstract

BackgroundPropensity score (PS) methods are widely used in observational studies to estimate causal effects, but they often exclude patients due to a lack of comparable counterparts, leading to reduced power and potential bias. Generative adversarial networks (GANs) have shown promise in creating synthetic data, but their application to causal inference remains underexplored. Synthetic data could be used as plausible counterfactuals, potentially mitigating the issues of the PS methods. This study evaluates the integration of GAN-generated synthetic observations into propensity score matching (PSM) to improve the emulation of RCTs, using both simulated and real-world electronic health record (EHR) data. MethodsA simulation study was conducted using with predefined confounding structures to compare traditional PSM against two hybrid approaches incorporating GAN-generated synthetic patients to partially or fully match the original sample of patients. Treatment effects were estimated via logistic regression, and performance was assessed by bias, standard error, alpha risk, power, and confidence interval coverage. The methods were applied to a real-world dataset of mechanically ventilated COVID-19 patients to evaluate the impact of early prone positioning on 28-day mortality. ResultsIn simulations, GAN-generated patients permitted to match all patients in the original sample, whereas PSM dropped up to 60% of them. While synthetic augmentation improved sample size, unadjusted use of synthetic matches led to underestimated standard errors and inflated type I error. Down-weighting matched synthetic data improved error control but did not consistently outperform PSM in bias or power. In the real-world application (n=1399), treatment effect estimates for prone positioning were similar across all methods and did not reach statistical significance. ConclusionGAN-augmented propensity score matching can reduce sample loss. However, its current application in causal inference through PS matching remains limited. Synthetic data do not contribute independent information and must be cautiously integrated to avoid misleading precision. While promising, current GAN implementations require methodological refinements before routine use in causal inference.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
BMC Medical Research Methodology
43 papers in training set
Top 0.1%
33.1%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
19.5%
50% of probability mass above
3
PLOS ONE
4510 papers in training set
Top 33%
4.4%
4
Journal of Clinical Epidemiology
28 papers in training set
Top 0.1%
3.6%
5
JMIR Medical Informatics
17 papers in training set
Top 0.3%
3.6%
6
BMJ Open
554 papers in training set
Top 6%
3.6%
7
Journal of Biomedical Informatics
45 papers in training set
Top 0.4%
3.6%
8
BMJ
49 papers in training set
Top 0.4%
2.4%
9
Scientific Reports
3102 papers in training set
Top 53%
1.9%
10
Trials
25 papers in training set
Top 0.7%
1.8%
11
BMC Medicine
163 papers in training set
Top 3%
1.7%
12
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.2%
13
JAMIA Open
37 papers in training set
Top 1%
1.1%
14
European Respiratory Journal
54 papers in training set
Top 1%
1.0%
15
BMJ Health & Care Informatics
13 papers in training set
Top 0.7%
0.9%
16
Nature Communications
4913 papers in training set
Top 60%
0.9%
17
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
18
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
19
JAMA Network Open
127 papers in training set
Top 4%
0.8%
20
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.8%
21
PLOS Medicine
98 papers in training set
Top 5%
0.7%
22
Heliyon
146 papers in training set
Top 8%
0.6%
23
Frontiers in Pharmacology
100 papers in training set
Top 5%
0.6%
24
Clinical Cancer Research
58 papers in training set
Top 2%
0.6%