Benchmarking foundation models for improving confounding control in target trial emulation

Kleper, S. L.; Melamed, R. D.

2026-05-13 epidemiology

10.64898/2026.05.09.26352820 medRxiv

Show abstract

Machine learning models for causal inference aim to adjust for confounding factors that are associated with both an exposure and an outcome, creating a spurious biased association. But, these methods are rarely empirically evaluated to assess their success in mitigating such bias. Recent advances in knowledge representation, including both foundation models and knowledge graphs, could enrich these models, but rigorous evaluations are needed in order to assess their potential. Here, we ask whether enriching existing causal inference models with knowledge representations from foundation models can improve confounding control. Rather than using semi-simulated data to address this question, we focus on examples of real confounding: we emulate target randomized active comparator trials that are subject to confounding by indication. Our results can guide researchers aiming to develop or apply methods for discovering causal effects from observational data.

Matching journals

●Non-profit ◐University press ○Commercial

The top 6 journals account for 50% of the predicted probability mass.

Only show non-profit

BMC Medical Research Methodology

○ 47 papers in training set

Clinical Trials

○ 11 papers in training set

Statistics in Medicine

○ 40 papers in training set

● 5828 papers in training set

Journal of Clinical Epidemiology

○ 31 papers in training set

American Journal of Epidemiology

◐ 67 papers in training set

50% of probability mass above

● 5266 papers in training set

○ 29 papers in training set

Research Synthesis Methods

○ 20 papers in training set

Nature Communications

○ 5641 papers in training set

○ 32 papers in training set

PLOS Computational Biology

● 1863 papers in training set

International Journal of Epidemiology

◐ 88 papers in training set

Pharmacoepidemiology and Drug Safety

○ 18 papers in training set

◐ 23 papers in training set

Statistical Methods in Medical Research

○ 11 papers in training set

Value in Health

○ 11 papers in training set

npj Digital Medicine

○ 118 papers in training set

● 486 papers in training set

○ 176 papers in training set

Scientific Reports

○ 3612 papers in training set

Journal of Biomedical Informatics

○ 47 papers in training set

○ 116 papers in training set

Medical Decision Making

○ 12 papers in training set

Proceedings of the National Academy of Sciences

● 2444 papers in training set