Back

Sociodemographic Bias in Large Language Model Clinical Trial Screening

Soffer, S.; Omar, M.; efros, o.; Apakama, D. U.; Mudrik, A.; Freeman, R.; Nadkarni, G.; Klang, E.

2025-11-17 health informatics
10.1101/2025.11.15.25340177 medRxiv
Show abstract

BackgroundLarge language models (LLMs) are increasingly used in randomized clinical trial (RCT) screening, but their potential for sociodemographic bias remains unclear. ObjectiveTo determine whether LLM-based trial screening judgments vary with patient sociodemographic characteristics when clinical details and eligibility criteria are held constant. Design, Setting, and ParticipantsCross-sectional evaluation of Phase II-III RCT protocols from ClinicalTrials.gov (U.S. adult populations; 2023-2024). For each protocol, we created 15 physician-validated clinical vignettes rendered in 34 versions: one control (no identifiers) and 33 identity variants spanning gender, race/ethnicity, socioeconomic status, homelessness, unemployment, and sexual orientation. ExposuresIdentity labels applied to otherwise identical vignettes, evaluated by nine contemporary LLMs. Main Outcomes and MeasuresPrimary eligibility domain score (1-5 Likert scale) comparing identity variants versus control. Secondary: adherence, resources, risk-benefit, and trust/attitude domains. Mixed-effects models estimated adjusted mean differences with multiplicity-corrected P values; differences <.10 considered trivial. ResultsOf 69 protocols, 58 met inclusion criteria. Analysis of 5,324,400 model evaluations showed eligibility judgments were largely stable: most identity-related differences fell within {+/-}0.05 (transgender woman -.008 [95% CI -.04 to .02]; White male .036 [.01 to .07]). Only homelessness exceeded the trivial threshold (-.121 [-.15 to -.09], P<.001). Secondary domains revealed socioeconomic gradients, particularly for adherence (homeless -.595, P<.001) and resources (homeless -.715, P<.001), with smaller trust/attitude effects and negligible risk-benefit differences. Conclusions and RelevanceBias in LLM-assisted trial screening is conditional. Within fixed criteria, models reason consistently; outside them, they echo the inequities of their data. Responsible deployment in clinical research depends on preserving that boundary so that automation strengthens fairness in trial access rather than inheriting distortion.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Trials
25 papers in training set
Top 0.1%
33.5%
2
BMJ
49 papers in training set
Top 0.1%
8.5%
3
PLOS ONE
4510 papers in training set
Top 27%
6.5%
4
BMJ Open
554 papers in training set
Top 3%
6.4%
50% of probability mass above
5
BMC Medical Research Methodology
43 papers in training set
Top 0.2%
4.0%
6
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.6%
4.0%
7
Journal of Clinical Epidemiology
28 papers in training set
Top 0.1%
4.0%
8
JAMA Network Open
127 papers in training set
Top 0.8%
3.7%
9
Annals of Internal Medicine
27 papers in training set
Top 0.3%
2.1%
10
BMC Medicine
163 papers in training set
Top 3%
1.8%
11
Journal of General Internal Medicine
20 papers in training set
Top 0.5%
1.7%
12
Clinical and Translational Science
21 papers in training set
Top 0.5%
1.4%
13
JMIR Research Protocols
18 papers in training set
Top 0.9%
1.2%
14
JAMA
17 papers in training set
Top 0.2%
1.1%
15
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.9%
16
Preventive Medicine Reports
14 papers in training set
Top 0.3%
0.9%
17
Pilot and Feasibility Studies
12 papers in training set
Top 0.5%
0.8%
18
eClinicalMedicine
55 papers in training set
Top 2%
0.8%
19
Scientific Reports
3102 papers in training set
Top 74%
0.8%
20
Journal of the American Heart Association
119 papers in training set
Top 4%
0.7%
21
PLOS Medicine
98 papers in training set
Top 5%
0.7%
22
FACETS
11 papers in training set
Top 0.4%
0.7%
23
eLife
5422 papers in training set
Top 61%
0.7%
24
PLOS Biology
408 papers in training set
Top 23%
0.7%
25
BMJ Mental Health
15 papers in training set
Top 0.5%
0.5%
26
JAMIA Open
37 papers in training set
Top 2%
0.5%
27
BMJ Health & Care Informatics
13 papers in training set
Top 1%
0.5%
28
Journal of Clinical and Translational Science
11 papers in training set
Top 0.6%
0.5%
29
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 7%
0.5%