Back

Mitigating Automation Bias in Physician-LLM Diagnostic Reasoning Using Behavioral Nudges: A Randomized Controlled Trial

Qazi, I. A.; Ali, A.; Khawaja, A. U.; Akhtar, M. J.; Sheikh, A. Z.; Alizai, M. H.

2026-06-02 health informatics
10.64898/2026.06.01.26354596 medRxiv
Show abstract

As large language models (LLMs) enter clinical workflows, automation bias, the uncritical acceptance of automated output, poses a patient-safety risk. Optimal physician-AI collaboration requires trust calibration, matching scrutiny to LLM recommendation accuracy. We report a randomized trial evaluating a behavioral nudge to mitigate automation bias. Seventy-two AI-trained physicians were randomized to evaluate six vignettes alongside ChatGPT-5.1 recommendations, consulted at each physician's discretion; three contained deliberate, clinically significant errors. The treatment arm received a dual-component nudge: an anchoring cue reporting ChatGPT's benchmark accuracy to calibrate expectations, and a case-specific, selective-attention cue; a numeric accuracy rating and color-coded traffic light, derived from the mean of three distinct-family LLMs. The control group saw recommendations alone; blinded reviewers scored diagnostic reasoning against an expert rubric. The treatment group scored significantly higher (mean difference, 7.6 percentage-points; 95% CI, 1.4-13.9; P=0.016) than the control, suggesting a scalable strategy to preserve clinical judgment in LLM-assisted care. ClinicalTrials.gov registration: NCT07328815.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
37.4%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
22.4%
50% of probability mass above
3
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 1%
3.6%
4
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.9%
5
PLOS Digital Health
91 papers in training set
Top 1%
2.1%
6
Nature Medicine
117 papers in training set
Top 2%
2.1%
7
Annals of Internal Medicine
27 papers in training set
Top 0.4%
1.8%
8
Med
38 papers in training set
Top 0.3%
1.7%
9
Scientific Reports
3102 papers in training set
Top 59%
1.7%
10
Nature Communications
4913 papers in training set
Top 54%
1.5%
11
PLOS ONE
4510 papers in training set
Top 56%
1.5%
12
BMJ Health & Care Informatics
13 papers in training set
Top 0.6%
1.3%
13
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.6%
1.3%
14
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.3%
15
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.8%
16
iScience
1063 papers in training set
Top 32%
0.7%
17
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.7%
18
Healthcare
16 papers in training set
Top 2%
0.7%
19
Cell Reports Medicine
140 papers in training set
Top 9%
0.7%
20
BMJ Open
554 papers in training set
Top 13%
0.7%
21
eBioMedicine
130 papers in training set
Top 5%
0.7%
22
Journal of Clinical Epidemiology
28 papers in training set
Top 0.6%
0.7%
23
JAMA
17 papers in training set
Top 0.5%
0.6%
24
BMJ
49 papers in training set
Top 1%
0.6%
25
JAMA Network Open
127 papers in training set
Top 5%
0.6%