Back

Bias in respiratory diagnoses by Large Language Models (LLMs) in Low Middle Income Countries (LMICs)

Mouelhi, A.; Patel, K.; Kussad, S.; Ojha, S.; Prayle, A. P.; LMIC Medical AI Alignment Group,

2026-03-03 health informatics
10.64898/2026.03.02.26347405 medRxiv
Show abstract

IntroductionClinicians and patients are likely to increasingly use Large Language Models (LLMs) for diagnostic support. Use of LLMs mostly created in North America and Europe, could lead to a High-Income Country bias if used in Low- and Middle-Income Country (LMIC) healthcare settings. We aimed to explore if diagnostic suggestions made by LLMs are relevant in LMIC settings. MethodsFive short respiratory clinical vignettes were produced. For each vignette, a group of doctors from one of 5 countries (Ghana, India, Jordan and Brazil and the UK) independently gave the 4 most likely diagnoses. 4 LLMs (ChatGPT, Claude Sonnet, Google Gemini and Microsoft Copilot) were prompted with the same vignettes. The top 4 diagnoses for each case was requested. A Virtual Private Network (VPN) was used to access the LLM from each of the 4 countries, and in a second experiment the LLM was given the same vignettes but also informed of the country in which the case was based in the prompt. The diagnoses presented by the LLMs was compared with the doctors diagnoses for the LMICs and also compared to the UK. Results106 unique diagnoses were offered by 21 doctors, and 53 by LLMs with a VPN. The LLMs proposed fewer of the doctors diagnoses in LMICs versus in the UK - 50% (95% CI 32.6 to 67.4%) in the UK compared to 32.0% (95% CI 23.1 to 42.3%) in LMICs. This effect persisted when the LLM was informed of the location of the doctor in the prompt. Overall, LLMs performed worse in the LMIC setting (Chi-squared p = 0.028). ConclusionDoctors working in LMICs consider a wider range of diagnoses than LLMs, even when LLMs are queried from that country, or informed that they are in that country. LLMs appear to show a bias when considering likely diagnosis likely related to the epidemiology of high income countries.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Journal of Medical Internet Research
85 papers in training set
Top 0.3%
12.6%
2
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
12.4%
3
PLOS ONE
4510 papers in training set
Top 22%
8.5%
4
PLOS Digital Health
91 papers in training set
Top 0.3%
6.4%
5
BMJ Open
554 papers in training set
Top 4%
4.9%
6
BMJ Health & Care Informatics
13 papers in training set
Top 0.1%
4.9%
7
JMIR Medical Informatics
17 papers in training set
Top 0.2%
4.2%
50% of probability mass above
8
BMC Health Services Research
42 papers in training set
Top 0.5%
4.0%
9
Frontiers in Digital Health
20 papers in training set
Top 0.2%
3.6%
10
Scientific Reports
3102 papers in training set
Top 47%
2.5%
11
Emergency Medicine Journal
20 papers in training set
Top 0.2%
2.1%
12
BMJ Open Quality
15 papers in training set
Top 0.4%
2.1%
13
BMJ Paediatrics Open
21 papers in training set
Top 0.4%
1.8%
14
European Respiratory Journal
54 papers in training set
Top 0.9%
1.7%
15
Wellcome Open Research
57 papers in training set
Top 1%
1.5%
16
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.3%
17
BMC Infectious Diseases
118 papers in training set
Top 3%
1.3%
18
British Journal of General Practice
22 papers in training set
Top 0.4%
1.1%
19
Public Health
34 papers in training set
Top 1%
1.0%
20
Cancer Medicine
24 papers in training set
Top 1%
0.9%
21
Frontiers in Public Health
140 papers in training set
Top 7%
0.9%
22
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
23
Journal of General Internal Medicine
20 papers in training set
Top 1.0%
0.8%
24
JMIR Formative Research
32 papers in training set
Top 2%
0.8%
25
Healthcare
16 papers in training set
Top 2%
0.8%
26
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.7%
27
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%
28
ERJ Open Research
44 papers in training set
Top 1%
0.5%
29
DIGITAL HEALTH
12 papers in training set
Top 0.9%
0.5%
30
JMIR Public Health and Surveillance
45 papers in training set
Top 5%
0.5%