Back

Measuring the growth of infectious disease modelling publications and their impact on policymaking: a Large Language Model-assisted bibliometric review

Christen, P.; Ahmed, M. H. A.; Chua, B.; Chaowanasawat, P.; Chapman-Banks, E.; Ozkan, Y. A.; van Elsland, S. L.; Cori, A.; K C, S.; Whitaker, M.; Chadeau-Hyam, M.; Dabak, S. V.; Jit, M.

2025-06-12 infectious diseases
10.1101/2025.06.12.25328864 medRxiv
Show abstract

BackgroundInfectious disease modelling (IDM) is increasingly being used to understand disease transmission and inform public health policy. However, its growth and policy influence has never been quantified, possibly because of the volume of literature involved. The development of large language model (LLM)-assisted reviewing allowed us to quantify the expansion of IDM publications over time, trends in policy citations of IDM research, and regional disparities in research contributions and citations in policy documents. MethodsAn LLM-assisted bibliometric review was conducted using Embase, Medline, and Web of Science, identifying IDM publications from database inception to December 2024 using GPT-4o. Inclusion criteria encompassed peer-reviewed studies employing mathematical, statistical, or mechanistic models for infectious disease outcomes. LLM accuracy was iteratively refined by human review. We extracted publication metadata, geographic scope, and policy citations using Overton, a global database of policy documents. Growth trends were analysed using negative binomial regression models, and geographic disparities were assessed based on World Bank income classifications. ResultsA total of 33,255 IDM publications were identified over 44 years, with distinct growth phases. The LLM selection and data extraction achieved 98% and 100% accuracy respectively, compared to human search. Publication volume increased from the time of HIV/AIDS emergence, experiencing steady expansion through multiple outbreaks (Ebola, SARS, H1N1, MERS, Zika), and surged sharply just before the COVID-19 pandemic before declining post-2021. Recorded policy citations accounted for 1.7% of IDM publications, closely following the overall publication trend, peaking during periods of heightened public health attention. Policy citations largely reflected national research outputs, with notable cross-regional adoption of IDM evidence in some settings. ConclusionStrengthening the integration of IDM evidence into policymaking processes may require addressing geographic disparities in research output (and its recording in international databases), enhancing cross-regional collaboration, and improving mechanisms for policy uptake. Following the COVID-19 pandemic, policy citations declined despite continued growth in IDM literature, suggesting a potential lag or shift in policymaking priorities.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
BMC Infectious Diseases
118 papers in training set
Top 0.1%
14.8%
2
The Lancet Global Health
24 papers in training set
Top 0.1%
9.2%
3
PLOS ONE
4510 papers in training set
Top 22%
8.5%
4
Clinical Infectious Diseases
231 papers in training set
Top 1%
3.9%
5
Epidemics
104 papers in training set
Top 0.4%
3.6%
6
BMJ Global Health
98 papers in training set
Top 0.9%
3.6%
7
BMC Medicine
163 papers in training set
Top 1%
3.6%
8
BMC Public Health
147 papers in training set
Top 2%
3.3%
50% of probability mass above
9
PLOS Medicine
98 papers in training set
Top 2%
2.1%
10
Open Forum Infectious Diseases
134 papers in training set
Top 0.9%
2.1%
11
Annals of Internal Medicine
27 papers in training set
Top 0.3%
2.1%
12
The Lancet Infectious Diseases
71 papers in training set
Top 1%
2.1%
13
The Journal of Infectious Diseases
182 papers in training set
Top 2%
1.9%
14
Scientific Reports
3102 papers in training set
Top 58%
1.7%
15
Clinical Microbiology and Infection
60 papers in training set
Top 0.6%
1.7%
16
AIDS
31 papers in training set
Top 0.3%
1.5%
17
The American Journal of Tropical Medicine and Hygiene
60 papers in training set
Top 3%
1.5%
18
PLOS Global Public Health
293 papers in training set
Top 4%
1.3%
19
Nature Communications
4913 papers in training set
Top 56%
1.2%
20
American Journal of Epidemiology
57 papers in training set
Top 1.0%
1.2%
21
JMIR Public Health and Surveillance
45 papers in training set
Top 2%
1.2%
22
BMJ Open
554 papers in training set
Top 11%
1.0%
23
JAMA Network Open
127 papers in training set
Top 3%
1.0%
24
Vaccine
189 papers in training set
Top 2%
0.9%
25
PLOS Biology
408 papers in training set
Top 18%
0.8%
26
Emerging Infectious Diseases
103 papers in training set
Top 3%
0.8%
27
Frontiers in Public Health
140 papers in training set
Top 8%
0.8%
28
American Journal of Preventive Medicine
11 papers in training set
Top 0.5%
0.8%
29
The Lancet
16 papers in training set
Top 0.8%
0.7%
30
International Journal of Infectious Diseases
126 papers in training set
Top 4%
0.7%