Back

Predicting highly pathogenic avian influenza H5N1 outbreak risk using extreme weather and bird migration data in machine learning models

Zou, W. W.; Carlton, E. J.; Grover, E. N.

2026-04-01 epidemiology
10.64898/2026.03.30.26349797 medRxiv
Show abstract

Background. Climate change is intensifying extreme weather events (EWEs) with potentially profound consequences for zoonotic disease dynamics, yet the mechanisms linking EWEs to highly pathogenic avian influenza (HPAI) H5N1 outbreaks remain poorly characterized. The ongoing H5N1 panzootic, responsible for infection in over 500 avian and mammalian species, as well as nearly 1000 human cases and 477 deaths worldwide, provides a critical opportunity to evaluate how climate conditions shape spillover risk at landscape scales. Methods. We compiled a county-month dataset of confirmed H5N1 detections across the contiguous United States from 2022 to 2024 and integrated it with satellite-derived climate metrics, storm event data, and wild bird activity data. We trained and validated a gradient boosting machine classifier to predict outbreak risk and characterize predictor relationships. Results. Our model achieved strong discriminative performance (AUC-ROC = 0.856; AUC-PR = 0.237, representing a 7-fold improvement over chance) and high recall (0.726), supporting its utility as an early warning tool. Human population and temperature-related variables were the most influential predictors: cold temperature shocks and prolonged low temperatures were consistently associated with elevated outbreak risk, likely through enhanced environmental viral persistence, wild bird habitat compression, and allostatic stress-driven immunosuppression in reservoir hosts. Among storm variables, high wind coverage elevated risk, potentially via aerosol dispersal of contaminated particulates, while tornado activity showed an inverse relationship, consistent with documented avoidant behavior in migratory birds. Wild bird reservoir density showed a strong positive monotonic relationship with outbreak risk. Conclusions. Our analyses demonstrate that routinely available environmental and infection data can be used to predict HPAI outbreak risk at fine spatiotemporal scales. These findings demonstrate the divergent roles of short- versus long-term environmental exposures in HPAI spillover dynamics, as well as the potential for machine learning-based surveillance tools to inform targeted biosecurity interventions and early warning systems.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 22%
8.4%
2
One Health
29 papers in training set
Top 0.1%
8.4%
3
Scientific Reports
3102 papers in training set
Top 12%
7.2%
4
Clinical Infectious Diseases
231 papers in training set
Top 0.8%
6.4%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 11%
6.4%
6
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
7
The Journal of Infectious Diseases
182 papers in training set
Top 0.6%
4.8%
8
PLOS ONE
4510 papers in training set
Top 34%
4.3%
50% of probability mass above
9
GeoHealth
10 papers in training set
Top 0.1%
4.0%
10
BMC Medicine
163 papers in training set
Top 1%
3.6%
11
Journal of Travel Medicine
18 papers in training set
Top 0.1%
3.1%
12
Nature Medicine
117 papers in training set
Top 2%
1.7%
13
Science Advances
1098 papers in training set
Top 20%
1.5%
14
The Lancet Microbe
43 papers in training set
Top 0.8%
1.3%
15
American Journal of Epidemiology
57 papers in training set
Top 0.9%
1.3%
16
eLife
5422 papers in training set
Top 47%
1.3%
17
Viruses
318 papers in training set
Top 3%
1.3%
18
eBioMedicine
130 papers in training set
Top 2%
1.3%
19
Science of The Total Environment
179 papers in training set
Top 4%
1.2%
20
Science Translational Medicine
111 papers in training set
Top 4%
1.2%
21
BMC Infectious Diseases
118 papers in training set
Top 4%
1.2%
22
Influenza and Other Respiratory Viruses
44 papers in training set
Top 0.3%
0.9%
23
Emerging Infectious Diseases
103 papers in training set
Top 2%
0.9%
24
Journal of The Royal Society Interface
189 papers in training set
Top 4%
0.9%
25
PNAS Nexus
147 papers in training set
Top 1%
0.9%
26
Epidemics
104 papers in training set
Top 2%
0.8%
27
PLOS Biology
408 papers in training set
Top 18%
0.8%
28
Communications Biology
886 papers in training set
Top 21%
0.8%
29
Journal of Infection
71 papers in training set
Top 3%
0.7%
30
BMC Public Health
147 papers in training set
Top 6%
0.7%