Back

Subtype Dynamics Reveal Horizon-Dependent Structure in Influenza Predictability

Mao, Y.; Lopman, B.; Koelle, K.; Lau, M. S.

2026-05-30 epidemiology
10.64898/2026.05.28.26354347 medRxiv
Show abstract

Accurate forecasting of seasonal influenza is critical for public health preparedness, and data-driven models are central to this effort. However, most approaches rely on aggregate indicators of influenza-like-illness (ILI), which can obscure heterogeneity and limit predictability at longer horizons. While subtype dynamics are well established, their role in data-driven forecasting remains incompletely understood. Here, we integrate subtype-resolved surveillance data into diverse data-driven frameworks using over a decade of U.S. surveillance records to evaluate and decompose predictive signal in influenza forecasting. Across pre- and post-COVID-19 periods, subtype-informed models consistently improve over baseline models trained on aggregate ILI alone, with the largest gains at longer horizons. Decomposition reveals a horizon-dependent reorganization of predictability: autoregressive persistence in recent aggregate incidence dominates at short horizons but declines with lead time, while predictive signal shifts toward subtype-derived structure. Within this structure, interaction-related features among co-circulating subtypes grow systematically with forecast horizon, indicating that longer-term predictability is driven increasingly by interaction structure rather than marginal subtype composition alone. Together, our results show that subtype information provides non-redundant predictive signal and extends the effective forecasting window of data-driven models. More broadly, our findings suggest that aggregation of heterogeneous subtype processes can obscure latent predictability, supporting subtype-resolved surveillance.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 2%
17.0%
2
Nature Communications
4913 papers in training set
Top 24%
8.2%
3
Science Translational Medicine
111 papers in training set
Top 0.2%
6.6%
4
Science
429 papers in training set
Top 5%
6.2%
5
Nature Human Behaviour
85 papers in training set
Top 0.7%
4.2%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
7
Nature Medicine
117 papers in training set
Top 0.7%
3.9%
50% of probability mass above
8
Science Advances
1098 papers in training set
Top 7%
3.5%
9
PNAS Nexus
147 papers in training set
Top 0.1%
3.5%
10
Scientific Reports
3102 papers in training set
Top 42%
3.0%
11
Cell Genomics
162 papers in training set
Top 2%
3.0%
12
Nature Machine Intelligence
61 papers in training set
Top 1%
3.0%
13
npj Digital Medicine
97 papers in training set
Top 2%
2.0%
14
eLife
5422 papers in training set
Top 39%
1.8%
15
Epidemics
104 papers in training set
Top 0.9%
1.7%
16
Nature
575 papers in training set
Top 11%
1.7%
17
The Lancet Infectious Diseases
71 papers in training set
Top 2%
1.6%
18
Journal of The Royal Society Interface
189 papers in training set
Top 3%
1.3%
19
Nature Biotechnology
147 papers in training set
Top 6%
1.2%
20
American Journal of Epidemiology
57 papers in training set
Top 1%
1.2%
21
BMC Medicine
163 papers in training set
Top 5%
1.1%
22
Advanced Science
249 papers in training set
Top 15%
1.1%
23
Clinical Infectious Diseases
231 papers in training set
Top 4%
0.9%
24
PLOS ONE
4510 papers in training set
Top 64%
0.9%
25
Cell Systems
167 papers in training set
Top 11%
0.9%
26
The Journal of Infectious Diseases
182 papers in training set
Top 5%
0.7%
27
Communications Medicine
85 papers in training set
Top 1%
0.7%
28
eBioMedicine
130 papers in training set
Top 5%
0.7%
29
Nature Genetics
240 papers in training set
Top 8%
0.7%
30
International Journal of Epidemiology
74 papers in training set
Top 3%
0.7%