Back

Scalable deep-learning-based inference of time-varying transmission dynamics from outbreak phylogenies

XIE, R.; Zhukova, A.; Pena, P. G.; Iglesias, G.; Hu, S.; Wang, J.; Tsang, T. K.; Dhanasekaran, V.; Kraemer, M. U. G.; Pybus, O. G.; Gascuel, O.

2026-05-10 infectious diseases
10.64898/2026.05.07.26352673 medRxiv
Show abstract

Infectious disease dynamics can be inferred from pathogen genomic data using phylodynamic methods, but the applicability of many such approaches to large data sets is constrained by computational cost. Recent deep-learning approaches to phylodynamics have improved scalability, yet challenges remain when genetic divergence is limited during fast spreading outbreaks. To address this, we use pathogen-specific models to show that deep-learning models trained on outbreak-like phylogenies can accurately estimate the reproductive number (R) when both the birth-death model and the expected phylogenetic resolution are matched to the target pathogen, highlighting the importance of realistic training conditions. Focusing on three major respiratory pathogens of public health importance (SARS-CoV-2, seasonal human influenza virus, and respiratory syncytial virus (RSV)), we introduce PhyloRt, a scalable framework for estimating the time-varying reproductive number (Rt) from large outbreak phylogenies. PhyloRt decomposes large trees into overlapping subtrees and applies a hierarchical deep-learning-based inference strategy to classify subtrees as exhibiting constant or time-varying reproduction numbers, enabling identifiable and computationally efficient estimation of Rt as a piecewise-constant trajectory through time. Applications to SARS-CoV-2 and influenza outbreaks show that PhyloRt recovers transmission dynamics consistent with estimates derived from mathematical epidemiological and Bayesian phylodynamic analyses. Our work enables scalable and rapid estimation of time-varying transmission dynamics from very large-scale outbreak genomic data sets, supporting real-time genomic epidemiology of emerging pathogens. SignificanceEstimating changes in transmission dynamics over time is important for responding to infectious disease outbreaks. Current methods mostly rely on reported case data from epidemiological surveillance, which can be biased or incomplete due to variable testing capabilities, particularly in resource-limited settings. A complementary approach is to use viral genomes as an alternative data source. However, inferences from genomic data can be computationally intensive and have mainly been applied retrospectively. We present PhyloRt, a scalable deep-learning-based phylodynamic framework that enables fast inference of the time-varying reproductive number (Rt) from large outbreak phylogenies. Our approach is widely applicable and provides a practical approach to monitoring epidemic dynamics, complementing traditional surveillance and supporting timely public health decision-making.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nature Computational Science
50 papers in training set
Top 0.1%
22.5%
2
Nature Communications
4913 papers in training set
Top 11%
14.4%
3
Cell Systems
167 papers in training set
Top 2%
7.2%
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 11%
6.3%
50% of probability mass above
5
Science
429 papers in training set
Top 7%
4.3%
6
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
7
Science Translational Medicine
111 papers in training set
Top 0.9%
3.6%
8
Nature Medicine
117 papers in training set
Top 1%
3.1%
9
Nature Biotechnology
147 papers in training set
Top 3%
2.6%
10
Nature
575 papers in training set
Top 10%
1.8%
11
Cell Reports
1338 papers in training set
Top 26%
1.5%
12
The Lancet Infectious Diseases
71 papers in training set
Top 2%
1.5%
13
Nature Genetics
240 papers in training set
Top 5%
1.5%
14
Science Advances
1098 papers in training set
Top 20%
1.5%
15
eLife
5422 papers in training set
Top 47%
1.3%
16
Epidemics
104 papers in training set
Top 1%
1.3%
17
Scientific Reports
3102 papers in training set
Top 66%
1.2%
18
Cell Reports Methods
141 papers in training set
Top 3%
1.2%
19
Nature Microbiology
133 papers in training set
Top 4%
0.9%
20
Bioinformatics
1061 papers in training set
Top 10%
0.7%
21
American Journal of Epidemiology
57 papers in training set
Top 2%
0.7%
22
Patterns
70 papers in training set
Top 3%
0.7%
23
Emerging Infectious Diseases
103 papers in training set
Top 3%
0.7%
24
Virus Evolution
140 papers in training set
Top 1%
0.7%