Back

Federated analysis of incubation period distributions using individual-level observed data and heterogeneous summary statistics

Morgenstern, C.; Khurana, M. P.; Naidoo, T.; Rawson, T.; Cori, A.; Duchene, D. A.; Ferguson, N. M.; Kraemer, M. U. G.; Bhatt, S.

2026-06-02 epidemiology
10.64898/2026.06.01.26354607 medRxiv
Show abstract

The incubation period, the interval between pathogen exposure and symptom onset, is a critical epidemiological parameter for follow-up policy and outbreak response, yet individual-level exposure data remain scarce, especially early in outbreaks. For most priority pathogens, only summary statistics are available because sharing of individual-level data can be sensitive. Here we introduce a Bayesian hierarchical framework that jointly models individual-level observations and published summary statistics under a unified federated analysis framework. Simulation studies demonstrate that the method accurately recovers incubation period distributions across a range of data availability scenarios, generally outperforming approaches that use published summary statistics alone. Applying the framework to 18 pathogens, including 10 priority pathogens classified to have outbreak potential by the World Health Organization, we find substantial between-study heterogeneity in incubation period estimates, including by outbreak country for SARS-CoV-1, variants of concern for COVID-19, and exposure setting for typhoid fever. These estimates, together with the curated dataset and modelling framework in our associated R package ddsynth, provide a reproducible foundation for improved incubation period estimation and synthesis across pathogens of epidemic concern. Our framework enables robust and rapid estimation of incubation periods during new outbreaks.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 5%
18.7%
2
PLOS Computational Biology
1633 papers in training set
Top 2%
14.7%
3
Epidemics
104 papers in training set
Top 0.1%
10.1%
4
Scientific Reports
3102 papers in training set
Top 27%
4.3%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 20%
3.6%
50% of probability mass above
6
American Journal of Epidemiology
57 papers in training set
Top 0.3%
3.6%
7
PLOS ONE
4510 papers in training set
Top 42%
3.1%
8
Patterns
70 papers in training set
Top 0.4%
2.7%
9
International Journal of Epidemiology
74 papers in training set
Top 0.8%
2.7%
10
Nature Computational Science
50 papers in training set
Top 0.3%
2.6%
11
Science Translational Medicine
111 papers in training set
Top 2%
2.5%
12
eLife
5422 papers in training set
Top 34%
2.4%
13
PLOS Genetics
756 papers in training set
Top 8%
1.7%
14
Nature Biotechnology
147 papers in training set
Top 5%
1.7%
15
Nature Genetics
240 papers in training set
Top 5%
1.7%
16
Science Advances
1098 papers in training set
Top 22%
1.3%
17
Epidemiology
26 papers in training set
Top 0.4%
1.0%
18
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
19
Science
429 papers in training set
Top 19%
0.8%
20
PLOS Biology
408 papers in training set
Top 18%
0.8%
21
Nature Medicine
117 papers in training set
Top 5%
0.8%
22
Communications Biology
886 papers in training set
Top 24%
0.7%
23
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%
24
Statistics in Medicine
34 papers in training set
Top 0.3%
0.7%
25
npj Digital Medicine
97 papers in training set
Top 4%
0.7%
26
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 6%
0.7%
27
Nature Machine Intelligence
61 papers in training set
Top 4%
0.6%
28
BMC Bioinformatics
383 papers in training set
Top 8%
0.6%