Back

On real-time calibrated prediction for complex model-based decision support in pandemics: Part 2

McKinley, T. J.; Williamson, D. B.; Xiong, X.; Salter, J. M.; Challen, R.; Danon, L.; Youngman, B. D.; McNeall, D.

2025-05-16 infectious diseases
10.1101/2025.05.16.25327744 medRxiv
Show abstract

Calibration of complex stochastic infectious disease models is challenging. These often have high-dimensional input and output spaces, with the models exhibiting complex, non-linear dynamics. Coupled with a paucity of necessary data, this results in a large number of non-ignorable hidden states that must be handled by the inference routine. Likelihood-based approaches to this missing data problem are very flexible, but challenging to scale, due to having to monitor and update these hidden states. Methods based on simulating the hidden states directly from the model-of-interest have an advantage that they are often more straightforward to code, and thus are easier to implement and adapt in real-time. However, these often require evaluating very large numbers of simulations, rendering them infeasible for many large-scale problems. We present a framework for using emulation-based methods to calibrate a large-scale, stochastic, age-structured, spatial meta-population model of COVID-19 transmission in England and Wales. By embedding a model discrepancy process into the simulation model, and combining this with particle filtering, we show that it is possible to calibrate complex models to high-dimensional data by emulating the log-likelihood surface instead of individual data points. The use of embedded model discrepancy also helps to alleviate other key challenges, such as the introduction of infection across space and time. We conclude with a discussion of major challenges remaining and key areas for future work.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.7%
22.4%
2
Epidemics
104 papers in training set
Top 0.1%
14.3%
3
Nature Computational Science
50 papers in training set
Top 0.1%
6.3%
4
PLOS ONE
4510 papers in training set
Top 28%
6.3%
5
Journal of The Royal Society Interface
189 papers in training set
Top 0.7%
4.8%
50% of probability mass above
6
Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
12 papers in training set
Top 0.1%
4.8%
7
Infectious Disease Modelling
50 papers in training set
Top 0.4%
3.6%
8
Biostatistics
21 papers in training set
Top 0.1%
3.6%
9
Scientific Reports
3102 papers in training set
Top 50%
2.1%
10
Bioinformatics
1061 papers in training set
Top 7%
1.9%
11
Nature Communications
4913 papers in training set
Top 52%
1.7%
12
Biometrics
22 papers in training set
Top 0.1%
1.7%
13
Statistics in Medicine
34 papers in training set
Top 0.2%
1.7%
14
Bulletin of Mathematical Biology
84 papers in training set
Top 1%
1.5%
15
Medical Decision Making
10 papers in training set
Top 0.2%
1.3%
16
BMC Bioinformatics
383 papers in training set
Top 5%
1.2%
17
Royal Society Open Science
193 papers in training set
Top 3%
1.2%
18
eLife
5422 papers in training set
Top 52%
0.9%
19
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
20
Medical Image Analysis
33 papers in training set
Top 0.9%
0.9%
21
Wellcome Open Research
57 papers in training set
Top 3%
0.6%