Back

Synthetic-data augmented calibration for expert-informed rare disease models

Yang, H.; Rachel, T.; Litwin, T.; Karakioulaki, M.; Reimer-Taschenbrecker, A.; Timmer, J.; Has, C.; Binder, H.; Hess, M.

2026-05-20 bioinformatics
10.64898/2026.05.18.725833 bioRxiv
Show abstract

Clinical data for rare diseases are sparse, noisy, and heterogeneous, complicating calibration of ordinary differential equation (ODE) models. Thus, we introduce a noise-robust calibration in latent space that combines expertderived ODEs with learned latent representations. Our approach leverages synthetic ODE trajectories, augmenting our scarce observations to train a model-specific autoencoder representation and imputer. During calibration, observed and ODE-generated trajectories are compared in latent space, and ODE parameters are updated by minimizing their latent distance. In a controlled ABCDE simulation model, the imputer outperformed a carry-forward baseline for moderate parameter shifts, parameter recovery remained stable under random missingness, calibration remained robust to additional noise variables despite reduced downstream identifiability, and distinct dynamics formed visually separable latent trajectories. On a custom developed ODE model for real Epidermolysis Bullosa patients, the calibrated phenomenological model reproduced patient-level trajectories from sparse observations. Thus, we conclude that our latent-space calibration approach supports rare-disease modeling.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.3%
14.8%
2
PLOS Computational Biology
1633 papers in training set
Top 2%
12.4%
3
Nature Machine Intelligence
61 papers in training set
Top 0.3%
8.3%
4
Scientific Reports
3102 papers in training set
Top 27%
4.3%
5
Nature Communications
4913 papers in training set
Top 37%
4.0%
6
Journal of The Royal Society Interface
189 papers in training set
Top 1.0%
4.0%
7
Bioinformatics
1061 papers in training set
Top 5%
3.6%
50% of probability mass above
8
Communications Biology
886 papers in training set
Top 3%
2.7%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.7%
10
Advanced Science
249 papers in training set
Top 7%
2.6%
11
eLife
5422 papers in training set
Top 33%
2.5%
12
PLOS ONE
4510 papers in training set
Top 47%
2.1%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
1.9%
14
Patterns
70 papers in training set
Top 0.9%
1.7%
15
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.7%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
17
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.7%
18
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
19
Physical Review X
23 papers in training set
Top 0.4%
0.9%
20
Genome Medicine
154 papers in training set
Top 7%
0.9%
21
Biophysical Journal
545 papers in training set
Top 5%
0.8%
22
BioData Mining
15 papers in training set
Top 0.7%
0.8%
23
Science Advances
1098 papers in training set
Top 30%
0.8%
24
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.8%
25
Frontiers in Computational Neuroscience
53 papers in training set
Top 2%
0.7%
26
BMC Bioinformatics
383 papers in training set
Top 8%
0.6%
27
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.6%
28
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.6%
29
Nature Medicine
117 papers in training set
Top 7%
0.5%
30
European Journal of Human Genetics
49 papers in training set
Top 2%
0.5%