Back

Physiology-informed regularization enables training of universal differential equation systems for biological applications

de Rooij, M.; Erdos, B.; van Riel, N.; O'Donovan, S.

2024-06-01 systems biology
10.1101/2024.05.28.596164 bioRxiv
Show abstract

Systems biology tackles the challenge of understanding the high complexity in the internal regulation of homeostasis in the human body through mathematical modelling. These models can aid in the discovery of disease mechanisms and potential drug targets. However, on one hand the development and validation of knowledge-based mechanistic models is time-consuming and does not scale well with increasing features in medical data. On the other hand, more data-driven approaches such as machine learning models require large volumes of data to produce generalizable models. The integration of neural networks and mechanistic models, forming universal differential equation (UDE) models, enables the automated learning of unknown model terms with less data than the neural network alone. Nevertheless, estimating parameters for these hybrid models remains difficult with sparse data and limited sampling durations that are common in biological applications. In this work, we propose the use of physiology-informed regularization, penalizing biologically implausible model behavior to guide the UDE towards more physiologically plausible regions of the solution space. In a simulation study we show that physiology-informed regularization not only results in a more accurate forecasting of model behaviour, but also supports training with less data. We also applied this technique to learn a representation of the rate of glucose appearance in the glucose minimal model using meal response data measured in healthy people. In that case, the inclusion of regularization reduces variability between UDE-embedded neural networks that were trained from different initial parameter guesses. Author summarySystems biology concerns the modelling and analysis of biological processes, by viewing these as interconnected systems. Modelling is typically done either using mechanistic differential equations that are derived from experiments and known biology, or using machine learning on large biological datasets. While mathematical modelling from biological experiments can provide useful insights with limited data, building and validating these models takes a long time and often requires highly invasive measurements in humans. Efforts to combine this classical technique with machine learning have resulted in a framework termed universal differential equations, where the model equations contain a neural network to describe unknown biological interactions. While these methods have shown success in numerous fields, applications in biology are more challenging due to limited data-availability, high data sparsity. In this work, we have introduced physiology-informed regularization to overcome these instabilities and to constrain the model to biologically plausible behavior. Our results show that by using physiology-informed regularization, we can accurately predict future unseen observations in a simulated example, with much more limited data than a similar model without regularization. Additionally, we show an application of this technique on human data, applying a neural network to learn the appearance of glucose in the blood plasma after a meal.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.6%
22.9%
2
Bioinformatics
1061 papers in training set
Top 1%
18.6%
3
BMC Bioinformatics
383 papers in training set
Top 1%
7.3%
4
npj Systems Biology and Applications
99 papers in training set
Top 0.2%
6.9%
50% of probability mass above
5
IFAC-PapersOnLine
12 papers in training set
Top 0.1%
4.4%
6
Journal of The Royal Society Interface
189 papers in training set
Top 1%
3.6%
7
Journal of Theoretical Biology
144 papers in training set
Top 0.5%
2.6%
8
Bulletin of Mathematical Biology
84 papers in training set
Top 0.9%
1.9%
9
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
10
Frontiers in Physiology
93 papers in training set
Top 3%
1.7%
11
Physical Biology
43 papers in training set
Top 1%
1.5%
12
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.4%
13
Scientific Reports
3102 papers in training set
Top 64%
1.4%
14
Mathematical Biosciences
42 papers in training set
Top 0.8%
1.2%
15
PLOS ONE
4510 papers in training set
Top 60%
1.2%
16
Journal of the Royal Society Interface
18 papers in training set
Top 0.1%
1.1%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.0%
18
Molecular Biology of the Cell
272 papers in training set
Top 2%
0.8%
19
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
20
iScience
1063 papers in training set
Top 34%
0.7%
21
Journal of Mathematical Biology
37 papers in training set
Top 0.4%
0.7%
22
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
23
Biophysical Journal
545 papers in training set
Top 6%
0.7%
24
Communications Biology
886 papers in training set
Top 32%
0.5%
25
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.5%