Back

Integrating Metabolic Networks into Hybrid Bioprocess Models

Gotsmy, M.; Guillen-Gosalbez, G.

2026-04-24 bioengineering
10.64898/2026.04.22.720062 bioRxiv
Show abstract

The optimization and control of bioprocesses require robust in silico models that can accurately capture the complex and dynamic behavior of living cells. While hybrid models that combine machine learning with mechanistic equations have emerged as a powerful tools, they often require relatively large datasets and might yield inconsistent predictions that violate the stoichiometry of metabolism. In this study, we introduce FBA-Hyb, a multi-scale hybrid modeling framework that tightly integrates genome-scale metabolic networks via flux balance analysis (FBA) into its architecture. In our FBA-Hyb framework, artificial neural networks predict key FBA inputs (substrate uptake rates and cellular objectives) while a surrogate FBA module translates them into the metabolic fluxes that govern the bioprocess. A key novelty is that the FBA optimization step is replaced by a surrogate generated with symbolic regression, which encapsulates the FBA model into a compact analytical expression. This allows easy backpropagation through the integration of the neural controlled differential equationbased FBA-Hyb bioprocess model. We validated FBA-Hyb against a standard hybrid model (Std-Hyb) using two Escherichia coli fedbatch case studies. In the first study, FBA-Hyb achieved a 42 % average improvement in predictive accuracy (R2) during a leave-one-process-out cross validation. Crucially, FBA-Hyb maintains strict stoichiometric feasibility even during extrapolation. Meanwhile, an alternative approach based on standard architectures leads to stoichiometrically inconsistent solutions in 22 % of the cases analyzed. In the second case study, we demonstrate how FBA-Hyb effectively simulates unmeasured chemical species and discovers a metabolic shift in sulfate-limited regimes during bioprocessing. By providing a modular, biologically consistent, and computationally efficient architecture, FBA-Hyb offers a robust foundation for the next generation of bioprocess models and sustainable process optimization. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=81 SRC="FIGDIR/small/720062v1_ufig1.gif" ALT="Figure 1"> View larger version (28K): org.highwire.dtl.DTLVardef@16f011eorg.highwire.dtl.DTLVardef@b25b5borg.highwire.dtl.DTLVardef@18bd178org.highwire.dtl.DTLVardef@65274e_HPS_FORMAT_FIGEXP M_FIG C_FIG HighlightsO_LIFBA-Hyb integrates flux balance analysis (FBA) into hybrid bioprocess models. C_LIO_LISymbolic regression discovers a simple closed-form FBA surrogate model. C_LIO_LIThe FBA surrogate ensures accurate reaction stoichiometry. C_LIO_LIA neural network predicting the FBA objective keeps the model flexible. C_LIO_LIFBA-Hyb has superior capabilities and accuracy compared to the current standard. C_LI

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Metabolic Engineering
68 papers in training set
Top 0.1%
18.5%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
12.3%
3
Biotechnology and Bioengineering
49 papers in training set
Top 0.1%
10.0%
4
Nature Communications
4913 papers in training set
Top 23%
8.4%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.3%
50% of probability mass above
6
ACS Synthetic Biology
256 papers in training set
Top 0.7%
6.3%
7
npj Systems Biology and Applications
99 papers in training set
Top 0.5%
3.6%
8
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 0.6%
3.6%
9
Chemical Engineering Journal
10 papers in training set
Top 0.2%
2.3%
10
Advanced Science
249 papers in training set
Top 9%
2.1%
11
IFAC-PapersOnLine
12 papers in training set
Top 0.1%
1.3%
12
PLOS ONE
4510 papers in training set
Top 58%
1.3%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 38%
1.2%
14
Scientific Reports
3102 papers in training set
Top 66%
1.2%
15
mSystems
361 papers in training set
Top 6%
1.2%
16
Metabolic Engineering Communications
20 papers in training set
Top 0.2%
1.1%
17
iScience
1063 papers in training set
Top 24%
0.9%
18
Bioinformatics
1061 papers in training set
Top 9%
0.9%
19
Network Neuroscience
116 papers in training set
Top 1%
0.8%
20
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
21
Water Research
74 papers in training set
Top 1%
0.8%
22
eLife
5422 papers in training set
Top 56%
0.8%
23
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
24
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%
25
Environmental Science & Technology
64 papers in training set
Top 2%
0.7%
26
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
27
Cell Systems
167 papers in training set
Top 14%
0.6%