Back

Multi-Objective Bayesian Optimization for Data-Efficient Bioprocess Development

Ma, E.; Morrissey, J.; Duan, S.; Lu, Z.; Ranpura, S.; Arora, S.; Dabek, A.; Liu, C.; Gheorghe, A.-G.; Fong, L. K. W.; Sani, M.; Vrljicak, P.; Demirhan, D.; Betenbaugh, M.

2026-02-04 bioengineering
10.64898/2026.02.02.703372 bioRxiv
Show abstract

Process optimization for Chinese hamster ovary (CHO) cell culture remains a challenge in biopharmaceutical development because multiple interacting parameters jointly influence productivity and product quality attributes. Traditional design-of-experiments (DoE) methods, while systematic, become impractically expensive when extended across multiple parameters and clones. To address this challenge, we developed a multi-objective Bayesian Optimization (BO) framework that identifies optimal process conditions efficiently in grouped recommendations, which is well suited for experimental workflows in bioprocess development. The model integrates continuous variables such as pH, DO, temperature, and feed rate with categorical identifiers to enable knowledge transfer across clones and scales, optimizing titer, glycan profile, and charge variants. We validated the framework through in-silico benchmarks on analytic functions, retrospective cross-validation on historical CHO datasets, and forward experimental validation in small-scale bioreactors. Across these tests, our algorithm consistently outperformed Latin Hypercube Sampling (LHS) and Random Search baselines, achieving superior performance under a limited experimental budget. The framework improved titer by up to 37% under single-objective optimization. In the multi-objective setting, it increased titer by 25% while simultaneously reducing overall glycan-profile error by a factor of seven, demonstrating the ability to optimize multiple biologically coupled objectives simultaneously. Through comprehensive in-silico and experimental validation, this study establishes a framework that enables adaptive, AI-guided process development and improves decision-making across multiple objectives, clones, and scales while minimizing experimental runs in process development and optimization workflows.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Metabolic Engineering
68 papers in training set
Top 0.1%
18.7%
2
ACS Synthetic Biology
256 papers in training set
Top 0.3%
14.8%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.5%
4
Nature Communications
4913 papers in training set
Top 32%
4.9%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1.0%
4.4%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 39%
3.6%
7
Advanced Science
249 papers in training set
Top 5%
3.6%
8
Scientific Reports
3102 papers in training set
Top 41%
3.1%
9
Cell Systems
167 papers in training set
Top 5%
2.8%
10
Biotechnology and Bioengineering
49 papers in training set
Top 0.3%
2.6%
11
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.5%
12
Communications Biology
886 papers in training set
Top 6%
1.9%
13
Analytical Chemistry
205 papers in training set
Top 1%
1.7%
14
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 2%
1.5%
15
Water Research
74 papers in training set
Top 1%
1.3%
16
mSystems
361 papers in training set
Top 5%
1.3%
17
BMC Genomics
328 papers in training set
Top 4%
1.0%
18
Chemical Engineering Journal
10 papers in training set
Top 0.4%
1.0%
19
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.8%
20
eLife
5422 papers in training set
Top 55%
0.8%
21
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
22
Journal of The Royal Society Interface
189 papers in training set
Top 4%
0.8%
23
Metabolic Engineering Communications
20 papers in training set
Top 0.3%
0.8%
24
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
25
iScience
1063 papers in training set
Top 34%
0.7%
26
Environmental Science & Technology
64 papers in training set
Top 2%
0.7%
27
Nature Machine Intelligence
61 papers in training set
Top 4%
0.6%
28
Frontiers in Microbiology
375 papers in training set
Top 10%
0.6%
29
npj Digital Medicine
97 papers in training set
Top 4%
0.6%
30
BMC Bioinformatics
383 papers in training set
Top 8%
0.5%