Back

Agentic AI Integrated with Scientific Knowledge: Laboratory Validation in Systems Biology

Brunnsaker, D.; Gower, A. H.; Naval, P.; Bjurström, E. Y.; Kronström, F.; Tiukova, I. A.; King, R. D.

2025-08-18 systems biology
10.1101/2025.06.24.661378 bioRxiv
Show abstract

Automation is transforming scientific discovery by enabling systematic exploration of complex hypotheses. Large language models (LLMs) perform well across diverse tasks and promise to accelerate research, but often struggle to interact with logical structures. Here we present a framework integrating LLM-based agents with laboratory automation, guided by a logical scaffold incorporating symbolic relational learning, structured vocabularies, and experimental constraints. This integration reduces output incoherence and improves reliability in automated workflows. We couple this AI-driven approach to automated cell-culture and metabolomics platforms, enabling hypothesis validation and refinement, yielding a flexible system for scientific discovery. We validate the system in Saccharomyces cerevisiae, identifying novel interactions, including glutamate-induced synergistic growth inhibition in spermine-treated cells and aminoadipates partial rescue of formic-acid stress. All hypotheses, experiments, and data are captured in a graph database employing controlled vocabularies. Existing ontologies are extended, and a novel representation of scientific hypotheses is presented using description logics. This work enables a more reliable, machine-driven discovery process in systems biology.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Molecular Systems Biology
142 papers in training set
Top 0.1%
26.0%
2
Bioinformatics
1061 papers in training set
Top 2%
12.5%
3
Nature Methods
336 papers in training set
Top 1%
8.4%
4
Bioinformatics Advances
184 papers in training set
Top 0.3%
6.8%
50% of probability mass above
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.9%
6
Cell Systems
167 papers in training set
Top 3%
4.3%
7
Nature Communications
4913 papers in training set
Top 39%
3.6%
8
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
9
npj Systems Biology and Applications
99 papers in training set
Top 0.6%
3.1%
10
Genome Medicine
154 papers in training set
Top 3%
2.7%
11
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
12
iScience
1063 papers in training set
Top 10%
2.1%
13
PLOS ONE
4510 papers in training set
Top 55%
1.7%
14
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.3%
15
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
16
Nature
575 papers in training set
Top 14%
0.9%
17
ACS Synthetic Biology
256 papers in training set
Top 2%
0.9%
18
Scientific Reports
3102 papers in training set
Top 73%
0.8%
19
eLife
5422 papers in training set
Top 55%
0.8%
20
Genome Biology
555 papers in training set
Top 7%
0.7%
21
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.7%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 44%
0.7%
23
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.6%