Back

ToxMCP: Guardrailed, Auditable Agentic Workflows for Computational Toxicology via the Model Context Protocol

Djidrovski, I.

2026-02-09 pharmacology and toxicology
10.64898/2026.02.06.703989 bioRxiv
Show abstract

Computational toxicology increasingly relies on evidence, high-throughput screening, predictive (Q)SAR, adverse outcome pathways (AOPs), physiologically based kinetic (PBK/PBPK) models, and exposure databases to support integrated approaches to testing and assessment (IATA). Yet the practical workflow remains fragmented across heterogeneous tools, data formats, and licensing regimes. Large language models (LLMs) can lower the interface barrier, but free-text interaction alone is insufficient for regulatory-grade science: it is difficult to audit, difficult to reproduce, and prone to overconfident errors. Here we introduce ToxMCP, a collection of Model Context Protocol (MCP) servers designed as a guardrailed, federated integration layer for reproducible computational toxicology. ToxMCP wraps toxicology-relevant capabilities, including chemical identity and regulatory context (EPA CompTox), rapid ADMET profiling (ADMETlab 3.0), mechanistic pathway retrieval and structuring (AOP knowledge services), quantitative read-across workflows (OECD QSAR Toolbox), and mechanistic PBPK simulation (Open Systems Pharmacology Suite), as typed tools with explicit inputs/outputs, provenance bundles, and policy hooks (e.g., applicability domain checks, critical-action confirmation, and role-based access control). We demonstrate how natural-language risk questions can be compiled into auditable tool invocations, returning mechanistic metrics such as tissue AUC/Cmax, sensitivity curves, and conservative points of departure. We further outline an evaluation protocol for measuring computational reproducibility, task throughput, and scientific utility across multi-tool toxicology tasks. ToxMCP reframes LLMs for toxicology from conversational summarizers into accountable orchestrators of established scientific kernels, enabling faster iteration while preserving the evidentiary structure expected in regulatory and academic settings. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=110 SRC="FIGDIR/small/703989v1_ufig1.gif" ALT="Figure 1"> View larger version (52K): org.highwire.dtl.DTLVardef@1b8ccceorg.highwire.dtl.DTLVardef@18e0703org.highwire.dtl.DTLVardef@16e87feorg.highwire.dtl.DTLVardef@1a24f13_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Environmental Health Perspectives
17 papers in training set
Top 0.1%
9.5%
2
Environmental Science & Technology
64 papers in training set
Top 0.4%
7.4%
3
PLOS ONE
4510 papers in training set
Top 23%
7.4%
4
Toxicological Sciences
38 papers in training set
Top 0.1%
7.0%
5
Nature Communications
4913 papers in training set
Top 27%
6.6%
6
Archives of Toxicology
14 papers in training set
Top 0.1%
5.0%
7
Patterns
70 papers in training set
Top 0.1%
5.0%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1.0%
4.5%
50% of probability mass above
9
Bioinformatics
1061 papers in training set
Top 5%
4.3%
10
eLife
5422 papers in training set
Top 21%
4.1%
11
PLOS Computational Biology
1633 papers in training set
Top 13%
2.2%
12
Frontiers in Pharmacology
100 papers in training set
Top 2%
1.8%
13
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
14
Nature Protocols
30 papers in training set
Top 0.1%
1.8%
15
Environment International
42 papers in training set
Top 0.8%
1.5%
16
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.1%
17
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.1%
18
iScience
1063 papers in training set
Top 25%
0.9%
19
Science of The Total Environment
179 papers in training set
Top 4%
0.9%
20
Clinical and Translational Science
21 papers in training set
Top 0.8%
0.9%
21
Scientific Reports
3102 papers in training set
Top 70%
0.9%
22
Advanced Science
249 papers in training set
Top 17%
0.8%
23
MethodsX
14 papers in training set
Top 0.3%
0.8%
24
Cell Reports Methods
141 papers in training set
Top 4%
0.8%
25
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
26
The Lancet Digital Health
25 papers in training set
Top 1%
0.7%
27
Interface Focus
14 papers in training set
Top 0.3%
0.7%
28
Scientific Data
174 papers in training set
Top 3%
0.7%
29
RSC Advances
18 papers in training set
Top 2%
0.7%
30
Chemosphere
15 papers in training set
Top 0.7%
0.5%