Back

An AI-agent-orchestrated grey-box Transformer framework for sparse pharmacokinetic curve reconstruction and pharmacometric model initialization

Chen, J.; Wang, J.; Du, S.; Chen, Y.; Li, K.; Song, J.; Liu, D.

2026-05-27 pharmacology and toxicology
10.64898/2026.05.23.727373 bioRxiv
Show abstract

Clinical pharmacokinetic (PK) modelling is constrained by sparse sampling, limited general-isability of single-drug models, and labour-intensive workflows, making it difficult to infer complete drug exposure from limited concentration observations. We present the Pharmacokinetic Foundation Model (PKFM), a grey-box Transformer framework pre-trained across 32 drugs that reconstructs concentration-time profiles from sparse concentration observations, dosing events, molecular descriptors, and physiological covariates while preserving output interpretability. In representative oral PK curves, three sparse input points recovered the principal absorption-elimination trajectory, achieving coefficient of determination (R2) = 0.992 for Midazolam oral and R2 = 0.990 for Verapamil oral. Using reconstructed curves in NONMEM (nonlinear mixed-effects modelling) improved covariance stability and individual prediction accuracy. Contrastive-learning embeddings supported Top-10 physiologically based pharmacokinetic (PBPK) candidate retrieval, with 75.6% of observations within the 2-fold range. A pharmacometrics-informed AI Agent (PM Agent) outperformed general-purpose programming tools in stability and pairwise win rate on a standardised modelling benchmark, with each run requiring human pharmaco-metrician confirmation before downstream use. These results support cross-drug pre-trained PK models as an information-completion layer for sparse PK evidence and a structured scaffold for the modelling workflow; clinical or regulatory use requires prospective validation, broader external benchmarking, and independent expert assessment.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 6%
18.4%
2
Nature Machine Intelligence
61 papers in training set
Top 0.2%
10.0%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.3%
4
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.1%
6.2%
5
eBioMedicine
130 papers in training set
Top 0.1%
4.8%
6
Bioinformatics
1061 papers in training set
Top 5%
3.9%
50% of probability mass above
7
PLOS ONE
4510 papers in training set
Top 38%
3.6%
8
npj Digital Medicine
97 papers in training set
Top 1%
3.5%
9
Communications Biology
886 papers in training set
Top 5%
2.1%
10
Scientific Reports
3102 papers in training set
Top 50%
2.1%
11
Frontiers in Pharmacology
100 papers in training set
Top 2%
2.1%
12
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
13
Advanced Science
249 papers in training set
Top 10%
1.9%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.9%
15
Science Translational Medicine
111 papers in training set
Top 3%
1.7%
16
Science Advances
1098 papers in training set
Top 18%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.6%
18
Clinical and Translational Science
21 papers in training set
Top 0.5%
1.6%
19
iScience
1063 papers in training set
Top 20%
1.3%
20
Genome Medicine
154 papers in training set
Top 6%
1.2%
21
Nucleic Acids Research
1128 papers in training set
Top 14%
1.1%
22
eLife
5422 papers in training set
Top 51%
1.1%
23
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
24
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.9%
25
Patterns
70 papers in training set
Top 2%
0.8%
26
European Respiratory Journal
54 papers in training set
Top 2%
0.8%
27
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.7%