Back

MetaReact: A Reaction-Aware Transformer for End-to-End Prediction of Drug Metabolism

Wang, Y.; Rao, J.; Zhang, W.; Shi, Y.; Zeng, C.; Cui, R.; Wang, Y.; Xiong, J.; Li, X.; Zheng, M.

2026-03-18 biochemistry
10.64898/2026.03.14.711529 bioRxiv
Show abstract

Accurate prediction of drug metabolites and enzyme selectivity is essential for rational drug design and safety assessment. However, existing computational approaches are often limited to specific enzyme families or reaction types, lacking the capacity to model enzyme-subtype specificity and prioritize major metabolites. Here, we present MetaReact, an end-to-end generalizable Transformer-based model that unifies the prediction of metabolic enzymes, metabolites, and sites of metabolism (SOM). By integrating structure-aware encoding ReactSeq, a chemistry reaction-based pretraining, MetaReact consistently outperforms state-of-the-art methods across multiple benchmarks under three settings: enzyme-agnostic, enzyme-completion, enzyme-conditioned. Notably, it achieves 60% Top-3 accuracy in identifying major metabolites and superior CYP450 enzyme-subtype prediction/SOM recognition. Case studies validate its applicability to complex natural products, synthetic cannabinoids, and clinical candidates, facilitating toxicity assessment and molecular optimization. This scalable, rule-free solution advances human metabolism modeling, with potential for computational pharmacokinetics and early drug discovery.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 18%
10.0%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.6%
10.0%
3
PLOS ONE
4510 papers in training set
Top 29%
6.3%
4
Journal of Cheminformatics
25 papers in training set
Top 0.1%
6.2%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 21%
3.5%
7
Bioinformatics
1061 papers in training set
Top 5%
3.5%
8
Nature Machine Intelligence
61 papers in training set
Top 1.0%
3.5%
9
Metabolites
50 papers in training set
Top 0.2%
3.5%
50% of probability mass above
10
eLife
5422 papers in training set
Top 34%
2.3%
11
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
12
Advanced Science
249 papers in training set
Top 10%
1.9%
13
Communications Biology
886 papers in training set
Top 7%
1.9%
14
Chemical Science
71 papers in training set
Top 0.9%
1.8%
15
Communications Chemistry
39 papers in training set
Top 0.2%
1.8%
16
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
18
Cell Systems
167 papers in training set
Top 8%
1.5%
19
Nature Chemical Biology
104 papers in training set
Top 2%
1.5%
20
Scientific Reports
3102 papers in training set
Top 62%
1.5%
21
Patterns
70 papers in training set
Top 1%
1.3%
22
iScience
1063 papers in training set
Top 20%
1.3%
23
Journal of Medicinal Chemistry
68 papers in training set
Top 1.0%
0.9%
24
Nature
575 papers in training set
Top 14%
0.9%
25
Science
429 papers in training set
Top 18%
0.9%
26
Cell Reports Medicine
140 papers in training set
Top 7%
0.8%
27
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
28
Nature Methods
336 papers in training set
Top 6%
0.8%
29
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.8%
30
Acta Pharmaceutica Sinica B
11 papers in training set
Top 1.0%
0.7%