Back

Drug-Target Interaction Prediction with PIGLET

Carpenter, K. A.; Altman, R. B.

2026-02-18 bioinformatics
10.64898/2026.02.18.706530 bioRxiv
Show abstract

Drug-target interaction (DTI) prediction is a key task for computed-aided drug development that has been widely approached by deep learning models. Despite extremely high reported performance, these models have yet to find widespread success in accelerating real-world drug discovery. In contrast with the most common approach of creating embeddings from one-dimensional or three-dimensional representations of the input drug and input target, we create a novel graph transformer method for DTI prediction that operates on a proteome-wide knowledge graph of binding pocket similarity, protein-protein interactions, drug similarity, and known binding relationships. We benchmark our method, named PIGLET, against existing DTI prediction models on the Human dataset. We assess performance with two different splitting strategies: the frequently-reported random split, and a novel, more rigorous drug-based split. All models perform similarly well on the random split, and PIGLET outperforms all models on the drug-based split. We highlight the utility of PIGLET through a real-world drug discovery case study.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
18.4%
2
Cell Systems
167 papers in training set
Top 2%
7.1%
3
Nature Methods
336 papers in training set
Top 2%
6.7%
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.8%
6.7%
5
Bioinformatics Advances
184 papers in training set
Top 0.7%
4.8%
6
Nature Communications
4913 papers in training set
Top 33%
4.8%
7
Journal of Cheminformatics
25 papers in training set
Top 0.1%
4.3%
50% of probability mass above
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 18%
3.9%
9
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
10
Nature Machine Intelligence
61 papers in training set
Top 1.0%
3.5%
11
Scientific Reports
3102 papers in training set
Top 42%
3.0%
12
Nucleic Acids Research
1128 papers in training set
Top 8%
2.6%
13
Advanced Science
249 papers in training set
Top 9%
2.1%
14
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
15
Nature Biotechnology
147 papers in training set
Top 4%
1.8%
16
Communications Biology
886 papers in training set
Top 11%
1.5%
17
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.3%
1.3%
18
eLife
5422 papers in training set
Top 49%
1.2%
19
Patterns
70 papers in training set
Top 2%
1.2%
20
PLOS ONE
4510 papers in training set
Top 65%
0.9%
21
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
22
Genome Medicine
154 papers in training set
Top 7%
0.9%
23
Chemical Science
71 papers in training set
Top 2%
0.7%
24
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
25
iScience
1063 papers in training set
Top 33%
0.7%
26
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
27
Genome Research
409 papers in training set
Top 5%
0.6%
28
Nature Computational Science
50 papers in training set
Top 2%
0.6%
29
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.6%
30
Biophysical Journal
545 papers in training set
Top 6%
0.6%