Back

Artemis: Harnessing Knowledge Graphs for Next-Generation Drug Target Prioritization

Kiselev, V. Y.; Ainscow, E.

2026-01-29 bioinformatics
10.64898/2026.01.27.701959 bioRxiv
Show abstract

Knowledge graphs (KGs) have become an important asset in biomedical research and drug discovery by enabling the structured integration of heterogeneous biological knowledge. When combined with machine learning (ML), KGs support the identification of novel drug-target relationships, but existing approaches are often KG-centric, relying primarily on graph structure and embeddings while overlooking disease-specific biological and clinical context. Moreover, many high-impact applications depend on proprietary KG infrastructures, limiting accessibility for the broader research community. Here, we introduce Artemis, a practical and generalisable machine-learning framework for indication-aware target prioritisation that integrates public biomedical KGs with clinical evidence from the ChEMBL database. Artemis derives graph-based representations of clinically validated drug targets from multiple publicly available KGs and augments them with disease-relevant clinical features from ChEMBL. This hybrid feature space is used to train supervised ML models across seven disease indications, with performance assessed via cross-validation and guided parameter optimisation. The framework is further evaluated on emerging breast cancer targets reported at the San Antonio Breast Cancer Symposium 2024, demonstrating its ability to prioritise novel candidates. Overall, this work demonstrates that publicly available KGs can be used for actionable, translational target discovery when coupled with clinical data. Artemis provides an accessible, scalable, and cost-efficient alternative to proprietary KG platforms. Thereby offering a practical solution for researchers seeking to prioritise therapeutic targets in real-world drug discovery settings. Key PointsO_LIKG applications can support the identification of novel drug-target relationships but rely primarily on graph structure while overlooking disease-specific biological and clinical context. C_LIO_LIArtemis performs indication-aware target prioritisation that integrates public biomedical KGs with clinical evidence from the ChEMBL database. C_LIO_LIArtemis is evaluated on emerging breast cancer targets reported at the San Antonio Breast Cancer Symposium 2024, demonstrating its ability to prioritise novel candidates. C_LIO_LIArtemis provides an accessible, scalable, and cost-efficient alternative to proprietary KG platforms offering a practical solution for researchers seeking to prioritise therapeutic targets in real-world drug discovery settings. C_LI

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.5%
38.0%
2
Bioinformatics Advances
184 papers in training set
Top 0.1%
12.4%
50% of probability mass above
3
Journal of Cheminformatics
25 papers in training set
Top 0.1%
6.4%
4
BMC Bioinformatics
383 papers in training set
Top 2%
4.2%
5
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
3.1%
6
Scientific Reports
3102 papers in training set
Top 44%
2.8%
7
PLOS ONE
4510 papers in training set
Top 52%
1.8%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
9
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.5%
10
Patterns
70 papers in training set
Top 1%
1.5%
11
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
13
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.4%
1.2%
14
Nature Communications
4913 papers in training set
Top 56%
1.2%
15
GigaScience
172 papers in training set
Top 2%
1.2%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
17
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
1.0%
18
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.0%
19
iScience
1063 papers in training set
Top 26%
0.9%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.9%
21
Metabolites
50 papers in training set
Top 0.9%
0.9%
22
Advanced Science
249 papers in training set
Top 19%
0.8%
23
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
24
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.8%
0.8%
25
Frontiers in Genetics
197 papers in training set
Top 12%
0.5%
26
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.5%
27
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.8%
0.5%