Back

Large Language Model-Driven Prioritization of Alzheimer's Disease Drug Targets Across Multidimensional Criteria

Adaszewski, S.; Schindler, T.

2025-12-29 health informatics
10.64898/2025.12.28.25343106 medRxiv
Show abstract

Large language models (LLMs) offer new opportunities to synthesize the vast and heterogeneous biomedical literature, yet their potential to support drug target prioritization in complex diseases such as Alzheimers disease (AD) remains largely unexplored. Here, we introduce an LLM-driven framework that evaluates and ranks AD therapeutic targets across six criteria central to pharmaceutical decision-making: biological confidence, technical feasibility, clinical developability, patient impact, competitive landscape, and safety assessment. Using Gemini 2.5 Pro augmented with real-time web search, we performed large-scale pairwise comparative evaluations and pointwise scoring across a focused set of 522 AD-associated targets with high-quality chemical probes--a tractable subset enriched for clinically advanced targets. We implemented a novel pairwise QuickSort-based ranking procedure that leverages the LLM as a comparative oracle, and benchmarked its performance against pointwise scoring across 16 replicate runs per criterion. Retrieval-augmented LLM reasoning substantially improved early enrichment of clinically validated AD targets, outperforming LLM-only prompting and approaching the performance of the OpenTargets association benchmark. Pairwise comparative reasoning consistently exceeded pointwise scoring across five of six criteria, yielding higher stability, stronger inter-criterion structure, and markedly improved normalized gain metrics. Multi-objective integration using Pareto fronts and utopia-point scoring further enhanced consensus and robustness, producing holistic rankings that nearly matched the strongest individual criteria while exhibiting superior cross-category coherence. Challenges remained in assessing competitiveness and safety--domains with sparse or inconsistent literature representation--highlighting areas where hybrid models integrating structured datasets may be required. Together, these results demonstrate that retrieval-augmented LLMs, when combined with structured comparative prompting and multi-criteria integration, can approximate expert-level reasoning and meaningfully enrich target prioritization pipelines for AD. This framework provides a scalable, interpretable, and biologically grounded approach for early-stage drug discovery, with broad applicability to other complex diseases. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=183 HEIGHT=200 SRC="FIGDIR/small/25343106v1_ufig1.gif" ALT="Figure 1"> View larger version (25K): org.highwire.dtl.DTLVardef@1cdfc2org.highwire.dtl.DTLVardef@1a666bforg.highwire.dtl.DTLVardef@1a3ae73org.highwire.dtl.DTLVardef@1121b88_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.8%
12.8%
2
Patterns
70 papers in training set
Top 0.1%
9.2%
3
Computers in Biology and Medicine
120 papers in training set
Top 0.2%
7.2%
4
npj Digital Medicine
97 papers in training set
Top 0.7%
6.9%
5
Bioinformatics
1061 papers in training set
Top 4%
6.4%
6
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
4.9%
7
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.0%
50% of probability mass above
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
9
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
3.6%
10
eBioMedicine
130 papers in training set
Top 0.3%
3.6%
11
Nature Communications
4913 papers in training set
Top 43%
3.1%
12
Scientific Reports
3102 papers in training set
Top 46%
2.5%
13
iScience
1063 papers in training set
Top 14%
1.7%
14
PLOS ONE
4510 papers in training set
Top 53%
1.7%
15
Communications Biology
886 papers in training set
Top 11%
1.5%
16
Chemical Science
71 papers in training set
Top 1%
1.3%
17
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.3%
18
Nature Machine Intelligence
61 papers in training set
Top 3%
1.2%
19
Cell Reports Medicine
140 papers in training set
Top 6%
1.0%
20
The Lancet Digital Health
25 papers in training set
Top 1%
0.8%
21
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
22
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
23
Alzheimer's Research & Therapy
52 papers in training set
Top 2%
0.7%
24
Heliyon
146 papers in training set
Top 7%
0.7%
25
Nature Biomedical Engineering
42 papers in training set
Top 3%
0.6%
26
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.9%
0.5%
27
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 1%
0.5%
28
European Respiratory Journal
54 papers in training set
Top 3%
0.5%
29
NeuroImage
813 papers in training set
Top 7%
0.5%
30
Nature Protocols
30 papers in training set
Top 0.4%
0.5%