Back

From GWAS to drug: A framework for drug candidate prioritisation using a gene expression signature matching approach

Chauquet, S.; Jiang, J.-C.; Barker, L. F.; Hunter, Z. L.; Singh, G.; Wray, N. R.; McRae, A. F.; Shah, S.

2026-04-24 genetic and genomic medicine
10.64898/2026.04.22.26349470 medRxiv
Show abstract

Drug targets supported by human genetic evidence have significantly higher approval rates, making genome-wide association studies a valuable resource for drug candidate prioritisation. Transcriptome-wide association study signature-matching is an emerging in silico approach that integrates GWAS data with expression quantitative trait loci to generate a disease gene expression signature, which is then compared against drug perturbation databases such as the Connectivity Map. Despite recent adoption, there is no consensus on optimal methodology. Here, we systematically benchmark key parameters, including TWAS method, eQTL tissue model, similarity metric, gene set size, and CMap cell line, using LDL cholesterol, familial combined hyperlipidemia, and asthma as proof-of-concept traits. We demonstrate that while TWAS signature-matching can successfully prioritise known first-line treatments, performance is highly sensitive to parameter choice; for instance, the selection of the cell line used for drug signatures alone can dramatically alter drug prioritisation. Based on these findings, we propose a best-practice framework for robust, genetically-informed drug prioritisation using TWAS signature-matching.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.2%
17.2%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.4%
9.9%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.6%
7.1%
4
Nature Communications
4913 papers in training set
Top 30%
6.2%
5
Cell Systems
167 papers in training set
Top 3%
4.2%
6
Cell Genomics
162 papers in training set
Top 1.0%
4.2%
7
Bioinformatics
1061 papers in training set
Top 5%
3.9%
50% of probability mass above
8
Nature Genetics
240 papers in training set
Top 3%
3.2%
9
Scientific Reports
3102 papers in training set
Top 44%
2.7%
10
iScience
1063 papers in training set
Top 8%
2.6%
11
npj Digital Medicine
97 papers in training set
Top 2%
2.0%
12
Nature Biomedical Engineering
42 papers in training set
Top 0.6%
2.0%
13
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.6%
15
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
16
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.5%
17
eLife
5422 papers in training set
Top 46%
1.5%
18
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.3%
19
PLOS ONE
4510 papers in training set
Top 59%
1.3%
20
Communications Biology
886 papers in training set
Top 15%
1.2%
21
Nature Machine Intelligence
61 papers in training set
Top 3%
1.2%
22
BMC Medical Genomics
36 papers in training set
Top 0.8%
1.2%
23
Frontiers in Pharmacology
100 papers in training set
Top 4%
0.9%
24
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 42%
0.9%
25
Human Genetics and Genomics Advances
70 papers in training set
Top 0.7%
0.8%
26
Genetic Epidemiology
46 papers in training set
Top 0.9%
0.7%
27
Frontiers in Genetics
197 papers in training set
Top 11%
0.7%
28
Genome Biology
555 papers in training set
Top 9%
0.6%
29
Patterns
70 papers in training set
Top 3%
0.6%