Back

The Gene Expression Landscape of Disease Genes

Garcia-Gonzalez, J.; Garcia-Gonzalez, S.; Liou, L.; O'Reilly, P. F.

2024-06-21 genetic and genomic medicine
10.1101/2024.06.20.24309121 medRxiv
Show abstract

Fine-mapping and gene-prioritisation techniques applied to the latest Genome-Wide Association Study (GWAS) results have prioritised hundreds of genes as causally associated with disease. Here we leverage these recently compiled lists of high-confidence causal genes to interrogate where in the body disease genes operate, which, in previous studies, has mostly been investigated by testing for enrichment of GWAS signal among genes with cell/tissue specific expression. By integrating GWAS summary statistics, gene prioritisation results, and RNA-seq data from 46 tissues and 204 cell types, we directly analyse the gene expression of putative disease genes across the body in relation to 11 major diseases and cancers. In tissues and cell types with established disease relevance, disease genes show higher and more specific gene expression compared to control genes. However, we also detect elevated expression in tissues and cell types without previous links to the corresponding disease. While some of these results may be explained by cell types that span multiple tissues, such as macrophages in brain, blood, lung and spleen in relation to Alzheimers disease (P-values < 10-3), the cause for others is unclear and warrants further investigation. To support functional follow-up studies of disease genes, we identify technical and biological factors influencing their expression, and highlight tissues in which higher expression is associated with increased odds of inclusion in drug development programs. We provide our systematic testing framework as an open-source, publicly available tool that can be utilised to offer novel insights into the genes, tissues and cell types involved in any disease, with the potential for informing drug development and delivery strategies.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
22.2%
2
Scientific Reports
3102 papers in training set
Top 20%
6.2%
3
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
4.8%
4
Nature Communications
4913 papers in training set
Top 33%
4.8%
5
Nature Biomedical Engineering
42 papers in training set
Top 0.3%
3.6%
6
Nature Genetics
240 papers in training set
Top 2%
3.5%
7
The American Journal of Human Genetics
206 papers in training set
Top 1%
3.5%
8
Cell Genomics
162 papers in training set
Top 1%
3.5%
50% of probability mass above
9
Cell Systems
167 papers in training set
Top 6%
2.0%
10
iScience
1063 papers in training set
Top 11%
2.0%
11
Alzheimer's Research & Therapy
52 papers in training set
Top 1%
1.9%
12
Brain
154 papers in training set
Top 3%
1.9%
13
Frontiers in Pharmacology
100 papers in training set
Top 2%
1.7%
14
PLOS Computational Biology
1633 papers in training set
Top 17%
1.7%
15
PLOS ONE
4510 papers in training set
Top 55%
1.7%
16
Genome Biology
555 papers in training set
Top 4%
1.7%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
18
npj Parkinson's Disease
89 papers in training set
Top 0.8%
1.3%
19
npj Digital Medicine
97 papers in training set
Top 3%
1.2%
20
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.1%
21
BMC Genomics
328 papers in training set
Top 4%
1.1%
22
Brain Communications
147 papers in training set
Top 3%
0.9%
23
Communications Medicine
85 papers in training set
Top 0.8%
0.9%
24
Bioinformatics
1061 papers in training set
Top 9%
0.9%
25
Cell Reports Medicine
140 papers in training set
Top 7%
0.9%
26
Communications Biology
886 papers in training set
Top 22%
0.8%
27
Human Genetics and Genomics Advances
70 papers in training set
Top 0.7%
0.8%
28
Molecular Systems Biology
142 papers in training set
Top 1%
0.8%
29
eLife
5422 papers in training set
Top 58%
0.7%
30
eBioMedicine
130 papers in training set
Top 4%
0.7%