Back

Mapping disease loci to biological processes via joint pleiotropic and epigenomic partitioning

Kerner, G.; Kamitaki, N.; Strober, B.; Price, A. L.

2025-05-06 genetic and genomic medicine
10.1101/2025.05.05.25327017 medRxiv
Show abstract

Genome-wide association studies (GWAS) have identified thousands of disease-associated loci, yet their interpretation remains limited by the heterogeneity of underlying biological processes. We propose Joint Pleiotropic and Epigenomic Partitioning (J-PEP), a clustering framework that integrates pleiotropic SNP effects on auxiliary traits and tissue-specific epigenomic data to partition disease-associated loci into biologically distinct clusters. To benchmark J-PEP against existing methods, we introduce a metric--Pleiotropic and Epigenomic Prediction Accuracy (PEPA)--that evaluates how well the clusters predict SNP-to-trait and SNP-to-tissue associations using off-chromosome data, avoiding overfitting. Applying J-PEP to GWAS summary statistics for 165 diseases/traits (average N=290K), we attained 16-30% higher PEPA than pleiotropic or epigenomic partitioning approaches with larger improvements for well-powered traits, consistent with simulations; these gains arise from J-PEPs tendency to upweight correlated structure--signals present in both auxiliary trait and tissue data--thereby emphasizing shared components. For type 2 diabetes (T2D), J-PEP identified clusters refining canonical pathological processes while revealing underexplored immune and developmental signals. For hypertension (HTN), J-PEP identified stromal and adrenal-endocrine processes that were not identified in prior analyses. For neutrophil count, J-PEP identified hematopoietic, hepatic-inflammatory, and neuroimmune processes, expanding biological interpretation beyond classical immune regulation. Notably, integrating single-cell chromatin accessibility data refined bulk-based clusters, enhancing cell-type resolution and specificity. For T2D, single-cell data refined a bulk endocrine cluster to pancreatic islet {beta}-cells, consistent with established {beta}-cell dysfunction in insulin deficiency; for HTN, single-cell data refined a bulk endocrine cluster to adrenal cortex cells, consistent with a GO enrichment for neutrophil-mediated inflammation that implicates feedback between aldosterone production in the adrenal gland and local immune signaling. In conclusion, J-PEP provides a principled framework for partitioning GWAS loci into interpretable, tissue-informed clusters that provide biological insights on complex disease.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nature Genetics
240 papers in training set
Top 0.1%
39.7%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.5%
9.2%
3
Nature Communications
4913 papers in training set
Top 22%
8.5%
50% of probability mass above
4
Cell Genomics
162 papers in training set
Top 0.4%
7.2%
5
Genome Medicine
154 papers in training set
Top 0.8%
6.9%
6
Cell Systems
167 papers in training set
Top 4%
3.3%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 32%
1.7%
8
Developmental Cell
168 papers in training set
Top 8%
1.7%
9
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
10
Genome Biology
555 papers in training set
Top 5%
1.2%
11
Scientific Reports
3102 papers in training set
Top 70%
0.9%
12
Human Genetics and Genomics Advances
70 papers in training set
Top 0.6%
0.9%
13
Bioinformatics
1061 papers in training set
Top 9%
0.8%
14
Science Advances
1098 papers in training set
Top 28%
0.8%
15
eLife
5422 papers in training set
Top 57%
0.8%
16
Communications Biology
886 papers in training set
Top 23%
0.8%
17
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
18
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
19
iScience
1063 papers in training set
Top 34%
0.7%
20
Journal of Clinical Investigation
164 papers in training set
Top 8%
0.6%
21
PLOS Genetics
756 papers in training set
Top 17%
0.6%
22
Cell
370 papers in training set
Top 19%
0.5%
23
Nature Neuroscience
216 papers in training set
Top 7%
0.5%
24
Nature Biomedical Engineering
42 papers in training set
Top 3%
0.5%
25
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.5%