Back

Benchmarking Bayesian colocalization methods in validating Mendelian randomization-based target discoveries from circulating proteins for cardiometabolic diseases

Zhang, W.; Yoshiji, S.; Sladek, R.; Dupuis, J.; Lu, T.

2024-10-17 genetics
10.1101/2024.10.14.617627 bioRxiv
Show abstract

BackgroundMendelian randomization (MR) is an important tool for identifying potential biomarkers and drug targets. Colocalization analysis is crucial for validating MR findings and guarding against potential confounding due to linkage disequilibrium. We aim to systematically benchmark the performance of four Bayesian colocalization methods in validating MR-based target discoveries from circulating proteins for cardiometabolic diseases. ResultsWe conducted MR analyses to assess the associations between circulating levels of 1,535 proteins and five cardiometabolic traits, followed by colocalization analyses using coloc, coloc+SuSiE, PWCoCo and SharePro. All methods demonstrated well-controlled false discoveries in the colocalization analysis of 611 pairs of circulating proteins and cardiometabolic traits with a nominal p-value > 0.9 in MR. SharePro demonstrated the highest frequency in supporting 160 (79.6%) of the 201 Bonferroni-significant protein-trait associations identified by MR, compared to coloc (supporting 40.3% of these associations), coloc+SuSiE (46.8%), and PWCoCo (45.8%), and was robust to varying prior colocalization probabilities. Moreover, protein-trait associations identified by MR and supported by SharePro were more likely to agree with significant gene-level associations based on rare variants detected in exome-wide association studies and implicate known drug targets for cardiometabolic diseases. Eight protein-trait associations were exclusively supported by SharePro and did not demonstrate a high risk of horizontal pleiotropy, suggesting potential cardiometabolic biomarkers or drug targets, such as HSF1 and HAVCR2. ConclusionsSharePro most often supports high-confidence associations identified through MR for cardiometabolic diseases. Combining multiple lines of evidence using different methods may substantially increase the yield of biomarker and drug target discovery programs.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
International Journal of Epidemiology
74 papers in training set
Top 0.1%
32.4%
2
PLOS ONE
4510 papers in training set
Top 29%
6.3%
3
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.8%
4
The American Journal of Human Genetics
206 papers in training set
Top 1%
4.2%
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.1%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 38%
3.9%
7
Bioinformatics
1061 papers in training set
Top 5%
3.6%
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.8%
3.5%
9
PLOS Genetics
756 papers in training set
Top 5%
3.5%
10
Genetic Epidemiology
46 papers in training set
Top 0.2%
3.2%
11
Genome Medicine
154 papers in training set
Top 4%
1.8%
12
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
13
Scientific Reports
3102 papers in training set
Top 60%
1.7%
14
American Journal of Epidemiology
57 papers in training set
Top 0.8%
1.7%
15
Genetics in Medicine
69 papers in training set
Top 0.7%
1.7%
16
Communications Biology
886 papers in training set
Top 9%
1.7%
17
Human Molecular Genetics
130 papers in training set
Top 2%
1.3%
18
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
19
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.9%
1.2%
20
Frontiers in Genetics
197 papers in training set
Top 7%
1.2%
21
BMC Medicine
163 papers in training set
Top 6%
0.9%
22
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
23
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.8%
24
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
25
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.8%
26
Database
51 papers in training set
Top 0.9%
0.8%
27
Human Genetics
25 papers in training set
Top 0.4%
0.7%
28
Cell Genomics
162 papers in training set
Top 7%
0.7%
29
European Journal of Epidemiology
40 papers in training set
Top 0.9%
0.6%