Back

Benchmarking Heritability Estimation Strategies Across 86 Configurations and Their Downstream Effect on Polygenic Risk Score Performance

Muneeb, M.; Ascher, D.

2026-04-02 bioinformatics
10.64898/2026.04.02.716079 bioRxiv
Show abstract

ObjectiveSNP heritability estimates vary substantially across estimation strategies, yet the downstream consequences for polygenic risk score (PRS) construction remain poorly characterised. We systematically benchmarked heritability estimation configurations and assessed their propagation into downstream PRS performance. MethodsWe benchmarked 86 heritability-estimation configurations spanning six tool families (GEMMA, GCTA, LDAK, DPR, LDSC, SumHer) and ten method groups across 10 UK Biobank phenotypes, yielding 844 configuration-level estimates. Each estimate was propagated into GCTA-SBLUP and LDpred2-lassosum2 PRS frameworks and evaluated across five cross-validation folds using null, PRS-only, and full models. Eleven binary analytical contrasts were tested using Mann-Whitney U tests to identify drivers of heritability variability. ResultsHeritability ranged from -0.862 to 2.735 (mean = 0.134, SD = 0.284), with 133 of 844 estimates (15.8%) negative and concentrated in unconstrained estimation regimes. Ten of eleven analytical contrasts significantly affected heritability magnitude, with algorithm choice and GRM standardisation showing the largest effects. Despite this upstream variability, downstream PRS test performance was only weakly coupled to heritability magnitude: pooled Pearson correlations between h2 and test AUC were r = -0.023 for GCTA-SBLUP and r = +0.014 for LDpred2-lassosum2 (both non-significant). ConclusionSNP heritability is best interpreted as a configuration-sensitive modelling parameter rather than a universally stable scalar input. Heritability estimates should always be reported alongside their full estimation specification, and downstream PRS performance is comparatively robust to moderate variation in the heritability input. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=80 SRC="FIGDIR/small/716079v1_ufig1.gif" ALT="Figure 1"> View larger version (27K): org.highwire.dtl.DTLVardef@112929borg.highwire.dtl.DTLVardef@573c36org.highwire.dtl.DTLVardef@132170borg.highwire.dtl.DTLVardef@1871363_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.3%
2
GigaScience
172 papers in training set
Top 0.1%
8.4%
3
BMC Bioinformatics
383 papers in training set
Top 1%
7.1%
4
PLOS ONE
4510 papers in training set
Top 25%
6.8%
5
Bioinformatics Advances
184 papers in training set
Top 0.7%
4.8%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.3%
7
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
50% of probability mass above
9
BMC Medical Research Methodology
43 papers in training set
Top 0.3%
3.2%
10
BioData Mining
15 papers in training set
Top 0.1%
3.2%
11
PeerJ
261 papers in training set
Top 3%
3.2%
12
Scientific Reports
3102 papers in training set
Top 44%
2.7%
13
Nature Communications
4913 papers in training set
Top 45%
2.4%
14
European Journal of Human Genetics
49 papers in training set
Top 0.4%
2.3%
15
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.8%
17
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
18
BMC Genomics
328 papers in training set
Top 4%
1.1%
19
PLOS Genetics
756 papers in training set
Top 12%
1.1%
20
Communications Biology
886 papers in training set
Top 19%
0.9%
21
BMC Biology
248 papers in training set
Top 4%
0.7%
22
Database
51 papers in training set
Top 1.0%
0.7%
23
BMC Medical Genomics
36 papers in training set
Top 1%
0.7%
24
The Lancet Digital Health
25 papers in training set
Top 1%
0.7%
25
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.7%
26
F1000Research
79 papers in training set
Top 5%
0.7%
27
JMIR mHealth and uHealth
10 papers in training set
Top 0.5%
0.6%
28
Computers in Biology and Medicine
120 papers in training set
Top 6%
0.6%
29
Nucleic Acids Research
1128 papers in training set
Top 20%
0.6%