Back

Calibrated Prediction Intervals for Polygenic Scores: Updated Comparisons, Contextual Calibration, and Data Normalization

Chang, X.; Hou, S.; Zhou, X.

2026-05-19 genetic and genomic medicine
10.64898/2026.05.15.26353336 medRxiv
Show abstract

Calibrated prediction intervals for polygenic scores (PGS) are essential for communicating individual-level uncertainty in genomic medicine. We present updated comparisons of two methods for constructing such intervals: CalPred, a parametric approach, and PredInterval, a non-parametric approach. Our results show that both methods can achieve calibrated coverage, although CalPred additionally requires a sufficiently large calibration set. The two methods also exhibit complementary trade-offs with respect to dataset size and risk identification. We further show that contextual calibration, as introduced in Hou et al. and followed in Shi et al., is most naturally achieved through appropriate phenotype normalization and data preprocessing. Apparent miscalibration can arise from inadequate normalization or from providing contextual information to some methods but not others. In UK Biobank, standard GWAS phenotype normalization procedures are sufficient to achieve contextual calibration for traits analyzed. In the extreme simulations of Hou et al. and Shi et al., supplying contextual covariates to PredInterval restores contextual calibration without normalization, and appropriate normalization can achieve contextual calibration without supplying covariates, while also substantially improving upstream tasks including association power and PGS accuracy. Together, these results underscore the central role of phenotype normalization and data preprocessing in GWAS analyses, including reliable uncertainty quantification for PGS.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
The American Journal of Human Genetics
206 papers in training set
Top 0.1%
23.0%
2
Nature Genetics
240 papers in training set
Top 0.2%
19.1%
3
Bioinformatics
1061 papers in training set
Top 4%
6.9%
4
Cell Systems
167 papers in training set
Top 2%
6.5%
50% of probability mass above
5
PLOS Genetics
756 papers in training set
Top 3%
4.9%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.4%
7
Cell Genomics
162 papers in training set
Top 1%
3.7%
8
Nature Communications
4913 papers in training set
Top 39%
3.7%
9
Genome Biology
555 papers in training set
Top 3%
2.4%
10
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
2.1%
11
Human Genetics and Genomics Advances
70 papers in training set
Top 0.2%
1.9%
12
GENETICS
189 papers in training set
Top 0.5%
1.9%
13
Genetic Epidemiology
46 papers in training set
Top 0.4%
1.7%
14
Genome Medicine
154 papers in training set
Top 4%
1.7%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 32%
1.7%
16
Nature Human Behaviour
85 papers in training set
Top 3%
1.5%
17
Frontiers in Genetics
197 papers in training set
Top 6%
1.2%
18
Nucleic Acids Research
1128 papers in training set
Top 16%
0.8%
19
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
20
European Journal of Human Genetics
49 papers in training set
Top 2%
0.7%
21
Human Molecular Genetics
130 papers in training set
Top 4%
0.7%
22
eLife
5422 papers in training set
Top 63%
0.5%
23
BMC Genomics
328 papers in training set
Top 7%
0.5%
24
Scientific Reports
3102 papers in training set
Top 79%
0.5%