Back

A Deep Learning-based Genome-wide Polygenic Risk Score for Common Diseases Identifies Individuals with Risk

Peng, J.; Li, J.; Han, R.; Wang, Y.; Han, L.; Peng, J.; Wang, T.; Hao, J.; Shang, X.; Wei, Z.

2021-11-21 genetic and genomic medicine
10.1101/2021.11.17.21265352 medRxiv
Show abstract

Identifying individuals at high risk in the population is a key public health need. For many common diseases, individual susceptibility may be influenced by genetic variation. Recently, the clinical potential of polygenic risk score (PRS) has attracted widespread attention. However, the performance of traditional methods is limited in fitting capabilities of the linear model and unable to capture the interaction information between single nucleotide polymorphisms (SNPs). To fill this gap, a novel deep-learning-based model named DeepPRS is developed for scoring the risk of common diseases with genome-wide genotype data. Using the UK Biobank dataset, the evaluation shows that DeepPRS performs better than the other two existing state-of-art methods on Alzheimers disease, inflammatory bowel disease, type 2 diabetes and breast cancer. Since DeepPRS does not only rely on the addictive effect of risk SNPs, DeepPRS has the chance to identify high-risk individuals even with few known risk SNPs.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.2%
14.5%
2
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.1%
7.2%
3
Scientific Reports
3102 papers in training set
Top 17%
6.4%
4
Frontiers in Genetics
197 papers in training set
Top 0.8%
6.4%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.9%
6
Bioinformatics
1061 papers in training set
Top 5%
4.4%
7
Communications Biology
886 papers in training set
Top 1%
3.7%
8
Genome Medicine
154 papers in training set
Top 2%
3.6%
50% of probability mass above
9
Journal of Biomedical Informatics
45 papers in training set
Top 0.5%
2.8%
10
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.8%
11
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
12
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
13
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.7%
14
Genetic Epidemiology
46 papers in training set
Top 0.5%
1.5%
15
Journal of Personalized Medicine
28 papers in training set
Top 0.5%
1.5%
16
PLOS ONE
4510 papers in training set
Top 57%
1.5%
17
Nature Communications
4913 papers in training set
Top 55%
1.3%
18
Human Genetics
25 papers in training set
Top 0.2%
1.3%
19
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
20
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.2%
0.8%
21
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
22
iScience
1063 papers in training set
Top 29%
0.8%
23
European Journal of Human Genetics
49 papers in training set
Top 1%
0.8%
24
Heliyon
146 papers in training set
Top 6%
0.8%
25
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.8%
26
PLOS Genetics
756 papers in training set
Top 15%
0.8%
27
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
28
Medical Image Analysis
33 papers in training set
Top 1%
0.7%
29
Human Genetics and Genomics Advances
70 papers in training set
Top 0.9%
0.7%
30
Human Molecular Genetics
130 papers in training set
Top 4%
0.7%