Back

Predicting Alzheimer's Trajectory: A Multi-PRS Machine Learning Approach for Early Diagnosis and Progression Forecasting

Mustaq, M.; Ahmed, N.; Mahbub, S.; Li, C.; Miyaoka, Y.; TCW, J.; Andrews, S.; Bayzid, M. S.

2023-11-29 health informatics
10.1101/2023.11.28.23299110 medRxiv
Show abstract

INTRODUCTIONPredicting the early onset of dementia due to Alzheimers Disease (AD) has major implications for timely clinical management and outcomes. Current diagnostic methods, reliant on invasive and costly procedures, underscore the need for scalable and innovative approaches. To date, considerable effort has been dedicated to developing machine learning (ML) based approaches using different combinations of medical, demographic, cognitive, and clinical data, achieving varying levels of accuracy. However, they often lack the scalability required for large-scale screening and fail to identify underlying risk factors for AD progression. Polygenic risk scores (PRS) have shown promise in predicting disease risk from genetic data. Here, we aim to leverage ML techniques to develop a multi-PRS model that captures both genetic and non-genetic risk factors to diagnose and predict the progression of AD in different stages in older adults. METHODSWe trained and tested ML-based multi-PRS models, integrating genetically predicted clinical, behavioral, psychiatric, and lifestyle risk factors to predict the diagnosis of AD as well as the progression between different cognitive stages. We developed an automatic feature selection pipeline that identifies the relevant traits that predict AD. We also analyzed the interpretability of our pro-posed ML models and the selected features. Leveraging data from the Alzheimers Disease Neuroimaging Initiative (ADNI), Religious Orders Study and Memory and Aging Project (ROSMAP), and the IEU OpenGWAS Project, our study presents the first known end-to-end ML-based multi-PRS model for AD. RESULTSRelevant features were selected from an initial set of 53 polygenic risk scores computed for 1567 patients in the ADNI and 1642 patients in the ROSMAP dataset. The proposed multi-PRS ML method produced AUROC scores of 77% on ADNI and 72% on ROSMAP for predicting the diagnosis of AD, substantially surpassing the performance of the uni-variate PRS models. Our models also showed promise in predicting transitions between various cognitive stages (65%-75% AUROC scores). Moreover, the features identified by our automated feature selection pipeline are closely aligned with the widely recognized potentially modifiable risk factors for AD. DISCUSSIONMulti-PRS-based machine learning models can identify risk factors and construct predictive models for early Alzheimers disease (AD) diagnosis. This approach offers an automated mechanism to harness genetic data for AD diagnosis and prognosis, enhancing our understanding of the role of various traits in AD development and progression. It will facilitate the implementation of preventive measures at an early stage, thereby contributing to more effective interventions and improved patient outcomes.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring
38 papers in training set
Top 0.1%
9.9%
2
Frontiers in Aging Neuroscience
67 papers in training set
Top 0.3%
9.9%
3
Alzheimer's Research & Therapy
52 papers in training set
Top 0.2%
8.2%
4
The Journal of Prevention of Alzheimer's Disease
10 papers in training set
Top 0.1%
8.2%
5
Neurobiology of Aging
95 papers in training set
Top 0.4%
6.7%
6
NeuroImage: Clinical
132 papers in training set
Top 1%
4.2%
7
GeroScience
97 papers in training set
Top 0.6%
3.5%
50% of probability mass above
8
Scientific Reports
3102 papers in training set
Top 45%
2.7%
9
PLOS ONE
4510 papers in training set
Top 46%
2.4%
10
npj Digital Medicine
97 papers in training set
Top 2%
2.3%
11
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.2%
2.3%
12
Journal of Alzheimer’s Disease
39 papers in training set
Top 0.5%
2.0%
13
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
14
Alzheimer's & Dementia: Translational Research & Clinical Interventions
16 papers in training set
Top 0.4%
1.7%
15
Journal of Alzheimer's Disease
43 papers in training set
Top 0.8%
1.7%
16
Human Brain Mapping
295 papers in training set
Top 3%
1.6%
17
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.6%
18
Bioinformatics
1061 papers in training set
Top 8%
1.5%
19
Age and Ageing
27 papers in training set
Top 0.3%
1.3%
20
Artificial Intelligence in Medicine
15 papers in training set
Top 0.4%
1.3%
21
Annals of Neurology
57 papers in training set
Top 2%
1.2%
22
NeuroImage
813 papers in training set
Top 5%
1.1%
23
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.9%
24
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.9%
25
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.2%
0.8%
26
Frontiers in Digital Health
20 papers in training set
Top 1%
0.8%
27
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
28
Medical Image Analysis
33 papers in training set
Top 1%
0.7%
29
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.7%
30
Translational Psychiatry
219 papers in training set
Top 4%
0.7%