Back

Personalized Data-Driven Robust Machine Learning Models to Differentiate Parkinson's Disease Patients Using Heterogeneous Risk Factors

Iluppangama, M.; Abeywardana, D.; Tsokos, C.

2025-12-19 neurology
10.64898/2025.12.18.25342612 medRxiv
Show abstract

Parkinsons Disease (PD) is the most prevalent neurodegenerative disorder after Alzheimers, yet its diagnosis largely relies on subjective clinical assessments. Thus, this study proposes a systematic, data-driven approach to accurately classify PD patients using heterogeneous risk factors along with efficient machine learning. Six machine learning algorithms, Support Vector Machine(SVM), Random Forest(RF), Extreme Gradient Boosting(XGBoost), Logistic Regression(LR), K-Nearest Neighbour (KNN), and Decision Tree(DT) were utilized and evaluated their performances to identify the most robust and efficient model with high discrimination power. SVM model outper-formed all other machine learning models, and it has been identified as the highest-quality model to classify PD patients from others with at least 96% accuracy. Further-more, Feature importance was analyzed using SHAP to enhance the interpretability of the proposed model. This study contributes to the integration of artificial intelligence in the healthcare domain, emphasizing the value of data-driven classification modeling techniques in supporting healthcare professionals with accurate, personalized, and actionable insights for high-risk patients. Together, these approaches enhance the precision of early detection of PD, paving the way for more informed clinical decision-making and improved patient care.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Computers in Biology and Medicine
120 papers in training set
Top 0.1%
22.8%
2
Sensors
39 papers in training set
Top 0.3%
4.9%
3
PLOS ONE
4510 papers in training set
Top 31%
4.9%
4
Scientific Reports
3102 papers in training set
Top 27%
4.4%
5
Frontiers in Aging Neuroscience
67 papers in training set
Top 0.8%
4.4%
6
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.7%
4.0%
7
BioMed Research International
25 papers in training set
Top 0.5%
4.0%
8
IEEE Access
31 papers in training set
Top 0.1%
3.6%
50% of probability mass above
9
Journal of NeuroEngineering and Rehabilitation
28 papers in training set
Top 0.4%
2.1%
10
BMC Neurology
12 papers in training set
Top 0.4%
1.7%
11
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
12
npj Digital Medicine
97 papers in training set
Top 2%
1.7%
13
Biomedicines
66 papers in training set
Top 1.0%
1.7%
14
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.7%
15
npj Parkinson's Disease
89 papers in training set
Top 0.8%
1.5%
16
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.3%
17
Artificial Intelligence in Medicine
15 papers in training set
Top 0.4%
1.3%
18
Frontiers in Neuroscience
223 papers in training set
Top 5%
1.2%
19
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.5%
1.0%
20
Alzheimer's Research & Therapy
52 papers in training set
Top 1%
1.0%
21
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
22
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
23
Parkinsonism & Related Disorders
21 papers in training set
Top 0.4%
0.8%
24
Applied Sciences
24 papers in training set
Top 0.9%
0.8%
25
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.8%
26
Cognitive Neurodynamics
15 papers in training set
Top 0.4%
0.8%
27
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
28
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
29
Advanced Science
249 papers in training set
Top 20%
0.7%
30
Frontiers in Neurology
91 papers in training set
Top 5%
0.7%