Back

High-Performance Classification of Mpox Symptoms Using Support Vector Classifier and Quadratic Discriminant Analysis

Okoli, S. C.; Ligali, F. C.; Olufemi, M.; Oyebola, K.

2026-02-22 infectious diseases
10.64898/2026.02.12.26346046 medRxiv
Show abstract

BackgroundRecent global outbreaks of Mpox have posed significant diagnostic challenges, particularly in resource-limited settings. Conventional diagnostic methods are often inaccessible due to cost, logistical constraints, or lack of trained personnel. These limitations highlight the urgent need for alternative, scalable diagnostic strategies. This study explored the application of machine learning (ML) classifiers trained on clinical symptom data as a rapid, cost-effective tool for Mpox detection. MethodsAn open-access dataset of clinical symptoms from suspected Mpox cases was used to train and evaluate five supervised ML algorithms: Extra Trees, Quadratic Discriminant Analysis (QDA), Decision Trees, Perceptron, and Support Vector Classifier (SVC). Prior to training, data preprocessing steps, including normalization and handling of missing values, were performed after which model training was carried out using a stratified 80:20 train-test split. Performance was assessed using accuracy, recall, area under the receiver operating characteristic curve (ROC-AUC), and F1-score metrics. Subsequently, feature importance was analyzed using permutation-based techniques to determine the contribution of each clinical symptom to model predictions. ResultsAmong the five evaluated models, SVC, QDA, and Perceptron achieved superior and identical performance metrics, with accuracy, ROC-AUC, and F1-score values of 97.7%, and a recall of 95.5%. Each of these models correctly identified 44 true positive cases with zero false positives. In addition, QDA and SVC produced the lowest number of false negatives (2) and the highest number of true negatives (42), indicating robust discriminatory power. Feature importance analysis identified skin rash as the most predictive clinical feature, with a permutation importance score of 0.12. ConclusionsThese findings demonstrate the strong potential of machine learning classifiers for detecting Mpox based on clinical features. Incorporating these models into healthcare systems could significantly enhance early case detection, improve clinical decision-making, and bolster disease surveillance. Future research should focus on prospective validation of these ML classifiers in real-world clinical environments.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
BMC Infectious Diseases
118 papers in training set
Top 0.1%
12.5%
2
Scientific Reports
3102 papers in training set
Top 10%
8.4%
3
PLOS ONE
4510 papers in training set
Top 25%
6.8%
4
PLOS Neglected Tropical Diseases
378 papers in training set
Top 2%
4.8%
5
Journal of Clinical Microbiology
120 papers in training set
Top 0.5%
4.0%
6
Clinical Infectious Diseases
231 papers in training set
Top 1%
3.7%
7
Frontiers in Public Health
140 papers in training set
Top 2%
3.6%
8
Journal of Infection
71 papers in training set
Top 0.5%
3.6%
9
Journal of Medical Internet Research
85 papers in training set
Top 2%
3.1%
50% of probability mass above
10
Epidemiology and Infection
84 papers in training set
Top 0.7%
3.1%
11
Open Forum Infectious Diseases
134 papers in training set
Top 0.7%
2.6%
12
PeerJ
261 papers in training set
Top 7%
1.7%
13
BMC Medical Research Methodology
43 papers in training set
Top 0.6%
1.7%
14
BMC Medicine
163 papers in training set
Top 3%
1.7%
15
Clinical Chemistry
22 papers in training set
Top 0.4%
1.5%
16
PLOS Global Public Health
293 papers in training set
Top 4%
1.5%
17
The Journal of Infectious Diseases
182 papers in training set
Top 3%
1.2%
18
The American Journal of Tropical Medicine and Hygiene
60 papers in training set
Top 3%
1.2%
19
Tropical Medicine & International Health
15 papers in training set
Top 0.5%
1.1%
20
International Journal of Medical Informatics
25 papers in training set
Top 1%
0.9%
21
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
22
Infection Control & Hospital Epidemiology
17 papers in training set
Top 0.3%
0.9%
23
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.9%
24
Diagnostics
48 papers in training set
Top 2%
0.9%
25
EClinicalMedicine
21 papers in training set
Top 0.8%
0.8%
26
Journal of Clinical Virology
62 papers in training set
Top 0.8%
0.7%
27
The Lancet Microbe
43 papers in training set
Top 1%
0.7%
28
Heliyon
146 papers in training set
Top 7%
0.7%
29
One Health
29 papers in training set
Top 1%
0.7%
30
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 6%
0.7%