Back

Screening for Rheumatic Heart Disease in Asymptomatic Children using Machine Learning from Electrocardiograms

Chuma, A. T.; Wang, c.; Asmare, M. h.; Varon, C.; Voigt, J.-U.; Kassie, D. M.; Zuhlke, L.; Vanrumste, B.

2026-05-05 health informatics
10.64898/2026.05.03.26352326 medRxiv
Show abstract

Early detection of Rheumatic Heart Disease (RHD) is essential in reducing its associated mortality and late complications. In resource-limited settings, automated detection using low-cost electrocardiogram (ECG) sensors can enhance prevention efforts. However, its effectiveness as a potential RHD screening tool in at-risk populations remains unexplored. This study aimed to investigate the utility of machine learning for classifying RHD in a cohort screened for RHD using low-cost ECG devices. The ECGs were collected from 611 at-risk schoolchildren using KardiaMobile, where 47 were confirmed RHD and 564 were healthy. First, the ECG fiducial points were annotated using a publicly available prominence-based delineator. Then, temporal, frequency, wavelet, and visibility graph-based features were extracted from six-leads and fed to the XGBoost classifier. A 10-fold cross-validation was used at different prediction score thresholds to obtain target sensitivity (Se) for screening RHD. Single-lead evaluation on Lead-II showed an F1-score of 60.9%, a Se of 59.6% and a positive-predictive-value (PPV) of 62.2%. However, using multiple leads improved the results, with an F1-score of 62.8%, a Se of 59.6% and a PPV of 66.7%. The best model performance was achieved by adjusting the threshold to 0.6 with Se and PPV of 66% and 51%, respectively. Error analysis revealed that T-wave and STT changes, as well as non-rheumatic mitral valve cases were among the false positive cases. Machine learning can enhance early detection by leveraging relevant ECG features and adjustable target sensitivity based on screening priorities and resource capacity. Measurements can be obtained without chest contact, using only the fingers and knees, thereby enabling use by non-clinical staff. This approach provides a scalable and cost-effective solution for RHD screening in high-prevalence regions.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Digital Health
91 papers in training set
Top 0.1%
14.7%
2
Scientific Reports
3102 papers in training set
Top 2%
14.3%
3
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.2%
6.4%
4
PLOS ONE
4510 papers in training set
Top 27%
6.4%
5
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 0.7%
4.8%
6
Computers in Biology and Medicine
120 papers in training set
Top 0.6%
4.2%
50% of probability mass above
7
Physiological Measurement
12 papers in training set
Top 0.1%
3.6%
8
Frontiers in Medicine
113 papers in training set
Top 2%
3.1%
9
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.1%
3.1%
10
BMC Cardiovascular Disorders
14 papers in training set
Top 0.8%
2.1%
11
npj Digital Medicine
97 papers in training set
Top 2%
1.9%
12
European Respiratory Journal
54 papers in training set
Top 0.8%
1.9%
13
IEEE Transactions on Biomedical Engineering
38 papers in training set
Top 0.4%
1.9%
14
Frontiers in Physiology
93 papers in training set
Top 3%
1.7%
15
Journal of the American Heart Association
119 papers in training set
Top 3%
1.3%
16
JAMIA Open
37 papers in training set
Top 1%
1.3%
17
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.3%
18
JMIR Formative Research
32 papers in training set
Top 1%
1.2%
19
Diagnostics
48 papers in training set
Top 1%
1.2%
20
Journal of Personalized Medicine
28 papers in training set
Top 0.6%
1.2%
21
BMJ Health & Care Informatics
13 papers in training set
Top 0.7%
0.9%
22
iScience
1063 papers in training set
Top 26%
0.9%
23
JMIR Medical Informatics
17 papers in training set
Top 1%
0.9%
24
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
25
European Heart Journal - Digital Health
15 papers in training set
Top 0.5%
0.9%
26
PLOS Neglected Tropical Diseases
378 papers in training set
Top 5%
0.8%
27
Frontiers in Digital Health
20 papers in training set
Top 1%
0.7%
28
JMIR Public Health and Surveillance
45 papers in training set
Top 4%
0.7%
29
NeuroImage: Clinical
132 papers in training set
Top 4%
0.6%