Back

Assessing the Impact of Downsampled ECGs and Alternative Loss Functions in Multi-Label Classification of 12-Lead ECGs

Singstad, B.-J.; Muten, E. M.

2022-11-18 cardiovascular medicine
10.1101/2022.11.16.22282373 medRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWThe electrocardiogram (ECG) is an almost universally accessible diagnostic tool for heart disease. An ECG is measured by using an electrocardiograph, and todays electrocardiographs use built-in software to interpret the ECGs automatically after they are recorded. However, these algorithms show limited performance, and therefore clinicians usually have to manually interpret the ECG, regardless of whether an algorithm has interpreted the ECG or not. Manual interpretation of the ECG can be time-consuming and require specific skills. Therefore, a better algorithm is clearly needed to make correct ECG interpretations more accessible and time efficient. Algorithms based on artificial intelligence have shown promising performance in many fields, including ECG interpretation, over the last few years and might represent an alternative to manual ECG interpretation. In this study, we used a dataset with 88253 12-lead ECGs from multiple databases, annotated with SNOMED-CT codes by medical experts. We employed a supervised convolutional neural network with an Inception architecture to classify 30 of the most frequent annotated diagnoses in the dataset. Each patient could have more than one diagnosis, which makes this a multi-label classification. We compared the Inception models performance while applying different preprocessing methods on the ECGs and different model settings during 10-folded cross-validation. We compared the models classification performance using binary cross-entropy (BCE) loss and double soft F1 loss. Furthermore, we compared the classification performance when downsampling the original sampling rate of the input ECG. Finally, we trained 30 interpretable linear models to provide class activation maps to explain the relative importance of each sample in the ECG with respect to the 30 diagnoses considered in this study. Due to the heavily imbalanced class distribution in our dataset, we placed the most emphasis on the F1 score when evaluating the performance of the models. Our results show that the best performance in terms of F1-score was seen when the Inception model used double soft F1 as the loss function and ECGs downsampled to 75Hz. This model achieved an F1 score of 0.420 {+/-} 0.017, accuracy = 0.954 {+/-} 0.002, and an AUROC score of 0.832 {+/-} 0.019. An aggregation of the generated saliency maps, achieved using Local Interpretable Model-Agnostic Explanations (LIME), showed that the Inception model paid the most attention to the limb leads and the augmented leads and less importance to the precordial leads. One of the more significant contributions that emerge from this study is the use of aggregated saliency maps to obtain ECG lead importance for different diagnoses. In addition, we emphasized the relevance of evaluating different loss functions, and in this specific case, we found double soft F1 loss to be slightly better than BCE. Finally, we found it somewhat surprising that downsampling the ECG led to higher performance compared to the original 500Hz sampling rate. These findings contribute in several ways to our understanding of the artificial intelligence-based interpretation of ECGs, but further studies should be carried out to validate these findings in other datasets from other patient cohorts.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Physiological Measurement
12 papers in training set
Top 0.1%
14.1%
2
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.1%
9.9%
3
Frontiers in Physiology
93 papers in training set
Top 0.3%
9.0%
4
Computers in Biology and Medicine
120 papers in training set
Top 0.2%
8.3%
5
Scientific Reports
3102 papers in training set
Top 13%
7.0%
6
PLOS ONE
4510 papers in training set
Top 37%
3.9%
50% of probability mass above
7
Biology Methods and Protocols
53 papers in training set
Top 0.3%
3.6%
8
IEEE Access
31 papers in training set
Top 0.1%
3.5%
9
Sensors
39 papers in training set
Top 0.5%
3.5%
10
Medical Image Analysis
33 papers in training set
Top 0.4%
3.5%
11
Journal of NeuroEngineering and Rehabilitation
28 papers in training set
Top 0.3%
3.0%
12
iScience
1063 papers in training set
Top 8%
2.7%
13
European Heart Journal - Digital Health
15 papers in training set
Top 0.2%
2.6%
14
Biomedical Signal Processing and Control
18 papers in training set
Top 0.3%
1.7%
15
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 2%
1.6%
16
Diagnostics
48 papers in training set
Top 1%
1.5%
17
npj Digital Medicine
97 papers in training set
Top 3%
1.2%
18
IEEE Transactions on Biomedical Engineering
38 papers in training set
Top 0.8%
0.9%
19
PLOS Digital Health
91 papers in training set
Top 2%
0.9%
20
JACC: Clinical Electrophysiology
11 papers in training set
Top 0.3%
0.9%
21
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
22
PLOS Neglected Tropical Diseases
378 papers in training set
Top 5%
0.8%
23
Cureus
67 papers in training set
Top 5%
0.7%
24
MethodsX
14 papers in training set
Top 0.5%
0.7%
25
BMC Genomics
328 papers in training set
Top 6%
0.7%
26
Annals of Biomedical Engineering
34 papers in training set
Top 1%
0.7%
27
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.7%
28
European Journal of Human Genetics
49 papers in training set
Top 2%
0.6%
29
Frontiers in Neuroscience
223 papers in training set
Top 9%
0.6%
30
Biomedicines
66 papers in training set
Top 4%
0.6%