Back
#1
11.5%
Top 14%
11.5%
#1
10.7%
Top 0.9%
7.6%
Top 3%
7.6%
Top 2%
7.6%
Top 73%
6.2%
Top 4%
5.8%
Top 3%
5.8%
Top 4%
4.9%
Top 1%
3.8%
Top 3%
1.9%
Top 2%
1.5%
Top 4%
1.3%
Top 7%
1.1%
Top 1%
1.1%
Top 0.8%
0.9%
Top 4%
0.7%
Top 5%
0.7%
Top 5%
0.7%
Top 8%
0.7%
Training machine learning models on patient level data segregation is crucial in practical clinical applications
2020-04-25
health informatics
Title + abstract only
View on medRxiv
Show abstract
This article discusses the effect of segregation of histopathology images data into three sets; training set for training machine learning model, validation set for model selection and test set for testing model performance. We found that one must be cautious when segregating histological images data (slides) into training, validation and test sets because subtle mishandling of data can introduce data leakage and gives illusively good results on the test set. We performed this study on gene muta...
Predicted journal destinations
1
Computers in Biology and Medicine
39 training papers
2
Scientific Reports
701 training papers
3
BMC Medical Informatics and Decision Making
36 training papers
4
Journal of Biomedical Informatics
37 training papers
5
Journal of the American Medical Informatics Association
53 training papers
6
PLOS Digital Health
88 training papers
7
PLOS ONE
1737 training papers
8
npj Digital Medicine
85 training papers
9
JAMIA Open
35 training papers
10
Journal of Medical Internet Research
81 training papers
11
International Journal of Medical Informatics
25 training papers
12
JMIR Medical Informatics
16 training papers
13
Communications Medicine
63 training papers
14
BMC Medical Research Methodology
41 training papers
15
Heliyon
57 training papers
16
Patterns
15 training papers
17
Journal of Personalized Medicine
17 training papers
18
Bioinformatics
24 training papers
19
Scientific Data
30 training papers
20
Frontiers in Digital Health
18 training papers
21
JMIR Formative Research
31 training papers