Back

Development and Validation of Machine Learning Algorithms to Classify Lower Urinary Tract Symptoms

Dallas, K. B.; Chiang, J. N.; Caron, A. T.; Anger, J. T.; Kaufman, M. R.; Ackerman, A. L.

2022-12-27 urology
10.1101/2022.12.25.22283168 medRxiv
Show abstract

ObjectiveLower urinary tract symptoms (LUTS), such as urinary urgency, frequency, and incontinence, affect the majority of the population, causing substantial morbidity, yet few receive effective care. Sizeable symptomatic overlap between LUTS categories leads to high rates of misdiagnosis. To improve diagnostic accuracy, we sought to employ machine learning approaches to LUTS categorization to generate diagnostic groupings based on patient-reported clinical data, creating a novel tool for diagnosis of patients with voiding complaints. MethodsQuestionnaire responses in a Development Dataset of 514 female subjects were used for model development, identifying 4 major clusters and 9 specific phenotypes of LUTS using agglomerative hierarchical clustering. Each cluster and phenotype was assigned a clinical identity consistent with recognized causes of voiding dysfunction by the consensus of two urologic specialists. Then, a random forest classifier was trained to assign unseen patients into these phenotypes. That model was then applied to a Validation Dataset of 571 additional subjects to confirm the diagnostic algorithm. ResultsThis data-driven, hierarchical clustering approach captured overlapping symptoms inherent in typical patients, recognizing common uncomplicated diagnoses (i.e., overactive bladder) as well as several underrecognized diagnostic categories (i.e., myofascial pelvic pain). A diagnostic algorithm derived by supervised machine learning to assign unseen subjects into these phenotypes demonstrated good reproducibillty of the phenotypes and their symptomatic patterns in an independent Validation Dataset. ConclusionsWe describe the generation of a machine learning algorithm relying only on validated, patient-reported symptoms for diagnostic classification. Given a growing physician shortage and increasing challenges for patients accessing specialist care, this type of digital technology holds great potential to improve the recognition and diagnosis of functional urologic conditions.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
PLOS Digital Health
91 papers in training set
Top 0.1%
43.8%
2
PLOS ONE
4510 papers in training set
Top 20%
9.3%
50% of probability mass above
3
Scientific Reports
3102 papers in training set
Top 11%
7.9%
4
Frontiers in Medicine
113 papers in training set
Top 2%
2.6%
5
npj Digital Medicine
97 papers in training set
Top 2%
2.3%
6
Cureus
67 papers in training set
Top 2%
2.3%
7
International Journal of Medical Informatics
25 papers in training set
Top 0.6%
2.1%
8
Diagnostics
48 papers in training set
Top 0.8%
2.0%
9
BMJ Open
554 papers in training set
Top 8%
1.9%
10
JMIR Medical Informatics
17 papers in training set
Top 0.6%
1.8%
11
Journal of Clinical Medicine
91 papers in training set
Top 4%
1.6%
12
Science Translational Medicine
111 papers in training set
Top 3%
1.5%
13
American Journal of Infection Control
12 papers in training set
Top 0.2%
1.4%
14
BMC Medical Research Methodology
43 papers in training set
Top 0.8%
1.4%
15
Heliyon
146 papers in training set
Top 3%
1.2%
16
eLife
5422 papers in training set
Top 49%
1.2%
17
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 5%
0.8%
18
Frontiers in Digital Health
20 papers in training set
Top 2%
0.7%
19
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%
20
Analytical Chemistry
205 papers in training set
Top 3%
0.6%
21
PLOS Biology
408 papers in training set
Top 24%
0.6%
22
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.5%