Machine Learning Analysis of User Sentiments in Tinnitus Management Apps
Yousaf, M. N.; Anwar, M. N.; Naveed, N.; Haider, U.
Show abstract
BackgroundTinnitus affects a substantial proportion of the global population and can severely disrupt sleep, mood, and daily functioning, yet the quality of mobile health apps designed for tinnitus management remains highly variable. Traditional evaluation methods, including clinical trials, expert rating scales, and small-scale surveys, rarely capture large-scale, feature-level feedback from real-world users, leaving a gap in understanding which app characteristics drive sustained engagement and satisfaction. MethodsThis study analysed 342,520 English-language reviews from 84 tinnitus-related apps on iOS and Android collected between 2015 and 2025. A pipeline first applied VADER-based preprocessing and sentiment assignment, then trained a graph neural network aspect-based sentiment analysis (GNN-ABSA) model operating on sentence-level dependency graphs to infer feature-level sentiment for domains such as sound therapy, sleep support, pricing, advertisements, stability, and user interface. ResultsThe GNN-ABSA model achieved an accuracy of 84.4% and a macro F1 score of 0.829 on unseen aspect-level test data, indicating stable performance across sentiment classes. Therapeutic features like sound masking and sleep support were associated with predominantly positive sentiment, whereas pricing, advertisements, background playback, and technical stability attracted more neutral or negative feedback over the ten-year period. ConclusionsLarge-scale, graph-based feature-level sentiment analysis provides a user-cantered perspective that complements clinical trials and expert app quality ratings, offering actionable guidance for developers seeking to prioritize design improvements, supporting clinicians in recommending suitable apps to patients, and informing the design of more explainable and user-driven digital health tools. Trial RegistrationNot applicable. This study analysed publicly available app store reviews and did not involve human participants.
Matching journals
The top 4 journals account for 50% of the predicted probability mass.