Design and Rationale of the My Heart Counts Cardiovascular Health Study: a Large-Scale, Fully Digital Biobank, and Randomized Trial of Large Language Model-Driven Coaching of Physical Activity
Schmiedmayer, P.; Johnson, A.; Schuetz, N.; Kollmer, L.; Goldschmidt, P.; Delgado-SanMartin, J.; Zhang, K.; Mantena, S. D.; Tolas, A.; Montalvo, S.; Raimrez Posada, M.; O'Sullivan, J. W.; Oppezzo, M.; King, A. C.; Rodriguez, F.; Ashley, E.; Lawrie, A.; Kim, D. S.
Show abstract
BackgroundCardiovascular disease remains the leading cause of global morbidity and mortality. The original My Heart Counts smartphone application demonstrated the feasibility of large-scale, fully digital recruitment and trial conduct, but was limited by platform exclusivity and the need for human experts to create text-based behavioral interventions. MethodsThe next-generation My Heart Counts smartphone application is a prospective, observational cohort study with an embedded randomized crossover trial, evaluating personalized text-based coaching prompts, available in both English and Spanish. All study and trial operations will be conducted via the My Heart Counts smartphone application, re-designed using the open-source Stanford Spezi framework to support iOS, with a planned Android release in 2027. The target enrollment is N=15,000 adults across the United States and United Kingdom. The study establishes a comprehensive digital biobank by synthesizing passive mobile health data (steps, flights climbed, heart rate, sleep, workouts), raw sensor data (e.g., accelerometry), longitudinal clinical surveys, active tasks (6-minute walk test and 12-minute Cooper run test), electrocardiograms (ECG), and electronic health record (EHR) data integrated via HL7 FHIR protocols. The embedded trial evaluates the effect of text-based coaching prompts generated by a large language model (LLM) grounded in the Transtheoretical Model of Change on daily physical activity, as compared to generic prompts. Planned AnalysisThe primary endpoint of the randomized crossover trial is change in daily step count between LLM-driven and generic text-based intervention arms, analyzed using mixed-effects models. Secondary endpoints include change in mean active minutes and calorie burn over each intervention week. Other analyses include the changes in submaximal (6-minute walk test) and maximal (Cooper 12-minute run test) cardiorespiratory fitness, changes to sensor-derived biomarkers (e.g., sleep quality, resting heart rate, and heart rate variability), and association of sensor-derived biomarkers with EHR-confirmed clinical outcomes. ConclusionsBy utilizing autonomous, LLM-driven coaching, modular software design, and cross-platform accessibility, our smartphone application-based study will provide a scalable model for inclusive and decentralized preventive care of patients with cardiovascular disease. Trial StatusRecruitment commenced in March 2026 and is ongoing.
Matching journals
The top 4 journals account for 50% of the predicted probability mass.