Back

Grounding Language Models in Behavioral Science to Scale Physical Activity Interventions for Hispanic/Latinx Populations

Mantena, S. D.; Johnson, A.; Schuetz, N.; Tolas, A.; Montalvo, S.; Delgado-SanMartin, J.; Ramirez Posada, M.; Du, L.; Zhang, S.; Huynh, A. D.; Oppezzo, M.; King, A. C.; Schmiedmayer, P.; Lawrie, A.; Rodriguez, F.; Ashley, E.; Kim, D. S.

2026-05-28 cardiovascular medicine
10.64898/2026.05.26.26354165 medRxiv
Show abstract

Objective: Hispanic/Latinx populations in the U.S. experience higher rates of chronic disease linked to physical inactivity, yet digital health interventions remain largely inaccessible to more than 16 million Hispanic/Latinx adults with limited English proficiency. While large language models (LLMs) offer scalable personalization, their use in non-English behavioral coaching is unexplored. This study introduces MHC-Coach-ES, a Spanish-language LLM fine-tuned on the Transtheoretical Model (TTM) of behavior change. Materials and Methods: We fine-tuned Llama 3-70B-Instruct using a two-stage pipeline. First, the model was adapted to Spanish health and motivational language using a 2.21-million-token corpus. Second, it was instruction-tuned on 3,268 translated human written messages to align the model with the Transtheoretical Model (TTM) of Behavioral Change. We compared MHC-Coach-ES with Llama 3-70B-Instruct and translated human-expert messages using a forced-choice preference survey (N = 77) and blinded expert review (N = 2). Results: Spanish-speaking participants significantly preferred MHC-Coach-ES messages over translated human-expert messages (81% preference, P<0.001). Linguistic analysis showed that MHC-Coach-ES produced more temporally anchored messages than the base model (65% vs. 20%), while maintaining readability. In blinded evaluation, clinical experts rated MHC-Coach-ES higher for alignment with Transtheoretical Model stages than human-expert messages (4.83 vs. 4.38 out of 5). The base model also outperformed translated expert messages across preference and expert ratings. Conclusions: Generative AI can operationalize behavioral science frameworks in Spanish, offering a scalable approach to reducing health disparities. The strong performance of both MHC-Coach-ES and the base model highlights the promise of generative and personalized approaches over translation-based localization for theory-driven behavioral interventions.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.3%
17.6%
2
DIGITAL HEALTH
12 papers in training set
Top 0.1%
9.2%
3
PLOS ONE
4510 papers in training set
Top 21%
8.5%
4
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.4%
7.2%
5
Healthcare
16 papers in training set
Top 0.1%
6.8%
6
American Journal of Preventive Medicine
11 papers in training set
Top 0.1%
4.9%
50% of probability mass above
7
PLOS Digital Health
91 papers in training set
Top 0.6%
4.0%
8
Scientific Reports
3102 papers in training set
Top 36%
3.6%
9
JMIR Research Protocols
18 papers in training set
Top 0.3%
3.1%
10
Nature Human Behaviour
85 papers in training set
Top 1%
2.7%
11
The Lancet Digital Health
25 papers in training set
Top 0.3%
2.1%
12
Epidemiology
26 papers in training set
Top 0.2%
1.9%
13
Journal of NeuroEngineering and Rehabilitation
28 papers in training set
Top 0.5%
1.8%
14
BMJ Health & Care Informatics
13 papers in training set
Top 0.5%
1.5%
15
iScience
1063 papers in training set
Top 19%
1.3%
16
Canadian Medical Association Journal
15 papers in training set
Top 0.2%
1.3%
17
JMIR Formative Research
32 papers in training set
Top 1%
1.2%
18
Nature Communications
4913 papers in training set
Top 57%
1.2%
19
JAMA Network Open
127 papers in training set
Top 3%
1.0%
20
eLife
5422 papers in training set
Top 53%
0.9%
21
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
22
PeerJ
261 papers in training set
Top 16%
0.7%
23
Journal of Translational Medicine
46 papers in training set
Top 3%
0.7%
24
Nature Medicine
117 papers in training set
Top 6%
0.6%
25
Vaccine
189 papers in training set
Top 2%
0.5%