Back

Self-Reported Symptoms Enable Four-Phase Menstrual Cycle Classification with Hormonally Validated Labels

Specht, B.; Tayeb, Z. Z.; Garbaya, S.; Khadraoui, D.; EL-Khozondar, M.; Schneider, R.

2026-04-01 health informatics
10.64898/2026.03.31.26349766 medRxiv
Show abstract

Accurate inference of physiological state across the menstrual cycle has important applications in reproductive health and in understanding symptom dynamics, yet most non-hormonal approaches rely on wearable sensors or calendar-based tracking. Whether self-reported symptoms alone can support prospective, cross-subject phase classification remains unresolved. Here, we introduce a hybrid modelling framework that combines a gradient-boosted classifier with a Hidden Semi-Markov Model to infer four menstrual cycle phases (menstrual, follicular, fertile, and luteal) from self-reported data. The classifier captures non-linear symptom patterns, while the temporal model imposes biologically grounded constraints, including cyclic ordering and realistic phase durations. In a leave-one-subject-out evaluation using hormonally annotated data from 41 participants, the model achieved 67.6\% accuracy and a macro F1 score of 0.662. Features reflecting short-term symptom variability were more informative than absolute symptom levels, indicating that within-person fluctuation provides a more generalisable signal of cycle phase than symptom intensity alone. These findings demonstrate the feasibility of low-burden, device-free menstrual health monitoring, establish symptom dynamics as a basis for scalable digital biomarkers, and expand access to tracking in resource-constrained settings and populations underserved by wearable-based approaches.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
26.5%
2
Scientific Reports
3102 papers in training set
Top 5%
10.7%
3
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.1%
10.4%
4
Nature Communications
4913 papers in training set
Top 28%
6.5%
50% of probability mass above
5
Advanced Science
249 papers in training set
Top 5%
3.7%
6
Science Advances
1098 papers in training set
Top 5%
3.7%
7
Nature Biomedical Engineering
42 papers in training set
Top 0.4%
2.8%
8
PLOS ONE
4510 papers in training set
Top 46%
2.4%
9
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.1%
10
PLOS Digital Health
91 papers in training set
Top 1%
1.9%
11
PNAS Nexus
147 papers in training set
Top 0.2%
1.8%
12
iScience
1063 papers in training set
Top 14%
1.7%
13
Communications Medicine
85 papers in training set
Top 0.2%
1.7%
14
Frontiers in Digital Health
20 papers in training set
Top 0.6%
1.7%
15
BMC Medicine
163 papers in training set
Top 4%
1.4%
16
Communications Biology
886 papers in training set
Top 12%
1.4%
17
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
18
IEEE Transactions on Biomedical Engineering
38 papers in training set
Top 0.8%
0.8%
19
JAMIA Open
37 papers in training set
Top 1%
0.8%
20
eLife
5422 papers in training set
Top 55%
0.8%
21
eBioMedicine
130 papers in training set
Top 3%
0.8%
22
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.8%
23
Human Reproduction
18 papers in training set
Top 0.4%
0.7%
24
Sensors
39 papers in training set
Top 2%
0.7%
25
BMC Pregnancy and Childbirth
20 papers in training set
Top 0.8%
0.7%
26
Epidemics
104 papers in training set
Top 2%
0.7%
27
PLOS Biology
408 papers in training set
Top 22%
0.7%
28
Science Translational Medicine
111 papers in training set
Top 8%
0.5%
29
Aging
69 papers in training set
Top 4%
0.5%
30
JMIR Public Health and Surveillance
45 papers in training set
Top 5%
0.5%