Back

Characterization of menopause onset and associated disease risks using large-scale electronic health records

Thakkar, N.; Patil, R.; Levy-Gantt, R.; Hswen, Y.; Agrawal, M.; Zou, J.; Chen, I. Y.

2026-05-12 obstetrics and gynecology
10.64898/2026.05.08.26352769 medRxiv
Show abstract

Menopause affects over one billion women worldwide, yet remains poorly characterized at scale. We apply an ICD-10-based phenotyping algorithm to electronic health records (EHR) from an academic medical center (n=33,444 women aged 35-64) and a safety-net hospital system (n=7,041), yielding one of the most racially and socioeconomically diverse menopause cohorts in the literature. Structured EHR fields underrepresent symptom burden: only 38.8% of patients had any documented symptom via natural language processing, despite an estimated prevalence of 90%. Adverse pregnancy outcomes were associated with earlier menopause onset after adjustment ({beta}=-1.21 years, p=8.7x10-45). Menopausal women showed elevated risk for osteoporosis (hazard ratio of 12.40), rheumatoid arthritis (HR of 2.43), and mental and behavioral disorders (HR 2.38) relative to age-matched men, with divergence at menopause onset. We show that large-scale EHR can characterize menopause at a scale and diversity that prospective enrollment has not achieved.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.3%
17.9%
2
Nature Communications
4913 papers in training set
Top 8%
17.6%
3
Nature Medicine
117 papers in training set
Top 0.1%
17.6%
50% of probability mass above
4
Science Advances
1098 papers in training set
Top 0.1%
12.0%
5
eLife
5422 papers in training set
Top 19%
4.7%
6
Scientific Reports
3102 papers in training set
Top 39%
3.5%
7
Science Translational Medicine
111 papers in training set
Top 2%
1.8%
8
Nature Human Behaviour
85 papers in training set
Top 2%
1.6%
9
Clinical Cancer Research
58 papers in training set
Top 1%
1.6%
10
The Lancet Digital Health
25 papers in training set
Top 0.5%
1.6%
11
Science
429 papers in training set
Top 16%
1.4%
12
Cell Reports
1338 papers in training set
Top 27%
1.4%
13
PLOS ONE
4510 papers in training set
Top 63%
0.9%
14
Nature Genetics
240 papers in training set
Top 7%
0.9%
15
PLOS Medicine
98 papers in training set
Top 4%
0.9%
16
Communications Biology
886 papers in training set
Top 20%
0.9%
17
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 1%
0.7%
18
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%
19
Cell Reports Medicine
140 papers in training set
Top 9%
0.7%
20
Advanced Science
249 papers in training set
Top 23%
0.6%
21
Human Genetics and Genomics Advances
70 papers in training set
Top 1%
0.6%
22
Cell Genomics
162 papers in training set
Top 8%
0.6%
23
BMC Medicine
163 papers in training set
Top 8%
0.6%
24
JCI Insight
241 papers in training set
Top 9%
0.6%