Back

Longitudinal Prediction of BMI using Explainable AI: Integrating Polygenic Scores, Maternal, Early-Life and Familial Factors

Chen, F.; Melton, P.; Vinsen, K.; Mori, T. A.; Beilin, L.; Huang, R.-C.

2025-07-11 epidemiology
10.1101/2025.07.07.25331071 medRxiv
Show abstract

Background/ObjectivesThis study aimed to predict body mass index (BMI) trajectories from childhood to early adulthood using explainable artificial intelligence, integrating polygenic scores (PGS), maternal, early-life, and familial factors to identify key predictors of obesity risk and inform prevention strategies. Subjects/MethodsWe analysed longitudinal data from the Raine Study Gen2 cohort, recruiting 2 868 participants. This observational study, without randomization or case-control design, collected BMI measurements at ages 8, 10, 14, 17, 20, 23, and 27 years. We applied Kolmogorov-Arnold Networks (KAN) alongside conventional machine learning models, integrating epidemiological variables (maternal and paternal anthropometrics, parental education, early-life skinfold measurements) with seven BMI-related PGS. The analysis spanned from childhood to early adulthood, with no intervention administered. ResultsThe KAN model, combining epidemiological and PGS data, achieved predictive performance with R{superscript 2} ranging from 0.81 at age 8 to 0.34 at age 27. BMI z-score at age 5 was the dominant predictor in early years, with PGS influence increasing post-adolescence. Maternal and paternal anthropometric measures, parental education, and early-life skinfold measurements were significant contributors. The interpretable KAN model revealed the dynamic interplay of genetic and environmental factors, with early-life BMI z-score and PGS emerging as key drivers of BMI trajectories across life stages. ConclusionsThese findings highlight the dynamic interplay of genetic and environmental factors across life stages, underscoring the potential of early-life BMI as a biomarker for obesity risk. Our interpretable model offers actionable insights for targeted obesity prevention strategies.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
International Journal of Obesity
25 papers in training set
Top 0.1%
29.3%
2
Scientific Reports
3102 papers in training set
Top 5%
10.7%
3
PLOS ONE
4510 papers in training set
Top 20%
8.9%
4
International Journal of Epidemiology
74 papers in training set
Top 0.4%
4.6%
50% of probability mass above
5
BMC Medicine
163 papers in training set
Top 1.0%
4.2%
6
American Journal of Epidemiology
57 papers in training set
Top 0.4%
2.9%
7
Journal of Clinical Medicine
91 papers in training set
Top 2%
2.5%
8
BMC Public Health
147 papers in training set
Top 3%
2.0%
9
Diabetologia
36 papers in training set
Top 0.5%
1.8%
10
PLOS Medicine
98 papers in training set
Top 3%
1.6%
11
International Journal of Environmental Research and Public Health
124 papers in training set
Top 4%
1.6%
12
PeerJ
261 papers in training set
Top 9%
1.3%
13
eBioMedicine
130 papers in training set
Top 3%
1.0%
14
Diabetes, Obesity and Metabolism
17 papers in training set
Top 0.4%
0.9%
15
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
16
Metabolism
14 papers in training set
Top 0.4%
0.8%
17
Nature Communications
4913 papers in training set
Top 60%
0.8%
18
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
19
Obesity
19 papers in training set
Top 0.7%
0.7%
20
BMJ Nutrition, Prevention & Health
10 papers in training set
Top 0.5%
0.7%
21
Frontiers in Genetics
197 papers in training set
Top 11%
0.7%
22
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
23
BMJ Open
554 papers in training set
Top 13%
0.7%
24
BMC Medical Research Methodology
43 papers in training set
Top 2%
0.5%
25
International Journal of Behavioral Nutrition and Physical Activity
15 papers in training set
Top 0.6%
0.5%
26
Nutrients
64 papers in training set
Top 2%
0.5%