Back

Satellite imagery encodes features predictive of regional mortality and life expectancy

Mitsuyama, Y.; Saito, K.; Kurimoto, S.; Walston, S. L.; Takita, H.; Ueda, D.

2026-05-19 public and global health
10.64898/2026.05.17.26353439 medRxiv
Show abstract

Background Increasingly accessible satellite imagery provides scalable measures of the built and natural environment relevant to population health. However, whether such imagery can capture subnational variation in mortality and life expectancy remains unclear. We therefore assessed its predictive value for regional mortality and life expectancy across OECD regions. Methods We conducted an ecological, cross-sectional prediction study using 2023 data from OECD Territorial Level 3 (TL3) regions. Annual cloud-masked composites from the Harmonized Landsat and Sentinel-2 collection were processed in the Google Earth Engine, tiled at 224 x 224 pixels, and encoded with the pretrained Prithvi foundation model to derive region-level satellite embeddings. For each outcome, we trained LightGBM regressors for a country-only baseline, a satellite-only model, a combined model (country + satellite), and a final contextual model that additionally included prespecified socioeconomic and environmental covariates. Performance was evaluated using 10-fold outer cross-validation with held-out test folds; R2 was the primary metric. Results The analytic sample comprised 2,414 OECD TL3 regions across 38 countries, for which 939,959 satellite image tiles were processed. In paired bootstrap comparisons, adding satellite features to country indicators improved predictive performance for all outcomes, with incremental R2 ranging from 0.097 to 0.233. The final contextual model achieved R2 values of 0.78 (95% CI, 0.74-0.81) for crude mortality, 0.87 (0.84-0.89) for age-adjusted mortality, 0.86 (0.82-0.88) for infant mortality, and 0.76 (0.69-0.84) for life expectancy. In SHAP analyses, the aggregated satellite image effect consistently ranked among the top predictors across outcomes. Conclusion Satellite imagery captures subnational environmental heterogeneity relevant to regional mortality and life expectancy beyond country identity alone. Earth observation may therefore provide a scalable, complementary data source for characterizing geographic disparities in population health.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Global Change Biology
69 papers in training set
Top 0.1%
14.4%
2
Nature Communications
4913 papers in training set
Top 18%
10.1%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 14%
4.9%
4
Scientific Data
174 papers in training set
Top 0.4%
4.2%
5
Nature Medicine
117 papers in training set
Top 0.6%
4.2%
6
Scientific Reports
3102 papers in training set
Top 31%
4.0%
7
PLOS ONE
4510 papers in training set
Top 37%
3.7%
8
Environmental Research Letters
15 papers in training set
Top 0.2%
2.6%
9
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.4%
50% of probability mass above
10
eBioMedicine
130 papers in training set
Top 0.8%
2.1%
11
eLife
5422 papers in training set
Top 38%
1.9%
12
American Journal of Epidemiology
57 papers in training set
Top 0.6%
1.9%
13
npj Digital Medicine
97 papers in training set
Top 2%
1.9%
14
Science of The Total Environment
179 papers in training set
Top 3%
1.7%
15
BMC Medicine
163 papers in training set
Top 3%
1.7%
16
BMJ Global Health
98 papers in training set
Top 2%
1.7%
17
Data in Brief
13 papers in training set
Top 0.1%
1.5%
18
International Journal of Epidemiology
74 papers in training set
Top 2%
1.5%
19
GeoHealth
10 papers in training set
Top 0.4%
1.3%
20
JMIR Public Health and Surveillance
45 papers in training set
Top 2%
1.2%
21
The Lancet Global Health
24 papers in training set
Top 0.8%
1.1%
22
The Innovation
12 papers in training set
Top 0.6%
1.1%
23
One Health
29 papers in training set
Top 1%
0.9%
24
BMJ Open
554 papers in training set
Top 11%
0.9%
25
Communications Earth & Environment
14 papers in training set
Top 0.7%
0.9%
26
Nature Human Behaviour
85 papers in training set
Top 4%
0.9%
27
Patterns
70 papers in training set
Top 2%
0.8%
28
PLOS Global Public Health
293 papers in training set
Top 5%
0.8%
29
BMC Public Health
147 papers in training set
Top 6%
0.7%
30
Science
429 papers in training set
Top 20%
0.7%