Back

Population-level migration modeling of North American birds through data integration with BirdFlow

Chen, Y.; Slager, D. L.; Plunkett, E.; Fuentes, M.; Deng, Y.; Mackenzie, S. A.; Berrigan, L. E.; Fink, D.; Sheldon, D.; Van Doren, B. M.; Dokter, A. M.

2025-10-02 ecology
10.1101/2025.09.30.679621 bioRxiv
Show abstract

Accurate information on the population-level movements of migratory animals is highly valuable for migration research and critical for designing effective conservation strategies for a changing world. However, population-wide movement information is lacking for most migratory species due to the effort and expense needed to collect data across species ranges. BirdFlow is a probabilistic modeling framework that infers population-level movements from weekly species distribution maps produced by the participatory science project eBird. Producing accurate species-specific BirdFlow models has required model tuning using high-resolution individual tracking data, which is not available for most migratory species. Here, we introduce a new model tuning framework that eliminates this reliance on tracking data and generalizes BirdFlow to hundreds of migratory species. This framework allows us to tune and validate BirdFlow models using a combination of data sources, including GPS tracks, banding recoveries, and radio telemetry data from the Motus Wildlife Tracking System. We investigate the generalizability of this approach by (1) investigating predictive performance compared to null models; (2) validating the biological plausibility of BirdFlow models by comparing movement properties such as route straightness, number of stopovers, and migration speed between model-generated routes and real movement tracks; and (3) comparing the performance of models tuned on species-specific movement data to models tuned using hyperparameters transferred from other species. Our results show that the BirdFlow modeling framework achieves biologically realistic performance, even for prediction horizons of thousands of kilometers and several months. When species-specific data are unavailable, models can still be tuned using data from other phylogenetically adjacent species to achieve improved performance. Alongside this study, we release 153 tuned BirdFlow models, representing the first collection of large-scale population-level movement forecasting models and offering a foundation for more accurate predictions for applications in conservation, disease surveillance, aviation, and public outreach.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Movement Ecology
18 papers in training set
Top 0.1%
39.1%
2
Methods in Ecology and Evolution
160 papers in training set
Top 0.1%
23.3%
50% of probability mass above
3
PLOS ONE
4510 papers in training set
Top 33%
4.5%
4
Remote Sensing in Ecology and Conservation
10 papers in training set
Top 0.1%
4.5%
5
Nature Communications
4913 papers in training set
Top 43%
2.8%
6
Scientific Reports
3102 papers in training set
Top 52%
2.0%
7
eLife
5422 papers in training set
Top 46%
1.4%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
9
Ecological Applications
28 papers in training set
Top 0.4%
1.3%
10
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
11
Ecology and Evolution
232 papers in training set
Top 3%
1.3%
12
Journal of Animal Ecology
63 papers in training set
Top 0.8%
0.9%
13
Journal of The Royal Society Interface
189 papers in training set
Top 4%
0.9%
14
Journal of Applied Ecology
35 papers in training set
Top 0.6%
0.9%
15
iScience
1063 papers in training set
Top 28%
0.8%
16
Ecology Letters
121 papers in training set
Top 1%
0.8%
17
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 6%
0.8%
18
Ecography
50 papers in training set
Top 1%
0.8%
19
Science Advances
1098 papers in training set
Top 28%
0.8%
20
Frontiers in Ecology and Evolution
60 papers in training set
Top 4%
0.7%
21
Diversity and Distributions
26 papers in training set
Top 0.5%
0.5%
22
Environmental Research Letters
15 papers in training set
Top 0.7%
0.5%
23
Communications Biology
886 papers in training set
Top 31%
0.5%
24
Epidemics
104 papers in training set
Top 2%
0.5%
25
Ecological Informatics
29 papers in training set
Top 0.9%
0.5%