Back

Machine learning applied to otolith microchemical data to discriminate stock of origin in salmon

Makhlouf, B.; Schindler, D.; Staneva, V.

2025-12-02 ecology
10.64898/2025.12.01.691230 bioRxiv
Show abstract

A key challenge in ecosystem-based fisheries management is sustainably harvesting co-occurring species and populations that differ in their vulnerability to fishing. This challenge is exemplified in western Alaska Chinook salmon, where recent population declines have led to closures of in-river subsistence fisheries, but identifying the cause of these declines is limited by an inability to identify the river of origin for marine-caught fish in offshore fisheries that target more abundant species and populations. This problem is particularly acute for estimating the impacts of bycatch on individual salmon populations which have demographic independence but no genetic differentiation from which stock assignments can be made. Here we use machine learning approaches to assess the efficacy of using the microchemical history preserved in fish otoliths to assign individuals to their river of origin in western Alaska. We tested the classification ability of three machine learning algorithms (Random Forest, K-Nearest Neighbors, and Support Vector Machines) combined with two time series smoothing techniques (Moving Average and Generalized Additive Models) to classify Chinook salmon to their river of origin using otolith time series data. Model accuracy ranged from 71.9% to 92.5%, with optimal performance achieved by Random Forest applied to GAM-smoothed data. Watershed-specific performance ranged from 86.8% to 93.7%, with most misclassifications occurring between spatially proximate Kuskokwim and Nushagak rivers. Raw predicted probabilities from classification algorithms were calibrated to reflect true class probabilities, enabling the incorporation of model results into decision analyses with explicit consideration of misclassification risk tolerance. The success of these models offers immediate utility for estimating marine mortality impacts across the regions three major river systems as well as an opportunity to understand commercial fisheries impacts on individual populations at substantially finer spatial scales than had been previously possible.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Canadian Journal of Fisheries and Aquatic Sciences
14 papers in training set
Top 0.1%
32.5%
2
PLOS ONE
4510 papers in training set
Top 11%
17.3%
3
Scientific Reports
3102 papers in training set
Top 20%
6.2%
50% of probability mass above
4
Ecosphere
53 papers in training set
Top 0.1%
4.1%
5
Ecological Informatics
29 papers in training set
Top 0.2%
3.8%
6
Aquatic Conservation: Marine and Freshwater Ecosystems
12 papers in training set
Top 0.1%
3.5%
7
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
1.9%
8
Science of The Total Environment
179 papers in training set
Top 3%
1.8%
9
Conservation Letters
11 papers in training set
Top 0.2%
1.7%
10
PeerJ
261 papers in training set
Top 8%
1.7%
11
Limnology and Oceanography: Methods
11 papers in training set
Top 0.2%
1.3%
12
Methods in Ecology and Evolution
160 papers in training set
Top 2%
1.2%
13
Biological Conservation
43 papers in training set
Top 0.6%
1.2%
14
Global Ecology and Conservation
25 papers in training set
Top 0.9%
0.9%
15
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
16
Frontiers in Marine Science
55 papers in training set
Top 1.0%
0.9%
17
Ecological Applications
28 papers in training set
Top 0.7%
0.7%
18
Global Change Biology
69 papers in training set
Top 2%
0.7%
19
eLife
5422 papers in training set
Top 60%
0.7%
20
FACETS
11 papers in training set
Top 0.3%
0.7%
21
Environmental Research Letters
15 papers in training set
Top 0.7%
0.6%
22
Communications Earth & Environment
14 papers in training set
Top 1%
0.6%
23
Movement Ecology
18 papers in training set
Top 0.6%
0.6%