Back

Large-scale automated detection of gray whales off California in panchromatic and multispectral satellite imagery.

HOUEGNIGAN, L.; Cuesta Lazaro, E.

2026-04-19 bioinformatics
10.64898/2026.04.15.718679 bioRxiv
Show abstract

Increasing human activities along the US west coast are of concern for populations of cetaceans and particularly for a number of large whale species that are recovering from overexploitation during the era of commercial whaling. New rapid monitoring tools, such as satellite imagery analysis powered by recent advances in artificial intelligence, have potential to provide additional broad-scale and near real-time capacities for survey and monitoring. This paper investigates and demonstrates the feasibility of automatic detection of gray whales in sub-meter satellite imagery off the coast of California, USA. Observations and statistical analysis of regional imagery allowed not only an assessment of their detectability but also the development of robust signal processing and machine learning-based solutions for automated detection. To that end, a regional dataset of 221 gray whales was created using signal processing to inform a deep-learning-based detection framework, and 20 different large neural network architectures for feature extraction followed by a support vector machine algorithm for classification were evaluated for their detection performance. Neural network backbones included 19 convolutional neural networks and 1 transformer network. The best architecture generally achieved satisfying performance with an average balanced accuracy reaching up to 99.90%. It is also demonstrated that panchromatic imagery, in spite of the lesser amount of information provided, can be used to perform detection with a relatively high accuracy of 87.05%, allowing wider spatial and temporal coverage. Large-scale deployment of the best performing models over a broad range of regional satellite imagery resulted in the detection of 3353 gray whales, as well as opportunistic detections of humpback, blue and fin whales, in and going from December 28th 2009 to March 26th 2023. It also provided meaningful data points concerning the migration routes of gray whales within the Channel Islands and Southern California Bight. The large number of high-confidence detections indicates the capacity for a large-scale monitoring approach to support state and federal conservation policies such as gear mitigation, vessel speed reduction programs, or shipping lane redefinition that could also be expanded to other areas and for other species.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 5%
23.8%
2
PeerJ
261 papers in training set
Top 0.3%
7.2%
3
Animals
20 papers in training set
Top 0.1%
7.2%
4
Scientific Data
174 papers in training set
Top 0.2%
6.7%
5
Scientific Reports
3102 papers in training set
Top 15%
6.7%
50% of probability mass above
6
Ecological Informatics
29 papers in training set
Top 0.1%
4.5%
7
Gigabyte
60 papers in training set
Top 0.2%
4.5%
8
Frontiers in Physiology
93 papers in training set
Top 3%
1.4%
9
Frontiers in Neuroscience
223 papers in training set
Top 5%
1.3%
10
Ecological Indicators
20 papers in training set
Top 0.3%
1.3%
11
Sensors
39 papers in training set
Top 1%
1.2%
12
Royal Society Open Science
193 papers in training set
Top 3%
1.0%
13
International Journal of Environmental Research and Public Health
124 papers in training set
Top 6%
0.9%
14
Applied Sciences
24 papers in training set
Top 0.6%
0.9%
15
Science of The Total Environment
179 papers in training set
Top 4%
0.9%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
17
Journal of Experimental Biology
249 papers in training set
Top 2%
0.8%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
19
Frontiers in Plant Science
240 papers in training set
Top 5%
0.8%
20
Peer Community Journal
254 papers in training set
Top 3%
0.8%
21
Limnology and Oceanography: Methods
11 papers in training set
Top 0.4%
0.8%
22
GigaScience
172 papers in training set
Top 3%
0.8%
23
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 3%
0.7%
24
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.5%
25
Communications Biology
886 papers in training set
Top 31%
0.5%
26
Nature Communications
4913 papers in training set
Top 66%
0.5%
27
PLOS Computational Biology
1633 papers in training set
Top 28%
0.5%
28
Viruses
318 papers in training set
Top 6%
0.5%
29
Bioengineering
24 papers in training set
Top 2%
0.5%