Back

Towards a general Detector of terrestrial Arthropods in Natural backgrounds

Remy, E.; Carlier, A.; Massol, E.; Kacimi, R.; Chaine, A. S.; Cauchoix, M.

2026-05-08 ecology
10.64898/2026.05.06.723207 bioRxiv
Show abstract

Widespread arthropod declines pose risks to ecosystem functioning and agriculture. Assessing this decline or potential remediation implies the need for standardized and scalable population monitoring. Image-based methods, including camera traps and citizen science programs, are increasingly used, but the volume of data collected requires automated analysis. Robust arthropod detection is essential for individual counting or fine-grained classification, yet current datasets and algorithms do not address the vast morphological diversity across arthropod species and often overlook the variety of photographic contexts, such as differences in background, lighting, and image composition, in which arthropods are captured. To address this gap, we developed an arthropod detection dataset, covering all terrestrial families present in France with available validated images on the iNaturalist platform (749 families). To achieve this, we employed an iterative workflow in which a YOLOv11 model pre-annotated images -- using one representative species per family-- followed by manual correction and model retraining. Repeating this process progressively reduced annotation effort and improved model accuracy. The final outcome consists of a publicly available curated detection dataset and a robust arthropod detector for natural background scenes. The detector achieves an F1-score of 0.91, demonstrating strong performance despite substantial interspecific morphological variation and heterogeneity in photographic contexts. We further demonstrated the taxonomical universality of the model showing high F1-score and IoU averaged at the class (0.79, 0.85) and order level (0.82, 0.86) and also a good detection generalizability (F1-score>0.90, IoU>0.83) on species, genera and families never encountered by the model during training. Finally, we show how this model can be improved to generalize to new datasets using data augmentation, complementary training data or fine-tuning and increase detection of small objects. In particular, we report performance of the improved models on three use cases largely used in non lethal insect monitoring: (i) diurnal pollinator monitoring through citizen science or (ii) flower and nocturnal insects monitoring through smartphone time-lapse of a UV-illuminated white panel. These results mark an important step toward automated analysis of arthropod images in natural contexts, from both large-scale automated monitoring approaches or from citizen science monitoring programs.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Methods in Ecology and Evolution
160 papers in training set
Top 0.3%
14.2%
2
Nature Communications
4913 papers in training set
Top 15%
12.2%
3
Remote Sensing in Ecology and Conservation
10 papers in training set
Top 0.1%
10.3%
4
PLOS ONE
4510 papers in training set
Top 19%
10.0%
5
Peer Community Journal
254 papers in training set
Top 0.3%
6.7%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
7
Scientific Reports
3102 papers in training set
Top 38%
3.5%
8
Scientific Data
174 papers in training set
Top 0.5%
3.5%
9
eLife
5422 papers in training set
Top 26%
3.5%
10
Ecological Informatics
29 papers in training set
Top 0.2%
2.7%
11
BMC Biology
248 papers in training set
Top 0.5%
2.6%
12
Communications Biology
886 papers in training set
Top 4%
2.4%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
14
Ecology and Evolution
232 papers in training set
Top 3%
1.2%
15
iScience
1063 papers in training set
Top 22%
1.2%
16
PLOS Biology
408 papers in training set
Top 16%
0.9%
17
Applications in Plant Sciences
21 papers in training set
Top 0.2%
0.9%
18
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
19
Nature
575 papers in training set
Top 14%
0.9%
20
Journal of Applied Ecology
35 papers in training set
Top 0.7%
0.8%
21
Science
429 papers in training set
Top 19%
0.8%
22
Sensors
39 papers in training set
Top 2%
0.8%
23
Nature Methods
336 papers in training set
Top 7%
0.7%
24
Patterns
70 papers in training set
Top 3%
0.7%
25
Ecological Applications
28 papers in training set
Top 0.8%
0.6%
26
Science Advances
1098 papers in training set
Top 34%
0.6%