Back

Automated community ecology using deep learning: a case study of planktonic foraminifera

Hsiang, A. Y.; Hull, P. M.

2022-11-01 ecology
10.1101/2022.10.31.514514 bioRxiv
Show abstract

The development of deep learning methods using convolutional neural networks (CNNs) has revolutionised the field of computer vision in recent years. The automation of taxonomic identification using CNNs leads naturally to the use of such technology for rapidly generating large organismal datasets in order to study the evolutionary and ecological dynamics of biological communities across time and space. While CNNs have been used to train machine learning classifiers that can identify organisms to the species level for several groups, this vision of automated community ecology has yet to be thoroughly tested or fulfilled. Here, we present a case study of automated community ecology using a large dataset of Atlantic planktonic foraminifera for which the generation of species labels and morphometric measurements was completely automated. We compare standard community diversity metrics between the fully automated dataset and a "traditional" dataset with human-identified specimens. We show that there is high congruence between the results, and that machine classifications help avoid biases that can result in the inference of misleading biodiversity patterns. Our study demonstrates the viability and potential of fully automated community ecology and sets the stage for a new era of ecological and evolutionary inquiry driven by artificial intelligence.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Methods in Ecology and Evolution
160 papers in training set
Top 0.1%
22.9%
2
Ecological Informatics
29 papers in training set
Top 0.1%
18.9%
3
Scientific Reports
3102 papers in training set
Top 13%
6.9%
4
Molecular Ecology Resources
161 papers in training set
Top 0.2%
6.9%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 27%
6.5%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.0%
7
Ecology and Evolution
232 papers in training set
Top 0.9%
3.6%
8
Peer Community Journal
254 papers in training set
Top 1%
2.4%
9
Ecography
50 papers in training set
Top 0.6%
1.8%
10
PeerJ
261 papers in training set
Top 8%
1.5%
11
BMC Ecology and Evolution
49 papers in training set
Top 1%
1.4%
12
Limnology and Oceanography: Methods
11 papers in training set
Top 0.2%
1.4%
13
Frontiers in Ecology and Evolution
60 papers in training set
Top 3%
1.2%
14
Ecological Indicators
20 papers in training set
Top 0.4%
1.1%
15
Ecology Letters
121 papers in training set
Top 1%
0.9%
16
Frontiers in Marine Science
55 papers in training set
Top 1%
0.8%
17
eLife
5422 papers in training set
Top 59%
0.7%
18
Systematic Biology
121 papers in training set
Top 0.4%
0.7%
19
Nature Communications
4913 papers in training set
Top 65%
0.7%
20
Scientific Data
174 papers in training set
Top 3%
0.7%
21
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
22
Fungal Genetics and Biology
14 papers in training set
Top 0.3%
0.7%
23
iScience
1063 papers in training set
Top 37%
0.7%
24
Water Research
74 papers in training set
Top 2%
0.7%
25
Systematic Entomology
11 papers in training set
Top 0.1%
0.7%
26
Global Ecology and Biogeography
41 papers in training set
Top 0.8%
0.5%
27
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 7%
0.5%
28
Metabarcoding and Metagenomics
12 papers in training set
Top 0.1%
0.5%