Back

A supervised learning algorithm to evaluate occurrence records in virtual species

Rios, R.; Noguera-Urbano, E. A.; Espinosa, J.; Ochoa, J. M.

2021-09-06 ecology
10.1101/2021.09.06.459158 bioRxiv
Show abstract

Digital and open access of occurrence data have encouraged the development of tools to improve biodiversity conservation and management. In this study, we proposed a methodology to evaluate point-occurrence records based on expert knowledge. We firstly generated virtual data to test our methodology without confounding factors by simulating geographical distributions, virtual sampling, and expert checking of occurrence records. We used a set of non-linear bioclimatic variables and principal component analysis (PCA) to define a duality function between niche and biotope spaces. Subsequently, a supervised-learning model was fit to classify records between true and doubtful presence based on the virtual expert checking. We then tested our methodology using three virtual species and 10-fold cross validation. Also, we evaluated the prediction performance of the supervise model compared with the virtual observer using a virtual external database of occurrence data.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Ecological Informatics
29 papers in training set
Top 0.1%
22.3%
2
Methods in Ecology and Evolution
160 papers in training set
Top 0.4%
10.0%
3
PLOS ONE
4510 papers in training set
Top 22%
8.3%
4
Ecography
50 papers in training set
Top 0.1%
8.3%
5
Scientific Reports
3102 papers in training set
Top 25%
4.8%
50% of probability mass above
6
Ecological Indicators
20 papers in training set
Top 0.1%
4.8%
7
Global Ecology and Conservation
25 papers in training set
Top 0.3%
3.6%
8
Ecology and Evolution
232 papers in training set
Top 1%
3.6%
9
Science of The Total Environment
179 papers in training set
Top 2%
2.3%
10
PLOS Computational Biology
1633 papers in training set
Top 13%
2.3%
11
Global Ecology and Biogeography
41 papers in training set
Top 0.3%
1.9%
12
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
1.7%
13
PeerJ
261 papers in training set
Top 8%
1.5%
14
eLife
5422 papers in training set
Top 45%
1.5%
15
Journal of Biogeography
37 papers in training set
Top 0.2%
1.5%
16
Diversity and Distributions
26 papers in training set
Top 0.2%
1.3%
17
Ecological Modelling
24 papers in training set
Top 0.5%
0.9%
18
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
19
Remote Sensing in Ecology and Conservation
10 papers in training set
Top 0.2%
0.9%
20
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 6%
0.8%
21
Nature Communications
4913 papers in training set
Top 63%
0.7%
22
Landscape Ecology
12 papers in training set
Top 0.4%
0.7%
23
Systematic Entomology
11 papers in training set
Top 0.1%
0.7%
24
Peer Community Journal
254 papers in training set
Top 4%
0.7%
25
Molecular Ecology Resources
161 papers in training set
Top 1%
0.6%
26
New Phytologist
309 papers in training set
Top 5%
0.6%
27
Communications Earth & Environment
14 papers in training set
Top 1%
0.6%