Back

Sky islands of Southwest China. II: Unraveling hidden species diversity of talpid moles using phylogenomics and skull-based deep learning

He, K.; LI, A.; Martinez, Q.; Wang, X.; Chen, Z.; He, S.; Xie, S.; Zeng, Z.; Wang, K.; Ye, Z.; Ruan, H.; Liu, S.; Lu, Q.; Zheng, X.; Luo, J.; Song, W.; Schwermann, A.; Yu, H.; Yu, W.; Springer, M.; Liu, S.; Li, S.; Tu, F.; Cao, Z.; Campbell, K. L.

2025-03-11 zoology
10.1101/2025.03.06.641773 bioRxiv
Show abstract

The sky islands of Southwest China, characterized by dramatic topographical and climatic variations, are prominent hotspots of biodiversity and endemism. Organisms inhabiting middle-to-high elevation habitats in this region are geographically isolated within distinct mountain chains, which over geological time have been subjected to isolation-by-distance and isolation-by-environment. These processes have led to profound allopatric diversification and strong phylogeographic structuring, resulting in a plethora of genetically distinct cryptic species, as is becoming increasingly evident for many small mammal families. However, morphological conservatism can pose significant challenges in delineating these clades within species complexes. In this study, we leverage artificial intelligence technologies to unravel the hidden species diversity of moles (family Talpidae) in Southwest Chinas sky islands. We first employed ultraconserved elements (UCEs) to investigate the evolutionary history of talpid moles, conducted molecular species delimitation using mitochondrial and multi-locus genes, and utilized both traditional and geometric morphometrics to examine their morphological disparity. To address the challenges of morphology based cryptic species identification, we developed a deep learning Hierarchical Identification of Species NETwork (HIS-NET) to create an image-based model that analyzes four different views of the skull/mandible to distinguish genera and species hierarchically. HIS-NET not only achieved expert-level accuracy in species identification but also effectively distinguished between cryptic and known species, aiding in the identification of key morphological variation intervals. Our results support the recognition of allopatrically distributed taxa in Euroscaptor and Parascaptor as full species, thereby confirming that species diversity in this region remains underestimated. Beyond advancing our understanding of speciation in this unique and fragile ecosystem, our study serves as a proof-of-concept, demonstrating the power of deep learning in unraveling hidden biodiversity within this and other species complexes.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.1%
35.7%
2
Journal of Systematics and Evolution
11 papers in training set
Top 0.1%
10.5%
3
PLOS ONE
4510 papers in training set
Top 30%
5.1%
50% of probability mass above
4
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.5%
5
eLife
5422 papers in training set
Top 19%
4.5%
6
iScience
1063 papers in training set
Top 4%
3.7%
7
Scientific Reports
3102 papers in training set
Top 40%
3.4%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.2%
9
National Science Review
22 papers in training set
Top 0.7%
2.0%
10
Science China Life Sciences
26 papers in training set
Top 0.7%
2.0%
11
Frontiers in Genetics
197 papers in training set
Top 4%
2.0%
12
Journal of Genetics and Genomics
36 papers in training set
Top 1%
1.5%
13
Nature Communications
4913 papers in training set
Top 54%
1.4%
14
Science of The Total Environment
179 papers in training set
Top 4%
1.2%
15
Science
429 papers in training set
Top 18%
0.9%
16
Science Bulletin
22 papers in training set
Top 0.7%
0.8%
17
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.8%
18
Science Advances
1098 papers in training set
Top 28%
0.8%
19
Frontiers in Public Health
140 papers in training set
Top 8%
0.7%
20
Genome Medicine
154 papers in training set
Top 8%
0.7%
21
Methods in Ecology and Evolution
160 papers in training set
Top 2%
0.7%
22
American Journal of Biological Anthropology
11 papers in training set
Top 0.3%
0.7%
23
Forensic Science International: Genetics
24 papers in training set
Top 0.1%
0.7%
24
Journal of Molecular Cell Biology
21 papers in training set
Top 0.7%
0.7%
25
Genome Research
409 papers in training set
Top 5%
0.7%
26
Current Biology
596 papers in training set
Top 16%
0.5%
27
Nature
575 papers in training set
Top 17%
0.5%