CCycDB: an integrative knowledgebase to fingerprint microbially mediated carbon cycling processes
Zhou, J.; Qian, L.; Ji, M.; Ma, K.; Yu, X.; Chen, J.; Lin, L.; Gong, X.; He, Z.; Wang, J.; Tu, Q.
Show abstract
Microorganisms play essential roles in mediating biogeochemical cycling of carbon across Earths ecosystems. Understanding the processes and underlying mechanisms for microbially mediated carbon cycling is therefore critical for advancing global ecology and climate change research. To comprehensively depict these complex biogeochemical processes, we developed CCycDB, a knowledge-based functional gene database, to accurately fingerprint microbially-mediated carbon cycling pathways and gene families, particularly from shotgun metagenomes. The CCycDB database comprises 4,676 gene families classified into six major functional categories, further structured into 45 level-1 and 188 level-2 sub-categories, encompassing a total of 10,991,724 high-quality reference sequences. Validation using both synthetic and real-world datasets demonstrated that CCycDB outperforms existing orthology databases in terms of accuracy, coverage and specificity. By directly targeting carbon-cycling functional gene families, CCycDB provided promising routines to reconstruct both functional gene and taxonomic profiles associated with microbially mediated carbon cycling. Application of CCycDB to shotgun metagenomes from diverse and complex ecosystems revealed pronounced habitat-specific differences in carbon cycling processes and their associated microbial taxa. Collectively, CCycDB provides a powerful and reliable tool for profiling carbon cycling processes from both functional and taxonomic perspectives in complex ecosystems. CCycDB is accessible at https://ccycdb.github.io/. Impact StatementThe microbially mediated carbon cycling processes are the most complex biogeochemical processes in the Earths biosphere, playing profound regulatory roles on global climate changes. A key bottleneck in linking microbial communities to global change is the lack of integrated tools for comprehensive carbon cycle profiling. Here, we present CCycDB, a tool that serves a dual purpose--first being a reference database that obtains functional gene and taxonomic profiles and functioning as a customized routine for efficiently aligning sequences and querying associated functional information. CCycDB enables researchers to accurately link microbial community dynamics to carbon cycling and transforming pathways, thereby advancing integrated global change studies with microbes and ecological research via complex metagenomic datasets.
Matching journals
The top 4 journals account for 50% of the predicted probability mass.