Back

MTB-KB: A Curated Knowledgebase of Mycobacterium tuberculosis Related Studies

Li, P.; Li, C.; Zhu, R.; Sun, W.; Zhou, H.; Fan, Z.; Yue, L.; Zhang, S.; Jiang, X.; Luo, Q.; Han, J.; Huang, H.; Shen, A.; Bahetibieke, T.; Wang, J.; Zhang, W.; Wen, H.; Niu, H.; Bu, C.; Zhang, Z.; Xiao, J.; Gao, R.; Chen, F.

2026-04-10 bioinformatics
10.64898/2026.04.07.716833 bioRxiv
Show abstract

Tuberculosis (TB), caused by Mycobacterium tuberculosis (MTB), has regained its position as the worlds leading killer among infectious diseases. Despite extensive research progress across epidemiology, diagnosis, drug development, treatment regimens, vaccines, drug resistance, virulence factors, and immune mechanisms, MTB-related knowledge remains fragmented across thousands of publications, limiting its effective use. To address this gap, we present MTB-KB, a literature-curated knowledgebase that systematically integrates high-impact findings from eight major sections of TB research. The current release contains 75,170 associations from 1,246 publications, covering 18,439 entities standardized using authoritative databases and WHO-endorsed classifications. A central feature is the interactive knowledge graph, which links cross-section associations to reveal and infer MTB-host interactions, treatment strategies, and vaccine development opportunities. MTB-KB also provides a user-friendly interface with browsing, advanced search, and statistical visualization. Overall, by consolidating dispersed MTB knowledge into a structured and accessible platform, MTB-KB provides a valuable resource for researchers, clinicians, and policymakers, supporting both basic and clinical TB research, enabling evidence-based TB prevention, diagnosis, and treatment, and contributing to global elimination efforts. MTB-KB is accessible at https://ngdc.cncb.ac.cn/mtbkb/.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.5%
10.5%
2
Scientific Data
174 papers in training set
Top 0.1%
10.2%
3
Nucleic Acids Research
1128 papers in training set
Top 2%
7.2%
4
Bioinformatics
1061 papers in training set
Top 4%
6.4%
5
Frontiers in Microbiology
375 papers in training set
Top 2%
4.9%
6
Clinical Infectious Diseases
231 papers in training set
Top 1%
4.2%
7
Scientific Reports
3102 papers in training set
Top 31%
4.0%
8
PLOS ONE
4510 papers in training set
Top 39%
3.6%
50% of probability mass above
9
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 1%
3.6%
10
Database
51 papers in training set
Top 0.2%
3.1%
11
Genome Medicine
154 papers in training set
Top 3%
3.1%
12
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
13
Frontiers in Medicine
113 papers in training set
Top 3%
1.8%
14
Communications Biology
886 papers in training set
Top 8%
1.7%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
16
Advanced Science
249 papers in training set
Top 11%
1.7%
17
Nature Communications
4913 papers in training set
Top 52%
1.7%
18
Antimicrobial Agents and Chemotherapy
167 papers in training set
Top 1%
1.3%
19
Tuberculosis
11 papers in training set
Top 0.2%
1.2%
20
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
21
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
22
Microbiology Spectrum
435 papers in training set
Top 5%
0.8%
23
iScience
1063 papers in training set
Top 32%
0.8%
24
eLife
5422 papers in training set
Top 57%
0.8%
25
The Lancet Microbe
43 papers in training set
Top 1%
0.8%
26
European Respiratory Journal
54 papers in training set
Top 2%
0.7%
27
The Lancet Infectious Diseases
71 papers in training set
Top 3%
0.7%
28
mSystems
361 papers in training set
Top 8%
0.7%
29
BMC Genomics
328 papers in training set
Top 7%
0.6%
30
PLOS Pathogens
721 papers in training set
Top 10%
0.6%