Back

Feature-based in-silico model to predict the Mycobacterium tuberculosis bedaquiline phenotype associated with Rv0678 variants

Quispe Rojas, W.; de Diego Fuertes, M.; Rennie, V.; Riviere, E.; Safarpour, M.; Van Rie, A.

2026-02-14 bioinformatics
10.64898/2026.02.12.705543 bioRxiv
Show abstract

Bedaquiline resistance is emerging globally and threatens the effectiveness of the novel short all-oral regimens for rifampicin-resistant tuberculosis. Following a systematic literature review, we quantified 13 sequence, biochemical, and structural features of 62 Rv0678 missense variants reported in 136 Mycobacterium tuberculosis isolates. Using rigorous machine learning methods, we show that the strongest contributing features were the evolutionary conservation score and the shortest atomic distance to key functional sites. The final 5-feature model had good performance (ROC-AUC 0.826) and classified the bedaquiline phenotype with high accuracy [sensitivity 87.1% (95% CI, 78.3-92.6) and specificity 88.2% (95% CI, 76.6-94.5)]. Performance was lower in external validation, likely due to the measurement error introduced when using diverse phenotypic methods. missense variants on the mmpR5 protein structure and function. Integrating the five-feature in-silico in variant interpretation software could improve the prediction of the effect of Rv0678 variants and guide clinical management of rifampicin-resistant tuberculosis.

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 6%
6.2%
2
Nature Communications
4913 papers in training set
Top 30%
6.2%
3
Scientific Reports
3102 papers in training set
Top 20%
6.2%
4
Communications Biology
886 papers in training set
Top 0.4%
6.2%
5
Frontiers in Medicine
113 papers in training set
Top 2%
3.5%
6
Tuberculosis
11 papers in training set
Top 0.1%
3.5%
7
eLife
5422 papers in training set
Top 27%
3.5%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.5%
9
Microbiology Spectrum
435 papers in training set
Top 1%
3.0%
10
PLOS ONE
4510 papers in training set
Top 43%
3.0%
11
Antimicrobial Agents and Chemotherapy
167 papers in training set
Top 0.7%
3.0%
12
Genome Medicine
154 papers in training set
Top 3%
2.7%
50% of probability mass above
13
Clinical Infectious Diseases
231 papers in training set
Top 2%
2.0%
14
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.9%
15
American Journal of Respiratory and Critical Care Medicine
39 papers in training set
Top 0.4%
1.9%
16
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
17
Frontiers in Immunology
586 papers in training set
Top 4%
1.8%
18
Bioinformatics
1061 papers in training set
Top 7%
1.7%
19
Journal of Clinical Microbiology
120 papers in training set
Top 1%
1.7%
20
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
21
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 3%
1.6%
22
PLOS Pathogens
721 papers in training set
Top 7%
1.3%
23
Genomics
60 papers in training set
Top 2%
1.2%
24
Journal of Infection
71 papers in training set
Top 2%
1.2%
25
The Lancet Microbe
43 papers in training set
Top 0.9%
1.2%
26
Nature Machine Intelligence
61 papers in training set
Top 3%
1.1%
27
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
28
eBioMedicine
130 papers in training set
Top 3%
0.9%
29
Advanced Science
249 papers in training set
Top 17%
0.9%
30
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.9%