Back

Automated bioinformatic pipeline for unbiased detection of tuberculosis transmission clusters: Real-time impact and retrospective insights

Genestet, C.; Testard, Q.; Ben-Hassen, G.; Bardel, C.; Vallee, M.; Bourg, C.; Bahuaud, O.; Joannard, B.; Tatai, C.; Barabotti, S.; Ader, F.; Dananche, C.; Hodille, E.; Dumitrescu, O.

2026-03-19 epidemiology
10.64898/2026.03.16.26348245 medRxiv
Show abstract

Background[RP1.1] In low-burden countries such as France, whole-genome sequencing (WGS) is increasingly integrated into routine tuberculosis (TB) surveillance to improve case management and transmission monitoring. However, applying WGS to all TB cases generates large volumes of data, requiring automated tools for timely interpretation and outbreak response. Methods Since November 2016, all clinical M. tuberculosis isolates diagnosed in eight hospitals from three cities of Auvergne-Rhone-Alpes in France have undergone WGS. In July 2023, an automated pipeline for anti-TB drug resistance prediction and unbiased detection of transmission clusters based on SNP distances was implemented. Epidemiological, microbiological and clinical data were collected, with contact duration classified as household, frequent, or occasional. Index cases were stratified by their level of extra-household transmission (EHT), and statistical analyses were performed to identify associated factors. Findings Among 1,152 TB patients diagnosed between 2016 and 2025, 75 clusters involving 247 patients (21.4%) were identified. WGS reliably detected resistance to first-line anti-TB drugs, leveraging the WHO mutation catalogue. Routine WGS enabled real-time alerts for TB control centres, leading to expanded field investigations, including community spillover, nosocomial transmissions, and school outbreak. Classical indicators of contagiousness (smear results, cavitary disease) were not associated with EHT level. Instead, lower TB severity indices and longer duration of symptoms were linked to higher EHT level. Interpretation Systematic WGS supports timely identification of drug resistance and transmission events and provides new insights into contagiousness factors. The automated pipeline enables direct interpretation by clinical microbiologists, facilitating real-time public health action. In this study, we demonstrate how, with the appropriate pipeline, WGS offered a time- and cost-effective solution for routine TB management. Funding This work was supported by SHAPE-Med@Lyon, a French government grant managed by the French National Research Agency under the France 2030 program (reference ANR-22-EXES-0012).

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Clinical Infectious Diseases
231 papers in training set
Top 0.2%
12.7%
2
European Respiratory Journal
54 papers in training set
Top 0.1%
10.1%
3
EBioMedicine
39 papers in training set
Top 0.1%
6.4%
4
Nature Communications
4913 papers in training set
Top 32%
4.9%
5
The Journal of Infectious Diseases
182 papers in training set
Top 1%
3.7%
6
Journal of Infection
71 papers in training set
Top 0.5%
3.6%
7
Genome Medicine
154 papers in training set
Top 2%
3.6%
8
The Lancet Microbe
43 papers in training set
Top 0.3%
3.3%
9
Thorax
32 papers in training set
Top 0.3%
2.6%
50% of probability mass above
10
Microbial Genomics
204 papers in training set
Top 0.8%
2.4%
11
Scientific Reports
3102 papers in training set
Top 50%
2.1%
12
Journal of Clinical Microbiology
120 papers in training set
Top 0.8%
2.1%
13
Open Forum Infectious Diseases
134 papers in training set
Top 1%
1.9%
14
The Lancet Infectious Diseases
71 papers in training set
Top 1%
1.9%
15
Frontiers in Medicine
113 papers in training set
Top 3%
1.9%
16
eLife
5422 papers in training set
Top 40%
1.8%
17
PLOS Medicine
98 papers in training set
Top 3%
1.5%
18
BMC Infectious Diseases
118 papers in training set
Top 3%
1.3%
19
eBioMedicine
130 papers in training set
Top 2%
1.2%
20
International Journal of Epidemiology
74 papers in training set
Top 2%
1.2%
21
Epidemiology and Infection
84 papers in training set
Top 2%
1.2%
22
PLOS Biology
408 papers in training set
Top 15%
1.1%
23
PLOS Global Public Health
293 papers in training set
Top 5%
1.0%
24
PLOS ONE
4510 papers in training set
Top 64%
0.9%
25
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
26
Communications Medicine
85 papers in training set
Top 0.7%
0.9%
27
The Lancet Global Health
24 papers in training set
Top 1%
0.8%
28
BMC Medicine
163 papers in training set
Top 6%
0.8%
29
JAC-Antimicrobial Resistance
13 papers in training set
Top 0.4%
0.8%
30
Clinical Chemistry
22 papers in training set
Top 0.8%
0.7%