Back

Integrating patient movement and pathogen genomics to support hospital infection prevention with PathoPath: a method development study

Sajib, M. S.; Tanmoy, A. M.; Kanon, N.; Jui, A. B.; Islam, M. S.; Dola, N. Z.; Hossain, M. M.; Mobarak, R.; Shahidullah, M.; Hoque, M.; Ahmed, A. N. U.; Holmes, A. H.; Saha, S. K.; Saha, S.; Wan, Y.; Hooda, Y.

2026-06-05 infectious diseases
10.64898/2026.06.03.26354630 medRxiv
Show abstract

Background Healthcare-associated infections pose a major burden to neonatal health worldwide and remain difficult to track in low-resource hospitals because patient movement data and pathogen genomic data are rarely integrated into actionable transmission models. Existing approaches are often restricted to specific settings, highly structured electronic health records (EHRs), or analyses focused on either patient movements or pathogen characteristics alone. To address this gap, we developed PathoPath, an open-source integrative modelling platform, and evaluated its utility in a high burden paediatric hospital in Dhaka, Bangladesh. Methods PathoPath is an open-source R package that combines electronic health records with whole genome sequencing data to generate contact networks from direct and indirect contacts using minimal structured inputs. We retrospectively applied PathoPath to 373 cases of Klebsiella pneumoniae species complex (KpSC) infection identified in 2021 at the largest paediatric referral hospital in Dhaka, Bangladesh. Ward level patient movement trajectories were used to reconstruct contact networks, and genomic data from isolates from children <60 days were integrated to identify probable dissemination of bacterial clones and antimicrobial resistance plasmids. Findings PathoPath identified 750 direct contacts among 317 patients, forming 25 connected components, with the largest including 93 patients. KpSC infections were identified across 21 of 37 wards, with the neonatal intensive care unit accounting for 77.9% of all cases. Integration of genomic and network data distinguished sustained clustering of ST147 from multiple probable inter-clonal dissemination events involving IncFII plasmids carrying blaNDM-5 and/or blaOXA-181 within ST16. Four dominant sequence types accounted for 65.6% of sequenced isolates, and carbapenemase genes were detected in 95.8%. Interpretation PathoPath reconstructs hospital-wide contact networks and integrates them with pathogen genomics to map probable dissemination of pathogens and antimicrobial resistance using minimal structured clinical data. It could support more targeted infection prevention and control in hospitals where granular digital records are not available.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Microbial Genomics
204 papers in training set
Top 0.2%
10.1%
2
PLOS Computational Biology
1633 papers in training set
Top 4%
8.4%
3
Nature Communications
4913 papers in training set
Top 23%
8.4%
4
The Journal of Infectious Diseases
182 papers in training set
Top 0.7%
4.8%
5
Epidemics
104 papers in training set
Top 0.3%
4.8%
6
Genome Medicine
154 papers in training set
Top 2%
3.8%
7
Clinical Infectious Diseases
231 papers in training set
Top 1%
3.6%
8
The Lancet Microbe
43 papers in training set
Top 0.2%
3.6%
9
PLOS ONE
4510 papers in training set
Top 39%
3.6%
50% of probability mass above
10
Scientific Reports
3102 papers in training set
Top 41%
3.1%
11
The Lancet Infectious Diseases
71 papers in training set
Top 1%
2.6%
12
The Lancet Digital Health
25 papers in training set
Top 0.3%
2.1%
13
eLife
5422 papers in training set
Top 35%
2.1%
14
Science
429 papers in training set
Top 12%
2.1%
15
PLOS Medicine
98 papers in training set
Top 2%
2.1%
16
BMC Medicine
163 papers in training set
Top 4%
1.7%
17
Wellcome Open Research
57 papers in training set
Top 1%
1.3%
18
BMC Infectious Diseases
118 papers in training set
Top 3%
1.3%
19
Journal of Infection
71 papers in training set
Top 2%
1.3%
20
Eurosurveillance
80 papers in training set
Top 0.9%
1.3%
21
PLOS Global Public Health
293 papers in training set
Top 4%
1.2%
22
Canadian Medical Association Journal
15 papers in training set
Top 0.2%
1.1%
23
Nature Computational Science
50 papers in training set
Top 1%
0.9%
24
Infection Control & Hospital Epidemiology
17 papers in training set
Top 0.3%
0.9%
25
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.9%
26
International Journal of Epidemiology
74 papers in training set
Top 2%
0.8%
27
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
28
Nature Genetics
240 papers in training set
Top 8%
0.7%
29
Nature Medicine
117 papers in training set
Top 5%
0.7%
30
PLOS Pathogens
721 papers in training set
Top 9%
0.7%