Back

Flex-It: A global standardised genotyping framework for Shigella flexneri

Hawkey, J.; Nodari, C. S.; Iqbal, Z.; Hunt, M.; Wick, R. R.; Chong, C. E.; Jenkins, C.; Howden, B. P.; Holt, K.; Weill, F.-X.; Baker, K. S.; Ingle, D. J.

2026-04-20 microbiology
10.64898/2026.04.17.719127 bioRxiv
Show abstract

Shigella flexneri is the leading causative agent of shigellosis globally. The public health threat posed by S. flexneri is compounded by its emergence as a sexually transmissible infection, importance of international travel in driving dissemination, and the increasing prevalence of antimicrobial resistance (AMR). A rapid and robust computational method is needed to enhance genomic surveillance and systematically explore features of the population structure of this WHO priority pathogen, which is scalable and readily implementable across jurisdictions, particularly as vaccine development efforts are underway. Here, we present Flex-It, a genomic framework and genotyping scheme implemented in Mykrobe for S. flexneri serotypes 1-5, X & Y, compatible with previous approaches used to describe S. flexneris population structure. To develop Flex-It, we curated a retrospective dataset of 5,819 publicly available S. flexneri genomes. We characterised the global population structure for S. flexneri, exploring geographical and temporal traits, and showed the granular diversity of AMR and serotype profiles. We applied Flex-It to >13,000 genomes routinely generated by public health laboratories from Australia, the UK and the USA across a ten-year period. We found significant genotype diversity in all three locations, with the emergence of genotypes with converged resistance to all major drugs currently used for treatment. Flex-It provides an open-source, novel genotyping method that rapidly characterises S. flexneri and its ciprofloxacin resistance determinants in <1 minute from both short and long whole-genome sequencing reads. Flex-It provides the community with a standardised nomenclature to monitor the emergence and spread of S. flexneri lineages.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
21.7%
2
Microbial Genomics
204 papers in training set
Top 0.1%
18.0%
3
Nature Communications
4913 papers in training set
Top 15%
11.9%
50% of probability mass above
4
Journal of Clinical Microbiology
120 papers in training set
Top 0.3%
6.6%
5
Nucleic Acids Research
1128 papers in training set
Top 3%
6.2%
6
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
7
Nature Microbiology
133 papers in training set
Top 2%
1.8%
8
GigaScience
172 papers in training set
Top 1%
1.8%
9
Genome Research
409 papers in training set
Top 2%
1.7%
10
Genome Biology
555 papers in training set
Top 5%
1.6%
11
mSystems
361 papers in training set
Top 5%
1.4%
12
PLOS Biology
408 papers in training set
Top 12%
1.4%
13
PLOS ONE
4510 papers in training set
Top 57%
1.4%
14
Journal of Infection
71 papers in training set
Top 2%
1.3%
15
PLOS Pathogens
721 papers in training set
Top 7%
1.3%
16
Microbiome
139 papers in training set
Top 2%
1.3%
17
Scientific Reports
3102 papers in training set
Top 65%
1.3%
18
mBio
750 papers in training set
Top 9%
1.2%
19
Cell Reports Medicine
140 papers in training set
Top 6%
0.9%
20
mSphere
281 papers in training set
Top 5%
0.9%
21
eLife
5422 papers in training set
Top 57%
0.8%
22
The Lancet Infectious Diseases
71 papers in training set
Top 3%
0.7%
23
Molecular Biology and Evolution
488 papers in training set
Top 5%
0.6%
24
Bioinformatics Advances
184 papers in training set
Top 5%
0.6%
25
Communications Biology
886 papers in training set
Top 30%
0.6%