Back

Salmonella Genomic Markers for Risk to Food Safety

Waters, E. V.; Hill, C.; Orzechowska, B.; Cook, R.; Jorgensen, F.; Chattaway, M. A.; Langridge, G. C.

2026-03-30 genomics
10.64898/2026.03.27.714810 bioRxiv
Show abstract

Foodborne non-typhoidal Salmonella remains a major public health concern, yet routine surveillance recovers large numbers of isolates from food that are not associated with human illness. Studies have shown foodborne isolates can be genetically linked to clinical cases, highlighting a critical challenge for risk assessment and outbreak prioritisation. This study aimed to determine whether genomic markers can distinguish foodborne Salmonella strains with an increased likelihood of causing infection. Whole-genome sequencing data from over 900 Salmonella isolates recovered from food and the environment through UK Health Security Agency surveillance were analysed using hierarchical clustering to define genetically related groups. These clusters were expanded using the global EnteroBase database to provide broader epidemiological context. Genome-wide association analyses identified genetic markers associated with clusters containing clinical isolates, including phage-associated regions. A highly conserved 7 kb marker identified in S. Agona demonstrated strong predictive performance at a global scale, with high sensitivity and specificity for infection-associated lineages and strict serovar restriction. Comparative genomic analysis revealed that all markers localised to a shared chromosomal hotspot corresponding to a prophage integration site. The 7 kb risk-associated marker formed part of a larger prophage closely related to the well-characterised S. Typhimurium Fels-2 phage, which encodes a DNA invertase linked to phase variation, a mechanism known to promote phenotypic heterogeneity and host adaptation. As these S. Agona isolates are monophasic, our findings indicate that our genome-wide association approach has rediscovered this DNA invertase known to contribute to infection risk but in a different serovar via an alternative regulatory mechanism. Overall, this work demonstrates the potential to move beyond treating all foodborne Salmonella isolates as equivalent hazards, towards a genomics-informed framework for risk stratification. This approach provides a foundation for improved risk-based decision-making, enhance outbreak investigations and enable earlier prioritisation of public health responses during Salmonella surveillance and control. Author summaryFoodborne Salmonella infections remain a major public health concern, but not all strains pose the same risk to human health. Here we investigated whether genetic differences could explain why some foodborne strains are more likely to cause human infection. We analysed over 900 genomes from food and environmental sources, grouping closely related strains before placing them in a global context using EnteroBase. By combining pangenome and genome-wide association analyses, we identified distinct lineages within several serovars that differed in their association with human cases. In Salmonella Agona, all clinical isolates belonged to a single lineage carrying a highly conserved 7 kb marker that was absent from low-risk strains. This marker demonstrated strong sensitivity and specificity across global datasets and was located within a prophage closely related to the well-characterised Fels-2 phage. This region encodes a DNA invertase previously linked to phase variation, a mechanism that promotes bacterial adaptability. Our findings indicate that infection risk can be structured at the lineage level and influenced by mobile genomic elements, particularly prophages, that enhance environmental persistence and host adaptation. This work advances genomic surveillance from retrospective linkage towards mechanistic and predictive risk assessment, with direct relevance for supporting risk-based decision-making during outbreak investigations.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Microbial Genomics
204 papers in training set
Top 0.1%
33.6%
2
Evolutionary Applications
91 papers in training set
Top 0.1%
8.6%
3
mSystems
361 papers in training set
Top 1%
6.5%
4
International Journal of Food Microbiology
11 papers in training set
Top 0.1%
4.2%
50% of probability mass above
5
Applied and Environmental Microbiology
301 papers in training set
Top 0.8%
3.7%
6
Nature Communications
4913 papers in training set
Top 42%
3.1%
7
PLOS Pathogens
721 papers in training set
Top 5%
2.4%
8
Molecular Ecology
304 papers in training set
Top 2%
2.4%
9
PLOS Genetics
756 papers in training set
Top 7%
2.1%
10
Animal Microbiome
26 papers in training set
Top 0.1%
1.8%
11
Environmental Microbiology
119 papers in training set
Top 2%
1.8%
12
BMC Genomics
328 papers in training set
Top 2%
1.7%
13
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
14
mBio
750 papers in training set
Top 8%
1.5%
15
Scientific Reports
3102 papers in training set
Top 66%
1.2%
16
Microbiology
57 papers in training set
Top 0.8%
1.2%
17
Frontiers in Genetics
197 papers in training set
Top 7%
1.2%
18
Genome Medicine
154 papers in training set
Top 6%
1.1%
19
Environmental Science & Technology
64 papers in training set
Top 2%
1.0%
20
Viruses
318 papers in training set
Top 4%
0.9%
21
PLOS ONE
4510 papers in training set
Top 66%
0.8%
22
mSphere
281 papers in training set
Top 6%
0.8%
23
eLife
5422 papers in training set
Top 57%
0.8%
24
Frontiers in Microbiology
375 papers in training set
Top 9%
0.8%
25
Genomics
60 papers in training set
Top 2%
0.8%
26
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 6%
0.7%
27
Microbiome
139 papers in training set
Top 4%
0.5%
28
The ISME Journal
194 papers in training set
Top 3%
0.5%
29
The Lancet Microbe
43 papers in training set
Top 2%
0.5%
30
npj Biofilms and Microbiomes
56 papers in training set
Top 2%
0.5%