Back

Accessory genes define species-specific pathways to antibiotic resistance.

Dillon, L.; Dimonaco, N. J.; Creevey, C. J.

2023-08-14 bioinformatics
10.1101/2023.08.09.552647 bioRxiv
Show abstract

BackgroundThe rise of antimicrobial resistance (AMR) is a growing concern globally and a deeper understanding of AMR gene carriage vs usage is vital for future responses to reduce the spread of AMR. Identification of AMR phenotype by laboratory-based assays are often hindered by difficulties in establishing cultures. This issue could be resolved by rapid computational assessment of an organisms genome, however, AMR gene finder tools are not intended to infer AMR phenotype which is likely to be a product of multiple gene interactions. MethodsTo understand the importance of multi-gene interactions to the relationship between AMR genotype and AMR phenotype, we applied machine learning approaches to 16,950 genomes from microbial isolates representing 28 different genera with 1.2 million corresponding laboratory-determined MICs for 23 different antibiotics. We then elucidated the genomic paths to phenotypic antimicrobial resistance with the aim of allowing for the development of rapid determination of AMR phenotype from genomes or even whole microbiomes. ResultsThe application of machine learning models resulted in a >1.5-fold increase in average prediction accuracy of AMR phenotype across the 23 antibiotic models. Interpretation of these models revealed 528 distinct genomic pathways to phenotypic resistance, many of which were species-specific and involved genes which have not previously been associated with AMR phenotype. This is the first study to demonstrate the utility of machine learning models in the prediction of AMR phenotype for a wide range of clinically relevant organisms and antibiotics. This could be applied as a rapid and affordable alternative to culture-based techniques, estimating taxonomy in addition to AMR phenotype, and providing real-time monitoring of multi-drug resistant pathogens. Availability and implementationO_ST_ABSContactC_ST_ABSldillon05@qub.ac.uk View supplementary information at this linkhttps://osf.io/cj4bq/?view_only=c0ee87b7609543b688953089be4c376f See Code Availability for scripts used.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Microbial Genomics
204 papers in training set
Top 0.1%
22.3%
2
The Lancet Microbe
43 papers in training set
Top 0.1%
17.3%
3
Nature Communications
4913 papers in training set
Top 25%
7.1%
4
Genome Medicine
154 papers in training set
Top 0.8%
6.7%
50% of probability mass above
5
Bioinformatics
1061 papers in training set
Top 4%
6.3%
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.6%
7
mSystems
361 papers in training set
Top 4%
2.4%
8
mSphere
281 papers in training set
Top 2%
2.3%
9
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
10
Journal of Clinical Microbiology
120 papers in training set
Top 1%
1.7%
11
Scientific Reports
3102 papers in training set
Top 59%
1.7%
12
Nature Microbiology
133 papers in training set
Top 3%
1.5%
13
Wellcome Open Research
57 papers in training set
Top 1%
1.5%
14
The Journal of Infectious Diseases
182 papers in training set
Top 3%
1.3%
15
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
16
PLOS ONE
4510 papers in training set
Top 64%
0.9%
17
F1000Research
79 papers in training set
Top 4%
0.8%
18
eLife
5422 papers in training set
Top 58%
0.7%
19
PeerJ
261 papers in training set
Top 15%
0.7%
20
Microbiome
139 papers in training set
Top 3%
0.7%
21
mBio
750 papers in training set
Top 12%
0.7%
22
JAC-Antimicrobial Resistance
13 papers in training set
Top 0.5%
0.7%
23
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.6%
24
BMC Microbiology
35 papers in training set
Top 2%
0.6%
25
PLOS Biology
408 papers in training set
Top 23%
0.6%