Back

TattleTail: A Pyocin Prediction Tool

Pais, R. G.; Chen, W.; Leptihn, S.; Hua, X.; Loh, B.

2026-03-27 bioinformatics
10.64898/2026.03.25.712926 bioRxiv
Show abstract

Tailocins are phage tail-like bacteriocins (PTLBs) thought to be remnants of prophages that have lost the ability to package viral genomes while retaining the ability to kill closely-related bacterial strains, thereby mediating bacterial competition. Tailocins produced by Pseudomonas aeruginosa are referred to as pyocins. Apart from their contribution to ecological fitness, they also have the potential to be harnessed as highly-specific antimicrobials to treat antibiotic resistant bacterial infections. Although pyocins lack the genetic components to package viral genomes, pyocin-encoding gene clusters share a high degree of genetic homology to phage tail genes, attributed to their shared ancestry. This poses a significant annotation-based challenge, as current prophage prediction tools, which rely on phage homology for prediction, can misclassify pyocins or tailocins as prophages. Pyocins unknowingly being misannotated as prophages is not only a bioinformatic issue, but can certainly confound experiments examining bacterial competition and prophage induction, if the experimental setup is based on this unintentional misannotation. In this study, we present "TattleTail", the first version of a bioinformatic tool designed to accurately identify tailocins in genome sequences, with a focus on identifying phage tail-derived pyocin-encoding gene clusters in P. aeruginosa in its first iteration. The tool leverages conserved pyocin gene cluster markers and accounts for the absence of canonical phage features, such as capsid, terminase and integrase genes, thereby distinguishing pyocins from intact and cryptic prophages. Validation in P. aeruginosa and non-P. aeruginosa genomes confirmed the presence of pyocin regions in all P. aeruginosa genomes, while none were detected in any non-P. aeruginosa genomes. Notably, TattleTail enabled the identification of representative pyocin-encoding gene clusters in clinical P. aeruginosa isolates. The identified pyocins in the clinical isolates were induced using mitomycin C, visualized via transmission electron microscopy, processed via tangential flow filtration and demonstrated bactericidal activity, thereby confirming TattleTail predictions. TattleTail aims to complement existing prophage prediction tools during genomic analyses involving phage-derived elements in bacterial genomes, allowing more accurate identification of these elements facilitated by robust discrimination between prophages and tailocins.

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
BMC Genomics
328 papers in training set
Top 0.2%
8.4%
2
Microbiome
139 papers in training set
Top 0.5%
6.3%
3
Viruses
318 papers in training set
Top 1%
4.8%
4
Bioinformatics
1061 papers in training set
Top 4%
4.8%
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.8%
6
Frontiers in Microbiology
375 papers in training set
Top 2%
4.3%
7
PLOS ONE
4510 papers in training set
Top 34%
4.3%
8
mSystems
361 papers in training set
Top 2%
4.2%
9
Scientific Reports
3102 papers in training set
Top 31%
3.9%
10
Microbial Genomics
204 papers in training set
Top 0.6%
3.6%
11
Nucleic Acids Research
1128 papers in training set
Top 7%
2.9%
50% of probability mass above
12
Antibiotics
32 papers in training set
Top 0.5%
2.6%
13
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
14
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.1%
15
ACS Synthetic Biology
256 papers in training set
Top 1%
2.1%
16
Microbiology Spectrum
435 papers in training set
Top 2%
1.9%
17
PLOS Pathogens
721 papers in training set
Top 6%
1.7%
18
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 3%
1.7%
19
Nature Communications
4913 papers in training set
Top 52%
1.7%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
21
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
22
Microbiological Research
19 papers in training set
Top 0.3%
1.5%
23
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.3%
24
Genome Biology
555 papers in training set
Top 6%
1.2%
25
Advanced Science
249 papers in training set
Top 15%
0.9%
26
Genome Medicine
154 papers in training set
Top 7%
0.9%
27
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
28
Microorganisms
101 papers in training set
Top 2%
0.8%
29
Genes
126 papers in training set
Top 3%
0.7%
30
BMC Biology
248 papers in training set
Top 4%
0.7%