Back

Jaccard Index Network Analysis for pangenome analysis

Penil-Celis, A.; Redondo-Salvo, S.; Tagg, K. A.; Webb, H. E. E.; Garcillan-Barcia, M. P.; de la Cruz, F.

2025-08-24 bioinformatics
10.1101/2025.08.20.669834 bioRxiv
Show abstract

ii.Summary/AbstractJaccard Index Network Analysis (JINA) is a comprehensive workflow designed to explore bacterial genome relationships through an integrated network-based approach. This workflow combines existing tools such as Jaccard Index (1), Gephi (2) and Pangraph (3). By integrating these methodologies into a unified framework, JINA enables efficient visualization and stratification of genomic data, facilitating the identification of meaningful patterns, groups, and associations within bacterial populations. The use of JINA ensures precision in capturing genomic variation including single nucleotide polymorphisms, insertions and deletions. While JINA does not implement Gephi, BLAST, and PanGraph directly in a single software, it guides their coordinated use to analyze and interpret genomic data effectively.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 7%
22.5%
2
Bioinformatics
1061 papers in training set
Top 3%
10.4%
3
Nucleic Acids Research
1128 papers in training set
Top 2%
8.4%
4
Microbiology Resource Announcements
22 papers in training set
Top 0.1%
6.4%
5
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
4.3%
50% of probability mass above
6
BMC Bioinformatics
383 papers in training set
Top 2%
4.0%
7
Scientific Reports
3102 papers in training set
Top 37%
3.6%
8
mSystems
361 papers in training set
Top 3%
2.7%
9
mSphere
281 papers in training set
Top 2%
2.1%
10
PeerJ
261 papers in training set
Top 6%
1.9%
11
F1000Research
79 papers in training set
Top 1%
1.9%
12
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.9%
13
Microbial Genomics
204 papers in training set
Top 1.0%
1.9%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
15
Database
51 papers in training set
Top 0.4%
1.8%
16
BMC Genomics
328 papers in training set
Top 3%
1.5%
17
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.3%
18
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.2%
19
BMC Research Notes
29 papers in training set
Top 0.2%
1.2%
20
Frontiers in Microbiology
375 papers in training set
Top 7%
1.2%
21
Frontiers in Genetics
197 papers in training set
Top 7%
1.1%
22
International Journal of Molecular Sciences
453 papers in training set
Top 14%
0.8%
23
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.7%
25
Viruses
318 papers in training set
Top 5%
0.7%
26
Access Microbiology
22 papers in training set
Top 0.7%
0.7%
27
Microbiology Spectrum
435 papers in training set
Top 5%
0.7%