Back

Synolog: A Scalable Synteny-Based Framework for Genome Architecture Characterization

Madrigal, G.; Catchen, J. M.

2026-04-10 bioinformatics
10.64898/2026.04.07.717040 bioRxiv
Show abstract

Detailing the genomic architecture across multiple organisms has been a task performed for decades. The continuing growth of genomic datasets not only serves as a resource for studying genome evolution but warrants the availability of scalable and user-friendly software for processing these datasets. Here, we present Synolog, a bioinformatic toolkit that can automatically identify orthologs for both protein-coding and non-coding genes, synteny clusters across two or more genomes, as well as retrogenes, and segmental duplications. Applying Synolog, we illustrate cases of local gene expansions in ecologically disparate turtle species, identify synteny clusters across hundreds of millions of years of metazoan evolution, and reconstruct chromosome-level assemblies in teleosts using the inferred synteny clusters; all using its integrated visual features. In parallel, we compare our orthogroup method to that of commonly used software and note the tradeoffs of making inferences solely based on sequence similarity versus a synteny-based approach.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
19.5%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.3%
10.4%
3
BMC Bioinformatics
383 papers in training set
Top 0.9%
10.1%
4
Bioinformatics Advances
184 papers in training set
Top 0.3%
8.2%
5
Genome Research
409 papers in training set
Top 0.5%
4.9%
50% of probability mass above
6
Genome Biology
555 papers in training set
Top 1%
4.9%
7
PLOS Computational Biology
1633 papers in training set
Top 7%
4.9%
8
Nature Biotechnology
147 papers in training set
Top 2%
3.6%
9
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
10
Nature Communications
4913 papers in training set
Top 42%
3.1%
11
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.7%
12
Nature Methods
336 papers in training set
Top 4%
1.9%
13
Methods in Ecology and Evolution
160 papers in training set
Top 1%
1.8%
14
PLOS ONE
4510 papers in training set
Top 54%
1.7%
15
Genome Biology and Evolution
280 papers in training set
Top 1%
1.7%
16
Genetics
225 papers in training set
Top 4%
0.8%
17
GigaScience
172 papers in training set
Top 3%
0.8%
18
Cell Systems
167 papers in training set
Top 11%
0.8%
19
Molecular Ecology Resources
161 papers in training set
Top 1%
0.7%
20
BMC Genomics
328 papers in training set
Top 6%
0.7%
21
Nature Genetics
240 papers in training set
Top 8%
0.6%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.6%