Back

On the Comparison of LGT networks and Tree-based Networks

Marchand, B.; Tahiri, N.; Tremblay-Savard, O.; Lafond, M.

2026-04-01 bioinformatics
10.1101/2025.11.20.689557 bioRxiv
Show abstract

Phylogenetic networks are widespread representations of evolutionary histories for taxa that undergo hybridization or Lateral-Gene Transfer (LGT) events. There are now many tools to reconstruct such networks, but no clearly established metric to compare them. Such metrics are needed, for example, to evaluate predictions against a simulated ground truth. Despite years of effort in developing metrics, known dissimilarity measures either do not distinguish all pairs of different networks, or are extremely difficult to compute. Since it appears challenging, if not impossible, to create the ideal metric for all classes of networks, it may be relevant to design them for specialized applications. In this article, we introduce a metric on LGT networks, which consist of trees with additional arcs that represent lateral gene transfer events. Our metric is based on edit operations, namely the addition/removal of transfer arcs, and the contraction/expansion of arcs of the base tree, allowing it to connect the space of all LGT networks. We show that it is linear-time computable if the order of transfers along a branch is unconstrained but NP-hard otherwise, in which case we provide a fixed-parameter tractable (FPT) algorithm in the level. We implemented our algorithms and demonstrate their applicability on three numerical experiments. Full online versionhttps://www.biorxiv.org/content/10.1101/2025.11.20.689557

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
23.1%
2
Systematic Biology
121 papers in training set
Top 0.1%
10.3%
3
Molecular Biology and Evolution
488 papers in training set
Top 0.5%
8.6%
4
Methods in Ecology and Evolution
160 papers in training set
Top 0.5%
7.0%
5
Nature Communications
4913 papers in training set
Top 32%
5.0%
50% of probability mass above
6
Peer Community Journal
254 papers in training set
Top 0.7%
4.1%
7
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.7%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.7%
10
Genome Research
409 papers in training set
Top 1%
3.1%
11
Journal of Computational Biology
37 papers in training set
Top 0.1%
2.7%
12
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
13
Ecology Letters
121 papers in training set
Top 0.8%
1.5%
14
Nature Methods
336 papers in training set
Top 4%
1.5%
15
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.4%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.4%
17
Scientific Reports
3102 papers in training set
Top 70%
0.9%
18
Cell Systems
167 papers in training set
Top 10%
0.9%
19
Communications Biology
886 papers in training set
Top 18%
0.9%
20
Nature Biotechnology
147 papers in training set
Top 7%
0.8%
21
Algorithms for Molecular Biology
15 papers in training set
Top 0.1%
0.8%
22
Molecular Ecology Resources
161 papers in training set
Top 1%
0.7%
23
iScience
1063 papers in training set
Top 40%
0.5%
24
Genome Biology
555 papers in training set
Top 9%
0.5%
25
Nature Computational Science
50 papers in training set
Top 2%
0.5%
26
PLOS ONE
4510 papers in training set
Top 73%
0.5%
27
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.5%