Back

On the consistency of duplication, loss, and deep coalescence gene tree parsimony costs under the multispecies coalescent

Sapoval, N.; Nakhleh, L.

2026-02-20 bioinformatics
10.64898/2026.02.20.707019 bioRxiv
Show abstract

Gene tree parsimony (GTP) is a common approach for efficient reconciliation of multiple discordant gene tree phylogenies for the inference of a single species tree. However, despite the popularity of GTP methods due to their low computational costs, prior work has shown that some commonly employed parsimony costs are statistically inconsistent under the multispecies coalescent process. Furthermore, a fine-grained analysis of the inconsistency has indicated potentially complimentary behavior of duplication and deep coalescence costs for symmetric and asymmetric species trees. In this work, we prove inconsistency of GTP estimators for all linear combinations of duplication, loss and deep coalescence scores. We also explore empirical implications of this result evaluating inference results of several GTP cost schemes under varying levels of incomplete lineage sorting.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.5%
23.6%
2
Systematic Biology
121 papers in training set
Top 0.1%
15.4%
3
Bioinformatics
1061 papers in training set
Top 3%
10.6%
4
Journal of Computational Biology
37 papers in training set
Top 0.1%
6.7%
50% of probability mass above
5
BMC Bioinformatics
383 papers in training set
Top 2%
5.1%
6
Journal of Theoretical Biology
144 papers in training set
Top 0.2%
4.5%
7
Scientific Reports
3102 papers in training set
Top 47%
2.5%
8
Bulletin of Mathematical Biology
84 papers in training set
Top 1%
1.8%
9
Genetics
225 papers in training set
Top 2%
1.8%
10
PLOS ONE
4510 papers in training set
Top 52%
1.8%
11
PeerJ
261 papers in training set
Top 8%
1.6%
12
Theoretical Population Biology
47 papers in training set
Top 0.1%
1.4%
13
PLOS Genetics
756 papers in training set
Top 12%
1.2%
14
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
15
Methods in Ecology and Evolution
160 papers in training set
Top 2%
0.9%
16
Biostatistics
21 papers in training set
Top 0.1%
0.9%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 42%
0.8%
18
Genome Research
409 papers in training set
Top 4%
0.8%
19
Biometrics
22 papers in training set
Top 0.1%
0.8%
20
Peer Community Journal
254 papers in training set
Top 3%
0.8%
21
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.7%
22
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
23
BMC Genomics
328 papers in training set
Top 6%
0.7%
24
Molecular Biology and Evolution
488 papers in training set
Top 5%
0.5%
25
Nature Communications
4913 papers in training set
Top 66%
0.5%