Back

Terraces in Gene Tree Reconciliation-Based Species Tree Inference

Sanderson, M.; McMahon, M. M.; Steel, M.

2020-04-18 evolutionary biology
10.1101/2020.04.17.047092 bioRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWTerraces in phylogenetic tree space are sets of trees with identical optimality scores for a given data set, arising from missing data. These were first described for multilocus phylogenetic data sets in the context of maximum parsimony inference and maximum likelihood inference under certain model assumptions. Here we show how the mathematical properties that lead to terraces extend to gene tree - species tree problems in which the gene trees are incomplete. Inference of species trees from either sets of gene family trees subject to duplication and loss, or allele trees subject to incomplete lineage sorting, can exhibit terraces in their solution space. First, we show conditions that lead to a new kind of terrace, which stems from subtree operations that appear in reconciliation problems for incomplete trees. Then we characterize when terraces of both types can occur when the optimality criterion for tree search is based on duplication, loss or deep coalescence scores. Finally, we examine the impact of assumptions about the causes of losses: whether they are due to imperfect sampling or true evolutionary deletion.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Systematic Biology
121 papers in training set
Top 0.1%
39.0%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.7%
6.8%
3
PLOS Computational Biology
1633 papers in training set
Top 5%
6.8%
50% of probability mass above
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 12%
6.3%
5
Bulletin of Mathematical Biology
84 papers in training set
Top 0.4%
4.3%
6
Nature Communications
4913 papers in training set
Top 40%
3.6%
7
Genetics
225 papers in training set
Top 1%
3.2%
8
Peer Community Journal
254 papers in training set
Top 1%
2.7%
9
Science
429 papers in training set
Top 13%
1.9%
10
Journal of Computational Biology
37 papers in training set
Top 0.2%
1.7%
11
Theoretical Population Biology
47 papers in training set
Top 0.1%
1.7%
12
Evolution
199 papers in training set
Top 1%
1.5%
13
GENETICS
189 papers in training set
Top 0.7%
1.5%
14
PLOS ONE
4510 papers in training set
Top 59%
1.3%
15
eLife
5422 papers in training set
Top 49%
1.2%
16
Journal of Mathematical Biology
37 papers in training set
Top 0.2%
1.2%
17
Bioinformatics
1061 papers in training set
Top 9%
0.8%
18
Genome Research
409 papers in training set
Top 4%
0.7%
19
Communications Biology
886 papers in training set
Top 27%
0.7%
20
PLOS Genetics
756 papers in training set
Top 16%
0.7%
21
Virus Evolution
140 papers in training set
Top 1%
0.7%
22
Scientific Reports
3102 papers in training set
Top 78%
0.6%
23
Journal of Theoretical Biology
144 papers in training set
Top 2%
0.6%