Back

Comparing Methods for Species Tree Estimation With Gene Duplication and Loss

Willson, J.; Roddur, M.; Warnow, T.

2021-02-07 evolutionary biology
10.1101/2021.02.05.429947 bioRxiv
Show abstract

Species tree inference from gene trees is an important part of biological research. One confounding factor in estimating species trees is gene duplication and loss which can lead to gene trees with multiple copies of the same gene. In recent years there have been several new methods developed to address this problem that have substantially improved on earlier methods; however, the best performing methods (ASTRAL-Pro, ASTRID-multi, and FastMulRFS) have not yet been directly compared. In this study, we compare ASTRAL-Pro, ASTRID-multi, and FastMulRFS under a wide variety of conditions. Our study shows that while all three have very good accuracy, nearly the same under many conditions, ASTRAL-Pro and ASTRID-multi are more reliably accurate than FastMuLRFS, and that ASTRID-multi is often faster than ASTRAL-Pro. The datasets generated for this study are freely available in the Illinois Data Bank at https://databank.illinois.edu/datasets/IDB-2418574

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
13.9%
2
Journal of Computational Biology
37 papers in training set
Top 0.1%
9.8%
3
Systematic Biology
121 papers in training set
Top 0.1%
8.1%
4
PLOS ONE
4510 papers in training set
Top 23%
8.0%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.2%
6
Bioinformatics Advances
184 papers in training set
Top 0.6%
6.1%
50% of probability mass above
7
Genome Research
409 papers in training set
Top 0.6%
4.7%
8
Molecular Biology and Evolution
488 papers in training set
Top 1%
3.6%
9
PeerJ
261 papers in training set
Top 3%
3.0%
10
Scientific Reports
3102 papers in training set
Top 45%
2.6%
11
Methods in Ecology and Evolution
160 papers in training set
Top 1%
2.3%
12
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
13
PLOS Genetics
756 papers in training set
Top 8%
1.8%
14
BMC Ecology and Evolution
49 papers in training set
Top 0.9%
1.7%
15
BMC Genomics
328 papers in training set
Top 3%
1.6%
16
Applications in Plant Sciences
21 papers in training set
Top 0.2%
1.6%
17
Genetics
225 papers in training set
Top 2%
1.6%
18
Developmental Biology
134 papers in training set
Top 2%
1.3%
19
Journal of Molecular Evolution
21 papers in training set
Top 0.3%
0.9%
20
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.9%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 42%
0.9%
22
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
23
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
24
Journal of Theoretical Biology
144 papers in training set
Top 2%
0.7%
25
iScience
1063 papers in training set
Top 36%
0.7%
26
Journal of Systematics and Evolution
11 papers in training set
Top 0.3%
0.7%
27
Ecology and Evolution
232 papers in training set
Top 5%
0.6%