Back

Assembling a fully-dated complete tree of life

Duke, J. D.; Guo, J.; Forest, F.; Gumbs, R.; McTavish, E. J.; Rosindell, J.

2026-03-05 evolutionary biology
10.64898/2026.03.05.709771 bioRxiv
Show abstract

Time-scaled phylogenetic trees summarising evolutionary relationships are fundamental to many analyses in biology, from diversification rate estimation to conservation prioritisation. The most comprehensive available summary of these relationships, the Open Tree of Life, synthesises information from over two thousand studies into a supertree covering the full range of global biodiversity, but its use in downstream analyses is limited by the lack of divergence times. Previous work has mapped dates from Open Tree's database of trees to certain nodes in the supertree, but for the majority of nodes no date is available. While algorithms exist to interpolate missing dates in a tree, we found that their time and memory requirements scaled quadratically with the number of nodes, which made it computationally infeasible to run them on the entire tree. In this work, we describe novel date interpolation algorithms that scale linearly with the number of nodes. These enabled us to produce a distribution of fully-dated trees containing 2.3 million extant described species, greatly expanding the scope of feasible phylogenetic analyses. We illustrate the utility of these trees by computing the most robust estimate yet of the phylogenetic diversity of the complete tree of life, incorporating both topological and temporal uncertainty.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Systematic Biology
121 papers in training set
Top 0.1%
22.1%
2
Science
429 papers in training set
Top 1%
14.4%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 9%
7.0%
4
Molecular Biology and Evolution
488 papers in training set
Top 0.7%
6.7%
50% of probability mass above
5
Nature Communications
4913 papers in training set
Top 30%
6.2%
6
Methods in Ecology and Evolution
160 papers in training set
Top 0.7%
4.2%
7
PLOS Biology
408 papers in training set
Top 3%
3.9%
8
Nature Ecology & Evolution
113 papers in training set
Top 2%
2.7%
9
Bioinformatics
1061 papers in training set
Top 6%
2.3%
10
eLife
5422 papers in training set
Top 36%
2.0%
11
Current Biology
596 papers in training set
Top 8%
1.9%
12
Nature
575 papers in training set
Top 12%
1.5%
13
Science Advances
1098 papers in training set
Top 20%
1.5%
14
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
15
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 5%
1.1%
16
Nature Genetics
240 papers in training set
Top 6%
0.9%
17
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.9%
18
Scientific Reports
3102 papers in training set
Top 72%
0.9%
19
Virus Evolution
140 papers in training set
Top 1%
0.9%
20
Nature Computational Science
50 papers in training set
Top 2%
0.8%
21
Peer Community Journal
254 papers in training set
Top 4%
0.8%
22
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
23
Cell
370 papers in training set
Top 17%
0.7%
24
New Phytologist
309 papers in training set
Top 5%
0.7%
25
Nature Plants
84 papers in training set
Top 2%
0.6%
26
PLOS ONE
4510 papers in training set
Top 72%
0.6%
27
Journal of Computational Biology
37 papers in training set
Top 0.8%
0.6%
28
PLOS Global Public Health
293 papers in training set
Top 6%
0.6%
29
BMC Bioinformatics
383 papers in training set
Top 8%
0.6%