Back

How many characters are needed to reconstruct a phylogeny?

Capobianco, A.

2025-09-28 evolutionary biology
10.1101/2025.09.26.678777 bioRxiv
Show abstract

Despite increased recent attention towards Bayesian phylogenetics and its applications in understanding macroevolutionary processes, it remains unclear how many discrete characters are needed to accurately estimate tree topologies in a Bayesian framework. This could be particularly relevant for morphological datasets used in phylogenetics, as they usually consist of few dozens to few hundreds of characters--orders of magnitude smaller than most molecular datasets. I designed a simulation study in the software RevBayes to explore how the number of sampled discrete characters affects accuracy and precision of Bayesian phylogenetic estimates, under various setups differing in number of taxa, average number of state changes per character (i.e., tree length), and number of states per character. Results indicate that between 100 and 500 variable characters are necessary to reach sufficient accuracy and precision of phylogenetic estimates for as low as 20 tips. All other parameters being equal, multistate characters produce slightly more accurate estimates than binary characters, and more labile characters produce more accurate estimates for trees above 50 tips. The results of this study highlight the continuous need for global research efforts geared towards the characterization and digitization of interspecific morphological diversity in both extant and extinct taxa.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Molecular Phylogenetics and Evolution
61 papers in training set
Top 0.1%
18.0%
2
Systematic Biology
121 papers in training set
Top 0.1%
16.9%
3
BMC Ecology and Evolution
49 papers in training set
Top 0.1%
9.7%
4
PeerJ
261 papers in training set
Top 0.3%
7.9%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 35%
4.2%
6
Methods in Ecology and Evolution
160 papers in training set
Top 0.7%
4.2%
7
Systematic Entomology
11 papers in training set
Top 0.1%
4.2%
8
Scientific Reports
3102 papers in training set
Top 52%
2.0%
9
Journal of Systematics and Evolution
11 papers in training set
Top 0.1%
2.0%
10
Ecology and Evolution
232 papers in training set
Top 3%
1.6%
11
Molecular Ecology Resources
161 papers in training set
Top 0.6%
1.6%
12
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
1.4%
13
Peer Community Journal
254 papers in training set
Top 2%
1.3%
14
PLOS Biology
408 papers in training set
Top 14%
1.2%
15
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.2%
16
Infection, Genetics and Evolution
43 papers in training set
Top 0.6%
1.2%
17
Ecological Informatics
29 papers in training set
Top 0.6%
0.9%
18
eLife
5422 papers in training set
Top 54%
0.9%
19
Royal Society Open Science
193 papers in training set
Top 4%
0.9%
20
Journal of Evolutionary Biology
98 papers in training set
Top 1%
0.7%
21
Journal of Computational Biology
37 papers in training set
Top 0.7%
0.7%
22
Zoological Journal of the Linnean Society
14 papers in training set
Top 0.3%
0.6%
23
Molecular Biology and Evolution
488 papers in training set
Top 5%
0.6%
24
Journal of Molecular Evolution
21 papers in training set
Top 0.5%
0.6%
25
New Phytologist
309 papers in training set
Top 5%
0.6%