Back

Trait evolution with incomplete lineage sorting and gene flow: the Gaussian Coalescent model

Ane, C.; Bastide, P.

2026-03-11 evolutionary biology
10.64898/2026.03.10.710880 bioRxiv
Show abstract

Most phylogenetic comparative methods use a species-level phylogeny, ignoring the effect of incomplete lineage sorting (ILS) and hemiplasy on the traits of interest. We consider here a trait controlled additively by one or more unknown loci. Their gene trees may differ from the species phylogeny due to ILS, as modeled by the coalescent process. If the species phylogeny is a network, this process also accounts for gene flow, admixture or hybridization. Our model allows for polymorphism in the ancestral population at the root of the species phylogeny, and predicts heritable within-population variation due to ILS. Even if each locus evolves according to a Brownian motion, the joint distribution of all trait measurements is not generally Gaussian due to ILS. We provide a Gaussian approximation, named the Gaussian Coalescent, and show how to compute its variance matrix efficiently using a single traversal of the species phylogeny. In simulations, this model is much more accurate than the model ignoring ILS. In simulations and on a data set of tomato floral traits, it is favored over the standard Brownian motion model with extra within-population variance. The GC model opens new avenues for various phylogenetic comparative methods, accounting for hemiplasy and gene flow simultaneously. It is implemented in phylolm v2.7.0 and in PhyloTraits v1.2.0.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Systematic Biology
121 papers in training set
Top 0.1%
21.5%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.2%
16.7%
3
Methods in Ecology and Evolution
160 papers in training set
Top 0.4%
9.6%
4
Bioinformatics
1061 papers in training set
Top 3%
9.6%
50% of probability mass above
5
Genetics
225 papers in training set
Top 1%
4.1%
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
7
Genome Research
409 papers in training set
Top 1%
3.5%
8
Peer Community Journal
254 papers in training set
Top 1.0%
3.4%
9
Molecular Ecology Resources
161 papers in training set
Top 0.4%
2.6%
10
Virus Evolution
140 papers in training set
Top 0.5%
2.6%
11
PLOS ONE
4510 papers in training set
Top 53%
1.7%
12
PLOS Genetics
756 papers in training set
Top 9%
1.6%
13
Genome Biology and Evolution
280 papers in training set
Top 1%
1.4%
14
Nature Communications
4913 papers in training set
Top 55%
1.3%
15
Journal of Computational Biology
37 papers in training set
Top 0.3%
1.3%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 39%
1.2%
17
GENETICS
189 papers in training set
Top 1%
0.9%
18
Nature Genetics
240 papers in training set
Top 7%
0.8%
19
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.7%
20
BMC Genomics
328 papers in training set
Top 6%
0.7%
21
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
22
Genome Biology
555 papers in training set
Top 8%
0.7%