Back

Gene-level complexity explains genome-wide variation in the distribution of fitness effects

Yildirim, B.; James, J.

2026-04-09 evolutionary biology
10.64898/2026.04.08.717178 bioRxiv
Show abstract

The distribution of fitness effects (DFE) -- describing how harmful, neutral, or beneficial new mutations are -- is central to understanding how populations evolve. Although the DFE varies across genomes and species, it remains unclear which aspects of genomic organization drive this variation. Here, we inferred gene-level selective constraints across the genomes of Mus musculus castaneus, Drosophila melanogaster and Saccharomyces cerevisiae using a combination of population genetics and machine learning trained on diverse gene features. Many gene features contributed to selective constraint, with conservation, gene structure, and expression being the most informative. These constraints delineated gene classes with distinct DFEs. Genes with higher connectivity and expression -- features reflecting how many traits a gene influences -- experienced stronger and less dispersed deleterious effects, and the rate of adaptation peaked at intermediate levels of selective constraint. When compared in a Fishers geometric model (FGM) framework, this variation was consistent with predictions based on complexity considered at the gene level rather than at the organism level, whereas between-species comparisons alone were less consistent with FGM. Our results suggest gene-level complexity, captured by genomic feature proxies, better explains DFE variation than organism-level labels and highlight the value of modeling the combined effects of gene features when linking genomic architecture to fitness landscape and patterns of molecular evolution.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Ecology & Evolution
113 papers in training set
Top 0.1%
17.4%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 2%
17.4%
3
Molecular Biology and Evolution
488 papers in training set
Top 0.7%
6.4%
4
Evolution Letters
71 papers in training set
Top 0.4%
6.0%
5
eLife
5422 papers in training set
Top 16%
5.9%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 31%
5.9%
7
Science
429 papers in training set
Top 8%
3.9%
8
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
9
Evolution
199 papers in training set
Top 0.9%
3.4%
10
Science Advances
1098 papers in training set
Top 8%
3.4%
11
PLOS Genetics
756 papers in training set
Top 5%
3.4%
12
PLOS Biology
408 papers in training set
Top 6%
2.3%
13
Genome Biology and Evolution
280 papers in training set
Top 0.8%
1.9%
14
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
1.9%
15
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 4%
1.7%
16
Molecular Ecology
304 papers in training set
Top 3%
1.6%
17
Cell Systems
167 papers in training set
Top 8%
1.6%
18
The American Journal of Human Genetics
206 papers in training set
Top 3%
1.2%
19
Current Biology
596 papers in training set
Top 12%
1.0%
20
BMC Biology
248 papers in training set
Top 3%
0.9%
21
GENETICS
189 papers in training set
Top 1%
0.8%
22
New Phytologist
309 papers in training set
Top 5%
0.7%
23
The American Naturalist
114 papers in training set
Top 2%
0.7%