Back

Phenotypic diversity of yeasts curated in the 5th edition of The Yeasts: A data-driven visualization approach

Seike, T.; Ide, M.; Yamamoto, M.; Yurimoto, H.; Shiraishi, K.

2026-02-16 microbiology
10.64898/2026.02.15.706042 bioRxiv
Show abstract

Yeasts have long served as key experimental systems in genetics, cell biology, and fermentation research; however, these studies have largely focused on a limited number of model species. In contrast, yeasts are a fungal group with remarkable physiological and ecological diversity. To provide a global overview of yeast diversity, we compiled and visualized phenotypic information for approximately 1,300 yeast species documented in The Yeasts (5th edition), including carbon utilization profiles, fermentation capacity, growth temperature ranges, and reported isolation sources. Taxonomic reconciliation revealed extensive reannotation, with approximately 44% of the species undergoing name changes and recognized genera increasing from 143 to 233. Integrative analyses revealed pronounced phylogenetic structuring of metabolic breadth. Many basidiomycetous yeasts, particularly Agaricomycotina, exhibited broader, generalist-like carbon utilization, whereas ascomycetous yeasts, especially Saccharomycotina, more frequently displayed sugar-centered, specialist-like narrower profiles; model yeasts such as Saccharomyces cerevisiae and Schizosaccharomyces pombe fell at the narrow end of this spectrum. Fewer than half of all species fermented glucose, a trait largely confined to Saccharomycotina. In addition, nearly one-fifth of species failed to grow at 30 {degrees}C, the standard laboratory temperature. By reconstructing and visualizing decades of dispersed taxonomic knowledge accumulated in The Yeasts, this study reframes yeasts not merely as laboratory model organisms but as metabolically diverse fungi whose phenotypic diversity reflects diverse ecological contexts. The analytical framework presented here provides a foundation for integrating standardized quantitative phenotypes and newly described species and offers a starting point for exploring the latent ecological and metabolic potential of yeast diversity.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
mSphere
281 papers in training set
Top 0.4%
8.2%
2
mBio
750 papers in training set
Top 2%
8.0%
3
PLOS Biology
408 papers in training set
Top 0.8%
6.6%
4
eLife
5422 papers in training set
Top 15%
6.2%
5
mSystems
361 papers in training set
Top 2%
6.2%
6
Frontiers in Microbiology
375 papers in training set
Top 1%
6.1%
7
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.4%
8
Nucleic Acids Research
1128 papers in training set
Top 5%
4.2%
9
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
50% of probability mass above
10
Nature Communications
4913 papers in training set
Top 41%
3.5%
11
Yeast
15 papers in training set
Top 0.1%
2.8%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 25%
2.5%
13
ISME Communications
103 papers in training set
Top 0.9%
2.0%
14
Scientific Reports
3102 papers in training set
Top 55%
1.8%
15
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 3%
1.7%
16
Microbiome
139 papers in training set
Top 2%
1.6%
17
Current Biology
596 papers in training set
Top 10%
1.6%
18
PLOS Genetics
756 papers in training set
Top 10%
1.4%
19
Nature Microbiology
133 papers in training set
Top 3%
1.3%
20
Molecular Systems Biology
142 papers in training set
Top 0.9%
1.3%
21
Advanced Science
249 papers in training set
Top 15%
1.2%
22
Cell Systems
167 papers in training set
Top 10%
0.9%
23
BMC Biology
248 papers in training set
Top 3%
0.9%
24
Communications Biology
886 papers in training set
Top 20%
0.9%
25
Microbiological Research
19 papers in training set
Top 0.6%
0.9%
26
Microbiology Spectrum
435 papers in training set
Top 5%
0.8%
27
Environmental Microbiology
119 papers in training set
Top 3%
0.8%
28
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
29
PLOS ONE
4510 papers in training set
Top 69%
0.7%
30
Cell Host & Microbe
113 papers in training set
Top 5%
0.7%