Back

Deciphering the genetic basis of phytoplankton traits through genome-wide association studies

Maupetit, A.; Segura, V.; Pajot, A.; Nicolau, E.; Bougaran, G.; Lacour, T.; Berard, J. B.; Charrier, A.; Schreiber, N.; Robert, E.; Saint-Jean, B.; Carrier, G.

2026-02-27 genetics
10.64898/2026.02.27.708454 bioRxiv
Show abstract

Recently, an inventory of genes in phytoplankton was conducted through expeditions such as TARA Oceans. Approximately 1.5 million genes were identified, of which at least three-quarters have unknown function. Presently, a several research programmes are engaged in the sequencing of marine biodiversity, resulting in a rapid expansion of genomic databases. Access to the genomic sequences of these organisms will soon be readily accessible to the scientific community. Although analysing this data is promising, the characterization of genes or genomes, on the other hand, is progressing very slowly and remains a major challenge for scientists. The aim of this study was to use GWAS approaches to decipher genomic loci without a priori assumptions. The microalga Tisochrysis lutea was selected as a case study due to its economic importance and the extensive knowledge accumulated over the years. Particular attention was paid to pigment and lipid metabolism due to their high commercial value. To implement the GWAS approach, a collection of algal lineages was established (100 lineages) from available polyclonal strains (15 strains). This collection was then phenotyped under two different culture conditions. Of the 31 phenotypic traits investigated, 18 met the requirements for GWAS analysis. Concurrently, each algal lineage was genotyped by whole genome sequencing to inventory all genetic polymorphisms. A mixed model was applied, revealing 13 significant associations between phenotypic traits and alleles. These associations highlight previously unsuspected genomic loci that play a major role in pigment or lipid content. Genes identified at these loci may have a direct or indirect role in these metabolic pathways. Nevertheless, elucidating the molecular mechanisms of the associated genes remains limited without the implementation of functional approaches. Despite the complexity of the process, we conclude that the GWAS approach was effective for deciphering phytoplankton genomes, particularly for quantitative traits of interest. Ideally, this approach should be combined with other functional methods to progressively decode marine genomes.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Aquaculture
29 papers in training set
Top 0.1%
10.1%
2
Frontiers in Marine Science
55 papers in training set
Top 0.1%
10.1%
3
BMC Genomics
328 papers in training set
Top 0.2%
8.4%
4
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.3%
6.3%
5
PeerJ
261 papers in training set
Top 1%
4.3%
6
Journal of Phycology
11 papers in training set
Top 0.1%
4.3%
7
PLOS ONE
4510 papers in training set
Top 35%
4.2%
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
50% of probability mass above
9
Scientific Reports
3102 papers in training set
Top 37%
3.6%
10
Algal Research
20 papers in training set
Top 0.1%
2.1%
11
Frontiers in Microbiology
375 papers in training set
Top 5%
1.9%
12
Journal of Fish Biology
14 papers in training set
Top 0.1%
1.8%
13
Molecular Ecology Resources
161 papers in training set
Top 0.6%
1.7%
14
Genome Biology and Evolution
280 papers in training set
Top 1%
1.7%
15
Molecular Ecology
304 papers in training set
Top 3%
1.7%
16
Hydrobiologia
11 papers in training set
Top 0.1%
1.7%
17
Biology
43 papers in training set
Top 1%
1.1%
18
Ecology and Evolution
232 papers in training set
Top 3%
1.1%
19
FACETS
11 papers in training set
Top 0.2%
0.9%
20
Limnology and Oceanography
26 papers in training set
Top 0.2%
0.9%
21
Frontiers in Ecology and Evolution
60 papers in training set
Top 4%
0.8%
22
Metabarcoding and Metagenomics
12 papers in training set
Top 0.1%
0.8%
23
Evolutionary Applications
91 papers in training set
Top 1%
0.8%
24
Environmental DNA
49 papers in training set
Top 0.3%
0.7%
25
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
26
Bioinformatics
1061 papers in training set
Top 9%
0.7%
27
Microorganisms
101 papers in training set
Top 3%
0.7%
28
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 7%
0.7%
29
Journal of Experimental Zoology Part B: Molecular and Developmental Evolution
22 papers in training set
Top 0.8%
0.6%
30
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.6%