Back

Re-calibration of flow cytometry standards for plant genome size estimation

Soni, A.; Henry, R. J.

2024-11-12 molecular biology
10.1101/2024.11.11.623134 bioRxiv
Show abstract

The evolution of long-read sequencing technologies has advanced the development of genome assemblies that are now frequently presented as gap-free or nearly gapless, reflecting substantial improvements in sequencing accuracy and completeness. However, discrepancies between the genome size estimates derived from genome assemblies and flow cytometry create ambiguity regarding the accuracy of these complementary approaches. Accurate genome size estimation via flow cytometry relies on use of an internal standard with a genome of known size. Historically, the genome size of these standards was often calibrated against incomplete genome assemblies or non-plant genomes, such as the human male genome, which was previously considered to be 7 pg but is now known to be around 6.15 pg after 20 years of advancements. Calibrating plant references against non-plant standards is not recommended due to differential staining properties. Therefore, we recalibrated the size of five plant genomes commonly used as reference standards in flow cytometry, by utilizing a recent gapless, telomere-to-telomere (T2T) genome assembly of Nipponbare rice. Our results indicate a significant overestimation of around 20% in previous flow cytometry-based estimates for Pisum sativum and Nicotiana benthamiana, around 10% for Arabidopsis thaliana, and less than 5% for Sorghum bicolor, and Gossypium hirsutum. The close alignment of the recalibrated GS estimates to the reference genome assemblies and recalculated estimates from different studies confirms their suitability as reference standards for more accurate measurement of plant genome size.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
The Plant Journal
197 papers in training set
Top 0.1%
26.2%
2
Plant Methods
39 papers in training set
Top 0.1%
19.9%
3
PLOS ONE
4510 papers in training set
Top 26%
6.5%
50% of probability mass above
4
Frontiers in Plant Science
240 papers in training set
Top 1%
6.5%
5
The Plant Genome
53 papers in training set
Top 0.2%
4.1%
6
Scientific Reports
3102 papers in training set
Top 30%
4.1%
7
GigaScience
172 papers in training set
Top 0.6%
3.1%
8
Plant Biotechnology Journal
56 papers in training set
Top 0.4%
2.5%
9
Molecular Ecology Resources
161 papers in training set
Top 0.6%
1.7%
10
Biology Methods and Protocols
53 papers in training set
Top 1%
1.5%
11
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.4%
12
BMC Bioinformatics
383 papers in training set
Top 5%
1.4%
13
BMC Genomics
328 papers in training set
Top 3%
1.4%
14
Scientific Data
174 papers in training set
Top 1%
1.3%
15
Nature Communications
4913 papers in training set
Top 56%
1.3%
16
Plants
39 papers in training set
Top 1%
0.9%
17
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.8%
18
BMC Plant Biology
47 papers in training set
Top 0.8%
0.8%
19
Genome Biology
555 papers in training set
Top 7%
0.8%
20
Analytical Biochemistry
26 papers in training set
Top 0.2%
0.8%
21
Microorganisms
101 papers in training set
Top 3%
0.7%
22
Gigabyte
60 papers in training set
Top 2%
0.5%
23
Bioinformatics
1061 papers in training set
Top 11%
0.5%