Back

Precision-Based Filtering Facilitates Integration of Conventional and Single-Nucleus Transcriptomes to Identify Time- and Temperature-Sensitive Cell Populations

Seluzicki, A.; Lee, T. A.; Hartwick, N. T.; Michael, T. P.; Ecker, J. R.; Chory, J.

2026-01-31 bioinformatics
10.64898/2026.01.28.702137 bioRxiv
Show abstract

Transcriptome analysis via RNA sequencing (RNAseq) has become a ubiquitous method of molecular characterization from whole organisms, dissected tissues, and single cells. These experiments have provided an extraordinary volume of data describing the molecular states and responses to many conditions. However, standard approaches to RNAseq analysis commonly use expression level filters that eliminate potentially useful data in the service of decreasing noise. Here we describe the implementation of a coefficient of variation-based filter for RNAseq gene expression data. This filter prioritizes consistent data across replicates, allowing lowly-expressed genes with low-variation measurements to be retained for downstream analysis. We show that, in our Arabidopsis RNAseq data set, this filter allows for the inclusion of many more transcription factors than even a low-stringency expression level filter. We find that these lowly-expressed genes mark specific cell clusters in our single-nucleus (sn)RNAseq dataset. We further characterize communities of co-expressed genes, sampled across the day at two growth temperatures, in relation to snRNAseq cell clusters, finding evidence for a highly photosynthetic cell population, and a cell state marked by high cell division and translation. These methods can be expanded to RNAseq analysis in many systems, facilitating the construction of more detailed models of tissue-specific gene regulatory networks.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 1%
12.1%
2
The Plant Cell
141 papers in training set
Top 0.3%
9.9%
3
The Plant Journal
197 papers in training set
Top 0.7%
6.7%
4
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.2%
6.7%
5
Genetics
225 papers in training set
Top 0.9%
4.8%
6
Nature Communications
4913 papers in training set
Top 37%
3.9%
7
Scientific Reports
3102 papers in training set
Top 39%
3.5%
8
PLOS ONE
4510 papers in training set
Top 41%
3.5%
50% of probability mass above
9
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
10
Bioinformatics
1061 papers in training set
Top 6%
3.5%
11
Molecular Systems Biology
142 papers in training set
Top 0.3%
2.8%
12
Plant Physiology
217 papers in training set
Top 2%
2.7%
13
Molecular Biology of the Cell
272 papers in training set
Top 0.9%
2.5%
14
Cell Systems
167 papers in training set
Top 5%
2.5%
15
Genome Research
409 papers in training set
Top 2%
2.0%
16
Genome Biology
555 papers in training set
Top 4%
1.7%
17
Nature Methods
336 papers in training set
Top 4%
1.7%
18
eLife
5422 papers in training set
Top 48%
1.3%
19
Plant Direct
81 papers in training set
Top 1%
1.3%
20
Development
440 papers in training set
Top 2%
1.3%
21
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.2%
22
Nature Genetics
240 papers in training set
Top 6%
1.1%
23
BMC Bioinformatics
383 papers in training set
Top 6%
1.1%
24
New Phytologist
309 papers in training set
Top 4%
1.1%
25
Molecular Plant
36 papers in training set
Top 1%
0.9%
26
BMC Genomics
328 papers in training set
Top 5%
0.9%
27
iScience
1063 papers in training set
Top 34%
0.7%
28
Journal of Cell Science
353 papers in training set
Top 2%
0.7%
29
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
30
in silico Plants
24 papers in training set
Top 0.4%
0.6%