Back

Unlocking Multi-Sample Differential Expression for Spatial Transcriptomics Data with TESSERA

Constantine, F.; Laszik, Z.; Dudoit, S.; Purdom, E.

2026-04-30 bioinformatics
10.64898/2026.04.27.720955 bioRxiv
Show abstract

Spatial transcriptomics allows the unprecedented examination of gene expression levels at the resolution of spatially-situated single cells in a high-throughput manner. As the technology is adopted more broadly, studies frequently collect data from multiple tissue samples, which leads to unique challenges that traditional spatial statistical methods are not equipped to handle. In particular, factors that differ across samples, such as different coordinate systems, different numbers and types of cells, different underlying tissue architectures, among others, preclude the application of traditional methods to our new setting. In this work, we propose a novel method, TESSERA, based on a spatial generalized linear model, for analyzing multi-sample spatial transcriptomics count data. Importantly, we provide a mathematical and computational framework for efficient and scalable model fitting and statistical inference to accompany the specification of our model. Our method for fitting the model enables the estimation of a common set of fixed effects across samples. This allows us to address a variety of differential expression questions, such as identification of which genes are differentially expressed between conditions (e.g., diseases, treatments), while accounting for spatial correlation between cells within a sample. We benchmark our proposed method on simulated data and apply it to a spatial transcriptomics dataset of human kidney samples. We find that our method provides a hitherto nonexistent extension to the multi-sample setting while remaining competitive with or outperforming existing algorithms in the single-sample setting.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
22.1%
2
Cell Systems
167 papers in training set
Top 1%
9.0%
3
Nature Communications
4913 papers in training set
Top 36%
4.2%
4
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
5
Advanced Science
249 papers in training set
Top 5%
3.9%
6
PLOS ONE
4510 papers in training set
Top 40%
3.5%
7
Genome Biology
555 papers in training set
Top 3%
3.5%
50% of probability mass above
8
Nature Methods
336 papers in training set
Top 3%
3.5%
9
Scientific Reports
3102 papers in training set
Top 38%
3.5%
10
Nature Biotechnology
147 papers in training set
Top 3%
2.8%
11
PLOS Computational Biology
1633 papers in training set
Top 12%
2.5%
12
Nucleic Acids Research
1128 papers in training set
Top 8%
2.3%
13
Biometrics
22 papers in training set
Top 0.1%
1.8%
14
Patterns
70 papers in training set
Top 0.7%
1.8%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.7%
16
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
17
Nature Computational Science
50 papers in training set
Top 0.7%
1.7%
18
Biostatistics
21 papers in training set
Top 0.1%
1.6%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.6%
20
Communications Biology
886 papers in training set
Top 15%
1.2%
21
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
1.2%
22
Genome Research
409 papers in training set
Top 3%
1.2%
23
iScience
1063 papers in training set
Top 23%
1.1%
24
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
26
GigaScience
172 papers in training set
Top 3%
0.7%
27
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%