Back

TopOmics: Topic Modelling for All Omics

Sanguinetti, G.; El Kazwini, N.; Caretti, F.

2026-05-29 bioinformatics
10.64898/2026.05.26.727810 bioRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWTopic models have emerged as a popular paradigm to analyse and interpret complex single-cell and spatial data. Yet, current implementations are usually data-type specific and rely on different modelling and estimation approaches, hindering usability and interoperability. In this work we introduce TopOmics, a library to perform efficient and flexible topic modeling with any combination of -omics data at scale. The framework leverages standard libraries of the Python ecosystem, guaranteeing seamless integration with existing pipelines, and shows competitive performance against state-of-the-art methods while preserving interpretability. We provide several examples of TopOmics on diverse data sets, including a novel topic model for spatial multi-omic data, and an analysis of a very large VisiumHD data set.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Biology
555 papers in training set
Top 0.1%
18.4%
2
Bioinformatics
1061 papers in training set
Top 2%
17.3%
3
Nature Biotechnology
147 papers in training set
Top 0.3%
14.5%
50% of probability mass above
4
Nature Communications
4913 papers in training set
Top 23%
8.3%
5
Nucleic Acids Research
1128 papers in training set
Top 2%
7.1%
6
Nature Methods
336 papers in training set
Top 2%
4.2%
7
Genome Medicine
154 papers in training set
Top 4%
1.9%
8
Genome Research
409 papers in training set
Top 2%
1.8%
9
GigaScience
172 papers in training set
Top 1%
1.7%
10
BMC Bioinformatics
383 papers in training set
Top 5%
1.7%
11
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.7%
12
Cell Systems
167 papers in training set
Top 8%
1.6%
13
Peer Community Journal
254 papers in training set
Top 2%
1.3%
14
Advanced Science
249 papers in training set
Top 15%
1.2%
15
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
16
Microbiome
139 papers in training set
Top 2%
1.2%
17
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
18
PLOS ONE
4510 papers in training set
Top 64%
0.9%
19
Plant Communications
35 papers in training set
Top 1%
0.9%
20
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
21
iScience
1063 papers in training set
Top 29%
0.8%
22
Nature
575 papers in training set
Top 16%
0.7%
23
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
24
Cell Reports Methods
141 papers in training set
Top 6%
0.6%
25
Heliyon
146 papers in training set
Top 8%
0.6%