Back

MIMIQ: Fast mutual information calculation and significance testing for single-cell RNA sequencing analysis

O'Hanlon, D.; Garcia Busto, S.; Perez Carrasco, R.

2026-04-13 bioinformatics
10.64898/2026.04.10.717770 bioRxiv
Show abstract

Mutual information is a fundamental quantity in information theory that describes the non-linear dependency between two variables, and has numerous applications within bioinformatics and beyond. However, its exploitation is hampered by a trade-off between computational intensity and accuracy. Here we present an adaptive binning approach to computing the pairwise mutual information, optimized for small integer counts such as those observed in single-cell RNA sequencing. By assuming a sampling distribution such as the negative binomial, a {chi}2 test statistic for hypothesis testing can be computed simultaneously via a copula transformation. Using these quantities, we show how gene rewiring of CD4+ naive T-cells during SARS-CoV-2 infection can be studied using a single-cell sequencing dataset of healthy and COVID-19 donors.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 1.0%
14.1%
2
Bioinformatics
1061 papers in training set
Top 2%
14.1%
3
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.1%
10.3%
4
Nature Communications
4913 papers in training set
Top 23%
8.3%
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.8%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
7
Cell Systems
167 papers in training set
Top 3%
4.2%
8
Nature Biotechnology
147 papers in training set
Top 2%
4.2%
9
Communications Biology
886 papers in training set
Top 1.0%
4.2%
10
Genome Research
409 papers in training set
Top 1%
3.5%
11
Genome Biology
555 papers in training set
Top 3%
2.8%
12
Nature Methods
336 papers in training set
Top 4%
2.4%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.9%
14
Scientific Reports
3102 papers in training set
Top 59%
1.7%
15
iScience
1063 papers in training set
Top 15%
1.7%
16
Cell Reports Methods
141 papers in training set
Top 3%
1.6%
17
PLOS ONE
4510 papers in training set
Top 57%
1.5%
18
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.3%
19
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.7%
22
eLife
5422 papers in training set
Top 62%
0.6%
23
Nature Computational Science
50 papers in training set
Top 2%
0.6%