Back

A Bayesian Informative Shrinkage Approach for Large-scale Multiple Hypothesis Testing (BISHOT): with Applications in Differential Analysis of Omics Data

Su, Y.; Clark, M. E. J. Z.; Wang, C.

2025-09-16 genetics
10.1101/2025.09.11.675690 bioRxiv
Show abstract

A major goal of many omics studies is to identify differential features, e.g. differentially expressed genes, between experimental groups. When performing differential analysis for a given dataset, relevant information from another platform or species is often available. Incorporating such prior information can help identify features that show consistent differential patterns across platforms or species, which are more likely to reflect shared biological processes, and thereby enhance the robustness and generalizability of the findings. However, existing differential analysis methods typically analyze only the data from the current study and do not leverage prior knowledge about the magnitude or direction of changes from other platforms or species. We address this challenge, and the associated multiple testing problem, using a Bayesian framework that enables the incorporation of prior knowledge obtained from different platforms or species. We propose a new test statistic, Bayesian Credible Ratio (BCR), based on a heteroscedastic global local shrinkage prior, and a new multiple testing criterion, sign-adjusted FDR (SFDR), that emphasize information regarding the direction of the differentially features. We prove that BCR achieves the largest count of sign-based true positives among all legitimate SFDR-controlling methods. Simulation results offer numerical evidence of its advantage compared to an empirical Bayesian method. The approach is demonstrated through the analysis of RNAseq and single-cell RNAseq datasets.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
22.0%
2
BMC Bioinformatics
383 papers in training set
Top 0.4%
17.1%
3
PLOS ONE
4510 papers in training set
Top 24%
7.0%
4
Frontiers in Genetics
197 papers in training set
Top 1.0%
6.2%
50% of probability mass above
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.2%
6
Biometrics
22 papers in training set
Top 0.1%
4.7%
7
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.2%
8
Genetic Epidemiology
46 papers in training set
Top 0.2%
3.9%
9
BMC Genomics
328 papers in training set
Top 1.0%
3.5%
10
Scientific Reports
3102 papers in training set
Top 52%
2.0%
11
Communications Biology
886 papers in training set
Top 6%
2.0%
12
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.8%
13
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
14
Statistics in Medicine
34 papers in training set
Top 0.2%
1.2%
15
Journal of Computational Biology
37 papers in training set
Top 0.5%
0.9%
16
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
17
Applied Sciences
24 papers in training set
Top 0.9%
0.8%
18
Nature Communications
4913 papers in training set
Top 63%
0.7%
19
Genetics
225 papers in training set
Top 5%
0.7%
20
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.7%
21
iScience
1063 papers in training set
Top 36%
0.7%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 11%
0.6%
23
Biostatistics
21 papers in training set
Top 0.2%
0.6%