Back

The Rayleigh Quotient and Contrastive Principal Component Analysis II

Jackson, K. C.; Carilli, M. T.; Pachter, L.

2026-04-10 bioinformatics
10.64898/2026.04.08.717236 bioRxiv
Show abstract

Contrastive principal component analysis (PCA) methods are effective approaches to dimensionality reduction where variance of a target dataset is maximized while variance of a background dataset is minimized. We previously described how contrastive PCA problems can be written as solutions to generalized eigenvalue problems that maximize particular instantiations of the Rayleigh quotient. Here, we discuss two extensions of contrastive PCA: we use kernel weighting from spatial PCA (k-{rho}PCA) to contrast spatial and non-spatial axes of variation, and separately solve the Rayleigh quotient in the space of basis function coefficients (f-{rho}PCA) to find modes of variation in functional data. Together, these extensions expand the scope of contrastive PCA while unifying disparate fields of spatial and functional methods within a single conceptual and mathematical framework. We showcase the utility of these extensions with several examples drawn from genomics, analyzing gene expression in cancer and immune response to vaccination.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.5%
23.1%
2
Bioinformatics
1061 papers in training set
Top 3%
10.7%
3
PLOS ONE
4510 papers in training set
Top 24%
7.0%
4
Biostatistics
21 papers in training set
Top 0.1%
5.0%
5
Biometrics
22 papers in training set
Top 0.1%
3.7%
6
Statistics in Medicine
34 papers in training set
Top 0.1%
3.7%
50% of probability mass above
7
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
3.7%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.2%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.8%
10
Frontiers in Genetics
197 papers in training set
Top 3%
2.5%
11
Cell Systems
167 papers in training set
Top 7%
1.7%
12
Journal of Computational Biology
37 papers in training set
Top 0.2%
1.7%
13
Scientific Reports
3102 papers in training set
Top 63%
1.4%
14
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
15
eLife
5422 papers in training set
Top 48%
1.3%
16
Nature Communications
4913 papers in training set
Top 56%
1.3%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 39%
1.1%
18
PLOS Genetics
756 papers in training set
Top 13%
0.8%
19
Frontiers in Bioinformatics
45 papers in training set
Top 0.8%
0.8%
20
Genetics
225 papers in training set
Top 4%
0.8%
21
Physical Biology
43 papers in training set
Top 2%
0.8%
22
Physical Review E
95 papers in training set
Top 1%
0.7%
23
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
24
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
25
PeerJ
261 papers in training set
Top 17%
0.7%
26
Communications Biology
886 papers in training set
Top 28%
0.7%
27
NeuroImage
813 papers in training set
Top 6%
0.7%
28
GigaScience
172 papers in training set
Top 4%
0.5%
29
BioData Mining
15 papers in training set
Top 1%
0.5%
30
Human Brain Mapping
295 papers in training set
Top 5%
0.5%