Back

The MICA rs2596542 locus is not an island: component-aware haplotype decomposition reveals a composite MHC tag with distinct regulatory axes

Ichikawa, Y.

2026-05-18 genomics
10.64898/2026.05.15.725353 bioRxiv
Show abstract

Cross-population reversal of signed linkage disequilibrium (LD), or the "flip-flop" phenomenon, can arise when a tag SNP captures different extended haplotype backgrounds across populations. The MICA hepatocellular carcinoma susceptibility variant rs2596542 exemplifies this problem in the MHC, where signed LD reverses between Japanese and European populations but the relevant regulatory backgrounds are obscured by haplotypic complexity. We analyzed 7,303 biallelic SNVs surrounding rs2596542 across 26 populations using carrier-set topology classification followed by non-negative matrix factorization of carrier haplotypes. This identified two regulatory axes. Axis I, represented by components c4/c6, was population-stable and MICA-regulatory, with coherent MICA cis-eQTL enrichment and depletion for signed-LD reversal. Axis II, represented by component c5, was enriched for signed-LD reversal and showed an HLA-B{uparrow}/HLA-C{downarrow} expression signature with no MICA overlap across six GTEx tissues. In an independent Japanese HCC cohort (LIRI-JP, n = 122), Axis II-associated HLA-C downregulation remained after adjustment for clinical covariates, immune infiltration, and HLA-A expression. The previously proposed cross-population tag rs2244546 mapped to a population-stable component rather than Axis II. A parallel reanalysis of the COMT Val158Met flip-flop locus reproduced the signed-LD pattern reported by Lin et al1. and showed population-specific latent backgrounds among Val carriers. These results show that carrier-set topology combined with NMF can decompose composite marker alleles into functionally interpretable regulatory haplotype subspaces.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Cell Genomics
162 papers in training set
Top 0.1%
14.4%
2
Nature Communications
4913 papers in training set
Top 14%
12.3%
3
Nature Genetics
240 papers in training set
Top 0.7%
9.9%
4
Science
429 papers in training set
Top 7%
4.8%
5
Nature
575 papers in training set
Top 6%
4.2%
6
Genome Medicine
154 papers in training set
Top 2%
3.9%
7
Cell
370 papers in training set
Top 5%
3.9%
50% of probability mass above
8
Genome Biology
555 papers in training set
Top 3%
3.5%
9
Molecular Cell
308 papers in training set
Top 4%
3.5%
10
eLife
5422 papers in training set
Top 30%
3.0%
11
Nature Biotechnology
147 papers in training set
Top 3%
3.0%
12
Cell Reports
1338 papers in training set
Top 18%
2.8%
13
Science Advances
1098 papers in training set
Top 12%
2.3%
14
Nucleic Acids Research
1128 papers in training set
Top 9%
2.0%
15
Communications Biology
886 papers in training set
Top 7%
1.8%
16
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.6%
17
Nature Neuroscience
216 papers in training set
Top 4%
1.6%
18
Genome Research
409 papers in training set
Top 3%
1.3%
19
Molecular Systems Biology
142 papers in training set
Top 1%
1.2%
20
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
21
Nature Methods
336 papers in training set
Top 6%
0.8%
22
The EMBO Journal
267 papers in training set
Top 5%
0.7%
23
Cell Systems
167 papers in training set
Top 13%
0.7%
24
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.7%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
26
Nature Cell Biology
99 papers in training set
Top 5%
0.7%
27
Nature Machine Intelligence
61 papers in training set
Top 4%
0.6%
28
Nature Medicine
117 papers in training set
Top 6%
0.6%
29
Scientific Reports
3102 papers in training set
Top 78%
0.6%
30
Nature Plants
84 papers in training set
Top 2%
0.6%