Back

Long-Read Haplotype Phasing Resolves Allelic Configuration as a Missing Layer of Precision Oncology

Vo, J. N.; Wu, Y.-M.; Wang, R.; Pham, T.; Cao, X.; Yeung, S.; Park, M.; Kleyman-Smith, Y.; Teo, G. C.; Wu, A.; Li, A.; Estill, J.; Kunju, L. P.; Yang, C.; Robinson, D. R.; Chinnaiyan, A. M.

2026-05-05 oncology
10.64898/2026.05.05.26351600 medRxiv
Show abstract

Conventional short-read sequencing cannot determine whether co-occurring variants within a cancer gene reside on the same allele (cis) or on opposing alleles (trans), a distinction with direct biological and therapeutic consequences. Trans configurations confirm biallelic tumor suppressor inactivation and inform therapy selection, while cis configurations generate compound oncogenic alleles with enhanced activity. We analyzed 768 patients with prostate, breast, or ovarian cancers in the PROBLEM cohort, using mutational signatures to nominate cryptic genomic instability cases where the causative biallelic event was not apparent from short-read sequencing. Long-read nanopore sequencing resolved 32 of 46 cryptic cases (69.6%), leveraging its unique advantages in direct methylation detection, long insertion resolution, and complex structural variant characterization, confirming trans biallelic inactivation in all resolved tumor suppressor cases. Systematic analysis of 4,496 MiOncoSeq samples identified 17,519 multi-hit gene pairs, of which 78.7% exceeded the 500 bp short-read phasing limit. Long-read phasing further revealed recurrent compound cis oncogenic alleles in NOTCH1, PIK3CA, PDGFRB, and KIT with functionally synergistic activity. Haplotype phasing resolves a systematically overlooked gap in cancer variant interpretation and warrants broader integration into precision oncology workflows. Statement of SignificanceShort-read sequencing cannot resolve whether co-occurring variants within a cancer gene are cis or trans, a distinction critical for clinical interpretation. Long-read nanopore sequencing addresses this gap through direct haplotype phasing, methylation detection, and complex structural variant resolution, confirming biallelic tumor suppressor inactivation and revealing compound cis oncogenic alleles with enhanced activity.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Clinical Cancer Research
58 papers in training set
Top 0.1%
12.4%
2
Nature Communications
4913 papers in training set
Top 14%
12.3%
3
Nature Cancer
35 papers in training set
Top 0.1%
10.0%
4
Cancer Discovery
61 papers in training set
Top 0.1%
9.1%
5
Cancer Cell
38 papers in training set
Top 0.2%
6.3%
50% of probability mass above
6
Cell Reports Medicine
140 papers in training set
Top 1%
3.7%
7
npj Precision Oncology
48 papers in training set
Top 0.2%
3.2%
8
Nature Genetics
240 papers in training set
Top 3%
2.9%
9
Cancer Research
116 papers in training set
Top 1%
2.6%
10
Nature Medicine
117 papers in training set
Top 1%
2.4%
11
Med
38 papers in training set
Top 0.1%
2.3%
12
Annals of Oncology
13 papers in training set
Top 0.4%
2.1%
13
Cell Genomics
162 papers in training set
Top 3%
1.8%
14
Journal of Clinical Investigation
164 papers in training set
Top 3%
1.7%
15
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.7%
16
Nature
575 papers in training set
Top 11%
1.7%
17
Science
429 papers in training set
Top 14%
1.7%
18
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.5%
1.5%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
20
European Journal of Cancer
10 papers in training set
Top 0.4%
0.9%
21
JCO Precision Oncology
14 papers in training set
Top 0.4%
0.8%
22
JNCI: Journal of the National Cancer Institute
16 papers in training set
Top 0.6%
0.8%
23
Molecular Cancer
14 papers in training set
Top 0.9%
0.8%
24
Journal for ImmunoTherapy of Cancer
64 papers in training set
Top 1.0%
0.8%
25
Scientific Reports
3102 papers in training set
Top 73%
0.8%
26
eLife
5422 papers in training set
Top 56%
0.8%
27
Cell Reports
1338 papers in training set
Top 33%
0.7%
28
Leukemia
39 papers in training set
Top 0.8%
0.7%
29
EMBO Molecular Medicine
85 papers in training set
Top 5%
0.6%