Back

RNAseq analysis reveals the recurrent loss of heterozygosity in lung cancer and associated transcription patterns

Gumerov, R.; He, W.; Luong, P.; Dall Olio, F.; Vassetzky, Y. S.; SCHWAGER, A.

2026-01-19 cancer biology
10.64898/2026.01.19.698133 bioRxiv
Show abstract

A key limitation in cancer transcriptomics is the lack of accompanying genomic profiling such as whole-genome sequencing (WGS) or copy number alteration (CNA) data. Here we address this by showing that RNA-seq alone can be used to infer chromosomal aberrations and identify biologically meaningful patterns in lung cancer. Through a large-scale meta-analysis of publicly available RNA-seq datasets from non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC), and matched controls, we reconstructed large scale CNA profiles and identified deletions in 3p, 9p, and 17p as the most frequent genomic events. Validation against paired WGS data confirmed a high degree of accuracy for RNA-seq-based inference. Our analyses revealed that while deletion-associated transcriptional heterogeneity exists, approximately 25% of differentially expressed genes were shared across all three deletion classes, indicating a conserved oncogenic program in lung cancer. Enrichment analysis linked these shared genes to pathways governing cell division, DNA replication, and extracellular matrix organization, while deletion-specific effects reflected disruption of tumor suppressor pathways, notably p53 signaling in 17p-deleted tumors, leading to deregulation of NOS2 and PLOD2. Integrating gene-level expression data, we identified both shared and deletion-specific biomarkers: TPX2 was consistently overexpressed across all deletion groups, while PTPRZ1 and CLDN9 were uniquely associated with del3p and del9p, respectively. Experimental validation in lung cancer cell lines confirmed these predictions, particularly the upregulation of CLDN9 in del9p carriers. By demonstrating that RNA-seq data can capture large-scale chromosomal events and reveal their transcriptional consequences, this study establishes an efficient framework for genomic inference and biomarker discovery, introducing CLDN9 as a novel, deletion-specific marker with potential prognostic and therapeutic value for 9p-deleted lung cancers.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 4%
21.7%
2
Genome Medicine
154 papers in training set
Top 0.5%
10.1%
3
Molecular Cancer
14 papers in training set
Top 0.1%
6.1%
4
Nature Genetics
240 papers in training set
Top 1%
6.1%
5
Cell Genomics
162 papers in training set
Top 0.7%
6.1%
50% of probability mass above
6
Cell Reports
1338 papers in training set
Top 11%
4.7%
7
Cancer Discovery
61 papers in training set
Top 0.7%
3.5%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 23%
3.0%
9
Developmental Cell
168 papers in training set
Top 6%
2.6%
10
eLife
5422 papers in training set
Top 33%
2.5%
11
Molecular Cell
308 papers in training set
Top 6%
2.0%
12
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
13
Science Advances
1098 papers in training set
Top 19%
1.6%
14
Science
429 papers in training set
Top 15%
1.6%
15
Communications Biology
886 papers in training set
Top 10%
1.6%
16
Scientific Reports
3102 papers in training set
Top 63%
1.4%
17
Cell Systems
167 papers in training set
Top 9%
1.3%
18
Cancer Research
116 papers in training set
Top 3%
1.2%
19
Nature
575 papers in training set
Top 14%
0.9%
20
Cell Reports Medicine
140 papers in training set
Top 6%
0.9%
21
Cancer Cell
38 papers in training set
Top 1%
0.9%
22
PLOS ONE
4510 papers in training set
Top 65%
0.9%
23
EMBO Molecular Medicine
85 papers in training set
Top 4%
0.9%
24
Nature Cell Biology
99 papers in training set
Top 4%
0.9%
25
Advanced Science
249 papers in training set
Top 18%
0.8%
26
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
27
Cell
370 papers in training set
Top 17%
0.7%
28
Journal of Experimental Medicine
106 papers in training set
Top 5%
0.6%
29
The American Journal of Human Genetics
206 papers in training set
Top 5%
0.6%