Back

Integrative prioritization of clinically and biologically relevant long noncoding RNAs across gastrointestinal cancers

Flowers, B.; Lialios, P.; DiLollo, I.; Smith, N.; Whalley, J.; Lee, J.-S.

2026-05-29 cancer biology
10.64898/2026.05.26.728026 bioRxiv
Show abstract

Across gastrointestinal (GI) cancers, shared malignant programs are layered onto strong anatomical, lineage, and microenvironmental variation, making it difficult to distinguish disease-relevant long noncoding RNAs (lncRNAs) from context-dependent transcriptional signals. We developed a pan-GI integrative framework to classify lncRNAs across colorectal adenocarcinoma, gastric adenocarcinoma, and esophageal cancer using bulk and single-cell transcriptomic resources. This framework evaluates lncRNAs across four complementary dimensions: recurrent tumor-associated expression, clinical association with disease progression and overall survival, co-expression network context, and malignant epithelial expression at single-cell resolution. Paired tumor-normal RNA-seq analyses identified extensive tumor-associated lncRNA dysregulation and defined recurrent pan-GI lncRNAs consistently upregulated across cancer types. Clinical analyses further nominated transcripts linked to tumor extension, nodal involvement, metastatic dissemination, progression-linked expression, and adverse overall survival. Co-expression network analysis identified lncRNAs embedded within disease-associated transcriptional modules, providing functional context for otherwise poorly annotated transcripts. In parallel, single-cell-derived metacell analysis nominated malignant epithelial-associated and detection-supported lncRNAs, helping distinguish tumor-compartment-associated signals from stromal, immune, endothelial, and other microenvironmental contributions. Together, this study establishes an evidence-structured pan-GI lncRNA resource and a generalizable prioritization strategy for nominating disease-associated noncoding transcripts. More broadly, the framework provides a transferable strategy for systematic lncRNA prioritization across other cancers and heterogeneous disease contexts.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 7%
18.1%
2
Genome Medicine
154 papers in training set
Top 0.2%
17.0%
3
Cell Genomics
162 papers in training set
Top 0.3%
8.2%
4
Nucleic Acids Research
1128 papers in training set
Top 5%
4.2%
5
Cancer Discovery
61 papers in training set
Top 0.5%
3.9%
50% of probability mass above
6
Molecular Cancer
14 papers in training set
Top 0.1%
3.5%
7
Cell Reports
1338 papers in training set
Top 16%
3.5%
8
Genome Biology
555 papers in training set
Top 3%
2.5%
9
Advanced Science
249 papers in training set
Top 8%
2.5%
10
Cell Reports Medicine
140 papers in training set
Top 2%
2.3%
11
Nature Cell Biology
99 papers in training set
Top 2%
2.0%
12
Developmental Cell
168 papers in training set
Top 7%
2.0%
13
Nature Genetics
240 papers in training set
Top 4%
1.8%
14
Cell
370 papers in training set
Top 12%
1.6%
15
Science
429 papers in training set
Top 15%
1.6%
16
Cancer Research
116 papers in training set
Top 2%
1.3%
17
Communications Biology
886 papers in training set
Top 15%
1.2%
18
Nature
575 papers in training set
Top 13%
1.2%
19
Nature Biotechnology
147 papers in training set
Top 7%
0.9%
20
Gastroenterology
40 papers in training set
Top 2%
0.9%
21
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
22
Cancer Cell
38 papers in training set
Top 2%
0.8%
23
Nature Cancer
35 papers in training set
Top 1%
0.8%
24
Cell Systems
167 papers in training set
Top 12%
0.8%
25
PLOS ONE
4510 papers in training set
Top 66%
0.8%
26
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
27
Scientific Reports
3102 papers in training set
Top 75%
0.7%
28
Molecular Cell
308 papers in training set
Top 11%
0.7%
29
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.7%
30
Science Translational Medicine
111 papers in training set
Top 7%
0.7%