Back

A Transformer-Based 2.5D Deep Learning Model for Preoperative Prediction of Lymph Node Metastasis in Papillary Thyroid Carcinoma

Xu, S.; Yan, X.; Su, Y.; Qi, J.; Chen, X.; Li, Y.; Xiong, H.; Jiang, J.; Wei, Z.; Chen, Z.; YALIKUN, Y.; Li, H.; Li, X.; Xi, Y.; Li, W.; Li, X.; Du, Y.

2026-04-02 oncology
10.64898/2026.04.01.26349933 medRxiv
Show abstract

Background: Accurate preoperative prediction of lymph node metastasis (LNM) in papillary thyroid carcinoma (PTC) remains challenging, particularly in clinically node-negative (cN0) patients, leading to potential overtreatment. We aimed to develop and validate a Transformer-based 2.5D deep learning model (ThyLNT) using preoperative computed tomography (CT) images for robust prediction of LNM and to explore its underlying biological basis through multi-omics analyses. Methods: A total of 1,560 PTC patients from six hospitals were retrospectively included. The Tongji Hospital cohort (n=1,010) was divided into training (70%) and internal validation (30%) sets, while five independent institutions served as external test cohorts. For each lesion, seven 2.5D slices were extracted and modeled using a DenseNet201 backbone. Slice-level features were integrated using a Transformer-based feature-level fusion strategy and compared with ensemble learning, multi-instance learning (MIL), and traditional radiomics approaches. Model performance was assessed using area under the receiver operating characteristic curve (AUC), calibration analysis, decision curve analysis (DCA), and precision-recall curves. Multi-omics analyses, including bulk RNA-seq, single-cell RNA-seq, spatial transcriptomics, and spatial metabolomics, were performed to investigate biological correlates. Results: The Transformer-based model consistently outperformed comparator models across cohorts. In the training and validation cohorts, ThyLNT achieved AUCs of 0.882 and 0.787, respectively, with external AUCs ranging from 0.772 to 0.827. Compared with ultrasound (US) and CT, ThyLNT showed superior predictive performance (all P < 0.001 in the validation cohort). Simulation analysis in cN0 patients suggested that ThyLNT could reduce unnecessary lymph node dissection (LND) from 52.16% to 4.88%. Transcriptomic analysis combined with WGCNA and correlation analysis identified VEGFA as the gene most strongly associated with ThyLNT prediction scores. Single-cell and spatial transcriptomic analyses suggested metastasis-related tumor microenvironment remodeling, while enrichment analysis of genes affected by virtual knockout of VEGFA indicated involvement of angiogenesis- and epithelial-mesenchymal transition (EMT)-related pathways. Spatial metabolomics further revealed coordinated lipid metabolic reprogramming in metastatic tissues. These findings suggest that ThyLNT provides robust predictive performance while capturing biologically relevant features associated with metastatic progression.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Frontiers in Oncology
95 papers in training set
Top 0.1%
18.4%
2
Scientific Reports
3102 papers in training set
Top 6%
10.2%
3
eLife
5422 papers in training set
Top 21%
4.2%
4
British Journal of Cancer
42 papers in training set
Top 0.3%
4.0%
5
Journal of Translational Medicine
46 papers in training set
Top 0.3%
3.1%
6
Clinical Cancer Research
58 papers in training set
Top 0.6%
2.8%
7
Nature Communications
4913 papers in training set
Top 45%
2.5%
8
iScience
1063 papers in training set
Top 9%
2.4%
9
Cancers
200 papers in training set
Top 2%
2.1%
10
European Journal of Cancer
10 papers in training set
Top 0.2%
1.8%
50% of probability mass above
11
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
12
npj Precision Oncology
48 papers in training set
Top 0.7%
1.5%
13
eBioMedicine
130 papers in training set
Top 2%
1.5%
14
PLOS ONE
4510 papers in training set
Top 57%
1.5%
15
Breast Cancer Research
32 papers in training set
Top 0.3%
1.5%
16
npj Digital Medicine
97 papers in training set
Top 2%
1.5%
17
JNCI: Journal of the National Cancer Institute
16 papers in training set
Top 0.4%
1.5%
18
Cell Reports Medicine
140 papers in training set
Top 5%
1.3%
19
JNCI Cancer Spectrum
10 papers in training set
Top 0.3%
1.3%
20
Theranostics
33 papers in training set
Top 0.8%
1.3%
21
International Journal of Cancer
42 papers in training set
Top 0.9%
1.1%
22
Cancer Letters
32 papers in training set
Top 0.5%
1.1%
23
Molecular Cancer
14 papers in training set
Top 0.6%
1.1%
24
Cancer Medicine
24 papers in training set
Top 1%
0.9%
25
JCO Precision Oncology
14 papers in training set
Top 0.3%
0.9%
26
JAMA Network Open
127 papers in training set
Top 4%
0.8%
27
Annals of Oncology
13 papers in training set
Top 0.9%
0.8%
28
Cancer Research
116 papers in training set
Top 3%
0.8%
29
Cancer Cell
38 papers in training set
Top 2%
0.8%
30
The Journal of Pathology
22 papers in training set
Top 0.4%
0.8%