Back

A Deep-Learning Atlas of XPO1-Mediated Nuclear Export at Proteome Scale

Dhungel, S.; de Zoysa, S.; Burns, D.; McGregor, L.; Pushpabai, R. R.; Alam, R.; Arain, D.; Bhaskar, V.; Jeong, J.; Kikani, A.; Kolli, E.; Mardini, Z.; Parasramka, A.; Potterton, E.; Thomas, S.; Kikani, C. K.

2026-03-27 cell biology
10.64898/2026.03.25.713363 bioRxiv
Show abstract

Exportin 1 (XPO1/CRM1) is the principal nuclear export receptor for cargos bearing hydrophobic nuclear export sequences (NESs). Dysregulation of XPO1-dependent export is implicated in cancer, neurodegeneration, and other diseases, yet a comprehensive view of XPO1 function remains limited by the poor reliability of sequence-based NES prediction. Existing predictors are largely derived from a small set of XPO1-cargo structures and are therefore biased toward canonical docking geometries, limiting their ability to detect NESs that engage XPO1 through noncanonical pocket-occupancy patterns. We hypothesized that deep learning-based structural modeling could overcome this limitation by directly sampling binding geometries. Using AlphaFold 3, we modeled full-length cargo-XPO1-RanGTP complexes for more than 4,000 human proteins and identified over 3,000 previously uncharacterized, high-confidence NESs. Integration of AlphaFold predictions with unsupervised structural geometry analysis and experimental validation identified both canonical NESs and noncanonical sequence patterns exhibiting atypical anchor-residue usage, expanding the structural language of XPO1-recognized NESs. Groove-resolved contact maps further revealed helix rotation within the export groove as a regulatory feature that can rewire pocket usage without altering the core NES sequence, enabling PTM- and cofactor-sensitive tuning of export strength. This exportome atlas resolves many previously ambiguous or unidentified NESs in disease-associated proteins and across major cellular systems, including centrosome organization, mRNA processing, ubiquitin signaling, kinase networks, ribosome quality control, and macroautophagy. We further identified recurrent NES-NLS tandem motifs encoded in primary sequence, suggesting coordinated regulation of nucleocytoplasmic transport. Together, our deep learning-based exportome atlas, integrated with NLS maps and accessible through a web-searchable resource, defines an expanded and regulatable code of nuclear transport at proteome scale and offers a framework for dissecting nuclear trafficking and its dysregulation in human disease.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Cell Biology
99 papers in training set
Top 0.1%
13.8%
2
Nature Communications
4913 papers in training set
Top 18%
10.1%
3
Nature
575 papers in training set
Top 4%
8.1%
4
Developmental Cell
168 papers in training set
Top 3%
6.9%
5
Nature Structural & Molecular Biology
218 papers in training set
Top 0.5%
6.9%
6
Cell
370 papers in training set
Top 6%
3.5%
7
Journal of Cell Biology
333 papers in training set
Top 1%
3.5%
50% of probability mass above
8
Molecular Cell
308 papers in training set
Top 4%
3.5%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 22%
3.5%
10
Science
429 papers in training set
Top 9%
3.5%
11
eLife
5422 papers in training set
Top 32%
2.6%
12
Cell Reports
1338 papers in training set
Top 19%
2.6%
13
Cell Systems
167 papers in training set
Top 5%
2.5%
14
Science Advances
1098 papers in training set
Top 14%
2.0%
15
Nature Methods
336 papers in training set
Top 4%
2.0%
16
Nature Chemical Biology
104 papers in training set
Top 2%
1.6%
17
Nature Biotechnology
147 papers in training set
Top 5%
1.6%
18
Advanced Science
249 papers in training set
Top 12%
1.6%
19
The EMBO Journal
267 papers in training set
Top 2%
1.6%
20
Cell Genomics
162 papers in training set
Top 5%
1.2%
21
Nature Immunology
71 papers in training set
Top 1%
1.2%
22
Genome Biology
555 papers in training set
Top 6%
0.9%
23
Nature Genetics
240 papers in training set
Top 7%
0.9%
24
Nature Medicine
117 papers in training set
Top 5%
0.8%
25
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
26
Nucleic Acids Research
1128 papers in training set
Top 20%
0.6%
27
Nature Neuroscience
216 papers in training set
Top 7%
0.6%