Back

Genomic Variability of the HCT116 Cell Line Identified Using Oxford Nanopore Sequencing

Leonov, P.; Mikheeva, R.; Koryukov, M.; Ruleva, E.; Karabut, E.; Kechin, A.

2026-04-24 genomics
10.64898/2026.04.23.720331 bioRxiv
Show abstract

HCT116 is a colorectal cancer cell line frequently used in anti-tumor drug development experiments as well as in studies of the molecular machinery of eukaryotic cells. It is well characterized by the presence of several single-nucleotide and short mutations in multiple oncogenes and tumor suppressor genes, including KRAS, PIK3CA, MLH1, CTNNB1, CDKN2A, TGFBR2, and BRCA2. However, its landscape of large genomic rearrangements (LGRs) and copy number variants (CNVs) is still far from being fully understood. Therefore, the aim of this study was to identify LGRs and CNVs in several HCT116 cell line samples using Oxford Nanopore sequencing technology, including three samples from the SRA NCBI database, and to compare common and unique variants across all samples. Using the recently developed eLaRodON tool, we identified 22,666 common LGRs, among which more than 70% of tandem duplications and deletions larger than 80 kb were confirmed by CNV analysis. Among LGRs affecting protein-coding sequences, two in-frame rearrangements were identified: a deletion of exons 4-6 and a duplication of exon 10 in the CCSER1 gene, which encodes a cell division regulator protein. Given its high rearrangement rate in various tumors and the clinical significance of its overexpression, this finding may be potentially useful in future research on this cell line. Regarding differences between samples, we found that LGRs in the laboratory sample and in one of the three SRA NCBI samples occurred more frequently via ALR/Alpha repeats than via Alu repeats, in contrast to common LGRs and those unique to the other samples, a finding that may indicate the presence of unique mechanisms of genomic instability. Thus, this study reveals a broad spectrum of large genomic rearrangements and copy number variants that can be identified in the HCT116 cell line using Oxford Nanopore sequencing, including rearrangements specific to distinct cell line samples.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 2%
14.8%
2
Frontiers in Genetics
197 papers in training set
Top 0.3%
10.2%
3
BMC Genomics
328 papers in training set
Top 0.1%
8.5%
4
PLOS ONE
4510 papers in training set
Top 27%
6.4%
5
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
6
International Journal of Molecular Sciences
453 papers in training set
Top 3%
3.6%
7
Cancers
200 papers in training set
Top 2%
3.6%
50% of probability mass above
8
Genes
126 papers in training set
Top 0.4%
2.9%
9
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
2.1%
10
The Journal of Molecular Diagnostics
36 papers in training set
Top 0.2%
1.9%
11
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
12
Molecular Biology Reports
19 papers in training set
Top 0.2%
1.7%
13
BMC Cancer
52 papers in training set
Top 1%
1.7%
14
BMC Medical Genomics
36 papers in training set
Top 0.5%
1.7%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
16
PeerJ
261 papers in training set
Top 9%
1.3%
17
Biology Methods and Protocols
53 papers in training set
Top 1%
1.2%
18
Genomics
60 papers in training set
Top 2%
1.1%
19
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 6%
1.1%
20
Heliyon
146 papers in training set
Top 4%
1.0%
21
DNA Research
23 papers in training set
Top 0.4%
1.0%
22
GigaScience
172 papers in training set
Top 2%
1.0%
23
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.0%
24
F1000Research
79 papers in training set
Top 3%
0.9%
25
Oncotarget
15 papers in training set
Top 0.3%
0.9%
26
Genome Medicine
154 papers in training set
Top 7%
0.9%
27
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
28
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.8%
29
Journal of Translational Medicine
46 papers in training set
Top 3%
0.8%
30
Diagnostics
48 papers in training set
Top 2%
0.7%