Back

Statistical and Evolutionary Analysis of Sequenced DNA from Breast Cancer FFPE Specimens

Kurpas, M. K.; Kus, P.; Jaksik, R.; Dinh, K. N.; Adamczyk, A.; Majchrzyk, K.; Kimmel, M.

2025-10-05 evolutionary biology
10.1101/2025.10.04.680485 bioRxiv
Show abstract

BackgroundDespite the introduction of instant freezing of tumor specimens, formalin-fixed paraffin-embedded (FFPE) blocks of tissue are still commonplace in clinical practice and constitute an important reference for genetic epidemiology of cancer. We carried out a study of a collection of breast tumors paired with lymph-node metastases and analyzed using advanced computational methods, to determine how much information can be obtained from mid-depth whole-exome bulk DNA sequencing. MethodsWe gathered 15 paired (primary and an involved lymph node) excised breast tumors of different molecular subtypes (HER2+, triple negative, luminal A and luminal B HER2-), from the National Research Institute of Oncology, Krakow (Poland) Branch. FFPE specimens contained typical artifacts, manifesting themselves in spurious DNA variant calls. We used several bioinformatics tools to remove the artifacts and analyzed the exomic data, using both commercial and original in-house computational techniques. ResultsWe used several of recent bioinformatics tools to remove the FFPE artifacts and found a serious dispersal of outcomes. After calibration, a series of analyses was performed, including copy number study, resulting in ploidy levels ranging from 1 to 5 (average of 2.5). Positive association was found between the frequency of oncogenes relative to tumor suppressor genes and DNA copy number. In addition, we carried out analyses of the clonal structure of the data using original computational methods based on evolutionary modeling. Interesting results concerning clonal structure, early tumor expansion, and interdependence of the primary tumor and lymph node metastases have been obtained. ConclusionsDespite the imperfections of the FFPE data, many important features of molecular evolution of tumor DNA can be recovered from routine clinical samples.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 8%
19.5%
2
Scientific Reports
3102 papers in training set
Top 3%
13.0%
3
Cancers
200 papers in training set
Top 0.5%
8.8%
4
Genes
126 papers in training set
Top 0.1%
6.6%
5
PeerJ
261 papers in training set
Top 2%
3.7%
50% of probability mass above
6
Data in Brief
13 papers in training set
Top 0.1%
3.7%
7
F1000Research
79 papers in training set
Top 0.4%
3.7%
8
Diagnostics
48 papers in training set
Top 0.6%
2.7%
9
FEBS Open Bio
29 papers in training set
Top 0.1%
2.2%
10
Frontiers in Genetics
197 papers in training set
Top 4%
2.0%
11
BMC Bioinformatics
383 papers in training set
Top 4%
1.9%
12
Frontiers in Oncology
95 papers in training set
Top 2%
1.9%
13
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 4%
1.8%
14
BMC Genomics
328 papers in training set
Top 2%
1.8%
15
BMC Cancer
52 papers in training set
Top 1%
1.5%
16
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.4%
17
Bioinformatics
1061 papers in training set
Top 9%
0.9%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
19
JNCI Cancer Spectrum
10 papers in training set
Top 0.4%
0.8%
20
Biology
43 papers in training set
Top 2%
0.8%
21
Gene Reports
13 papers in training set
Top 0.6%
0.8%
22
Molecular Biology Reports
19 papers in training set
Top 0.5%
0.8%
23
PLOS Computational Biology
1633 papers in training set
Top 25%
0.7%
24
European Journal of Human Genetics
49 papers in training set
Top 1%
0.7%
25
Epigenetics
43 papers in training set
Top 1%
0.5%
26
Gene
41 papers in training set
Top 3%
0.5%
27
Scientific Data
174 papers in training set
Top 3%
0.5%
28
International Journal of Cancer
42 papers in training set
Top 2%
0.5%