Back

DIANA: An integrated pipeline for analysis of long-read whole-genome sequencing data for molecular neuropathology.

Bope, c. D.; Leske, H.; Nagymihaly, R. M.; Vik-Mo, E. O.; Halldorsson, S.

2026-03-27 neuroscience
10.64898/2026.03.25.714119 bioRxiv
Show abstract

SummaryCentral nervous system (CNS) tumor diagnosis requires comprehensive genomic profiling including DNA-methylation classification, copy-number variants (CNV), gene fusion analysis, small variant detection and MGMT promoter methylation status. Long-read sequencing platforms such as nanopore sequencing by Oxford Nanopore Technologies and SMRTseq by PacBio can capture all these in a single assay, but integrating diverse analytical tools to leverage the advantages of long-read sequencing remains complex. We present DIANA (Diagnostic Integrated Analytics of Neoplastic Alterations), a pipeline providing fully automated end-to-end processing of long-read whole-genome sequencing data from aligned sequence reads. DIANA produces a human readable report that combines methylation classification with prioritized genetic variants to support CNS tumor diagnostics and clinical decision-making. Availability and implementationDIANA is an open-source Nextflow pipeline, freely available through Docker or Apptainer/Singularity technologies. The source code, comprehensive documentation, and installation protocols are available on GitHub: https://github.com/VilhelmMagnusLab/DIANA.git. Supplementary informationSupplementary data are available at Bioinformatics online.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 8%
17.2%
2
Nature Neuroscience
216 papers in training set
Top 1.0%
8.3%
3
Genome Medicine
154 papers in training set
Top 1%
6.2%
4
Nature Methods
336 papers in training set
Top 2%
6.2%
5
Nature Biotechnology
147 papers in training set
Top 2%
4.8%
6
Alzheimer's & Dementia
143 papers in training set
Top 1%
3.6%
7
Cell Reports Methods
141 papers in training set
Top 1.0%
3.5%
8
Nucleic Acids Research
1128 papers in training set
Top 7%
2.7%
50% of probability mass above
9
Acta Neuropathologica
51 papers in training set
Top 0.4%
2.6%
10
Bioinformatics
1061 papers in training set
Top 6%
2.4%
11
Brain
154 papers in training set
Top 2%
2.0%
12
Nature
575 papers in training set
Top 10%
1.9%
13
iScience
1063 papers in training set
Top 12%
1.9%
14
eneuro
389 papers in training set
Top 6%
1.7%
15
Genome Research
409 papers in training set
Top 2%
1.6%
16
Neuro-Oncology
30 papers in training set
Top 0.5%
1.3%
17
Scientific Reports
3102 papers in training set
Top 65%
1.3%
18
Nature Protocols
30 papers in training set
Top 0.1%
1.3%
19
eBioMedicine
130 papers in training set
Top 2%
1.2%
20
EMBO Molecular Medicine
85 papers in training set
Top 3%
1.2%
21
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
22
Communications Biology
886 papers in training set
Top 17%
0.9%
23
Genome Biology
555 papers in training set
Top 6%
0.9%
24
PLOS ONE
4510 papers in training set
Top 66%
0.8%
25
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
26
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
27
Science Advances
1098 papers in training set
Top 28%
0.8%
28
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.7%
0.7%
29
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.7%
30
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%