Back

FA-NIVA: A Nextflow framework for automated analysis of Nanopore based long-read sequencing data for genetic analysis in Fanconi anemia

Neurgaonkar, P.; Dierolf, M.; O'Gorman, L.; Remmele, C.; Schaeffer, J.; Popp, I.; Borst, A.; Rost, S.; Ankenbrand, M.; Kratz, C.; Bergmann, A.; Kalb, R.; Yu, J.

2026-03-04 genetic and genomic medicine
10.64898/2026.02.27.26346867 medRxiv
Show abstract

MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis platform for these data is missing. ResultsWe present FA-NIVA (Fanconi anemia - Nanopore Indel and Variant Analysis), an automated and adaptable analysis workflow tailored for Nanopore-based long-read sequencing data in FA genetic analysis. FA-NIVA integrates state-of-the-art tools to comprehensively detect both single nucleotide variants (SNVs) and structural variants (SVs). Our analysis platform enhances genotyping accuracy for biallelic variants by a joint SNV-SV based phasing in FA associated genes. Built within the Nextflow ecosystem and powered by containerized Docker images, FA-NIVA ensures reproducibility, flexibility, scalability and transparency across different computing environments. Together, FA-NIVA provides a robust end-to-end solution for the automated analysis of SVs and SNVs and high-resolution phasing analysis in FA genes, enabling an accurate and efficient pipeline for genetic analysis. AvailabilityFA-NIVA is available on GitHub at: https://github.com/UKWgenommedizin/FA-NIVA.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
18.4%
2
Genome Medicine
154 papers in training set
Top 0.3%
12.2%
3
Nature Communications
4913 papers in training set
Top 23%
8.3%
4
Bioinformatics Advances
184 papers in training set
Top 0.3%
7.1%
5
Human Mutation
29 papers in training set
Top 0.1%
4.8%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 4%
4.8%
7
Genetics in Medicine
69 papers in training set
Top 0.4%
4.3%
8
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.6%
9
BMC Medical Genomics
36 papers in training set
Top 0.2%
2.7%
10
GigaScience
172 papers in training set
Top 0.9%
2.3%
11
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
12
Scientific Reports
3102 papers in training set
Top 54%
1.9%
13
BMC Genomics
328 papers in training set
Top 2%
1.9%
14
Genome Biology
555 papers in training set
Top 4%
1.7%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.3%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
1.2%
17
Cell Genomics
162 papers in training set
Top 5%
0.9%
18
Communications Biology
886 papers in training set
Top 19%
0.9%
19
The Journal of Molecular Diagnostics
36 papers in training set
Top 0.4%
0.9%
20
Frontiers in Bioinformatics
45 papers in training set
Top 0.8%
0.8%
21
Clinical Chemistry
22 papers in training set
Top 0.8%
0.8%
22
Clinical and Translational Science
21 papers in training set
Top 1%
0.7%
23
iScience
1063 papers in training set
Top 33%
0.7%
24
Frontiers in Genetics
197 papers in training set
Top 11%
0.7%
25
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
26
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.9%
0.7%
27
Nature Medicine
117 papers in training set
Top 6%
0.6%
28
Viruses
318 papers in training set
Top 6%
0.6%
29
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.6%