Back

Next-generation sequencing analysis with a population-specific reference genome

Suzuki, T.; Ninomiya, K.; Funayama, T.; Okamura, Y.; Tadaka, S.; the Tohoku Medical Megabank Project Study Group, ; Kinoshita, K.; Yamamoto, M.; Kure, S.; Kikuchi, A.; Tamiya, G.; Takayama, J.

2024-03-10 bioinformatics
10.1101/2024.03.07.584017 bioRxiv
Show abstract

Next-generation sequencing (NGS) has become widely available and is routinely used in basic research and clinical practice. The reference genome sequence is an essential resource for NGS analysis, and several population-specific reference genomes have recently been constructed to provide a choice to deal with the vast genetic diversity of human samples. However, resources supporting population-specific references are insufficient, and it is burdensome to perform analysis using these reference genomes. Here, we constructed a set of resources to support NGS analysis using the Japanese reference genome sequence, JG. We created resources for variant calling, gene and repeat element annotations, variant-effect prediction, read mappability, and RNA-seq analysis. We also provide a resource for reference coordinate conversion for further annotation enrichment. We then provide a variant calling protocol using JG-based resources. Our resources provide a guide to prepare sufficient resources for the use of population-specific reference genomes and can facilitate the migration of reference genomes.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.1%
54.1%
50% of probability mass above
2
Journal of Genetics and Genomics
36 papers in training set
Top 0.1%
6.6%
3
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.1%
4
PLOS ONE
4510 papers in training set
Top 35%
4.1%
5
Database
51 papers in training set
Top 0.2%
3.7%
6
Nucleic Acids Research
1128 papers in training set
Top 8%
2.2%
7
Frontiers in Genetics
197 papers in training set
Top 4%
2.0%
8
Scientific Reports
3102 papers in training set
Top 55%
1.8%
9
Science China Life Sciences
26 papers in training set
Top 1%
1.5%
10
Clinical and Translational Medicine
30 papers in training set
Top 0.5%
1.3%
11
BMC Bioinformatics
383 papers in training set
Top 6%
1.2%
12
Genome Biology
555 papers in training set
Top 6%
0.9%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
14
DNA Research
23 papers in training set
Top 0.4%
0.8%
15
BioMed Research International
25 papers in training set
Top 3%
0.8%
16
Human Genetics
25 papers in training set
Top 0.4%
0.8%
17
Biosensors and Bioelectronics
52 papers in training set
Top 1%
0.7%
18
Journal of Hospital Infection
27 papers in training set
Top 0.7%
0.7%
19
Microbial Genomics
204 papers in training set
Top 2%
0.7%
20
Forensic Science International: Genetics
24 papers in training set
Top 0.1%
0.7%
21
Small Methods
26 papers in training set
Top 1%
0.7%
22
Genome Medicine
154 papers in training set
Top 9%
0.7%