Back

Medically relevant tandem repeats in nanopore sequencing of control cohorts

De Coster, W.; Hoijer, I.; Bruggeman, I.; D'Hert, S.; Melin, M.; Ameur, A.; Rademakers, R.

2024-03-14 genetic and genomic medicine
10.1101/2024.03.06.24303700 medRxiv
Show abstract

Research and diagnostics for medically relevant tandem repeats and repeat expansions are hampered by the lack of population-scale databases. We attempt to fill this gap using our pathSTR web tool, which leverages long-read sequencing of large cohorts to determine repeat length and sequence composition in the general population. The current version includes 878 individuals of the 1000 Genomes Project cohort sequenced on the Oxford Nanopore Technologies PromethION. A comprehensive set of medically relevant tandem repeats were genotyped using STRdust to determine the tandem repeat length and sequence composition. PathSTR provides rich visualizations of this dataset, as well as the feature to upload ones own data for comparison along the control cohort. We demonstrate the implementation of this application using data from targeted nanopore sequencing of a patient with Myotonic Dystrophy type 1. This resource will empower the genetics community to get a more complete overview of normal variation in tandem repeat length and sequence composition, and enable a better assessment of the pathogenic impact of tandem repeats observed in patients. PathSTR is available at https://pathstr.bioinf.be

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Human Mutation
29 papers in training set
Top 0.1%
22.2%
2
Genome Medicine
154 papers in training set
Top 0.6%
8.3%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.9%
4.8%
4
npj Genomic Medicine
33 papers in training set
Top 0.1%
4.8%
5
Bioinformatics Advances
184 papers in training set
Top 0.9%
4.3%
6
European Journal of Human Genetics
49 papers in training set
Top 0.2%
3.9%
7
BMC Genomics
328 papers in training set
Top 0.9%
3.5%
50% of probability mass above
8
Genetics in Medicine
69 papers in training set
Top 0.4%
3.5%
9
Scientific Reports
3102 papers in training set
Top 38%
3.5%
10
Bioinformatics
1061 papers in training set
Top 6%
3.2%
11
BMC Medical Genomics
36 papers in training set
Top 0.2%
3.0%
12
Nature Communications
4913 papers in training set
Top 47%
2.1%
13
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
14
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
15
GENETICS
189 papers in training set
Top 0.6%
1.7%
16
Genome Biology
555 papers in training set
Top 5%
1.5%
17
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
18
Human Molecular Genetics
130 papers in training set
Top 2%
1.2%
19
Journal of Medical Genetics
28 papers in training set
Top 0.4%
1.2%
20
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
1.2%
21
Genes
126 papers in training set
Top 2%
1.2%
22
Human Genetics and Genomics Advances
70 papers in training set
Top 0.6%
0.9%
23
Genetics in Medicine Open
10 papers in training set
Top 0.1%
0.9%
24
Genetic Epidemiology
46 papers in training set
Top 0.7%
0.9%
25
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
26
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.8%
27
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
28
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
29
Nature Medicine
117 papers in training set
Top 5%
0.7%
30
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 1%
0.7%