Back

T-Rex: Standardized Analysis of Germline Variants in Whole-Exome Sequencing Trios

Reh, S.-L.; Walter, C.; Lohse, J.; Ghete, T.; Metzler, M.; Quante, A.; Hauer, J.; Auer, F.

2026-04-01 bioinformatics
10.64898/2026.03.30.715083 bioRxiv
Show abstract

Whole-exome sequencing (WES) enables the identification of rare germline variants contributing to pediatric diseases. Trio-based sequencing, comparing affected children with their parents, is particularly effective for rare disease genetics. However, WES data analysis requires bioinformatics expertise, varies across institutions, and is often incompatible with clinical workflows. We developed T-Rex (Trio Rare variant analysis of EXomes), a cross-platform desktop application that enables the standardized and local analysis of WES germline Trio data without the need for programming knowledge. T-Rex integrates state-of-the-art tools for alignment, dual-variant calling (GATK HaplotypeCaller + VarScan2), annotation (SNPEff/SNPSift), rare-variant filtering based on population frequencies (gnomAD), and family-based statistical testing, including the Transmission Disequilibrium Test with multiple-testing correction. Benchmarking of the dual-caller strategy on the Genome in a Bottle Ashkenazim Trio demonstrates high precision (99.2%) while maintaining robust sensitivity (91.1%). User testing (n=13) confirmed quick learning across clinicians and researchers. Application to a cohort of n=121 pediatric cancer Trio datasets, filtering for rare protein-coding variants (MAF[≤]0.1% in gnomAD v4.0), validated all assessable previously reported pathogenic variants. Overall, T-Rex enables clinicians to robustly analyze WES Trio data in compliance with data protection regulations without requiring additional software licenses. As one of the first platforms for comprehensive WES Trio analysis that requires no programming expertise while providing clinical-grade, end-to-end workflows, T-Rex facilitates collaborative research between clinics and reduces reliance on external providers. Implementation and AvailabilityThe source code is available on GitHub (https://github.com/SaraLuisaReh/trex). The fully precompiled app is available on Zenodo (https://zenodo.org/records/19135262).

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
18.9%
2
Nature Communications
4913 papers in training set
Top 17%
10.2%
3
Bioinformatics
1061 papers in training set
Top 3%
10.2%
4
Nature Biotechnology
147 papers in training set
Top 1%
7.2%
5
Nature Methods
336 papers in training set
Top 2%
6.4%
50% of probability mass above
6
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
6.4%
7
BMC Bioinformatics
383 papers in training set
Top 3%
3.7%
8
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
9
Bioinformatics Advances
184 papers in training set
Top 1%
3.6%
10
Nature Genetics
240 papers in training set
Top 4%
1.9%
11
PLOS ONE
4510 papers in training set
Top 52%
1.8%
12
Genome Biology
555 papers in training set
Top 4%
1.8%
13
Scientific Reports
3102 papers in training set
Top 62%
1.5%
14
Genome Research
409 papers in training set
Top 3%
1.3%
15
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
16
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.7%
0.9%
17
npj Genomic Medicine
33 papers in training set
Top 0.7%
0.9%
18
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
19
Cell Systems
167 papers in training set
Top 12%
0.7%
20
Cell Genomics
162 papers in training set
Top 7%
0.7%
21
Human Mutation
29 papers in training set
Top 0.8%
0.7%
22
European Journal of Human Genetics
49 papers in training set
Top 2%
0.7%
23
Nature Medicine
117 papers in training set
Top 6%
0.7%
24
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.5%
25
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.5%
26
Nature
575 papers in training set
Top 18%
0.5%
27
GigaScience
172 papers in training set
Top 4%
0.5%