Back

STRategy: A support system for collecting and analyzing short tandem repeats for forensic science

Kulthammanit, N.; Sukawutthiya, P.; Noh, H.; Vongpaisarnsin, K.; Wichadakul, D.

2023-02-21 genetics
10.1101/2023.02.20.529208 bioRxiv
Show abstract

Short tandem repeats (STRs) are short repeated sequences commonly found in the human genome. They provide many advantages to forensic sciences, such as identifying individuals, estimating the likelihood of kinship, and analyzing mixtures. Next-generation sequencing (NGS) technologies, e.g., ForenSeq Signature Prep, have been proposed for sequencing STRs, obtaining the sequence of each locus and SNPs, and inferring length-based alleles. However, even though the sequenced STRs from ForenSeq offer more insights into the STRs, which lead to the genetic analysis of population and sub-population structures, no open-source software platform enables the collection and management of STR data from NGS and incorporates related analysis tools in one place. Here, we introduce STRategy, a standalone web-based application supporting essential STR data management and analysis capabilities. The analyzed data will be visualized in various forms, for example, charts, maps, and pattern alignments. The system implemented a role-based access control that allows users to search or access specific data depending on their responsibilities. It enables public users to search for data. In addition, they can view statistical data, for example, detailed alleles and genetic variation. Lab users can add, update, and see the information of individuals and explore pattern alignments for a specific locus within the population. Administrators can customize the system, for example, configure maps according to the samples geographic data, and manage reference STR repeat motifs. We designed and developed the STRategy using software engineering principles for flexible extension and easy deployment utilizing the Docker container. The source code is publicly available at https://github.com/cucpbioinfo/STRategy. Also, we deployed a showcase system on a cloud computing service where its URL is included on the GitHub repository. The current version only supports the ForenSeq sample detail report files.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.