PT - JOURNAL ARTICLE AU - Halman, Andreas AU - Dolzhenko, Egor AU - Oshlack, Alicia TI - STRipy: a graphical application for enhanced genotyping of pathogenic short tandem repeats in sequencing data AID - 10.1101/2021.06.13.448220 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.06.13.448220 4099 - http://biorxiv.org/content/early/2021/06/14/2021.06.13.448220.short 4100 - http://biorxiv.org/content/early/2021/06/14/2021.06.13.448220.full AB - Short tandem repeats (STRs) are highly polymorphic with high mutation rates and expansions of STRs have been implicated as the causal variant in diseases. The application of genome sequencing in patients has recently allowed many new discoveries with over 50 disease causing loci known to date. There are several tools which allow genotyping of STRs from high-throughput sequencing (HTS) data. However, running these tools out of the box only allow around half of the known disease-causing loci to be genotyped, with lengths often limited to either read or fragment length which is less than the pathogenic cut-off for some diseases. While analysis tools can be customised to genotype extra loci, this requires proficiency in bioinformatics to set up, use, and analyse the resulting data, limiting their widespread usage by other researchers and clinicians.To address these issues, we have created a new software called STRipy that has an intuitive graphical interface and requires no specific skills for usage, thus significantly simplifying detection of STRs expansions from human HTS data. STRipy is able to target all known disease-causing STRs with genotyping performed with an established tool, ExpansionHunter, that is incorporated into the software. We have created additional functionality into STRipy to work with long alleles exceeding the fragment length.STRipy was validated using over 60 thousand simulated samples and was shown to work on whole genome sequencing of biological samples with pathogenic variants. Finally, we have used STRipy to acquire genotypes of pathogenic loci for thousands of samples from various populations which are provided to the user along with the data from the literature to assist with results interpretation. We believe the simplicity and breadth of STRipy will increase the testing of STR diseases in current datasets resulting in further diagnoses of rare diseases caused by STRs expansions.Competing Interest StatementEgor Dolzhenko is an employee of Illumina, Inc., a public company that develops and markets systems for genetic analysis.