NGS technologies are revolutionizing diagnostics and clinical medicine. However, these approaches have previously proven inefficient at identifying pathogenic repeat expansions (RE), which underlie at least 30 inherited human conditions such as ataxia, Fragile X and Friedreich ataxia. These disorders occur when a short tandem repeat (STR) is expanded beyond a locus-specific threshold. RE disorders display a broad onset age and can present with overlapping phenotypes, making clinical and molecular diagnoses challenging. Traditionally, molecular testing for RE has utilised single-gene PCR-based methods, however these can be slow and costly and often yield no result. We recently developed exSTRa (expanded short tandem repeat algorithm, [1]), which interrogates NGS data to identify RE. We demonstrated the tool could identify 11 of 12 common RE with high specificity and sensitivity, suggesting considerable clinical utility in comparison to current molecular testing methods.
One limitation of exSTRa and similar RE detection algorithms is the need to provide the genomic co-ordinates of STRs to be interrogated [2], precluding genome-wide discovery of novel RE. Here we applied a pipeline of bioinformatics tools, testing for both novel and known RE, to a cohort of 22 families with a clinical diagnosis of cerebellar ataxia with neuropathy and bilateral vestibular areflexia syndrome (CANVAS). Genome-wide analysis identified a novel recessively inherited intronic RE [(AAGGG)exp] in the gene encoding Replication Factor C1 (RFC1). Genetic analyses confirmed the pathogenic RE in 18 CANVAS families, demonstrating it is the predominant cause of this disorder [3].
In conclusion, these studies have demonstrated that improved bioinformatics tools increase the diagnostic utility of NGS to determine the genetic basis of a heterogeneous group of clinically overlapping neurogenetic disorders. They provide a powerful new approach for the identification of novel RE and add to the evidence base for implementation of WGS as cost effective single test for molecular diagnostics.