Fantastic tandem repeats and where to find them
Human genomes contain over a million tandem repeats [1], [2]. These highly instable and polymorphic variations have historically been divided into three groups depending on the size of the repeated motif. Microsatellites (or short tandem repeats, STR) correspond to repetitions of typically 2–6 and up to 9 nucleotides and are the most common repeats present in noncoding regions of genes. Minisatellites (10–99 bp repeats) is a heterogeneous category of repeated elements that is also present but more rarely in specific parts of genes while satellites (≥ 100 bp repeats) are restricted to gene-depleted regions and mainly constitute heterochromatin and centromeres. The distinction between microsatellites and minisatellites is blur and altogether, they form the variable number of tandem repeats (VNTR). Tandem repeats are mainly scattered in noncoding regions of the genome although repetitions of triplets can also be found in a subset of protein coding genes. This rapidly evolving variations are recent in terms of evolution and many of them are even specific to humans. Recent evidence suggests that tandem repeats overlapping regulatory and/or intronic regions could represent functional DNA elements able to regulate or modulate gene transcription [3].
From normal to pathological
The expansion of tandem repeats across generations is a well-known process described first in 1991 [4], [5], [6] and resulting in at least 50 human disorders [7]. The peripheral and central nervous system and the muscles are the main affected tissues in the vast majority of the expansion disorders known so far, but the examples of Fragile X-associated Primary Ovary Insufficiency (premutation: 55-200 CGG repeat expansions in FMR1 in females) and Fuchs’s corneal dystrophy (intronic CTG repeats in TCF4) suggest that repeat expansions could affect other tissues as well. Because of their repetitive nature, repeats are unstable in a length-dependent manner, with the longest, uninterrupted tandem repeats being by far the most prone to expand. Repeats can expand both during mitosis and meiosis, and their stability/instability is greatly influenced by DNA mismatch repair processes [8].
Coding or noncoding
Two main types of STR expansions exist: expansions affecting coding regions, mainly associated with codons encoding polyglutamine and polyalanine, and expansions altering non-coding regions of genes. The most frequent coding expansion disorders are associated with CAG triplet expansions, including those found in Huntington’s disease (HD), spinocerebellar ataxias type 1, 2, 3, 6, 7, 17, and are characterized by neuronal intranuclear protein inclusions that contain the polyQ-expanded protein [9]. Noncoding repeat expansions can be located in 5’ or 3’ untranslated (UTR) regions, promoter or introns of genes and they act via distinct mechanisms depending on their location, GC content and impact on gene transcription and translation [7], [10].
Hunting hidden repeats and discriminating the bad from the benign
The discovery of most repeat expansion disorders has been possible thanks to familial or multiple cases allowing to conduct linkage analyses and has usually taken many years and efforts. Indeed, repeats are difficult to study by short-read sequencing technologies that are still prevalent for clinical applications and repeat expansions are typically missed unless they are specifically looked for by appropriate techniques or bioinformatics tools. The last three years have witnessed an acceleration of repeat expansion discoveries related to the development of both computational tools and third generation sequencing technologies, including long-read sequencing. Yet, despite the technological advances, the study of repeats remains difficult because of their abundance of the human genome, the limited knowledge about their variability in control populations, and the difficulty in distinguishing benign from pathogenic expanded motifs.
In this context, this issue of Medizinische Genetik aims to provide an update in the field, featuring selected disorders caused by repeat expansions. The first review by Arning and Nguyen provides an update on Huntington disease. It particularly emphasizes the impact of CAG trinucleotide repeat instability, its relationship with DNA repair pathways and how this interaction acts as a modifier of disorder onset and symptom progression. The second review by Thieme et al. presents the molecular and clinical characteristics of Cerebellar Ataxia, Neuropathy and Vestibular Areflexia Syndrome (CANVAS), a recessive neurodegenerative disorder clinically recognized in 2011 but which cause, a biallelic AAGGG expansion in RFC1, has been identified in 2019. The third review by Peters et al. features Familial Adult Myoclonic Epilepsy (FAME) a dominant disorder caused by the same TTTCA repeat insertion in different genes and highlights how repeat expansion can be pathogenic independently of the gene where they occur. The fourth review by Pozojevic et al. describes the many facets of X-linked dystonia-parkinsonism (XDP), another neurodegenerative disorder resulting from the insertion of a short interspersed nuclear element (SINE)-VNTR-Alu (SVA) retrotransposon, associated with a polymorphic hexanucleotide (CCCTCT) repeat in TAF1. Finally, the last and fifth review by Schröder et al. provides an update on GC-rich expansions, which correspond to at least a third of all repeat expansions described so far and highlight the different mechanisms by which these expansions lead to disorder.
We hope that you will enjoy reading these reviews. Beyond their educational purpose, we hope that this issue will raise awareness of the difficulty to identify new pathogenic repeat expansions and the need to use specific techniques or bioinformatic tools in order to look specifically for them. Thirty years after their first description, we have acquired an important knowledge about the molecular mechanisms associated with repeats that can be used, in combination with recent long-read technologies, to systematically study repeat expansions and find those accounting for unsolved genetic disorders, just as any other variant type.
References
[1] Gymrek M. A genomic view of short tandem repeats. Curr Opin Genet Dev. 2017;44:9–16.10.1016/j.gde.2017.01.012Search in Google Scholar PubMed
[2] Willems T, Gymrek M, Highnam G, Genomes Project C, Mittelman D, Erlich Y. The landscape of human STR variation. Genome Res. 2014;24(11):1894–904.10.1101/gr.177774.114Search in Google Scholar PubMed PubMed Central
[3] Fotsing SF, Margoliash J, Wang C, Saini S, Yanicky R, Shleizer-Burko S et al. The impact of short tandem repeat variation on gene expression. Nat Genet. 2019;51(11):1652–9.10.1038/s41588-019-0521-9Search in Google Scholar PubMed PubMed Central
[4] Verkerk AJ, Pieretti M, Sutcliffe JS, Fu YH, Kuhl DP, Pizzuti A et al. Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell. 1991;65(5):905–14.10.1016/0092-8674(91)90397-HSearch in Google Scholar PubMed
[5] La Spada AR, Wilson EM, Lubahn DB, Harding AE, Fischbeck KH. Androgen receptor gene mutations in X-linked spinal and bulbar muscular atrophy. Nature. 1991;352(6330):77–9.10.1038/352077a0Search in Google Scholar PubMed
[6] Oberle I, Rousseau F, Heitz D, Kretz C, Devys D, Hanauer A et al. Instability of a 550-base pair DNA segment and abnormal methylation in fragile X syndrome. Science. 1991;252(5009):1097–102.10.1126/science.252.5009.1097Search in Google Scholar PubMed
[7] Depienne C, Mandel JL. 30 years of repeat expansion disorders: What have we learned and what are the remaining challenges? Am J Hum Genet. 2021.10.1016/j.ajhg.2021.03.011Search in Google Scholar PubMed PubMed Central
[8] Pearson CE, Nichol Edamura K, Cleary JD. Repeat instability: mechanisms of dynamic mutations. Nat Rev Genet. 2005;6(10):729–42.10.1038/nrg1689Search in Google Scholar PubMed
[9] Stoyas CA, La Spada AR. The CAG-polyglutamine repeat diseases: a clinical, molecular, genetic, and pathophysiologic nosology. Handb Clin Neurol. 2018;147:143–70.10.1016/B978-0-444-63233-3.00011-7Search in Google Scholar PubMed
[10] Rohilla KJ, Gagnon KT. RNA biology of disease-associated microsatellite repeat expansions. Acta Neuropathol Commun. 2017;5(1):63.10.1186/s40478-017-0468-ySearch in Google Scholar PubMed PubMed Central
© 2021 Depienne, published by De Gruyter
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Frontmatter
- MAIN TOPIC: Tandem repeat expansions: the good, the bad and the hidden
- Tandem repeat expansions: the good, the bad and the hidden
- Huntington disease update: new insights into the role of repeat instability in disease pathogenesis
- Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS): from clinical diagnosis towards genetic testing
- Familial adult myoclonic epilepsy (FAME): clinical features, molecular characteristics, pathophysiological aspects and diagnostic work-up
- X-linked dystonia-parkinsonism: over and above a repeat disorder
- GC-rich repeat expansions: associated disorders and mechanisms
- BERICHTE AUS DER HUMANGENETIK
- Aktuelle Debatte
- Wie wichtig ist die Kenntnis des genetischen Populationshintergrundes in der Medizin? Ein humangenetischer Beitrag vor dem Hintergrund der aktuellen Diskussion um die Verwendung des Begriffs „Rasse“
- Karriere und Perspektiven
- Career satisfaction of German human genetics residents
- Personalia
- News aus den humangenetischen Einrichtungen
- Tagungsbericht
- Angeborene Stoffwechselerkrankungen Klinik, Diagnostik, Therapie
- GfH-Verbandsmitteilungen
- Einladung zur 34. ordentlichen Mitgliederversammlung der Deutschen Gesellschaft für Humangenetik anlässlich der 33. GfH-Jahrestagung in Würzburg, 16.–18.3.2022
- Die GfH-Juniorakademie 2021 – Persönliche Begegnungen wieder möglich gemacht
- Aktuelle Nachrichten
- Aktuelle Nachrichten
Articles in the same Issue
- Frontmatter
- MAIN TOPIC: Tandem repeat expansions: the good, the bad and the hidden
- Tandem repeat expansions: the good, the bad and the hidden
- Huntington disease update: new insights into the role of repeat instability in disease pathogenesis
- Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS): from clinical diagnosis towards genetic testing
- Familial adult myoclonic epilepsy (FAME): clinical features, molecular characteristics, pathophysiological aspects and diagnostic work-up
- X-linked dystonia-parkinsonism: over and above a repeat disorder
- GC-rich repeat expansions: associated disorders and mechanisms
- BERICHTE AUS DER HUMANGENETIK
- Aktuelle Debatte
- Wie wichtig ist die Kenntnis des genetischen Populationshintergrundes in der Medizin? Ein humangenetischer Beitrag vor dem Hintergrund der aktuellen Diskussion um die Verwendung des Begriffs „Rasse“
- Karriere und Perspektiven
- Career satisfaction of German human genetics residents
- Personalia
- News aus den humangenetischen Einrichtungen
- Tagungsbericht
- Angeborene Stoffwechselerkrankungen Klinik, Diagnostik, Therapie
- GfH-Verbandsmitteilungen
- Einladung zur 34. ordentlichen Mitgliederversammlung der Deutschen Gesellschaft für Humangenetik anlässlich der 33. GfH-Jahrestagung in Würzburg, 16.–18.3.2022
- Die GfH-Juniorakademie 2021 – Persönliche Begegnungen wieder möglich gemacht
- Aktuelle Nachrichten
- Aktuelle Nachrichten