In
genetics
Genetics is the study of genes, genetic variation, and heredity in organisms.Hartl D, Jones E (2005) It is an important branch in biology because heredity is vital to organisms' evolution. Gregor Mendel, a Moravian Augustinians, Augustinian ...
, a minisatellite is a tract of repetitive
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
in which certain
DNA motifs (ranging in length from 10–60
base pairs
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
) are typically repeated two to several hundred times. Minisatellites occur at more than 1,000 locations in the
human genome
The human genome is a complete set of nucleic acid sequences for humans, encoded as the DNA within each of the 23 distinct chromosomes in the cell nucleus. A small DNA molecule is found within individual Mitochondrial DNA, mitochondria. These ar ...
and they are notable for their high
mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
rate and high diversity in the population. Minisatellites are prominent in the
centromere
The centromere links a pair of sister chromatids together during cell division. This constricted region of chromosome connects the sister chromatids, creating a short arm (p) and a long arm (q) on the chromatids. During mitosis, spindle fiber ...
s and
telomere
A telomere (; ) is a region of repetitive nucleotide sequences associated with specialized proteins at the ends of linear chromosomes (see #Sequences, Sequences). Telomeres are a widespread genetic feature most commonly found in eukaryotes. In ...
s of
chromosome
A chromosome is a package of DNA containing part or all of the genetic material of an organism. In most chromosomes, the very long thin DNA fibers are coated with nucleosome-forming packaging proteins; in eukaryotic cells, the most import ...
s, the latter protecting the chromosomes from damage. The name
"satellite" refers to the early observation that
centrifugation
Centrifugation is a mechanical process which involves the use of the centrifugal force to separate particles from a solution according to their size, shape, density, medium viscosity and rotor speed. The denser components of the mixture migrate ...
of genomic DNA in a test tube separates a prominent layer of bulk DNA from accompanying "satellite" layers of repetitive DNA. Minisatellites are small sequences of DNA that do not encode
proteins
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, re ...
but appear throughout the genome hundreds of times, with many repeated copies lying next to each other.
Minisatellites and their shorter cousins, the
microsatellite
A microsatellite is a tract of repetitive DNA in which certain Sequence motif, DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organ ...
s, together are classified as
VNTR (variable number of tandem repeats) DNA. Confusingly, minisatellites are often referred to as VNTRs, and microsatellites are often referred to as short tandem repeats (STRs) or simple sequence repeats (SSRs).
Structure
Minisatellites consist of repetitive, generally
GC-rich, motifs that range in length from 10 to over 100 base pairs. These variant repeats are tandemly intermingled. Some minisatellites contain a central sequence (or "core unit") of nucleobases "GGGCAGGANG" (where N can be any base) or more generally consist of sequence motifs of purines (
adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
(A) and
guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
(G)) and pyrimidines (
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C) and
thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T)).
Hypervariable minisatellites have core units 9–64 bp long and are found mainly at the centromeric regions.
In humans, 90% of minisatellites are found at the sub-telomeric region of chromosomes. The human
telomere
A telomere (; ) is a region of repetitive nucleotide sequences associated with specialized proteins at the ends of linear chromosomes (see #Sequences, Sequences). Telomeres are a widespread genetic feature most commonly found in eukaryotes. In ...
sequence itself is a tandem repeat: TTAGGG TTAGGG TTAGGG ...
Function
Minisatellites have been implicated as regulators of
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
(e.g. at levels of
transcription,
alternative splicing
Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
, or
imprint control). They are generally
non-coding DNA
Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules (e.g. transfer RNA, microRNA, piRNA, ribosomal RNA, and reg ...
but sometimes are part of
possible genes.
Minisatellites also constitute the chromosomal telomeres, which protect the ends of a chromosome from deterioration or from fusion with neighbouring chromosomes.
Mutability
Minisatellites have been associated with chromosome fragile sites and are proximal to a number of recurrent translocation breakpoints.
Some human minisatellites (~1%) have been demonstrated to be
hypermutable, with an average mutation rate in the germline higher than 0.5% up to over 20%, making them the most unstable region in the human genome known to date. While other genomes (mouse, rat and pig) contain minisatellite-like sequences, none was found to be hypermutable. Since all hypermutable minisatellites contain
internal variants, they provide extremely informative systems for analyzing the complex turnover processes that occur at this class of tandem repeat. Minisatellite variant repeat mapping by PCR (MVR-PCR) has been extensively used to chart the interspersion patterns of variant repeats along the array, which provides details on the structure of the alleles before and after mutation.
Studies have revealed distinct mutation processes operating in somatic and germline cells. Somatic instability detected in blood DNA shows simple and rare intra-allelic events two to three orders of magnitude lower than in sperm. In contrast, complex inter-allelic conversion-like events occur in the germline.
Additional analyses of DNA sequences flanking human minisatellites have also revealed an intense and highly localized
meiotic crossover hotspot that is centered upstream of the unstable side of minisatellite arrays. Repeat turnover therefore appears to be controlled by recombinational activity in DNA that flanks the repeat array and results in a polarity of mutation. These findings have suggested that minisatellites most probably evolved as bystanders of localized meiotic
recombination hotspots in the human genome.
It has been proposed that minisatellite sequences encourage chromosomes to swap DNA. In alternative models, it is the presence of neighbouring double-strand hotspots which is the primary cause of minisatellite repeat copy number variations.
Somatic changes are suggested to result from replication difficulties (which might include
replication slippage, among other phenomena).
Studies have shown that the evolutionary fate of minisatellites tends towards an equilibrium distribution in the size of alleles, until mutations in the flanking DNA affect the recombinational activity of a minisatellite by suppressing DNA instability. Such an event would ultimately lead to the extinction of a hypermutable minisatellite by
meiotic drive.
History
The first human minisatellite was discovered in 1980 by
A.R. Wyman and
R. White,. Discovering their high level of variability,
Sir Alec Jeffreys developed
DNA fingerprinting
DNA profiling (also called DNA fingerprinting and genetic fingerprinting) is the process of determining an individual's deoxyribonucleic acid (DNA) characteristics. DNA analysis intended to identify a species, rather than an individual, is cal ...
based on minisatellites, solving the first immigration case by DNA in 1985, and the first forensic murder case, the Enderby murders in the United Kingdom, in 1986. Minisatellites were subsequently also used for
genetic marker
A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can ...
s in
linkage analysis and population studies, but were soon replaced by microsatellite profiling in the 1990s.
The term satellite DNA originates from the observation in the 1960s of a fraction of sheared DNA that showed a distinct buoyant density, detectable as a "satellite peak" in density gradient centrifugation, and that was subsequently identified as large centromeric tandem repeats. When shorter (10–30-bp) tandem repeats were later identified, they came to be known as minisatellites. Finally, with the discovery of tandem iterations of simple sequence motifs, the term microsatellites was coined.
External links
* Search tools:
*
SERFDe Novo Genome Analysis and Tandem Repeats Finder
*
Tandem Repeats Finder
See also
*
Microsatellite
A microsatellite is a tract of repetitive DNA in which certain Sequence motif, DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organ ...
*
Tandem repeat
*
Telomere
A telomere (; ) is a region of repetitive nucleotide sequences associated with specialized proteins at the ends of linear chromosomes (see #Sequences, Sequences). Telomeres are a widespread genetic feature most commonly found in eukaryotes. In ...
References
{{Repeated sequence
Repetitive DNA sequences