
A frameshift mutation (also called a framing error or a reading frame shift) is a
genetic mutation caused by
indel
Indel (insertion-deletion) is a molecular biology term for an insertion or deletion of bases in the genome of an organism. Indels ≥ 50 bases in length are classified as structural variants.
In coding regions of the genome, unless the lengt ...
s (
insertions or
deletions) of a number of
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
s in a DNA sequence that is not divisible by three. Due to the triplet nature of
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
by
codon
Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
s, the insertion or deletion can change the
reading frame (the grouping of the codons), resulting in a completely different
translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
from the original. The earlier in the sequence the deletion or insertion occurs, the more altered the protein.
A frameshift mutation is not the same as a
single-nucleotide polymorphism
In genetics and bioinformatics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a ...
in which a nucleotide is replaced, rather than inserted or deleted. A
frameshift mutation will in general cause the reading of the codons after the mutation to code for different amino acids. The frameshift mutation will also alter the first stop codon ("UAA", "UGA" or "UAG") encountered in the sequence. The polypeptide being created could be abnormally short or abnormally long, and will most likely not be functional.
Frameshift mutations are apparent in severe genetic diseases such as
Tay–Sachs disease; they increase susceptibility to certain cancers and classes of
familial hypercholesterolaemia; in 1997,
a frameshift mutation was linked to resistance to infection by the HIV retrovirus. Frameshift mutations have been proposed as a source of biological novelty, as with the alleged creation of
nylonase, however, this interpretation is controversial. A study by Negoro ''et al.'' (2006) found that a frameshift mutation was unlikely to have been the cause and that rather a two amino acid substitution in the
active site of an ancestral
esterase resulted in nylonase.
Background
The information contained in DNA determines protein function in the cells of all organisms. Transcription and translation allow this information to be communicated into making proteins. However, an error in reading this communication can cause protein function to be incorrect and eventually cause disease even as the cell incorporates a variety of corrective measures.Genetic information is conveyed by DNA for protein synthesis within cells. Misinterpretation can lead to faulty function and disease, despite cellular correction mechanisms.
Central dogma
In 1956
Francis Crick
Francis Harry Compton Crick (8 June 1916 – 28 July 2004) was an English molecular biologist, biophysicist, and neuroscientist. He, James Watson, Rosalind Franklin, and Maurice Wilkins played crucial roles in deciphering the Nucleic acid doub ...
described the flow of genetic information from
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
to a specific amino acid arrangement for making a
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
as the central dogma.
[ For a cell to properly function, proteins are required to be produced accurately for structural and for ]catalytic
Catalysis () is the increase in reaction rate, rate of a chemical reaction due to an added substance known as a catalyst (). Catalysts are not consumed by the reaction and remain unchanged after it. If the reaction is rapid and the catalyst ...
activities. An incorrectly made protein can have detrimental effects on cell viability and in most cases cause the higher organism
An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
to become unhealthy by abnormal cellular functions. To ensure that the genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
successfully passes the information on, proofreading
Proofreading is a phase in the process of publishing where galley proofs are compared against the original manuscripts or graphic artworks, to identify transcription errors in the typesetting process. In the past, proofreaders would place corr ...
mechanisms such as exonucleases and mismatch repair systems are incorporated in DNA replication
In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all life, living organisms, acting as the most essential part of heredity, biolog ...
.[
]
Transcription and translation
After DNA replication, the reading of a selected section of genetic information is accomplished by transcription.[
Nucleotides containing the genetic information are now on a single strand messenger template called ]mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein.
mRNA is ...
. The mRNA is incorporated with a subunit of the ribosome
Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
and interacts with an rRNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosomal ...
. The genetic information carried in the codons of the mRNA are now read (decoded) by anticodons of the tRNA. As each codon (triplet) is read, amino acids
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the Proteinogenic amino acid, 22 α-amino acids incorporated into p ...
are being joined until a stop codon
In molecular biology, a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the additio ...
(UAG, UGA or UAA) is reached. At this point the polypeptide
Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
(protein) has been synthesised and is released.[ For every 1000 amino acid incorporated into the protein, no more than one is incorrect. This fidelity of codon recognition, maintaining the importance of the proper reading frame, is accomplished by proper base pairing at the ribosome A site, GTP hydrolysis activity of EF-Tu a form of kinetic stability, and a proofreading mechanism as EF-Tu is released.][
Frameshifting may also occur during ]prophase
Prophase () is the first stage of cell division in both mitosis and meiosis. Beginning after interphase, DNA has already been replicated when the cell enters prophase. The main occurrences in prophase are the condensation of the chromatin retic ...
translation, producing different proteins from overlapping open reading frames, such as the gag-pol-env retroviral proteins. This is fairly common in viruses
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
and also occurs in bacteria
Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
and yeast
Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom (biology), kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are est ...
(Farabaugh, 1996). Reverse transcriptase
A reverse transcriptase (RT) is an enzyme used to convert RNA genome to DNA, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobi ...
, as opposed to RNA Polymerase II
RNA polymerase II (RNAP II and Pol II) is a Protein complex, multiprotein complex that Transcription (biology), transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNA pol ...
, is thought to be a stronger cause of the occurrence of frameshift mutations. In experiments only 3–13% of all frameshift mutations occurred because of RNA Polymerase II. In prokaryotes
A prokaryote (; less commonly spelled procaryote) is a single-celled organism whose cell lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Greek (), meaning 'before', and (), meaning 'nut' ...
the error rate inducing frameshift mutations is only somewhere in the range of .0001 and .00001.
There are several biological processes that help to prevent frameshift mutations. Reverse mutations occur which change the mutated sequence back to the original wild type
The wild type (WT) is the phenotype of the typical form of a species as it occurs in nature. Originally, the wild type was conceptualized as a product of the standard "normal" allele at a locus, in contrast to that produced by a non-standard, " ...
sequence. Another possibility for mutation correction is the use of a suppressor mutation. This offsets the effect of the original mutation by creating a secondary mutation, shifting the sequence to allow for the correct amino acids to be read. Guide RNA can also be used to insert or delete Uridine into the mRNA after transcription, this allows for the correct reading frame.[
]
Codon-triplet importance
A codon
Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
is a set of three nucleotides
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
, a triplet that codes for a certain amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
. The first codon establishes the reading frame, whereby a new codon begins. A protein's amino acid backbone sequence
In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
is defined by contiguous triplets. Codons are key to translation of genetic information for the synthesis of proteins. The reading frame is set when translating the mRNA begins and is maintained as it reads one triplet to the next. The reading of the genetic code is subject to three rules the monitor codons in mRNA. First, codons are read in a 5' to 3' direction. Second, codons are nonoverlapping and the message has no gaps. The last rule, as stated above, that the message is translated in a fixed reading frame.[
]
Mechanism
Frameshift mutations can occur randomly or be caused by an external stimulus. The detection of frameshift mutations can occur via several different methods. Frameshifts are just one type of mutation that can lead to incomplete or incorrect proteins, but they account for a significant percentage of errors in DNA.In an unaltered gene, codons (triplets of nucleotides) are sequentially interpreted, with each codon encoding a specific amino acid. This is known as the standard reading frame. However, in cases of frame shift mutations, an extra nucleotide (or more) is inserted into the DNA sequence, disrupting the typical reading frame and causing a shift in the sequence.
This insertion prompts a shift in the reading frame due to the triplet nature of the genetic code. For instance, the addition of an extra "A" leads to a sequence shift, triggering the reading of an entirely different set of codons. This deviation in genetic information causes the ribosome, which reads the mRNA for protein synthesis, to misinterpret the genetic data. Consequently, an entirely different series of amino acids is generated, resulting in the generation of an altered protein sequence. In most instances, the new reading frame results in an early encounter with a stop codon, leading to the formation of a shortened and usually inactive protein. This form of mutation is termed an early stop codon or a nonsense mutation.
Genetic or environmental
This is a genetic mutation at the level of nucleotide bases. Why and how frameshift mutations occur are continually being sought after. An environmental study, specifically the production of UV-induced frameshift mutations by DNA polymerases deficient in 3′ → 5′ exonuclease activity was done. The normal sequence 5′ GTC GTT TTA CAA 3′ was changed to GTC GTT T TTA CAA (MIDT) of GTC GTT C TTA CAA (MIDC) to study frameshifts. E. coli pol I Kf and T7 DNA polymerase mutant enzymes
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as pro ...
devoid of 3′ → 5′ exonuclease activity produce UV-induced revertants at higher frequency than did their exonuclease proficient counterparts. The data indicates that loss of proofreading activity increases the frequency of UV-induced frameshifts.
Detection
Fluorescence
The effects of neighboring bases and secondary structure to detect the frequency of frameshift mutations has been investigated in depth using fluorescence
Fluorescence is one of two kinds of photoluminescence, the emission of light by a substance that has absorbed light or other electromagnetic radiation. When exposed to ultraviolet radiation, many substances will glow (fluoresce) with colore ...
. Fluorescently tagged DNA, by means of base analogues, permits one to study the local changes of a DNA sequence. Studies on the effects of the length of the primer strand reveal that an equilibrium mixture of four hybridization conformations was observed when template bases looped-out as a bulge, i.e. a structure flanked on both sides by duplex DNA. In contrast, a double-loop structure with an unusual unstacked DNA conformation at its downstream edge was observed when the extruded bases were positioned at the primer–template junction, showing that misalignments can be modified by neighboring DNA secondary structure.
Sequencing
Sanger sequencing
Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Fred ...
and pyrosequencing
Pyrosequencing is a method of DNA sequencing (determining the order of nucleotides in DNA) based on the "sequencing by synthesis" principle, in which the sequencing is performed by detecting the nucleotide incorporated by a DNA polymerase. Pyrosequ ...
are two methods that have been used to detect frameshift mutations, however, it is likely that data generated will not be of the highest quality. Even still, 1.96 million indel
Indel (insertion-deletion) is a molecular biology term for an insertion or deletion of bases in the genome of an organism. Indels ≥ 50 bases in length are classified as structural variants.
In coding regions of the genome, unless the lengt ...
s have been identified through Sanger sequencing that do not overlap with other databases. When a frameshift mutation is observed it is compared against the Human Genome Mutation Database (HGMD) to determine if the mutation has a damaging effect. This is done by looking at four features. First, the ratio between the affected and conserved DNA, second the location of the mutation relative to the transcript, third the ratio of conserved and affected amino acids and finally the distance of the indel to the end of the exon
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
.
Massively Parallel Sequencing is a newer method that can be used to detect mutations. Using this method, up to 17 gigabases can be sequenced at once, as opposed to limited ranges for Sanger sequencing
Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Fred ...
of only about 1 kilobase. Several technologies are available to perform this test and it is being looked at to be used in clinical applications. When testing for different carcinomas, current methods only allow for looking at one gene at a time. Massively Parallel Sequencing can test for a variety of cancer causing mutations at once as opposed to several specific tests. An experiment to determine the accuracy of this newer sequencing method tested for 21 genes and had no false positive calls for frameshift mutations.
Diagnosis
A US patent
A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
(5,958,684) in 1999 by Leeuwen, details the methods and reagents for diagnosis of diseases caused by or associated with a gene having a somatic mutation giving rise to a frameshift mutation. The methods include providing a tissue or fluid sample and conducting gene analysis for frameshift mutation or a protein from this type of mutation. The nucleotide sequence of the suspected gene is provided from published gene sequences or from cloning
Cloning is the process of producing individual organisms with identical genomes, either by natural or artificial means. In nature, some organisms produce clones through asexual reproduction; this reproduction of an organism by itself without ...
and sequencing of the suspect gene. The amino acid sequence encoded by the gene is then predicted. NA Sequencing: Sanger sequencing or Next-Generation Sequencing (NGS) can be used to directly sequence the DNA and identify insertions or deletions.Polymerase Chain Reaction (PCR): PCR can be used to amplify the specific region containing the mutation for subsequent analysis.Multiplex Ligation-dependent Probe Amplification (MLPA): MLPA is a technique used to detect copy number variations and small insertions or deletions.Comparative Genomic Hybridization (CGH): CGH is used to detect chromosomal imbalances, which may include large insertions or deletions.
Frequency
Despite the rules that govern the genetic code and the various mechanisms present in a cell to ensure the correct transfer of genetic information during the process of DNA replication as well as during translation, mutations do occur; frameshift mutation is not the only type. There are at least two other types of recognized point mutations, specifically missense mutation and nonsense mutation.[ A frameshift mutation can drastically change the coding capacity (genetic information) of the message.][ Small insertions or deletions (those less than 20 base pairs) make up 24% of mutations that manifest in currently recognized genetic disease.]
Frameshift mutations are found to be more common in repeat regions of DNA. A reason for this is because of slipping of the polymerase enzyme in repeat regions, allowing for mutations to enter the sequence
In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
. Experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
s can be run to determine the frequency of the frameshift mutation by adding or removing a pre-set number of nucleotides. Experiments have been run by adding four basepairs, called the +4 experiments, but a team from Emory University
Emory University is a private university, private research university in Atlanta, Georgia, United States. It was founded in 1836 as Emory College by the Methodist Episcopal Church and named in honor of Methodist bishop John Emory. Its main campu ...
looked at the difference in frequency of the mutation by both adding and deleting a base pair. It was shown that there was no difference in the frequency between the addition and deletion of a base pair. There is however, a difference in the result of the protein.
Huntington's disease
Huntington's disease (HD), also known as Huntington's chorea, is an incurable neurodegenerative disease that is mostly Genetic disorder#Autosomal dominant, inherited. It typically presents as a triad of progressive psychiatric, cognitive, and ...
is one of the nine codon reiteration disorders caused by polyglutamine expansion mutations that include spino-cerebellar ataxia (SCA) 1, 2, 6, 7 and 3, spinobulbar muscular atrophy and dentatorubal-pallidoluysianatrophy. There may be a link between diseases caused by polyglutamine and polyalanine expansion mutations, as frame shifting of the original SCA3 gene product encoding CAG/polyglutamines to GCA/polyalanines. Ribosomal slippage during translation of the SCA3 protein has been proposed as the mechanism resulting in shifting from the polyglutamine to the polyalanine-encoding frame. A dinucleotide deletion or single nucleotide insertion within the polyglutamine tract of huntingtin exon 1 would shift the CAG, polyglutamineen coding frame by +1 (+1 frame shift) to the GCA, polyalanine-encoding frame and introduce a novel epitope to the C terminus of Htt exon 1 (APAAAPAATRPGCG).
Diseases
Several diseases have frameshift mutations as at least part of the cause. Knowing prevalent mutations can also aid in the diagnosis of the disease. Currently there are attempts to use frameshift mutations beneficially in the treatment of diseases, changing the reading frame of the amino acids.
Cancer
Frameshift mutations are known to be a factor in colorectal cancer as well as other cancers
Cancer is a group of diseases involving Cell growth#Disorders, abnormal cell growth with the potential to Invasion (cancer), invade or Metastasis, spread to other parts of the body. These contrast with benign tumors, which do not spread. Po ...
with microsatellite instability. As stated previously, frameshift mutations are more likely to occur in a region of repeat sequence. When DNA mismatch repair does not fix the addition or deletion of bases, these mutations are more likely to be pathogenic. This may be in part because the tumor is not told to stop growing. Experiments in yeast and bacteria help to show characteristics of microsatellites that may contribute to defective DNA mismatch repair. These include the length of the microsatellite
A microsatellite is a tract of repetitive DNA in which certain Sequence motif, DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organ ...
, the makeup of the genetic material and how pure the repeats are. Based on experimental results longer microsatellites have a higher rate of frameshift mutations. The flanking DNA can also contribute to frameshift mutations. In prostate cancer a frameshift mutation changes the open reading frame
In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames ...
(ORF) and prevents apoptosis
Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
from occurring. This leads to an unregulated growth of the tumor
A neoplasm () is a type of abnormal and excessive growth of tissue. The process that occurs to form or produce a neoplasm is called neoplasia. The growth of a neoplasm is uncoordinated with that of the normal surrounding tissue, and persists ...
. While there are environmental factors that contribute to the progression of prostate cancer
Prostate cancer is the neoplasm, uncontrolled growth of cells in the prostate, a gland in the male reproductive system below the bladder. Abnormal growth of the prostate tissue is usually detected through Screening (medicine), screening tests, ...
, there is also a genetic component. During testing of coding regions to identify mutations, 116 genetic variants were discovered, including 61 frameshift mutations. There are over 500 mutations on chromosome 17 that seem to play a role in the development of breast and ovarian cancer in the BRCA1 gene, many of which are frameshift.
Crohn's disease
Crohn's disease
Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
has an association with the NOD2 gene. The mutation is an insertion of a Cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
at position 3020. This leads to a premature stop codon, shortening the protein that is supposed to be transcribed. When the protein is able to form normally, it responds to bacterial liposaccharides, where the 3020insC mutation prevents the protein from being responsive.
Cystic fibrosis
Cystic fibrosis
Cystic fibrosis (CF) is a genetic disorder inherited in an autosomal recessive manner that impairs the normal clearance of Sputum, mucus from the lungs, which facilitates the colonization and infection of the lungs by bacteria, notably ''Staphy ...
(CF) is a disease based on mutations in the CF transmembrane
A transmembrane protein is a type of integral membrane protein that spans the entirety of the cell membrane. Many transmembrane proteins function as gateways to permit the transport of specific substances across the membrane. They frequently u ...
conductance regulator (CFTR) gene. There are over 1500 mutations identified, but not all cause the disease. Most cases of cystic fibrosis are a result of the ∆F508 mutation, which deletes the entire amino acid. Two frameshift mutations are of interest in diagnosing CF, CF1213delT and CF1154-insTC. Both of these mutations commonly occur in tandem with at least one other mutation. They both lead to a small decrease in the function of the lungs
The lungs are the primary organs of the respiratory system in many animals, including humans. In mammals and most other tetrapods, two lungs are located near the backbone on either side of the heart. Their function in the respiratory syste ...
and occur in about 1% of patients tested. These mutations were identified through Sanger sequencing.
HIV
CCR5
C-C chemokine receptor type 5, also known as CCR5 or CD195, is a protein on the surface of white blood cells that is involved in the immune system as it acts as a receptor for chemokines.
In humans, the ''CCR5'' gene that encodes the CCR5 p ...
is one of the cell entry co-factors associated with HIV, most frequently involved with nonsyncytium-inducing strains, is most apparent in HIV patients as opposed to AIDS patients. A 32 base pair deletion in CCR5 has been identified as a mutation that negates the likelihood of an HIV infection. This region on the open reading frame ORF contains a frameshift mutation leading to a premature stop codon. This leads to the loss of the HIV-coreceptor function in vitro. CCR5-1 is considered the wild type and CCR5-2 is considered to be the mutant allele. Those with a heterozygous mutation for the CCR5 were less susceptible to the development of HIV. In a study, despite high exposure to the HIV virus, there was no one homozygous for the CCR5 mutation that tested positive for HIV.
Tay–Sachs disease
Tay–Sachs disease is a fatal disease affecting the central nervous system. It is most frequently found in infants and small children. Disease progression begins in the womb but symptoms do not appear until approximately 6 months of age. There is no cure for the disease. Mutations in the β-hexosaminidase A (Hex A) gene are known to affect the onset of Tay-Sachs, with 78 mutations of different types being described, 67 of which are known to cause disease. Most of the mutations observed (65/78) are single base substitutions or SNPs, 11 deletions, 1 large and 10 small, and 2 insertions. 8 of the observed mutations are frameshift, 6 deletions and 2 insertions. A 4 base pair insertion in exon 11 is observed in 80% of Tay-Sachs disease presence in the Ashkenazi
Ashkenazi Jews ( ; also known as Ashkenazic Jews or Ashkenazim) form a distinct subgroup of the Jewish diaspora, that Ethnogenesis, emerged in the Holy Roman Empire around the end of the first millennium Common era, CE. They traditionally spe ...
Jewish population. The frameshift mutations lead to an early stop codon which is known to play a role in the disease in infants. Delayed onset disease appears to be caused by 4 different mutations, one being a 3 base pair deletion.
Smith–Magenis syndrome
Smith–Magenis syndrome (SMS) is a complex syndrome
A syndrome is a set of medical signs and symptoms which are correlated with each other and often associated with a particular disease or disorder. The word derives from the Greek language, Greek σύνδρομον, meaning "concurrence". When a sy ...
involving intellectual disabilities, sleep disturbance, behavioural problems, and a variety of craniofacial, skeletal, and visceral anomalies. The majority of SMS cases harbor an ~3.5 Mb common deletion that encompasses the retinoic acid induced-1 ('' RAI1'') gene. Other cases illustrate variability in the SMS phenotype
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
not previously shown for RAI1 mutation, including hearing loss, self-abusive behaviours, and mild global delays. Sequencing of RAI1 revealed mutation of a heptamericC-tract (CCCCCCC) in exon 3 resulting in frameshift mutations. Of the seven reported frameshift mutations occurring in poly C-tracts in RAI1, four cases (~57%) occur at this heptameric C-tract. The results indicate that this heptameric C-tract is a preferential recombination hotspot insertion/deletions (SNindels) and therefore a primary target for analysis in patients suspected for mutations in RAI1.
Hypertrophic cardiomyopathy
Hypertrophic cardiomyopathy is the most common cause of sudden death in young people, including trained athletes, and is caused by mutations in genes encoding proteins of the cardiac sarcomere. Mutations in the Troponin C gene (''TNNC1
Troponin C, also known as TN-C or TnC, is a protein that resides in the troponin complex on actin thin filaments of striated muscle (cardiac, fast-twitch skeletal, or slow-twitch skeletal) and is responsible for binding calcium in biology, calci ...
'') are a rare genetic cause of hypertrophic cardiomyopathy. A recent study has indicated that a frameshift mutation (c.363dupG or p.Gln122AlafsX30) in Troponin C was the cause of hypertrophic cardiomyopathy (and sudden cardiac death) in a 19-year-old male.
Cures
Finding a cure for the diseases caused by frameshift mutations is rare. Research into this is ongoing. One example is a primary immunodeficiency (PID), an inherited condition which can lead to an increase in infections. There are 120 genes and 150 mutations that play a role in primary immunodeficiencies. The standard treatment is currently gene therapy, but this is a highly risky treatment and can often lead to other diseases, such as leukemia. Gene therapy procedures include modifying the zinc fringer nuclease fusion protein, cleaving both ends of the mutation, which in turn removes it from the sequence. Antisense-oligonucleotide mediated exon skipping is another possibility for Duchenne muscular dystrophy
Muscular dystrophies (MD) are a genetically and clinically heterogeneous group of rare neuromuscular diseases that cause progressive weakness and breakdown of skeletal muscles over time. The disorders differ as to which muscles are primarily affe ...
. This process allows for passing over the mutation so that the rest of the sequence remains in frame and the function of the protein stays intact. This, however, does not cure the disease, just treats symptoms, and is only practical in structural proteins or other repetitive genes. A third form of repair is revertant mosaicism, which is naturally occurring by creating a reverse mutation or a mutation at a second site that corrects the reading frame. This reversion may happen by intragenic recombination, mitotic gene conversion, second site DNA slipping or site-specific reversion. This is possible in several diseases, such as X-linked severe combined immunodeficiency (SCID), Wiskott–Aldrich syndrome
Wiskott–Aldrich syndrome (WAS) is a rare X-linked recessive disease characterized by eczema, thrombocytopenia (low platelet count), immune deficiency, and bloody diarrhea (secondary to the thrombocytopenia). It is also sometimes called the e ...
, and Bloom syndrome. There are no drugs or other pharmacogenomic methods that help with PIDs.
A European patent (EP1369126A1) in 2003 by Bork records a method used for prevention of cancers and for the curative treatment of cancers and precancers such as DNA-mismatch repair deficient (MMR) sporadic tumours and HNPCC associated tumours. The idea is to use immunotherapy with combinatorial mixtures of tumour-specific frameshift mutation-derived peptides to elicit a cytotoxic T-cell response specifically directed against tumour cells.[European Paten]
(December 10, 2003) "Use of coding microsatellite region frameshift mutation-derived peptides for treating cancer" by Bork ''et al''
See also
* Translational frameshift
* Mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
* Transcription (genetics)
Transcription is the process of copying a segment of DNA into RNA for the purpose of gene expression. Some segments of DNA are transcribed into RNA molecules that can encode proteins, called messenger RNA (mRNA). Other segments of DNA are transc ...
* Translation (biology)
In biology, translation is the process in living Cell (biology), cells in which proteins are produced using RNA molecules as templates. The generated protein is a sequence of amino acids. This sequence is determined by the sequence of nucleotide ...
* codon
Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
* protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
* reading frame
* point mutation
A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
* Crohn's disease
Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
* Tay–Sachs disease
References
Further reading
*
*
*
External links
*
NCBI dbSNP database
— "a central repository for both single base nucleotide substitutions and short deletion and insertion polymorphisms"
- aligns a protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
against a DNA sequence allowing frameshifts and intron
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s
FastY
- compare a DNA sequence to a protein sequence database, allowing gaps and frameshifts
Path
- tool that compares two frameshift proteins (back-translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
principle)
HGMD
- Human Genome Mutation Database
{{Mutation
Mutation