Transcriptional modification or co-transcriptional modification is a set of biological processes common to most
eukaryotic
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
cells by which an
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
primary transcript is chemically altered following
transcription from a
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
to produce a mature, functional RNA molecule that can then leave the
nucleus and perform any of a variety of different functions in the cell. There are many types of post-transcriptional modifications achieved through a diverse class of molecular mechanisms.
One example is the conversion of precursor
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
transcripts into mature messenger RNA that is subsequently capable of being
translated into
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
. This process includes three major steps that significantly modify the chemical structure of the RNA molecule: the addition of a
5' cap, the addition of a 3'
polyadenylated tail, and
RNA splicing
RNA splicing is a process in molecular biology where a newly-made precursor messenger RNA (pre-mRNA) transcription (biology), transcript is transformed into a mature messenger RNA (Messenger RNA, mRNA). It works by removing all the introns (non-cod ...
. Such processing is vital for the correct translation of eukaryotic
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
s because the initial precursor mRNA produced by transcription often contains both
exon
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
s (coding sequences) and
intron
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s (non-coding sequences); splicing removes the introns and links the exons directly, while the cap and tail facilitate the transport of the mRNA to a
ribosome
Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
and protect it from molecular degradation.
Post-transcriptional modifications may also occur during the processing of other transcripts which ultimately become
transfer RNA,
ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosomal ...
, or any of the other types of RNA used by the cell.
mRNA processing
5' processing
Capping
Capping of the pre-mRNA involves the addition of
7-methylguanosine (m
7G) to the 5' end. To achieve this, the terminal 5' phosphate requires removal, which is done with the aid of enzyme
RNA triphosphatase. The enzyme
guanosyl transferase then catalyses the reaction, which produces the
diphosphate
In chemistry, pyrophosphates are phosphorus oxyanions that contain two phosphorus atoms in a linkage. A number of pyrophosphate salts exist, such as disodium pyrophosphate () and tetrasodium pyrophosphate (), among others. Often pyrophosphate ...
5' end. The diphosphate 5' end then attacks the alpha phosphorus atom of a
GTP molecule in order to add the
guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
residue in a 5'5' triphosphate link. The enzyme
(guanine-''N''7-)-methyltransferase ("cap MTase") transfers a methyl group from
S-adenosyl methionine to the guanine ring. This type of cap, with just the (m
7G) in position is called a cap 0 structure. The
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally occurring form, , is a component of the ribonucleotides from which RNA is built, and so this comp ...
of the adjacent
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
may also be methylated to give a cap 1. Methylation of nucleotides downstream of the RNA molecule produce cap 2, cap 3 structures and so on. In these cases the methyl groups are added to the 2' OH groups of the ribose sugar.
The cap protects the 5' end of the primary RNA transcript from attack by
ribonucleases that have specificity to the 3'5'
phosphodiester bond
In chemistry, a phosphodiester bond occurs when exactly two of the hydroxyl groups () in phosphoric acid react with hydroxyl groups on other molecules to form two ester bonds. The "bond" involves this linkage . Discussion of phosphodiesters is d ...
s.
3' processing
Cleavage and polyadenylation
The pre-mRNA processing at the 3' end of the RNA molecule involves cleavage of its 3' end and then the addition of about 250
adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
residues to form a
poly(A) tail. The cleavage and adenylation reactions occur primarily if a
polyadenylation signal sequence (5'- AAUAAA-3') is located near the 3' end of the pre-mRNA molecule, which is followed by another sequence, which is usually (5'-CA-3') and is the site of cleavage. A GU-rich sequence is also usually present further downstream on the pre-mRNA molecule. More recently, it has been demonstrated that alternate signal sequences such as UGUA upstream off the cleavage site can also direct cleavage and polyadenylation in the absence of the AAUAAA signal. These two signals are not mutually independent, and often coexist. After the synthesis of the sequence elements, several multi-subunit
proteins
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, re ...
are transferred to the RNA molecule. The transfer of these sequence specific binding proteins
cleavage and polyadenylation specificity factor (CPSF), Cleavage Factor I (CF I) and
cleavage stimulation factor (CStF) occurs from
RNA Polymerase II
RNA polymerase II (RNAP II and Pol II) is a Protein complex, multiprotein complex that Transcription (biology), transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNA pol ...
. The three factors bind to the sequence elements. The AAUAAA signal is directly bound by CPSF. For UGUA dependent processing sites, binding of the multi protein complex is done by Cleavage Factor I (CF I). The resultant protein complex formed contains additional cleavage factors and the enzyme
Polyadenylate Polymerase (PAP). This complex cleaves the RNA between the polyadenylation sequence and the GU-rich sequence at the cleavage site marked by the (5'-CA-3') sequences. Poly(A) polymerase then adds about 200 adenine units to the new 3' end of the RNA molecule using
ATP as a precursor. As the poly(A) tail is synthesized, it binds multiple copies of
poly(A)-binding protein, which protects the 3'end from ribonuclease digestion by enzymes including the
CCR4-Not
Carbon Catabolite Repression 4—Negative On TATA-less, or CCR4-Not, is a multiprotein complex that functions in gene expression. The complex has multiple enzymatic activities as both a poly(A) 3′-5′ exonuclease and a ubiquitin ligase. T ...
complex.
Intron splicing
RNA splicing is the process by which
intron
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s, regions of RNA that do not code for proteins, are removed from the pre-mRNA and the remaining
exon
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
s connected to re-form a single continuous molecule. Exons are sections of mRNA which become "expressed" or translated into a protein. They are the coding portions of a mRNA molecule.
Although most RNA splicing occurs after the complete synthesis and end-capping of the pre-mRNA, transcripts with many exons can be spliced co-transcriptionally.
The splicing reaction is catalyzed by a large protein complex called the
spliceosome assembled from proteins and
small nuclear RNA molecules that recognize
splice sites in the pre-mRNA sequence. Many pre-mRNAs, including those encoding
antibodies
An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as bacteria and viruses, including those that caus ...
, can be spliced in multiple ways to produce different mature mRNAs that encode different
protein sequences. This process is known as
alternative splicing
Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
, and allows production of a large variety of proteins from a limited amount of DNA.
Histone mRNA processing
Histones H2A, H2B, H3 and H4 form the core of a
nucleosome
A nucleosome is the basic structural unit of DNA packaging in eukaryotes. The structure of a nucleosome consists of a segment of DNA wound around eight histone, histone proteins and resembles thread wrapped around a bobbin, spool. The nucleosome ...
and thus are called
core histones. Processing of core histones is done differently because typical histone mRNA lacks several features of other eukaryotic mRNAs, such as poly(A) tail and introns. Thus, such mRNAs do not undergo splicing and their 3' processing is done independent of most cleavage and polyadenylation factors. Core histone mRNAs have a special
stem-loop
Stem-loops are nucleic acid Biomolecular structure, secondary structural elements which form via intramolecular base pairing in single-stranded DNA or RNA. They are also referred to as hairpins or hairpin loops. A stem-loop occurs when two regi ...
structure at 3-prime end that is recognized by a
stem–loop binding protein and a downstream sequence, called histone downstream element (HDE) that recruits
U7 snRNA.
Cleavage and polyadenylation specificity factor 73 cuts mRNA between stem-loop and HDE
Histone variants, such as
H2A.Z or H3.3, however, have introns and are processed as normal mRNAs including splicing and polyadenylation.
See also
*
Post-translational modification
In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
*
RNA editing
*
RNA-Seq
References
Further reading
*
*
*
*
*
*
{{MolBioGeneExp
Cell biology
Molecular biology
Gene expression
RNA