
Directionality, in
molecular biology
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and phys ...
and
biochemistry
Biochemistry or biological chemistry is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology ...
, is the end-to-end chemical orientation of a single strand of
nucleic acid
Nucleic acids are biopolymers, macromolecules, essential to all known forms of life. They are composed of nucleotides, which are the monomers made of three components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main ...
. In a single strand of
DNA or
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
, the chemical convention of naming carbon atoms in the
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecul ...
pentose-sugar-ring means that there will be a 5′ end (usually pronounced "five-prime end"), which frequently contains a
phosphate group attached to the 5′ carbon of the
ribose ring, and a 3′ end (usually pronounced "three-prime end"), which typically is unmodified from the ribose -OH substituent. In a
DNA double helix, the strands run in opposite directions to permit
base pairing between them, which is essential for replication or
transcription of the encoded information.
Nucleic acids can only be synthesized
in vivo in the 5′-to-3′ direction, as the
polymerase
A polymerase is an enzyme ( EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by copying a DNA template strand using b ...
s that assemble various types of new strands generally rely on the energy produced by breaking
nucleoside triphosphate bonds to attach new nucleoside monophosphates to the 3′-
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydrox ...
(−OH) group, via a
phosphodiester bond. The relative positions of structures along strands of nucleic acid, including
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s and various protein
binding site
In biochemistry and molecular biology, a binding site is a region on a macromolecule such as a protein that binds to another molecule with specificity. The binding partner of the macromolecule is often referred to as a ligand. Ligands may inclu ...
s, are usually noted as being either ''upstream'' (towards the 5′-end) or ''downstream'' (towards the 3′-end). (See also
upstream and downstream.)
Directionality is related to, but different from,
sense
A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of stimuli. (For example, in the human body, the brain which is part of the central nervous system rec ...
. Transcription of single-stranded RNA from a double-stranded DNA template requires the selection of one strand of the DNA template as the template strand that directly interacts with the nascent RNA due to
complementary sequence. The other strand is not copied directly, but necessarily its sequence will be similar to that of the RNA.
Transcription initiation sites generally occur on both strands of an organism's DNA, and specify the location, direction, and circumstances under which transcription will occur. If the transcript encodes one or (rarely) more
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s, translation of each protein by the
ribosome will proceed in a 5′-to-3′ direction, and will extend the protein from its N terminus toward its C terminus. For example, in a typical gene a
start codon (5′-ATG-3′) is a DNA sequence within the sense strand. Transcription begins at an upstream site (relative to the sense strand), and as it proceeds through the region it copies the 3′-TAC-5′ from the template strand to produce 5′-AUG-3′ within a
messenger RNA (mRNA). The mRNA is scanned by the ribosome from the 5′ end, where the start codon directs the incorporation of a
methionine (
bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
,
mitochondria, and
plastids
The plastid (Greek: πλαστός; plastós: formed, molded – plural plastids) is a membrane-bound organelle found in the cells of plants, algae, and some other eukaryotic organisms. They are considered to be intracellular endosymbiotic cy ...
use
''N''-formylmethionine instead) at the N terminus of the protein. By convention, single strands of
DNA and
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
sequences are written in a 5′-to-3′ direction except as needed to illustrate the pattern of base pairing.
5′-end

The 5′-end (pronounced "five prime end") designates the end of the DNA or RNA strand that has the fifth carbon in the
sugar-ring of the
deoxyribose
Deoxyribose, or more precisely 2-deoxyribose, is a monosaccharide with idealized formula H−(C=O)−(CH2)−(CHOH)3−H. Its name indicates that it is a deoxy sugar, meaning that it is derived from the sugar ribose by loss of a hydroxy group ...
or
ribose at its terminus. A
phosphate group attached to the 5′-end permits
ligation of two
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecul ...
s, i.e., the covalent binding of a 5′-phosphate to the 3′-hydroxyl group of another nucleotide, to form a
phosphodiester bond. Removal of the 5′-phosphate prevents ligation. To prevent unwanted nucleic acid ligation (e.g. self-ligation of a
plasmid vector in
DNA cloning),
molecular biologists commonly remove the 5′-phosphate with a
phosphatase.
The 5′-end of nascent
messenger RNA is the site at which
post-transcriptional capping occurs, a process which is vital to producing mature messenger RNA. Capping increases the stability of the messenger RNA while it undergoes
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
, providing resistance to the degradative effects of
exonuclease
Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is t ...
s.
It consists of a
methylated nucleotide (
methylguanosine
7-Methylguanosine (m7G) is a modified purine nucleoside. It is a methylated version of guanosine and when found in human urine, it may be a biomarker of some types of cancer. In the RNAs, 7-methylguanosine have been used to study and examine the ...
) attached to the messenger RNA in a rare 5′- to 5′-triphosphate linkage.
The
5′-''flanking'' region of a
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
often denotes a region of DNA which is not
transcribed into RNA. The 5′-flanking region contains the
gene promoter, and may also contain enhancers or other protein binding sites.
The
5′-''untranslated'' region (5′-UTR) is a region of a gene which is transcribed into mRNA, and is located at the 5′-end of the mRNA. This region of an
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
may or may not be
translated, but is usually involved in the regulation of translation. The 5′-untranslated region is the portion of the DNA starting from the cap site and extending to the base just before the AUG translation initiation codon of the main coding sequence. This region may have sequences, such as the
ribosome binding site and
Kozak sequence
The Kozak consensus sequence (Kozak consensus or Kozak sequence) is a Nucleic acid sequence, nucleic acid motif that functions as the protein Translation (biology), translation initiation site in most eukaryotic Messenger RNA, mRNA transcripts. Reg ...
, which determine the translation efficiency of the mRNA, or which may affect the stability of the mRNA.
3′-end

The 3′-end (three prime end) of a strand is so named due to it terminating at the
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydrox ...
group of the third carbon in the
sugar-ring, and is known as the ''tail end''. The 3′-hydroxyl is necessary in the synthesis of new nucleic acid molecules as it is
ligated (joined) to the 5′-phosphate of a separate nucleotide, allowing the formation of strands of linked nucleotides.
Molecular biologists can use
nucleotides that lack a 3′-hydroxyl (dideoxyribonucleotides) to interrupt the replication of
DNA. This technique is known as the dideoxy chain-termination method or the Sanger method, and is used to
determine the order of nucleotides in DNA.
The 3′-end of nascent
messenger RNA is the site of
post-transcriptional polyadenylation, which attaches a chain of 50 to 250
adenosine residues to produce mature messenger RNA. This chain helps in determining how long the messenger RNA lasts in the cell, influencing how much protein is produced from it.
The 3′-''flanking'' region is a region of DNA that is not copied into the mature mRNA, but which is present adjacent to 3′-end of the gene. It was originally thought that the 3′-flanking DNA was not transcribed at all, but it was discovered to be transcribed into RNA and quickly removed during processing of the primary transcript to form the mature mRNA. The 3′-flanking region often contains sequences that affect the formation of the 3′-end of the message. It may also contain enhancers or other sites to which proteins may bind.
The
3′-''untranslated'' region (3′-UTR) is a region of the DNA which ''is'' transcribed into mRNA and becomes the 3′-end of the message, but which does not contain protein coding sequence. Everything between the
stop codon
In molecular biology (specifically protein biosynthesis), a stop codon (or termination codon) is a codon ( nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in ...
and the
polyA tail
Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In eu ...
is considered to be 3′-untranslated. The 3′-untranslated region may affect the translation efficiency of the mRNA or the stability of the mRNA. It also has sequences which are required for the addition of the poly(A) tail to the message, including the hexanucleotide AAUAAA.
See also
*
Sense (molecular biology)
Further reading
*
{{Reflist
External links
A Molecular Biology Glossary
DNA
Molecular genetics
RNA