
Regulation of gene expression, or gene regulation,
includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific
gene product
A gene product is the biochemical material, either RNA or protein, resulting from the expression of a gene. A measurement of the amount of gene product is sometimes used to infer how active a gene is. Abnormal amounts of gene product can be corre ...
s (
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
or
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
). Sophisticated programs of
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from
transcriptional initiation, to
RNA processing
Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, fu ...
, and to the
post-translational modification
In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
of a protein. Often, one gene regulator controls another, and so on, in a
gene regulatory network
A gene (or genetic) regulatory network (GRN) is a collection of molecular regulators that interact with each other and with other substances in the cell to govern the gene expression levels of mRNA and proteins which, in turn, determine the fu ...
.
Gene regulation is essential for
viruses
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
,
prokaryote
A prokaryote (; less commonly spelled procaryote) is a unicellular organism, single-celled organism whose cell (biology), cell lacks a cell nucleus, nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Gree ...
s and
eukaryote
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s as it increases the versatility and adaptability of an
organism
An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
by allowing the cell to express protein when needed. Although as early as 1951,
Barbara McClintock
Barbara McClintock (June 16, 1902 – September 2, 1992) was an American scientist and cytogenetics, cytogeneticist who was awarded the 1983 Nobel Prize in Physiology or Medicine. McClintock received her PhD in botany from Cornell University ...
showed interaction between two genetic loci, Activator (''Ac'') and Dissociator (''Ds''), in the color formation of maize seeds, the first discovery of a gene regulation system is widely considered to be the identification in 1961 of the
''lac'' operon, discovered by
François Jacob
François Jacob (; 17 June 1920 – 19 April 2013) was a French biologist who, together with Jacques Monod, originated the idea that control of enzyme levels in all cells occurs through regulation of transcription. He shared the 1965 Nobel ...
and
Jacques Monod
Jacques Lucien Monod (; 9 February 1910 – 31 May 1976) was a French biochemist who won the Nobel Prize in Physiology or Medicine in 1965, sharing it with François Jacob and André Lwoff "for their discoveries concerning genetic control of e ...
, in which some enzymes involved in
lactose
Lactose is a disaccharide composed of galactose and glucose and has the molecular formula C12H22O11. Lactose makes up around 2–8% of milk (by mass). The name comes from (Genitive case, gen. ), the Latin word for milk, plus the suffix ''-o ...
metabolism are expressed by ''
E. coli'' only in the presence of lactose and absence of glucose.
In multicellular organisms, gene regulation drives
cellular differentiation
Cellular differentiation is the process in which a stem cell changes from one type to a differentiated one. Usually, the cell changes to a more specialized type. Differentiation happens multiple times during the development of a multicellula ...
and
morphogenesis
Morphogenesis (from the Greek ''morphê'' shape and ''genesis'' creation, literally "the generation of form") is the biological process that causes a cell, tissue or organism to develop its shape. It is one of three fundamental aspects of deve ...
in the embryo, leading to the creation of different cell types that possess different gene expression profiles from the same
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
sequence. Although this does not explain how gene regulation originated, evolutionary biologists include it as a partial explanation of how
evolution
Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
works at a
molecular level, and it is central to the science of
evolutionary developmental biology
Evolutionary developmental biology, informally known as evo-devo, is a field of biological research that compares the developmental biology, developmental processes of different organisms to infer how developmental processes evolution, evolved. ...
("evo-devo").
Regulated stages of gene expression
Any step of gene expression may be modulated, from
signaling to
transcription to
post-translational modification
In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
of a protein. The following is a list of stages where gene expression is regulated, where the most extensively utilized point is transcription initiation, the first stage in transcription:
*
Signal transduction
Signal transduction is the process by which a chemical or physical signal is transmitted through a cell as a biochemical cascade, series of molecular events. Proteins responsible for detecting stimuli are generally termed receptor (biology), rece ...
*
Chromatin
Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
,
chromatin remodeling
Chromatin remodeling is the dynamic modification of chromatin architecture to allow access of condensed genomic DNA to the regulatory transcription machinery proteins, and thereby control gene expression. Such remodeling is principally carried out ...
,
chromatin domains
*
Transcription
*
Post-transcriptional modification
Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, f ...
* RNA transport
*
Translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
*
mRNA degradation
Modification of DNA

In eukaryotes, the accessibility of large regions of DNA can depend on its
chromatin
Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
structure, which can be altered as a result of
histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes ...
modifications directed by
DNA methylation
DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter (genetics), promoter, DNA methylati ...
,
ncRNA, or
DNA-binding protein. Hence these modifications may up or down regulate the expression of a gene. Some of these modifications that regulate gene expression are inheritable and are referred to as
epigenetic regulation.
Structural
Transcription of DNA is dictated by its structure. In general, the density of its packing is indicative of the frequency of transcription. Octameric protein complexes called
histones
In biology, histones are highly Base (chemistry), basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaea, Archaeal Phylum, phyla. They act as spools around which DNA winds to create st ...
together with a segment of DNA wound around the eight histone proteins (together referred to as a nucleosome) are responsible for the amount of
supercoiling of DNA, and these complexes can be temporarily modified by processes such as
phosphorylation
In biochemistry, phosphorylation is described as the "transfer of a phosphate group" from a donor to an acceptor. A common phosphorylating agent (phosphate donor) is ATP and a common family of acceptor are alcohols:
:
This equation can be writ ...
or more permanently modified by processes such as
methylation
Methylation, in the chemistry, chemical sciences, is the addition of a methyl group on a substrate (chemistry), substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replac ...
. Such modifications are considered to be responsible for more or less permanent changes in gene expression levels.
Chemical
Methylation of DNA is a common method of gene silencing. DNA is typically methylated by methyltransferase enzymes on cytosine nucleotides in a CpG dinucleotide sequence (also called "
CpG islands" when densely clustered). Analysis of the pattern of methylation in a given region of DNA (which can be a promoter) can be achieved through a method called bisulfite mapping. Methylated cytosine residues are unchanged by the treatment, whereas unmethylated ones are changed to uracil. The differences are analyzed by DNA sequencing or by methods developed to quantify SNPs, such as
Pyrosequencing
Pyrosequencing is a method of DNA sequencing (determining the order of nucleotides in DNA) based on the "sequencing by synthesis" principle, in which the sequencing is performed by detecting the nucleotide incorporated by a DNA polymerase. Pyrosequ ...
(
Biotage) or MassArray (
Sequenom), measuring the relative amounts of C/T at the CG dinucleotide. Abnormal methylation patterns are thought to be involved in oncogenesis.
Histone acetylation is also an important process in transcription.
Histone acetyltransferase
Histone acetyltransferases (HATs) are enzymes that acetylation, acetylate conserved lysine amino acids on histone proteins by transferring an acetyl group from acetyl-CoA to form ε-N-acetyllysine, ε-''N''-acetyllysine. DNA is wrapped around his ...
enzymes (HATs) such as
CREB-binding protein also dissociate the DNA from the histone complex, allowing transcription to proceed. Often, DNA methylation and histone deacetylation work together in
gene silencing
Gene silencing is the regulation of gene expression in a cell to prevent the expression of a certain gene. Gene silencing can occur during either Transcription (genetics), transcription or Translation (biology), translation and is often used in res ...
. The combination of the two seems to be a signal for DNA to be packed more densely, lowering gene expression.
Regulation of transcription

Regulation of transcription thus controls when transcription occurs and how much RNA is created. Transcription of a gene by
RNA polymerase
In molecular biology, RNA polymerase (abbreviated RNAP or RNApol), or more specifically DNA-directed/dependent RNA polymerase (DdRP), is an enzyme that catalyzes the chemical reactions that synthesize RNA from a DNA template.
Using the e ...
can be regulated by several mechanisms.
Specificity factors alter the specificity of RNA polymerase for a given
promoter or set of promoters, making it more or less likely to bind to them (i.e.,
sigma factor
A sigma factor (σ factor or specificity factor) is a protein needed for initiation of Transcription (biology), transcription in bacteria. It is a bacterial transcription initiation factor that enables specific binding of RNA polymerase (RNAP) to g ...
s used in
prokaryotic transcription).
Repressor
In molecular genetics, a repressor is a DNA- or RNA-binding protein that inhibits the expression of one or more genes by binding to the operator or associated silencers. A DNA-binding repressor blocks the attachment of RNA polymerase to the ...
s bind to the
Operator, coding sequences on the DNA strand that are close to or overlapping the promoter region, impeding RNA polymerase's progress along the strand, thus impeding the expression of the gene. The image to the right demonstrates regulation by a repressor in the lac operon.
General transcription factors position RNA polymerase at the start of a protein-coding sequence and then release the polymerase to transcribe the mRNA.
Activators enhance the interaction between RNA polymerase and a particular
promoter, encouraging the expression of the gene. Activators do this by increasing the attraction of RNA polymerase for the promoter, through interactions with subunits of the RNA polymerase or indirectly by changing the structure of the DNA.
Enhancers
In genetics, an enhancer is a short (50–1500 bp) region of DNA that can be bound by proteins ( activators) to increase the likelihood that transcription of a particular gene will occur. These proteins are usually referred to as transcriptio ...
are sites on the DNA helix that are bound by activators in order to loop the DNA bringing a specific promoter to the initiation complex. Enhancers are much more common in eukaryotes than prokaryotes, where only a few examples exist (to date).
Silencers are regions of DNA sequences that, when bound by particular transcription factors, can silence expression of the gene.
Regulation by RNA
RNA can be an important regulator of gene activity, e.g. by
microRNA
Micro ribonucleic acid (microRNA, miRNA, μRNA) are small, single-stranded, non-coding RNA molecules containing 21–23 nucleotides. Found in plants, animals, and even some viruses, miRNAs are involved in RNA silencing and post-transcr ...
(miRNA),
antisense-RNA, or
long non-coding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as mic ...
(lncRNA). LncRNAs differ from
mRNAs in the sense that they have specified subcellular locations and functions. They were first discovered to be located in the
nucleus and
chromatin
Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
, and the localizations and functions are highly diverse now. Some still reside in chromatin where they interact with proteins. While this lncRNA ultimately affects gene expression in neuronal disorders such as
Parkinson,
Huntington, and
Alzheimer disease, others, such as, PNCTR(pyrimidine-rich non-coding transcriptors), play a role in
lung cancer
Lung cancer, also known as lung carcinoma, is a malignant tumor that begins in the lung. Lung cancer is caused by genetic damage to the DNA of cells in the airways, often caused by cigarette smoking or inhaling damaging chemicals. Damaged ...
. Given their role in disease, lncRNAs are potential
biomarker
In biomedical contexts, a biomarker, or biological marker, is a measurable indicator of some biological state or condition. Biomarkers are often measured and evaluated using blood, urine, or soft tissues to examine normal biological processes, ...
s and may be useful targets for drugs or
gene therapy
Gene therapy is Health technology, medical technology that aims to produce a therapeutic effect through the manipulation of gene expression or through altering the biological properties of living cells.
The first attempt at modifying human DNA ...
, although there are no approved drugs that target lncRNAs yet. The number of lncRNAs in the human genome remains poorly defined, but some estimates range from 16,000 to 100,000 lnc genes.
Epigenetic gene regulation
Epigenetics
In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
refers to the modification of genes that is not changing the DNA or RNA sequence. Epigenetic modifications are also a key factor in influencing
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
. They occur on genomic DNA and
histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes ...
s and their chemical modifications regulate gene expression in a more efficient manner. There are several modifications of DNA (usually
methylation
Methylation, in the chemistry, chemical sciences, is the addition of a methyl group on a substrate (chemistry), substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replac ...
) and more than 100 modifications of RNA in mammalian cells.” Those modifications result in altered protein binding to DNA and a change in
RNA stability and
translation efficiency.
Special cases in human biology and disease
Regulation of transcription in cancer
In vertebrates, the majority of gene
promoters contain a
CpG island with numerous
CpG sites.
When many of a gene's promoter CpG sites are
methylated
Methylation, in the chemical sciences, is the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These term ...
the gene becomes silenced.
Colorectal cancers typically have 3 to 6
driver mutations and 33 to 66
hitchhiker or passenger mutations.
However, transcriptional silencing may be of more importance than mutation in causing progression to cancer. For example, in colorectal cancers about 600 to 800 genes are transcriptionally silenced by CpG island methylation (see
regulation of transcription in cancer Generally, in progression to cancer, hundreds of genes are silenced or activated. Although silencing of some genes in cancers occurs by mutation, a large proportion of carcinogenic gene silencing is a result of altered DNA methylation (see DNA met ...
). Transcriptional repression in cancer can also occur by other
epigenetic
In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
mechanisms, such as altered expression of
microRNAs.
In breast cancer, transcriptional repression of
BRCA1
Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a ...
may occur more frequently by over-expressed microRNA-182 than by hypermethylation of the BRCA1 promoter (see
Low expression of BRCA1 in breast and ovarian cancers).
Regulation of transcription in addiction
One of the cardinal features of addiction is its persistence. The persistent behavioral changes appear to be due to long-lasting changes, resulting from
epigenetic
In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
alterations affecting gene expression, within particular regions of the brain.
Drugs of abuse cause three types of epigenetic alteration in the brain. These are (1)
histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes ...
acetylations and
histone methylations, (2) DNA methylation at
CpG sites, and (3) epigenetic
downregulation or upregulation of
microRNA
Micro ribonucleic acid (microRNA, miRNA, μRNA) are small, single-stranded, non-coding RNA molecules containing 21–23 nucleotides. Found in plants, animals, and even some viruses, miRNAs are involved in RNA silencing and post-transcr ...
s.
(See
Epigenetics of cocaine addiction for some details.)
Chronic nicotine intake in mice alters brain cell epigenetic control of gene expression through
acetylation of histones. This increases expression in the brain of the protein FosB, important in addiction.
Cigarette addiction was also studied in about 16,000 humans, including never smokers, current smokers, and those who had quit smoking for up to 30 years.
In blood cells, more than 18,000
CpG sites (of the roughly 450,000 analyzed CpG sites in the genome) had frequently altered methylation among current smokers. These CpG sites occurred in over 7,000 genes, or roughly a third of known human genes. The majority of the differentially methylated
CpG sites returned to the level of never-smokers within five years of smoking cessation. However, 2,568 CpGs among 942 genes remained differentially methylated in former versus never smokers. Such remaining epigenetic changes can be viewed as “molecular scars”
that may affect gene expression.
In rodent models, drugs of abuse, including cocaine,
methamphetamine,
alcohol
and tobacco smoke products,
all cause DNA damage in the brain. During repair of DNA damages some individual repair events can alter the methylation of DNA and/or the acetylations or methylations of histones at the sites of damage, and thus can contribute to leaving an epigenetic scar on chromatin.
Such epigenetic scars likely contribute to the persistent epigenetic changes found in addiction.
Regulation of transcription in learning and memory

In mammals, methylation of cytosine (see Figure) in DNA is a major regulatory mediator. Methylated cytosines primarily occur in dinucleotide sequences where cytosine is followed by a guanine, a
CpG site. The total number of
CpG sites in the human genome is approximately 28 million.
and generally about 70% of all CpG sites have a methylated cytosine.

In a rat, a painful learning experience, contextual
fear conditioning
Pavlovian fear conditioning is a behavioral paradigm in which organisms learn to predict aversive events. It is a form of learning in which an aversive stimulus (e.g. an electrical shock) is associated with a particular neutral context (e.g., a r ...
, can result in a life-long fearful memory after a single training event.
Cytosine methylation is altered in the promoter regions of about 9.17% of all genes in the hippocampus neuron DNA of a rat that has been subjected to a brief
fear conditioning
Pavlovian fear conditioning is a behavioral paradigm in which organisms learn to predict aversive events. It is a form of learning in which an aversive stimulus (e.g. an electrical shock) is associated with a particular neutral context (e.g., a r ...
experience.
The
hippocampus
The hippocampus (: hippocampi; via Latin from Ancient Greek, Greek , 'seahorse'), also hippocampus proper, is a major component of the brain of humans and many other vertebrates. In the human brain the hippocampus, the dentate gyrus, and the ...
is where new memories are initially stored.
Methylation of CpGs in a promoter region of a gene represses transcription
while methylation of CpGs in the body of a gene increases expression.
TET enzymes play a central role in demethylation of methylated cytosines. Demethylation of CpGs in a gene promoter by
TET enzyme activity increases transcription of the gene.
When contextual
fear conditioning
Pavlovian fear conditioning is a behavioral paradigm in which organisms learn to predict aversive events. It is a form of learning in which an aversive stimulus (e.g. an electrical shock) is associated with a particular neutral context (e.g., a r ...
is applied to a rat, more than 5,000
differentially methylated regions (DMRs) (of 500 nucleotides each) occur in the rat
hippocampus
The hippocampus (: hippocampi; via Latin from Ancient Greek, Greek , 'seahorse'), also hippocampus proper, is a major component of the brain of humans and many other vertebrates. In the human brain the hippocampus, the dentate gyrus, and the ...
neural genome both one hour and 24 hours after the conditioning in the hippocampus.
This causes about 500 genes to be up-regulated (often due to demethylation of CpG sites in a promoter region) and about 1,000 genes to be down-regulated (often due to newly formed 5-methylcytosine at CpG sites in a promoter region). The pattern of induced and repressed genes within neurons appears to provide a molecular basis for forming the first transient memory of this training event in the hippocampus of the rat brain.
Post-transcriptional regulation
After the DNA is transcribed and mRNA is formed, there must be some sort of regulation on how much the mRNA is translated into proteins. Cells do this by modulating the capping, splicing, addition of a Poly(A) Tail, the sequence-specific nuclear export rates, and, in several contexts, sequestration of the RNA transcript. These processes occur in eukaryotes but not in prokaryotes. This modulation is a result of a protein or transcript that, in turn, is regulated and may have an affinity for certain sequences.
Three prime untranslated regions and microRNAs
Three prime untranslated regions (3'-UTRs) of
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
s (mRNAs) often contain regulatory sequences that post-transcriptionally influence gene expression.
Such 3'-UTRs often contain both binding sites for
microRNA
Micro ribonucleic acid (microRNA, miRNA, μRNA) are small, single-stranded, non-coding RNA molecules containing 21–23 nucleotides. Found in plants, animals, and even some viruses, miRNAs are involved in RNA silencing and post-transcr ...
s (miRNAs) as well as for regulatory proteins. By binding to specific sites within the 3'-UTR, miRNAs can decrease gene expression of various mRNAs by either inhibiting translation or directly causing degradation of the transcript. The 3'-UTR also may have silencer regions that bind repressor proteins that inhibit the expression of a mRNA.
The 3'-UTR often contains
miRNA response elements (MREs). MREs are sequences to which miRNAs bind. These are prevalent motifs within 3'-UTRs. Among all regulatory motifs within the 3'-UTRs (e.g. including silencer regions), MREs make up about half of the motifs.
As of 2014, the
miRBase web site, an archive of miRNA
sequences
In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is call ...
and annotations, listed 28,645 entries in 233 biologic species. Of these, 1,881 miRNAs were in annotated human miRNA loci. miRNAs were predicted to have an average of about four hundred target mRNAs (affecting expression of several hundred genes).
Freidman et al.
estimate that >45,000 miRNA target sites within human mRNA 3'-UTRs are conserved above background levels, and >60% of human protein-coding genes have been under selective pressure to maintain pairing to miRNAs.
Direct experiments show that a single miRNA can reduce the stability of hundreds of unique mRNAs.
Other experiments show that a single miRNA may repress the production of hundreds of proteins, but that this repression often is relatively mild (less than 2-fold).
The effects of miRNA dysregulation of gene expression seem to be important in cancer.
For instance, in gastrointestinal cancers, a 2015 paper identified nine miRNAs as
epigenetically altered and effective in down-regulating DNA repair enzymes.
The effects of miRNA dysregulation of gene expression also seem to be important in neuropsychiatric disorders, such as
schizophrenia
Schizophrenia () is a mental disorder characterized variously by hallucinations (typically, Auditory hallucination#Schizophrenia, hearing voices), delusions, thought disorder, disorganized thinking and behavior, and Reduced affect display, f ...
,
bipolar disorder
Bipolar disorder (BD), previously known as manic depression, is a mental disorder characterized by periods of Depression (mood), depression and periods of abnormally elevated Mood (psychology), mood that each last from days to weeks, and in ...
,
major depressive disorder
Major depressive disorder (MDD), also known as clinical depression, is a mental disorder characterized by at least two weeks of pervasive depression (mood), low mood, low self-esteem, and anhedonia, loss of interest or pleasure in normally ...
,
Parkinson's disease
Parkinson's disease (PD), or simply Parkinson's, is a neurodegenerative disease primarily of the central nervous system, affecting both motor system, motor and non-motor systems. Symptoms typically develop gradually and non-motor issues become ...
,
Alzheimer's disease
Alzheimer's disease (AD) is a neurodegenerative disease and the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As the disease advances, symptoms can include problems wit ...
and
autism spectrum
Autism, also known as autism spectrum disorder (ASD), is a neurodevelopmental disorder characterized by differences or difficulties in social communication and interaction, a preference for predictability and routine, sensory processing d ...
disorders.
Regulation of translation
The translation of mRNA can also be controlled by a number of mechanisms, mostly at the level of initiation. Recruitment of the small ribosomal subunit can indeed be modulated by mRNA secondary structure, antisense RNA binding, or protein binding. In both prokaryotes and eukaryotes, a large number of RNA binding proteins exist, which often are directed to their target sequence by the secondary structure of the transcript, which may change depending on certain conditions, such as temperature or presence of a ligand (aptamer). Some transcripts act as
ribozyme
Ribozymes (ribonucleic acid enzymes) are RNA molecules that have the ability to Catalysis, catalyze specific biochemical reactions, including RNA splicing in gene expression, similar to the action of protein enzymes. The 1982 discovery of ribozy ...
s and self-regulate their expression.
Examples of gene regulation
*
Enzyme induction is a process in which a molecule (e.g., a drug) induces (i.e., initiates or enhances) the expression of an enzyme.
* The induction of
heat shock protein
Heat shock proteins (HSPs) are a family of proteins produced by cells in response to exposure to stressful conditions. They were first described in relation to heat shock, but are now known to also be expressed during other stresses including ex ...
s in the fruit fly ''
Drosophila melanogaster
''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
''.
* The
Lac operon is an interesting example of how gene expression can be regulated.
* Viruses, despite having only a few genes, possess mechanisms to regulate their gene expression, typically into an early and late phase, using collinear systems regulated by anti-terminators (
lambda phage
Lambda phage (coliphage λ, scientific name ''Lambdavirus lambda'') is a bacterial virus, or bacteriophage, that infects the bacterial species ''Escherichia coli'' (''E. coli''). It was discovered by Esther Lederberg in 1950. The wild type of ...
) or splicing modulators (
HIV).
*
Gal4 is a transcriptional activator that controls the expression of GAL1, GAL7, and GAL10 (all of which code for the metabolic of galactose in yeast). The
GAL4/UAS system has been used in a variety of organisms across various phyla to study gene expression.
Developmental biology
A large number of studied regulatory systems come from
developmental biology
Developmental biology is the study of the process by which animals and plants grow and develop. Developmental biology also encompasses the biology of Regeneration (biology), regeneration, asexual reproduction, metamorphosis, and the growth and di ...
. Examples include:
* The colinearity of the
Hox gene
Hox genes, a subset of homeobox, homeobox genes, are a gene cluster, group of related genes that Evolutionary developmental biology, specify regions of the body plan of an embryo along the craniocaudal axis, head-tail axis of animals. Hox protein ...
cluster with their nested antero-posterior patterning
* Pattern generation of the hand (digits - interdigits): the gradient of
sonic hedgehog
Sonic hedgehog protein (SHH) is a major signaling molecule of embryonic development in humans and animals, encoded by the ''SHH'' gene.
This signaling molecule is key in regulating embryonic morphogenesis in all animals. SHH controls organoge ...
(secreted inducing factor) from the
zone of polarizing activity in the limb, which creates a gradient of active Gli3, which activates Gremlin, which inhibits BMPs also secreted in the limb, results in the formation of an alternating pattern of activity as a result of this
reaction–diffusion system
Reaction–diffusion systems are mathematical models that correspond to several physical phenomena. The most common is the change in space and time of the concentration of one or more chemical substances: local chemical reactions in which the su ...
.
* Somitogenesis is the creation of segments (somites) from a uniform tissue (Pre-somitic
Mesoderm
The mesoderm is the middle layer of the three germ layers that develops during gastrulation in the very early development of the embryo of most animals. The outer layer is the ectoderm, and the inner layer is the endoderm.Langman's Medical ...
). They are formed sequentially from anterior to posterior. This is achieved in amniotes possibly by means of two opposing gradients, Retinoic acid in the anterior (wavefront) and Wnt and Fgf in the posterior, coupled to an oscillating pattern (segmentation clock) composed of FGF + Notch and Wnt in antiphase.
* Sex determination in the soma of a Drosophila requires the sensing of the ratio of autosomal genes to
sex chromosome
Sex chromosomes (also referred to as allosomes, heterotypical chromosome, gonosomes, heterochromosomes, or idiochromosomes) are chromosomes that
carry the genes that determine the sex of an individual. The human sex chromosomes are a typical pair ...
-encoded genes, which results in the production of sexless splicing factor in females, resulting in the female isoform of doublesex.
[Gilbert SF (2003). Developmental biology, 7th ed., Sunderland, Mass: Sinauer Associates, 65–6. .]
Circuitry
Up-regulation and down-regulation
Up-regulation is a process which occurs within a cell triggered by a signal (originating internal or external to the cell), which results in increased expression of one or more genes and as a result the proteins encoded by those genes. Conversely, down-regulation is a process resulting in decreased gene and corresponding protein expression.
*
Up-regulation occurs, for example, when a cell is deficient in some kind of receptor. In this case, more receptor protein is synthesized and transported to the membrane of the cell and, thus, the sensitivity of the cell is brought back to normal, reestablishing
homeostasis
In biology, homeostasis (British English, British also homoeostasis; ) is the state of steady internal physics, physical and chemistry, chemical conditions maintained by organism, living systems. This is the condition of optimal functioning fo ...
.
*
Down-regulation occurs, for example, when a cell is overstimulated by a
neurotransmitter
A neurotransmitter is a signaling molecule secreted by a neuron to affect another cell across a Chemical synapse, synapse. The cell receiving the signal, or target cell, may be another neuron, but could also be a gland or muscle cell.
Neurotra ...
,
hormone
A hormone (from the Ancient Greek, Greek participle , "setting in motion") is a class of cell signaling, signaling molecules in multicellular organisms that are sent to distant organs or tissues by complex biological processes to regulate physio ...
, or drug for a prolonged period of time, and the expression of the receptor protein is decreased in order to protect the cell (see also
tachyphylaxis).
Inducible vs. repressible systems

Gene Regulation can be summarized by the response of the respective system:
* Inducible systems - An inducible system is off unless there is the presence of some molecule (called an inducer) that allows for gene expression. The molecule is said to "induce expression". The manner by which this happens is dependent on the control mechanisms as well as differences between prokaryotic and eukaryotic cells.
* Repressible systems - A repressible system is on except in the presence of some molecule (called a corepressor) that suppresses gene expression. The molecule is said to "repress expression". The manner by which this happens is dependent on the control mechanisms as well as differences between prokaryotic and eukaryotic cells.
The
GAL4/UAS system is an example of both an inducible and repressible system.
Gal4 binds an upstream activation sequence (UAS) to activate the transcription of the GAL1/GAL7/GAL10 cassette. On the other hand, a
MIG1 response to the presence of glucose can inhibit GAL4 and therefore stop the expression of the GAL1/GAL7/GAL10 cassette.
Theoretical circuits
* Repressor/Inducer: an activation of a sensor results in the change of expression of a gene
* negative feedback: the gene product downregulates its own production directly or indirectly, which can result in
** keeping transcript levels constant/proportional to a factor
** inhibition of run-away reactions when coupled with a positive feedback loop
** creating an oscillator by taking advantage in the time delay of transcription and translation, given that the mRNA and protein half-life is shorter
* positive feedback: the gene product upregulates its own production directly or indirectly, which can result in
** signal amplification
** bistable switches when two genes inhibit each other and both have positive feedback
** pattern generation
Study methods

In general, most experiments investigating differential expression used whole cell extracts of RNA, called steady-state levels, to determine which genes changed and by how much. These are, however, not informative of where the regulation has occurred and may mask conflicting regulatory processes (''see
post-transcriptional regulation
Post-transcriptional regulation is the control of gene expression at the RNA level. It occurs once the RNA polymerase has been attached to the gene's promoter and is synthesizing the nucleotide sequence. Therefore, as the name indicates, it occur ...
''), but it is still the most commonly analysed (
quantitative PCR
A real-time polymerase chain reaction (real-time PCR, or qPCR when used quantitatively) is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule duri ...
and
DNA microarray
A DNA microarray (also commonly known as a DNA chip or biochip) is a collection of microscopic DNA spots attached to a solid surface. Scientists use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or t ...
).
When studying gene expression, there are several methods to look at the various stages. In eukaryotes these include:
* The local chromatin environment of the region can be determined by
ChIP-chip analysis by pulling down
RNA Polymerase II
RNA polymerase II (RNAP II and Pol II) is a Protein complex, multiprotein complex that Transcription (biology), transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNA pol ...
,
Histone 3 modifications,
Trithorax-group protein,
Polycomb-group protein, or any other DNA-binding element to which a good antibody is available.
*
Epistatic
Epistasis is a phenomenon in genetics in which the effect of a gene mutation is dependent on the presence or absence of mutations in one or more other genes, respectively termed modifier genes. In other words, the effect of the mutation is depe ...
interactions can be investigated by
synthetic genetic array Synthetic genetic array analysis (SGA) is a high-throughput technique for exploring synthetic lethal and synthetic sick genetic interactions ( SSL). SGA allows for the systematic construction of double mutants using a combination of recombinant ...
analysis
* Due to post-transcriptional regulation, transcription rates and total RNA levels differ significantly. To measure the transcription rates
nuclear run-on assays can be done and newer high-throughput methods are being developed, using
thiol
In organic chemistry, a thiol (; ), or thiol derivative, is any organosulfur compound of the form , where R represents an alkyl or other organic substituent. The functional group itself is referred to as either a thiol group or a sulfhydryl grou ...
labelling instead of
radioactivity
Radioactive decay (also known as nuclear decay, radioactivity, radioactive disintegration, or nuclear disintegration) is the process by which an unstable atomic nucleus loses energy by radiation. A material containing unstable nuclei is conside ...
.
* Only 5% of the RNA polymerised in the nucleus exits,
and not only introns,
abortive products, and non-sense transcripts are degradated. Therefore, the differences in nuclear and cytoplasmic levels can be seen by separating the two fractions by gentle lysis.
* Alternative splicing can be analysed with a splicing array or with a tiling array (''see
DNA microarray
A DNA microarray (also commonly known as a DNA chip or biochip) is a collection of microscopic DNA spots attached to a solid surface. Scientists use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or t ...
'').
* All
in vivo
Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, an ...
RNA is complexed as
RNPs. The quantity of transcripts bound to specific protein can be also analysed by
RIP-Chip. For example,
DCP2
mRNA-decapping enzyme 2 is a protein that in humans is encoded by the ''DCP2'' gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is ...
will give an indication of sequestered protein;
ribosome
Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
-bound gives and indication of transcripts active in transcription (although a more dated method, called
polysome
A polyribosome (or polysome or ergosome) is a group of ribosomes bound to an mRNA molecule like “beads” on a “thread”. It consists of a complex of an mRNA molecule and two or more ribosomes that act to translate mRNA instructions into p ...
fractionation, is still popular in some labs)
* Protein levels can be analysed by
Mass spectrometry
Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a ''mass spectrum'', a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is used ...
, which can be compared only to
quantitative PCR
A real-time polymerase chain reaction (real-time PCR, or qPCR when used quantitatively) is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule duri ...
data, as
microarray
A microarray is a multiplex (assay), multiplex lab-on-a-chip. Its purpose is to simultaneously detect the expression of thousands of biological interactions. It is a two-dimensional array on a Substrate (materials science), solid substrate—usu ...
data is relative and not absolute.
* RNA and protein degradation rates are measured by means of transcription inhibitors (
actinomycin D or
α-Amanitin) or translation inhibitors (
Cycloheximide), respectively.
See also
*
Artificial transcription factors (small molecules that mimic transcription factor protein)
*
Cellular model
*
Conserved non-coding DNA sequence
*
Enhancer (genetics)
In genetics, an enhancer is a short (50–1500 bp) region of DNA that can be bound by proteins ( activators) to increase the likelihood that transcription of a particular gene will occur. These proteins are usually referred to as transcriptio ...
*
Gene structure
*
Spatiotemporal gene expression
Spatiotemporal gene expression is the gene expression, activation of genes within specific biological tissue, tissues of an organism at specific times during developmental biology, development. Gene activation patterns vary widely in complexity. ...
*
Regulator gene glucosyltransferases (Rgg/SHP) systems
Notes and references
Bibliography
*
External links
Plant Transcription Factor Database and Plant Transcriptional Regulation Data and Analysis Platform*
ChIPBaseAn open database for decoding the transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.
{{DEFAULTSORT:Regulation Of Gene Expression
Gene expression
DNA
RNA
Post-translational modification
Evolutionary developmental biology