An E-box (enhancer box) is a
DNA response element found in some
eukaryotes
The eukaryotes ( ) constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of ...
that acts as a
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
-binding site and has been found to regulate gene expression in
neurons
A neuron (American English), neurone (British English), or nerve cell, is an membrane potential#Cell excitability, excitable cell (biology), cell that fires electric signals called action potentials across a neural network (biology), neural net ...
,
muscles
Muscle is a soft tissue, one of the four basic types of animal tissue. There are three types of muscle tissue in vertebrates: skeletal muscle, cardiac muscle, and smooth muscle. Muscle tissue gives skeletal muscles the ability to muscle contra ...
, and other tissues. Its specific DNA sequence, CANNTG (where N can be any
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
), with a palindromic
canonical sequence of CACGTG, is recognized and bound by
transcription factors
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fun ...
to initiate
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
transcription. Once the transcription factors bind to the promoters through the E-box, other
enzymes
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as pro ...
can bind to the promoter and facilitate transcription from DNA to
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein.
mRNA is ...
.
Discovery
The E-box was discovered in a collaboration between
Susumu Tonegawa's and
Walter Gilbert
Walter Gilbert (born March 21, 1932) is an American biochemist, physicist, molecular biology pioneer, and Nobel laureate.
Education and early life
Walter Gilbert was born in Boston, Massachusetts, on March 21, 1932, into a Jewish family, the so ...
's laboratories in 1985 as a control element in
immunoglobulin
An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
heavy-chain enhancer. They found that a region of 140
base pairs
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
in the tissue-specific transcriptional enhancer element was sufficient for different levels of transcription
enhancement in different tissues and sequences. They suggested that proteins made by specific tissues acted on these enhancers to activate sets of
genes
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
during cell differentiation.
In 1989, David Baltimore's lab discovered the first two E-box binding
proteins
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, re ...
, E12 and E47. These
immunoglobulin
An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
enhancers could bind as
heterodimers
In biochemistry, a protein dimer is a macromolecular complex or multimer formed by two protein monomers, or single proteins, which are usually non-covalently bound. Many macromolecules, such as proteins or nucleic acids, form dimers. The word ...
to proteins through
bHLH domains. In 1990, another E-protein, ITF-2A (later renamed E2-2Alt) was discovered that can bind to
immunoglobulin
An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
light chain enhancers. Two years later, the third E-box binding protein, HEB, was discovered by screening a
cDNA
In genetics, complementary DNA (cDNA) is DNA that was reverse transcribed (via reverse transcriptase) from an RNA (e.g., messenger RNA or microRNA). cDNA exists in both single-stranded and double-stranded forms and in both natural and engin ...
library from
HeLa
HeLa () is an immortalized cell line used in scientific research. It is the oldest human cell line and one of the most commonly used. HeLa cells are durable and prolific, allowing for extensive applications in scientific study. The line is ...
cells. A splice-variant of the E2-2 was discovered in 1997 and was found to inhibit the
promoter of a muscle-specific gene.
Since then, researchers have established that the E-box affects gene
transcription in several eukaryotes and found E-box binding factors that identify E-box
consensus sequences.
[Mädge B.: E-Box. In: Schwab M. (Ed.) Encyclopedia of Cancer. Springer-Verlag Berlin Heidelberg, 2009.] In particular, several experiments have shown that the E-box is an integral part of the transcription-translation feedback loop that comprises the
circadian clock
A circadian clock, or circadian oscillator, also known as one’s internal alarm clock is a biochemical oscillator that cycles with a stable phase and is synchronized with solar time.
Such a clock's ''in vivo'' period is necessarily almost exact ...
.
Binding
E-box binding proteins play a major role in regulating transcriptional activity. These proteins usually contain the
basic helix-loop-helix
Basic or BASIC may refer to:
Science and technology
* BASIC, a computer programming language
* Basic (chemistry), having the properties of a base
* Basic access authentication, in HTTP
Entertainment
* ''Basic'' (film), a 2003 film
* Basic, one ...
protein
structural motif
In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a common three-dimensional structure that appears in a variety of different, evolutionarily unrelated molecules. A structural motif does not have t ...
, which allows them to bind as
dimers. This motif consists of two
amphipathic
In chemistry, an amphiphile (), or amphipath, is a chemical compound possessing both hydrophilic (''water-loving'', polar) and lipophilic (''fat-loving'', nonpolar) properties. Such a compound is called amphiphilic or amphipathic. Amphiphilic c ...
α-helices
An alpha helix (or α-helix) is a sequence of amino acids in a protein that are twisted into a coil (a helix).
The alpha helix is the most common structural arrangement in the secondary structure of proteins. It is also the most extreme type of l ...
, separated by a small sequence of
amino acids
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the Proteinogenic amino acid, 22 α-amino acids incorporated into p ...
, that form one or more β-turns. The
hydrophobic
In chemistry, hydrophobicity is the chemical property of a molecule (called a hydrophobe) that is seemingly repelled from a mass of water. In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, thu ...
interactions between these α-helices stabilize dimerization. Besides, each bHLH
monomer
A monomer ( ; ''mono-'', "one" + '' -mer'', "part") is a molecule that can react together with other monomer molecules to form a larger polymer chain or two- or three-dimensional network in a process called polymerization.
Classification
Chemis ...
has a basic region, which helps mediate recognition between the bHLH
monomer
A monomer ( ; ''mono-'', "one" + '' -mer'', "part") is a molecule that can react together with other monomer molecules to form a larger polymer chain or two- or three-dimensional network in a process called polymerization.
Classification
Chemis ...
and the E-box (the basic region interacts with the major groove of the
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
). Depending on the DNA motif ("CAGCTG" versus "CACGTG") the bHLH protein has a different set of basic residues.

The E-box binding is modulated by Zn
2+ in mice. The CT-Rich Regions (CTRR) located about 23
nucleotides
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
upstream of the E-box is important in E-box binding,
transactivation (increased rate of genetic expression), and transcription of circadian genes
BMAL1/
NPAS2 and BMAL1/
CLOCK
A clock or chronometer is a device that measures and displays time. The clock is one of the oldest Invention, human inventions, meeting the need to measure intervals of time shorter than the natural units such as the day, the lunar month, a ...
complexes.
The binding specificity of different E-boxes is found to be essential in their function. E-boxes with different functions have a different number and type of binding factor.
The consensus sequence of the E-box is usually CANNTG; however, there exist other E-boxes of similar sequences called noncanonical E-boxes. These include, but are not limited to:
* CACGTT sequence 20 bp upstream of the
mouse
A mouse (: mice) is a small rodent. Characteristically, mice are known to have a pointed snout, small rounded ears, a body-length scaly tail, and a high breeding rate. The best known mouse species is the common house mouse (''Mus musculus'' ...
Period2 (
PER2) gene and regulates its expression
* CAGCTT sequence found within the
MyoD
MyoD, also known as myoblast determination protein 1, is a protein in animals that plays a major role in regulating muscle differentiation. MyoD, which was discovered in the laboratory of Harold M. Weintraub, belongs to a family of proteins kn ...
core enhancer
* CACCTCGTGAC sequence in the
proximal
Standard anatomical terms of location are used to describe unambiguously the anatomy of humans and other animals. The terms, typically derived from Latin or Greek roots, describe something in its standard anatomical position. This position prov ...
promoter region of
human
Humans (''Homo sapiens'') or modern humans are the most common and widespread species of primate, and the last surviving species of the genus ''Homo''. They are Hominidae, great apes characterized by their Prehistory of nakedness and clothing ...
and
rat
Rats are various medium-sized, long-tailed rodents. Species of rats are found throughout the order Rodentia, but stereotypical rats are found in the genus ''Rattus''. Other rat genera include '' Neotoma'' (pack rats), '' Bandicota'' (bandicoo ...
APOE
Apolipoprotein E (Apo-E) is a protein involved in the metabolism of fats in the body of mammals. A subtype is implicated in Alzheimer's disease and cardiovascular diseases. It is encoded in humans by the gene ''APOE''.
Apo-E belongs to a family ...
, which is a protein component of
lipoproteins
A lipoprotein is a biochemical assembly whose primary function is to transport hydrophobic lipid (also known as fat) molecules in water, as in blood plasma or other extracellular fluids. They consist of a triglyceride and cholesterol center, sur ...
.
Role in the circadian clock
The link between E-box-regulated genes and the
circadian clock
A circadian clock, or circadian oscillator, also known as one’s internal alarm clock is a biochemical oscillator that cycles with a stable phase and is synchronized with solar time.
Such a clock's ''in vivo'' period is necessarily almost exact ...
was discovered in 1997, when Hao, Allen, and Hardin (Department of Biology at
Texas A&M University
Texas A&M University (Texas A&M, A&M, TA&M, or TAMU) is a public university, public, Land-grant university, land-grant, research university in College Station, Texas, United States. It was founded in 1876 and became the flagship institution of ...
) analyzed
rhythmicity in the period (
per) gene in ''
Drosophila melanogaster
''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
''. They found a circadian transcriptional enhancer upstream of the per gene within a 69 bp DNA fragment. Depending upon PER protein levels, the enhancer drove high levels of
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein.
mRNA is ...
transcription in both LD (light-dark) and DD (constant darkness) conditions. The enhancer was found to be necessary for high-level
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
but not for circadian rhythmicity. It also works independently as a target of the BMAL1/CLOCK complex.
The E-box plays an important role in circadian genes; so far, nine E/E'BOX controlled circadian genes have been identified:
PER1
Period circadian protein homolog 1 is a protein in humans that is encoded by the ''PER1'' gene.
Function
The PER1 protein is important to the maintenance of circadian rhythms in cells, and may also play a role in the development of cancer. Thi ...
, PER2,
BHLHB2,
BHLHB3,
CRY1
Cryptochromes (from the Greek κρυπτός χρώμα, "hidden colour") are a class of flavoproteins found in plants and animals that are sensitive to blue light. They are involved in the circadian rhythms and the sensing of magnetic fie ...
,
DBP,
Nr1d1,
Nr1d2, and RORC. As the E-box is connected to several circadian genes, it is possible that the genes and proteins associated with it are "crucial and vulnerable points in the (circadian) system."
The E-box is one of the top five transcription factor families associated with the circadian phase and is found in most tissues. A total of 320 E-box-controlled genes are found in the SCN (
suprachiasmatic nucleus
The suprachiasmatic nucleus or nuclei (SCN) is a small region of the brain in the hypothalamus, situated directly above the optic chiasm. It is responsible for regulating sleep cycles in animals. Reception of light inputs from photosensitive r ...
),
liver
The liver is a major metabolic organ (anatomy), organ exclusively found in vertebrates, which performs many essential biological Function (biology), functions such as detoxification of the organism, and the Protein biosynthesis, synthesis of var ...
,
aorta
The aorta ( ; : aortas or aortae) is the main and largest artery in the human body, originating from the Ventricle (heart), left ventricle of the heart, branching upwards immediately after, and extending down to the abdomen, where it splits at ...
,
adrenal
The adrenal glands (also known as suprarenal glands) are endocrine glands that produce a variety of hormones including adrenaline and the steroids aldosterone and cortisol. They are found above the kidneys. Each gland has an outer cortex which ...
, WAT (
white adipose tissue
White adipose tissue or white fat is one of the two types of adipose tissue found in mammals. The other kind is brown adipose tissue. White adipose tissue is composed of monolocular Adipocyte, adipocytes.
In humans, the healthy body fat percent ...
),
brain
The brain is an organ (biology), organ that serves as the center of the nervous system in all vertebrate and most invertebrate animals. It consists of nervous tissue and is typically located in the head (cephalization), usually near organs for ...
,
atria,
ventricle,
prefrontal cortex
In mammalian brain anatomy, the prefrontal cortex (PFC) covers the front part of the frontal lobe of the cerebral cortex. It is the association cortex in the frontal lobe. The PFC contains the Brodmann areas BA8, BA9, BA10, BA11, BA12, ...
,
skeletal muscle
Skeletal muscle (commonly referred to as muscle) is one of the three types of vertebrate muscle tissue, the others being cardiac muscle and smooth muscle. They are part of the somatic nervous system, voluntary muscular system and typically are a ...
, BAT (
brown adipose tissue
Brown adipose tissue (BAT) or brown fat makes up the adipose organ together with white adipose tissue (or white fat). Brown adipose tissue is found in almost all mammals.
Classification of brown fat refers to two distinct cell populations with si ...
), and calvarial bone.
E-box like CLOCK-related elements (EL-box; GGCACGAGGC) are also important in maintaining circadian rhythmicity in clock-controlled genes. Similarly to the E-box, the E-box like CLOCK related element can also induce transcription of BMAL1/CLOCK, which can then lead to expression in other EL-box containing genes (Ank, DBP, Nr1d1). However, there are differences between the EL-box and the regular E-box. Suppressing
DEC1 and
DEC2 has a stronger effect on E-box than on EL-box. Furthermore, HES1, which can bind to a different consensus sequence (CACNAG, known as the N-box), shows suppression effect in EL-box, but not in E-box.
Both non-canonical E-boxes and E-box-like sequences are crucial for circadian oscillation. Recent research on this forms an hypothesis that either a canonical or non-canonical E-box followed by an E-box like sequence with 6
base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
interval in between is a necessary combination for circadian transcription. In silico analysis also suggests that such an interval existed in other known clock-controlled genes.
Role of proteins which bind to E-boxes
There are several proteins that bind to the E-box and affect gene transcription.
CLOCK-ARNTL complex
The CLOCK-
ARNTL (BMAL1) complex is an integral part of the
mammalian
A mammal () is a vertebrate animal of the Class (biology), class Mammalia (). Mammals are characterised by the presence of milk-producing mammary glands for feeding their young, a broad neocortex region of the brain, fur or hair, and three ...
circadian cycle and vital in maintaining circadian rhythmicity.
Knowing that binding activates transcription of the per gene in the promoter region, researchers discovered in 2002 that
DEC1 and
DEC2 (bHLH transcription factors) repressed the CLOCK-BMAL1 complex through direct interaction with BMAL1 and/or competition for E-box elements. They concluded that DEC1 and DEC2 were
regulators of the mammalian molecular clock.
In 2006, Ripperger and Schibler discovered that the binding of this complex to the E-box drove circadian
DBP transcription and
chromatin
Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
transitions (a change from chromatin to
facultative heterochromatin
Heterochromatin is a tightly packed form of DNA or '' condensed DNA'', which comes in multiple varieties. These varieties lie on a continuum between the two extremes of constitutive heterochromatin and facultative heterochromatin. Both play a rol ...
). It was concluded that CLOCK regulates DBP expression by binding to E-box motifs in enhancer regions located in the first and second
introns
An intron is any Nucleic acid sequence, nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of ...
.
MYC (c-Myc, an
oncogene
An oncogene is a gene that has the potential to cause cancer. In tumor cells, these genes are often mutated, or expressed at high levels. )
MYC (
c-Myc
''Myc'' is a family of regulator genes and proto-oncogenes that code for transcription factors. The ''Myc'' family consists of three related human genes: ''c-myc'' ( MYC), ''l-myc'' ( MYCL), and ''n-myc'' ( MYCN). ''c-myc'' (also sometimes ...
), a gene that codes for a transcription factor
Myc
''Myc'' is a family of regulator genes and proto-oncogenes that code for transcription factors. The ''Myc'' family consists of three related human genes: ''c-myc'' ( MYC), ''l-myc'' ( MYCL), and ''n-myc'' ( MYCN). ''c-myc'' (also sometimes r ...
, is important in regulating mammalian
cell proliferation
Cell proliferation is the process by which ''a cell grows and divides to produce two daughter cells''. Cell proliferation leads to an exponential increase in cell number and is therefore a rapid mechanism of tissue growth. Cell proliferation ...
and
apoptosis
Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
.
In 1991, researchers tested whether c-Myc could bind to DNA by dimerizing it to E12. Dimers of E6, the
chimeric protein, were able to bind to an E-box element (GGCCACGTGACC) which was recognized by other HLH proteins. Expression of E6 suppressed the function of c-Myc, which showed a link between the two.
In 1996, it was found that Myc heterodimerizes with
MAX
Max or MAX may refer to:
Animals
* Max (American dog) (1983–2013), at one time purported to be the world's oldest living dog
* Max (British dog), the first pet dog to win the PDSA Order of Merit (animal equivalent of the OBE)
* Max (gorilla) ...
and that this heterodimeric complex could bind to the CAC(G/A)TG E-box sequence and activate transcription.
In 1998, it was concluded that the function of c-Myc depends upon activating transcription of particular genes through E-box elements.
MYOD1 (MyoD)
MyoD comes from the Mrf
bHLH family and its main role is
myogenesis
Myogenesis is the formation of skeletal muscle, skeletal muscular tissue, particularly during embryonic development.
Skeletal muscle#Skeletal muscle cells, Muscle fibers generally form through the fusion of precursor cell, precursor myoblasts in ...
, the formation of muscular tissue.
Other members in this family include
myogenin
Myogenin, is a transcriptional activator encoded by the ''MYOG'' gene.
Myogenin is a muscle-specific basic-helix-loop-helix (bHLH) transcription factor involved in the coordination of skeletal muscle development or myogenesis and repair. Myogeni ...
,
Myf5,
Myf6,
Mist1, and Nex-1.
When MyoD binds to the E-box motif CANNTG,
muscle
Muscle is a soft tissue, one of the four basic types of animal tissue. There are three types of muscle tissue in vertebrates: skeletal muscle, cardiac muscle, and smooth muscle. Muscle tissue gives skeletal muscles the ability to muscle contra ...
differentiation and expression of muscle-specific proteins is initiated. The researchers ablated various parts of the recombinant MyoD sequence and concluded that MyoD used encompassing elements to bind the E-box and the tetralplex structure of the promoter sequence of the muscle specific gene α7
integrin
Integrins are transmembrane receptors that help cell–cell and cell–extracellular matrix (ECM) adhesion. Upon ligand binding, integrins activate signal transduction pathways that mediate cellular signals such as regulation of the cell cycle, o ...
and
sarcomeric ''sMtCK''.
MyoD regulates HB-EGF (
Heparin-binding EGF-like growth factor), a member of the EGF (
Epidermal growth factor) family that stimulates cell growth and proliferation.
It plays a role in the development of
hepatocellular carcinoma
Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer in adults and is currently the most common cause of death in people with cirrhosis. HCC is the third leading cause of cancer-related deaths worldwide.
HCC most common ...
,
prostate cancer
Prostate cancer is the neoplasm, uncontrolled growth of cells in the prostate, a gland in the male reproductive system below the bladder. Abnormal growth of the prostate tissue is usually detected through Screening (medicine), screening tests, ...
,
breast cancer
Breast cancer is a cancer that develops from breast tissue. Signs of breast cancer may include a Breast lump, lump in the breast, a change in breast shape, dimpling of the skin, Milk-rejection sign, milk rejection, fluid coming from the nipp ...
,
esophageal cancer
Esophageal cancer (American English) or oesophageal cancer (British English) is cancer arising from the esophagus—the food pipe that runs between the throat and the stomach. Symptoms often include dysphagia, difficulty in swallowing and weigh ...
, and
gastric cancer
Stomach cancer, also known as gastric cancer, is a malignant tumor of the stomach. It is a cancer that develops in the lining of the stomach. Most cases of stomach cancers are gastric carcinomas, which can be divided into a number of subtypes ...
.
MyoD can also bind to noncanonical E boxes of MyoG and regulate its expression.
MyoG (Myogenin)
MyoG belongs to the MyoD transcription factor family. MyoG-E-Box binding is necessary for
neuromuscular synapse formation as an HDAC-Dach2-
myogenin
Myogenin, is a transcriptional activator encoded by the ''MYOG'' gene.
Myogenin is a muscle-specific basic-helix-loop-helix (bHLH) transcription factor involved in the coordination of skeletal muscle development or myogenesis and repair. Myogeni ...
signaling pathway in
skeletal
A skeleton is the structural frame that supports the body of most animals. There are several types of skeletons, including the exoskeleton, which is a rigid outer shell that holds up an organism's shape; the endoskeleton, a rigid internal fram ...
muscle gene expression has been identified. Decreased MyoG expression has been shown in patients with muscle wasting symptom.
MyoG and MyoD have also been shown to involve in
myoblast
Myogenesis is the formation of skeletal muscular tissue, particularly during embryonic development.
Muscle fibers generally form through the fusion of precursor myoblasts into multinucleated fibers called myotubes. In the early development o ...
differentiation. They act by
transactivating cathepsin B promotor activity and inducing its mRNA expression.
TCF3 (E47)
E47 is produced by alternative spliced E2A in E47 specific bHLH-encoding
exons
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence i ...
. Its role is to regulate tissue specific gene expression and differentiation. Many
kinases
In biochemistry, a kinase () is an enzyme that catalysis, catalyzes the transfer of phosphate groups from High-energy phosphate, high-energy, phosphate-donating molecules to specific Substrate (biochemistry), substrates. This process is known as ...
have been associated with E47 including 3pk and MK2. These 2 proteins form a complex with E47 and reduce its transcription activity. CKII and PKA are also shown to
phosphorylate
In biochemistry, phosphorylation is described as the "transfer of a phosphate group" from a donor to an acceptor. A common phosphorylating agent (phosphate donor) is ATP and a common family of acceptor are alcohols:
:
This equation can be writt ...
E47 in vitro.
Similar to other E-box binding proteins, E47 also binds to the CANNTG sequence in the E-box. In homozygous E2A knock-out mice,
B cells
B cells, also known as B lymphocytes, are a type of the lymphocyte subtype. They function in the humoral immunity component of the adaptive immune system. B cells produce antibody molecules which may be either secreted or inserted into the plasm ...
development stops before the DJ arrangement stage and the B cells fail to mature. E47 has been shown to bind either as heterodimer(with E12) or as homodimer(but weaker).
Recent research
Although the structural basis for how BMAL1/CLOCK interact with the E-box is unknown, recent research has shown that the
bHLH protein domains of BMAL1/CLOCK are highly similar to other bHLH containing proteins, e.g. Myc/Max, which have been crystallized with E-boxes. It is surmised that specific
bases are necessary to support this high affinity binding. Furthermore, the sequence constraints on the region around the circadian E-box are not fully understood: it is believed to be necessary but not sufficient for E-boxes to be randomly spaced from each other in the genetic sequence in order for circadian transcription to occur. Recent research involving the E-box has been aimed at trying to find more binding proteins as well as discovering more mechanisms for inhibiting binding.
Researchers at the Medical School of
Nanjing University
Nanjing University (NJU) is a public university in Nanjing, Jiangsu, China. It is affiliated and sponsored by the Ministry of Education. The university is part of Project 211, Project 985, and the Double First-Class Construction. The univers ...
found that the amplitude of
FBXL3 (F-box/Leucine rich-repeat protein) is expressed via an E-box. They studied mice with FBXL3 deficiency and found that it regulates feedback loops in circadian rhythms by affecting circadian period length.
A study published April 4, 2013 by researchers at
Harvard Medical School
Harvard Medical School (HMS) is the medical school of Harvard University and is located in the Longwood Medical and Academic Area, Longwood Medical Area in Boston, Massachusetts. Founded in 1782, HMS is the third oldest medical school in the Un ...
found that the nucleotides on either side of an E-box influences which transcription factors can bind to the E-box itself.
These nucleotides determine the 3-D spatial arrangement of the DNA strand and restrict the size of binding
transcription factors
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fun ...
. The study also found differences in binding patterns between
in vivo
Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, an ...
and
in vitro
''In vitro'' (meaning ''in glass'', or ''in the glass'') Research, studies are performed with Cell (biology), cells or biological molecules outside their normal biological context. Colloquially called "test-tube experiments", these studies in ...
strands.
References
External links
*
{{Transcription
Regulatory sequences
DNA