DNA-binding proteins are
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s that have
DNA-binding domain
A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence (a recognition sequence) or have a gener ...
s and thus have a specific or general affinity for
single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the
major groove of
B-DNA, because it exposes more
functional group
In organic chemistry, a functional group is a substituent or moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions regardless of the res ...
s that identify a
base pair. However, there are some known
minor groove DNA-binding ligands such as
netropsin,
distamycin,
Hoechst 33258,
pentamidine,
DAPI and others.
Examples
DNA-binding
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s include
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fu ...
s which
modulate the process of transcription, various
polymerase
A polymerase is an enzyme ( EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by copying a DNA template strand using b ...
s,
nucleases which cleave DNA molecules, and
histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn ar ...
s which are involved in
chromosome
A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins ar ...
packaging and transcription in the
cell nucleus. DNA-binding proteins can incorporate such domains as the
zinc finger
A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) in order to stabilize the fold. It was originally coined to describe the finger-like appearance of a hypothesized struct ...
, the
helix-turn-helix
Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
, and the
leucine zipper (among many others) that facilitate binding to nucleic acid. There are also more unusual examples such as
transcription activator like effectors.
Non-specific DNA-protein interactions
Structural proteins that bind DNA are well-understood examples of non-specific DNA-protein interactions. Within chromosomes, DNA is held in complexes with structural proteins. These proteins organize the DNA into a compact structure called
chromatin
Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important ...
. In
eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s, this structure involves DNA binding to a complex of small basic proteins called
histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn ar ...
s. In
prokaryote
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Con ...
s, multiple types of proteins are involved. The histones form a disk-shaped complex called a
nucleosome, which contains two complete turns of double-stranded DNA wrapped around its surface. These non-specific interactions are formed through basic residues in the histones making
ionic bond
Ionic bonding is a type of chemical bonding that involves the electrostatic attraction between oppositely charged ions, or between two atoms with sharply different electronegativities, and is the primary interaction occurring in ionic compoun ...
s to the acidic sugar-phosphate backbone of the DNA, and are therefore largely independent of the base sequence.
Chemical
A chemical substance is a form of matter having constant chemical composition and characteristic properties. Some references add that chemical substance cannot be separated into its constituent elements by physical separation methods, i.e., wit ...
modifications of these basic
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
residues include
methylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
,
phosphorylation
In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, ...
and
acetylation. These chemical changes alter the strength of the interaction between the DNA and the histones, making the DNA more or less accessible to
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fu ...
s and changing the rate of transcription. Other non-specific DNA-binding proteins in chromatin include the high-mobility group (HMG) proteins, which bind to bent or distorted DNA. Biophysical studies show that these architectural HMG proteins bind, bend and loop DNA to perform its biological functions. These proteins are important in bending arrays of nucleosomes and arranging them into the larger structures that form chromosomes. Recently FK506 binding protein 25 (FBP25) was also shown to non-specifically bind to DNA which helps in DNA repair.
Proteins that specifically bind single-stranded DNA
A distinct group of DNA-binding proteins are the DNA-binding proteins that specifically bind single-stranded DNA. In humans,
replication protein A
Replication protein A (RPA) is the major protein that binds to single-stranded DNA (ssDNA) in eukaryotic cells. In vitro, RPA shows a much higher affinity for ssDNA than RNA or double-stranded DNA. RPA is required in replication, recombinatio ...
is the best-understood member of this family and is used in processes where the double helix is separated, including DNA replication, recombination and DNA repair. These binding proteins seem to stabilize single-stranded DNA and protect it from forming
stem-loop
Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same strand, usually complementary in nucleotide sequence wh ...
s or being degraded by
nucleases.
Binding to specific DNA sequences

In contrast, other proteins have evolved to bind to specific DNA sequences. The most intensively studied of these are the various
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fu ...
s, which are proteins that regulate transcription. Each transcription factor binds to one specific set of DNA sequences and activates or inhibits the transcription of genes that have these sequences near their promoters. The transcription factors do this in two ways. Firstly, they can bind the RNA polymerase responsible for transcription, either directly or through other mediator proteins; this locates the polymerase at the promoter and allows it to begin transcription. Alternatively, transcription factors can bind
enzyme
Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different molecule ...
s that modify the histones at the promoter. This alters the accessibility of the DNA template to the polymerase.
These DNA targets can occur throughout an organism's genome. Thus, changes in the activity of one type of transcription factor can affect thousands of genes. Thus, these proteins are often the targets of the
signal transduction
Signal transduction is the process by which a chemical or physical signal is transmitted through a cell as a series of molecular events, most commonly protein phosphorylation catalyzed by protein kinases, which ultimately results in a cellular ...
processes that control responses to environmental changes or
cellular differentiation and development. The specificity of these transcription factors' interactions with DNA come from the proteins making multiple contacts to the edges of the DNA bases, allowing them to ''read'' the DNA sequence. Most of these base-interactions are made in the major groove, where the bases are most accessible. Mathematical descriptions of protein-DNA binding taking into account sequence-specificity, and competitive and cooperative binding of proteins of different types are usually performed with the help of the
lattice models. Computational methods to identify the DNA binding sequence specificity have been proposed to make a good use of the abundant sequence data in the post-genomic era.
Protein–DNA interactions
Protein–DNA interactions occur when a
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
binds a molecule of
DNA, often to regulate the
biological function of DNA, usually the
expression of a
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
. Among the proteins that bind to DNA are
transcription factors
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
that activate or repress gene expression by binding to DNA motifs and
histones
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn are ...
that form part of the structure of DNA and bind to it less specifically. Also proteins that
repair DNA such as
uracil-DNA glycosylase
Uracil-DNA glycosylase is also known as UNG or UDG. Its most important function is to prevent mutagenesis by eliminating uracil from DNA molecules by cleaving the N-glycosidic bond and initiating the base-excision repair (BER) pathway.
Function ...
interact closely with it.
In general, proteins bind to DNA in the
major groove; however, there are exceptions.
Protein–DNA interaction are of mainly two types, either specific interaction, or non-specific interaction. Recent single-molecule experiments showed that DNA binding proteins undergo of rapid rebinding in order to bind in correct orientation for recognizing the target site.
Design
Designing DNA-binding proteins that have a specified DNA-binding site has been an important goal for biotechnology.
Zinc finger
A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) in order to stabilize the fold. It was originally coined to describe the finger-like appearance of a hypothesized struct ...
proteins have been designed to bind to specific DNA sequences and this is the basis of
zinc finger nucleases. Recently
transcription activator-like effector nucleases (TALENs) have been created which are based on natural
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s secreted by ''
Xanthomonas'' bacteria via their
type III secretion system when they infect various
plant
Plants are predominantly Photosynthesis, photosynthetic eukaryotes of the Kingdom (biology), kingdom Plantae. Historically, the plant kingdom encompassed all living things that were not animals, and included algae and fungi; however, all curr ...
species.
Detection methods
There are many ''in vitro'' and ''in vivo'' techniques which are useful in detecting DNA-Protein Interactions. The following lists some methods currently in use:
Electrophoretic mobility shift assay (EMSA) is a widespread qualitative technique to study protein–DNA interactions of known DNA binding proteins.
DNA-Protein-Interaction - Enzyme-Linked ImmunoSorbant Assay (DPI-ELISA) allows the qualitative and quantitative analysis of DNA-binding preferences of known proteins ''in vitro''. This technique allows the analysis of protein complexes that bind to DNA (DPI-Recruitment-ELISA) or is suited for automated screening of several nucleotide probes due to its standard ELISA plate formate .
DNase footprinting assay can be used to identify the specific sites of binding of a protein to DNA at basepair resolution.
Chromatin immunoprecipitation is used to identify the ''in vivo'' DNA target regions of a known transcription factor. This technique when combined with high throughput sequencing is known as
ChIP-Seq and when combined with
microarray
A microarray is a multiplex lab-on-a-chip. Its purpose is to simultaneously detect the expression of thousands of genes from a sample (e.g. from a tissue). It is a two-dimensional array on a solid substrate—usually a glass slide or silic ...
s it is known as
ChIP-chip.
Yeast one-hybrid System (Y1H) is used to identify which protein binds to a particular DNA fragment.
Bacterial one-hybrid system (B1H) is used to identify which protein binds to a particular DNA fragment. Structure determination using
X-ray crystallography
X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angle ...
has been used to give a highly detailed atomic view of protein–DNA interactions.
Besides these methods, other techniques such as SELEX, PBM (protein binding microarrays), DNA microarray screens, DamID, FAIRE or more recently DAP-seq are used in the laboratory to investigate DNA-protein interaction ''in vivo'' and ''in vitro''.
Manipulating the interactions
The protein–DNA interactions can be modulated using stimuli like ionic strength of the buffer, macromolecular crowding,
temperature, pH and electric field. This can lead to reversible dissociation/association of the protein–DNA complex.
See also
*
bZIP domain
*
ChIP-exo
*
Comparison of nucleic acid simulation software
*
DNA-binding domain
A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence (a recognition sequence) or have a gener ...
*
Helix-loop-helix
*
Helix-turn-helix
Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
*
HMG-box
*
Leucine zipper
*
Lexitropsin (a semi-synthetic DNA-binding ligand)
*
Deoxyribonucleoprotein
Nucleoproteins are proteins conjugated with nucleic acids (either DNA or RNA). Typical nucleoproteins include ribosomes, nucleosomes and viral nucleocapsid proteins.
Structures
Nucleoproteins tend to be positively charged, facilitating inte ...
*
Protein–DNA interaction site prediction software
*
RNA-binding protein
RNA-binding proteins (often abbreviated as RBPs) are proteins that bind to the double or single stranded RNA in cells and participate in forming ribonucleoprotein complexes.
RBPs contain various structural motifs, such as RNA recognition moti ...
*
Single-strand binding protein
*
Zinc finger
A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) in order to stabilize the fold. It was originally coined to describe the finger-like appearance of a hypothesized struct ...
References
External links
Protein-DNA binding: data, tools & models (annotated list, constantly updated)tool for modeling DNA-ligand interactions.
DBD database of predicted transcription factorsUses a curated set of DNA-binding domains to predict transcription factors in all completely sequenced genomes
*
{{DEFAULTSORT:Dna-Binding Protein
DNA-binding proteins
Molecular genetics
DNA replication
Transcription factors
Biophysics