
A palindromic sequence is a
nucleic acid
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a pentose, 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nuclei ...
sequence in a double-stranded
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
or
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
molecule whereby reading in a certain direction (e.g.
5' to 3') on one strand is identical to the sequence in the same direction (e.g. 5' to 3') on the
complementary strand. This definition of palindrome thus depends on complementary strands being palindromic of each other.
The meaning of
palindrome
A palindrome (Help:IPA/English, /ˈpæl.ɪn.droʊm/) is a word, palindromic number, number, phrase, or other sequence of symbols that reads the same backwards as forwards, such as ''madam'' or ''racecar'', the date "Twosday, 02/02/2020" and th ...
in the context of
genetics
Genetics is the study of genes, genetic variation, and heredity in organisms.Hartl D, Jones E (2005) It is an important branch in biology because heredity is vital to organisms' evolution. Gregor Mendel, a Moravian Augustinians, Augustinian ...
is slightly different from the definition used for words and sentences. Since a
double helix
In molecular biology, the term double helix refers to the structure formed by base pair, double-stranded molecules of nucleic acids such as DNA. The double Helix, helical structure of a nucleic acid complex arises as a consequence of its Nuclei ...
is formed by two paired
antiparallel strands of
nucleotides
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
that run in opposite
directions, and the nucleotides always pair in the same way (
adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
(A) with
thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T) in DNA or
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U) in RNA;
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C) with
guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
(G)), a (single-stranded) nucleotide sequence is said to be a palindrome if it is equal to its
reverse complement. For example, the DNA sequence
ACCTAGGT
is palindromic with its nucleotide-by-nucleotide
complement TGGATCCA
because reversing the order of the nucleotides in the complement gives the original sequence.
A palindromic nucleotide sequence is capable of forming a
hairpin. The stem portion of the hairpin is a ''pseudo-double stranded'' portion since the entire hairpin is a part of same (single) strand of nucleic acid. Palindromic
motifs are found in most
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
s or sets of
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
tic instructions. They have been specially researched in
bacteria
Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
l
chromosome
A chromosome is a package of DNA containing part or all of the genetic material of an organism. In most chromosomes, the very long thin DNA fibers are coated with nucleosome-forming packaging proteins; in eukaryotic cells, the most import ...
s and in the so-called Bacterial Interspersed Mosaic Elements (BIMEs) scattered over them. In 2008, a genome sequencing project discovered that large portions of the human
X and
Y chromosomes are arranged as palindromes.
A palindromic structure allows the Y chromosome to repair itself by bending over at the middle if one side is damaged.
Palindromes also appear to be found frequently in the
peptide
Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty am ...
sequences that make up proteins,
[
] but their role in protein function is not clearly known. It has been suggested that the existence of palindromes in peptides might be related to the prevalence of low-complexity regions in proteins, as palindromes are frequently associated with low-complexity sequences. Their prevalence may also be related to the propensity of such sequences to form
alpha helices or protein/protein complexes.
Examples
Restriction enzyme sites
Palindromic sequences play an important role in
molecular biology
Molecular biology is a branch of biology that seeks to understand the molecule, molecular basis of biological activity in and between Cell (biology), cells, including biomolecule, biomolecular synthesis, modification, mechanisms, and interactio ...
. Because a DNA sequence is double stranded, the
base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s are read, (not just the bases on one strand), to determine a palindrome. Many
restriction endonucleases (restriction enzymes) recognize specific palindromic sequences and cut them. The restriction enzyme
EcoR1 recognizes the following palindromic sequence:
5'- G A A T T C -3'
3'- C T T A A G -5'
The top strand reads 5'-GAATTC-3', while the bottom strand reads 3'-CTTAAG-5'. If the DNA strand is flipped over, the sequences are exactly the same (5'GAATTC-3' and 3'-CTTAAG-5'). Here are more restriction enzymes and the palindromic sequences which they recognize:
Methylation sites
Palindromic sequences may also have
methylation
Methylation, in the chemistry, chemical sciences, is the addition of a methyl group on a substrate (chemistry), substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replac ...
sites.
These are the sites where a methyl group can be attached to the palindromic sequence. Methylation makes the resultant gene inactive; this is called insertional inactivation or
insertional mutagenesis. For example, in
PBR322 methylation at the tetracyclin resistant gene makes the plasmid liable to tetracyclin; after methylation at the tetracyclin resistant gene if the plasmid is exposed to
antibiotic
An antibiotic is a type of antimicrobial substance active against bacteria. It is the most important type of antibacterial agent for fighting pathogenic bacteria, bacterial infections, and antibiotic medications are widely used in the therapy ...
tetracyclin, it does not survive.
Palindromic nucleotides in T cell receptors
Diversity of
T cell receptor (TCR) genes is generated by
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
insertions upon
V(D)J recombination
V(D)J recombination (variable–diversity–joining rearrangement) is the mechanism of somatic recombination that occurs only in developing lymphocytes during the early stages of T and B cell maturation. It results in the highly diverse repertoire ...
from their
germline-encoded V, D and J segments. Nucleotide insertions at V-D and D-J junctions are random, but some small subsets of these insertions are exceptional, in that one to three
base pairs
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
inversely repeat the sequence of the germline DNA. These short complementary palindromic sequences are called
P nucleotides.
References
{{DEFAULTSORT:Palindromic Sequence
DNA
Palindromes