Pyrimidine (; ) is an
aromatic
In organic chemistry, aromaticity is a chemical property describing the way in which a conjugated system, conjugated ring of unsaturated bonds, lone pairs, or empty orbitals exhibits a stabilization stronger than would be expected from conjugati ...
,
heterocyclic,
organic compound
Some chemical authorities define an organic compound as a chemical compound that contains a carbon–hydrogen or carbon–carbon bond; others consider an organic compound to be any chemical compound that contains carbon. For example, carbon-co ...
similar to
pyridine ().
One of the three
diazines (six-membered heterocyclics with two
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring.
The other diazines are
pyrazine (nitrogen atoms at the 1 and 4 positions) and
pyridazine (nitrogen atoms at the 1 and 2 positions).
In
nucleic acids
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic a ...
, three types of
nucleobases
Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nuc ...
are pyrimidine
derivative
In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...
s:
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C),
thymine (T), and
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U).
Occurrence and history

The pyrimidine ring system has wide occurrence in nature
as substituted and ring fused compounds and derivatives, including the
nucleotides
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
,
thymine and
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
,
thiamine
Thiamine, also known as thiamin and vitamin B1, is a vitamin – an Nutrient#Micronutrients, essential micronutrient for humans and animals. It is found in food and commercially synthesized to be a dietary supplement or medication. Phosp ...
(vitamin B1) and
alloxan. It is also found in many synthetic compounds such as
barbiturates and the HIV drug
zidovudine. Although pyrimidine derivatives such as alloxan were known in the early 19th century, a laboratory synthesis of a pyrimidine was not carried out until 1879,
[ when Grimaux reported the preparation of barbituric acid from ]urea
Urea, also called carbamide (because it is a diamide of carbonic acid), is an organic compound with chemical formula . This amide has two Amine, amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest am ...
and malonic acid in the presence of phosphorus oxychloride.[ ]
The systematic study of pyrimidines began in 1884 with Pinner,[ ]
who synthesized derivatives by condensing ethyl acetoacetate with amidine
Amidines are organic compounds with the functional group RC(NR)NR2, where the R groups can be the same or different. They are the imine derivatives of amides (RC(O)NR2). The simplest amidine is formamidine, HC(=NH)NH2.
Examples of amidines includ ...
s. Pinner first proposed the name “pyrimidin” in 1885.[ ] The parent compound was first prepared by Gabriel
In the Abrahamic religions (Judaism, Christianity, Islam), Gabriel ( ) is an archangel with the power to announce God's will to mankind, as the messenger of God. He is mentioned in the Hebrew Bible, the New Testament and the Quran. Many Chris ...
and Colman in 1900,[ ]
by conversion of barbituric acid to 2,4,6-trichloropyrimidine followed by reduction using zinc
Zinc is a chemical element; it has symbol Zn and atomic number 30. It is a slightly brittle metal at room temperature and has a shiny-greyish appearance when oxidation is removed. It is the first element in group 12 (IIB) of the periodic tabl ...
dust in hot water.
Nomenclature
The nomenclature of pyrimidines is straightforward. However, like other heterocyclics, tautomeric hydroxyl groups yield complications since they exist primarily in the cyclic amide
In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
form. For example, 2-hydroxypyrimidine is more properly named 2-pyrimidone. A partial list of trivial names of various pyrimidines exists.
Physical properties
Physical properties are shown in the data box. A more extensive discussion, including spectra, can be found in Brown ''et al.''
Chemical properties
Per the classification by Albert, six-membered heterocycles can be described as π-deficient. Substitution by electronegative groups or additional nitrogen atoms in the ring significantly increase the π-deficiency. These effects also decrease the basicity.
Like pyridines, in pyrimidines the π-electron density is decreased to an even greater extent. Therefore, electrophilic aromatic substitution is more difficult while nucleophilic aromatic substitution is facilitated. An example of the last reaction type is the displacement of the amino group in 2-aminopyrimidine by chlorine
Chlorine is a chemical element; it has Symbol (chemistry), symbol Cl and atomic number 17. The second-lightest of the halogens, it appears between fluorine and bromine in the periodic table and its properties are mostly intermediate between ...
and its reverse.
Electron lone pair
In chemistry, a lone pair refers to a pair of valence electrons that are not shared with another atom in a covalent bondIUPAC ''Gold Book'' definition''lone (electron) pair''/ref> and is sometimes called an unshared pair or non-bonding pair. Lone ...
availability (basicity
In chemistry, there are three definitions in common use of the word "base": ''Arrhenius bases'', ''Brønsted bases'', and ''Lewis bases''. All definitions agree that bases are substances that react with acids, as originally proposed by Guilla ...
) is decreased compared to pyridine. Compared to pyridine, ''N''-alkylation and ''N''-oxidation are more difficult. The p''K''a value for protonated pyrimidine is 1.23 compared to 5.30 for pyridine. Protonation and other electrophilic additions will occur at only one nitrogen due to further deactivation by the second nitrogen. The 2-, 4-, and 6- positions on the pyrimidine ring are electron deficient analogous to those in pyridine and nitro- and dinitrobenzene. The 5-position is less electron deficient and substituents there are quite stable. However, electrophilic substitution is relatively facile at the 5-position, including nitration and halogenation.
Reduction in resonance stabilization of pyrimidines may lead to addition and ring cleavage reactions rather than substitutions. One such manifestation is observed in the Dimroth rearrangement.
Pyrimidine is also found in meteorite
A meteorite is a rock (geology), rock that originated in outer space and has fallen to the surface of a planet or Natural satellite, moon. When the original object enters the atmosphere, various factors such as friction, pressure, and chemical ...
s, but scientists still do not know its origin. Pyrimidine also photolytically decomposes into uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
under ultraviolet
Ultraviolet radiation, also known as simply UV, is electromagnetic radiation of wavelengths of 10–400 nanometers, shorter than that of visible light, but longer than X-rays. UV radiation is present in sunlight and constitutes about 10% of ...
light.
Synthesis
Pyrimidine biosynthesis creates derivatives —like orotate, thymine, cytosine, and uracil— ''de novo'' from carbamoyl phosphate and aspartate.
As is often the case with parent heterocyclic ring systems, the synthesis of pyrimidine is not that common and is usually performed by removing functional groups from derivatives. Primary syntheses in quantity involving formamide have been reported.
As a class, pyrimidines are typically synthesized by the principal synthesis involving cyclization of β-di carbonyl compounds with N–C–N compounds. Reaction of the former with amidine
Amidines are organic compounds with the functional group RC(NR)NR2, where the R groups can be the same or different. They are the imine derivatives of amides (RC(O)NR2). The simplest amidine is formamidine, HC(=NH)NH2.
Examples of amidines includ ...
s to give 2-substituted pyrimidines, with urea
Urea, also called carbamide (because it is a diamide of carbonic acid), is an organic compound with chemical formula . This amide has two Amine, amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest am ...
to give 2- pyrimidinones, and guanidines to give 2- aminopyrimidines are typical.
Pyrimidines can be prepared via the Biginelli reaction and other multicomponent reactions. Many other methods rely on condensation
Condensation is the change of the state of matter from the gas phase into the liquid phase, and is the reverse of vaporization. The word most often refers to the water cycle. It can also be defined as the change in the state of water vapor ...
of carbonyls with diamines for instance the synthesis of 2-thio-6-methyluracil from thiourea and ethyl acetoacetate or the synthesis of 4-methylpyrimidine with 4,4-dimethoxy-2-butanone and formamide.
A novel method is by reaction of ''N''-vinyl and ''N''-aryl amide
In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
s with carbonitriles under electrophilic activation of the amide with 2-chloro-pyridine and trifluoromethanesulfonic anhydride:
:
Reactions
Because of the decreased basicity compared to pyridine, electrophilic substitution of pyrimidine is less facile. Protonation
In chemistry, protonation (or hydronation) is the adding of a proton (or hydron, or hydrogen cation), usually denoted by H+, to an atom, molecule, or ion, forming a conjugate acid. (The complementary process, when a proton is removed from a Brø ...
or alkylation Alkylation is a chemical reaction that entails transfer of an alkyl group. The alkyl group may be transferred as an alkyl carbocation, a free radical, a carbanion, or a carbene (or their equivalents). Alkylating agents are reagents for effecting al ...
typically takes place at only one of the ring nitrogen atoms. Mono-''N''-oxidation occurs by reaction with peracids.
Electrophilic ''C''-substitution of pyrimidine occurs at the 5-position, the least electron-deficient. Nitration, nitrosation, azo coupling, halogenation, sulfonation, formylation, hydroxymethylation, and aminomethylation have been observed with substituted pyrimidines.
Nucleophilic ''C''-substitution should be facilitated at the 2-, 4-, and 6-positions but there are only a few examples. Amination and hydroxylation have been observed for substituted pyrimidines. Reactions with Grignard or alkyllithium reagents yield 4-alkyl- or 4-aryl pyrimidine after aromatization.
Free radical attack has been observed for pyrimidine and photochemical reactions have been observed for substituted pyrimidines. Pyrimidine can be hydrogenated to give tetrahydropyrimidine.
Derivatives
Nucleotides
Three nucleobase
Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nuc ...
s found in nucleic acid
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a pentose, 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nuclei ...
s, cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C), thymine (T), and uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U), are pyrimidine derivatives:
:
In DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
and RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
, these bases form hydrogen bond
In chemistry, a hydrogen bond (H-bond) is a specific type of molecular interaction that exhibits partial covalent character and cannot be described as a purely electrostatic force. It occurs when a hydrogen (H) atom, Covalent bond, covalently b ...
s with their complementary purines. Thus, in DNA, the purines
Purine is a heterocyclic compound, heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which inc ...
adenine (A) and guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
(G) pair up with the pyrimidines thymine (T) and cytosine (C), respectively.
In RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
, the complement of adenine (A) is uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U) instead of thymine (T), so the pairs that form are adenine:uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
and guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
:cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
.
Very rarely, thymine can appear in RNA, or uracil in DNA, but when the other three major pyrimidine bases are represented, some minor pyrimidine bases can also occur in nucleic acids
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic a ...
. These minor pyrimidines are usually methylated versions of major ones and are postulated to have regulatory functions.
These hydrogen bonding modes are for classical Watson–Crick base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
ing. Other hydrogen bonding modes ("wobble pairings") are available in both DNA and RNA, although the additional 2′-hydroxyl group of RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
expands the configurations, through which RNA can form hydrogen bonds.
Theoretical aspects
In March 2015, NASA Ames scientists reported that, for the first time, complex DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
and RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
organic compound
Some chemical authorities define an organic compound as a chemical compound that contains a carbon–hydrogen or carbon–carbon bond; others consider an organic compound to be any chemical compound that contains carbon. For example, carbon-co ...
s of life
Life, also known as biota, refers to matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes. It is defined descriptively by the capacity for homeostasis, Structure#Biological, organisation, met ...
, including uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
, cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
and thymine, have been formed in the laboratory under outer space conditions, using starting chemicals, such as pyrimidine, found in meteorite
A meteorite is a rock (geology), rock that originated in outer space and has fallen to the surface of a planet or Natural satellite, moon. When the original object enters the atmosphere, various factors such as friction, pressure, and chemical ...
s. Pyrimidine, like polycyclic aromatic hydrocarbons (PAHs), the most carbon-rich chemical found in the universe
The universe is all of space and time and their contents. It comprises all of existence, any fundamental interaction, physical process and physical constant, and therefore all forms of matter and energy, and the structures they form, from s ...
, may have been formed in red giants or in interstellar dust and gas clouds.[, also published as ]
Prebiotic synthesis of pyrimidine nucleotides
In order to understand how life
Life, also known as biota, refers to matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes. It is defined descriptively by the capacity for homeostasis, Structure#Biological, organisation, met ...
arose, knowledge is required of the chemical pathways that permit formation of the key building blocks of life under plausible prebiotic conditions. The RNA world hypothesis holds that in the primordial soup there existed free-floating ribonucleotides, the fundamental molecules that combine in series to form RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
. Complex molecules such as RNA must have emerged from relatively small molecules whose reactivity was governed by physico-chemical processes. RNA is composed of pyrimidine and purine nucleotides, both of which are necessary for reliable information transfer, and thus natural selection and Darwinian evolution
Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
. Becker et al. showed how pyrimidine nucleosides can be synthesized from small molecules and ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally occurring form, , is a component of the ribonucleotides from which RNA is built, and so this comp ...
, driven solely by wet-dry cycles.[Becker S, Feldmann J, Wiedemann S, Okamura H, Schneider C, Iwan K, Crisp A, Rossa M, Amatov T, Carell T. Unified prebiotically plausible synthesis of pyrimidine and purine RNA ribonucleotides. Science. 2019 Oct 4;366(6461):76-82. doi: 10.1126/science.aax2747. PMID 31604305] Purine nucleosides can be synthesized by a similar pathway. 5’-mono-and diphosphates also form selectively from phosphate-containing minerals, allowing concurrent formation of polyribonucleotides with both the pyrimidine and purine bases. Thus a reaction network towards the pyrimidine and purine RNA building blocks can be established starting from simple atmospheric or volcanic molecules.
See also
* ANRORC mechanism
* Purine
* Pyrimidine metabolism
* Simple aromatic rings
* Transition
* Transversion
References
{{Authority control
Biomolecules
Aromatic bases
Simple aromatic rings
Substances discovered in the 19th century