Pyrimidine (; ) is an
aromatic
In organic chemistry, aromaticity is a chemical property describing the way in which a conjugated system, conjugated ring of unsaturated bonds, lone pairs, or empty orbitals exhibits a stabilization stronger than would be expected from conjugati ...
,
heterocyclic
A heterocyclic compound or ring structure is a cyclic compound that has atoms of at least two different elements as members of its ring(s). Heterocyclic organic chemistry is the branch of organic chemistry dealing with the synthesis, proper ...
,
organic compound
Some chemical authorities define an organic compound as a chemical compound that contains a carbon–hydrogen or carbon–carbon bond; others consider an organic compound to be any chemical compound that contains carbon. For example, carbon-co ...
similar to
pyridine
Pyridine is a basic (chemistry), basic heterocyclic compound, heterocyclic organic compound with the chemical formula . It is structurally related to benzene, with one methine group replaced by a nitrogen atom . It is a highly flammable, weak ...
().
One of the three
diazines (six-membered heterocyclics with two
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring.
The other diazines are
pyrazine
Pyrazine is a heterocyclic aromatic organic compound with the chemical formula C4H4N2. It is a symmetrical molecule with point group D2h. Pyrazine is less basic than pyridine, pyridazine and pyrimidine. It is a ''"deliquescent crystal or wax-lik ...
(nitrogen atoms at the 1 and 4 positions) and
pyridazine (nitrogen atoms at the 1 and 2 positions).
In
nucleic acids
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic a ...
, three types of
nucleobases
Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nuc ...
are pyrimidine
derivative
In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...
s:
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C),
thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T), and
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U).
Occurrence and history

The pyrimidine ring system has wide occurrence in nature
as substituted and ring fused compounds and derivatives, including the
nucleotides
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
,
thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
and
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
,
thiamine
Thiamine, also known as thiamin and vitamin B1, is a vitamin – an Nutrient#Micronutrients, essential micronutrient for humans and animals. It is found in food and commercially synthesized to be a dietary supplement or medication. Phosp ...
(vitamin B1) and
alloxan
Alloxan, sometimes referred to as alloxan monohydrate, is an organic compound with the formula . It is classified as a derivative of pyrimidine. The anhydrous derivative is also known, as well as a dimeric derivative. These are some of the earlie ...
. It is also found in many synthetic compounds such as
barbiturate
Barbiturates are a class of depressant, depressant drugs that are chemically derived from barbituric acid. They are effective when used medication, medically as anxiolytics, hypnotics, and anticonvulsants, but have physical and psychological a ...
s and the HIV drug
zidovudine
Zidovudine (ZDV), also known as azidothymidine (AZT), was the first antiretroviral medication used to prevent and treat HIV/AIDS. It is generally recommended for use in combination with other antiretrovirals. It may be used to prevent mothe ...
. Although pyrimidine derivatives such as alloxan were known in the early 19th century, a laboratory synthesis of a pyrimidine was not carried out until 1879,
[ when Grimaux reported the preparation of ]barbituric acid
Barbituric acid or malonylurea or 6-hydroxyuracil is an organic compound based on a pyrimidine heterocyclic skeleton. It is an odorless powder soluble in water. Barbituric acid is the parent compound of barbiturate drugs, although barbituric acid i ...
from urea
Urea, also called carbamide (because it is a diamide of carbonic acid), is an organic compound with chemical formula . This amide has two Amine, amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest am ...
and malonic acid in the presence of phosphorus oxychloride
Phosphoryl chloride (commonly called phosphorus oxychloride) is a colourless liquid with the formula . It hydrolyses in moist air releasing phosphoric acid and fumes of hydrogen chloride. It is manufactured industrially on a large scale from phos ...
.[ ]
The systematic study of pyrimidines began in 1884 with Pinner,[ ]
who synthesized derivatives by condensing ethyl acetoacetate with amidine
Amidines are organic compounds with the functional group RC(NR)NR2, where the R groups can be the same or different. They are the imine derivatives of amides (RC(O)NR2). The simplest amidine is formamidine, HC(=NH)NH2.
Examples of amidines includ ...
s. Pinner first proposed the name “pyrimidin” in 1885.[ ] The parent compound was first prepared by Gabriel
In the Abrahamic religions (Judaism, Christianity, Islam), Gabriel ( ) is an archangel with the power to announce God's will to mankind, as the messenger of God. He is mentioned in the Hebrew Bible, the New Testament and the Quran. Many Chris ...
and Colman in 1900,[ ]
by conversion of barbituric acid
Barbituric acid or malonylurea or 6-hydroxyuracil is an organic compound based on a pyrimidine heterocyclic skeleton. It is an odorless powder soluble in water. Barbituric acid is the parent compound of barbiturate drugs, although barbituric acid i ...
to 2,4,6-trichloropyrimidine followed by reduction using zinc
Zinc is a chemical element; it has symbol Zn and atomic number 30. It is a slightly brittle metal at room temperature and has a shiny-greyish appearance when oxidation is removed. It is the first element in group 12 (IIB) of the periodic tabl ...
dust in hot water.
Nomenclature
The nomenclature of pyrimidines is straightforward. However, like other heterocyclics, tautomer
In chemistry, tautomers () are structural isomers (constitutional isomers) of chemical compounds that readily interconvert.
The chemical reaction interconverting the two is called tautomerization. This conversion commonly results from the reloca ...
ic hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydroxy ...
groups yield complications since they exist primarily in the cyclic amide
In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
form. For example, 2-hydroxypyrimidine is more properly named 2-pyrimidone. A partial list of trivial names of various pyrimidines exists.
Physical properties
Physical properties are shown in the data box. A more extensive discussion, including spectra, can be found in Brown ''et al.''
Chemical properties
Per the classification by Albert, six-membered heterocycles can be described as π-deficient. Substitution by electronegative groups or additional nitrogen atoms in the ring significantly increase the π-deficiency. These effects also decrease the basicity.
Like pyridines, in pyrimidines the π-electron density is decreased to an even greater extent. Therefore, electrophilic aromatic substitution
Electrophilic aromatic substitution (SEAr) is an organic reaction in which an atom that is attached to an aromatic ring, aromatic system (usually hydrogen) is replaced by an electrophile. Some of the most important electrophilic aromatic substitut ...
is more difficult while nucleophilic aromatic substitution is facilitated. An example of the last reaction type is the displacement of the amino
In chemistry, amines (, ) are organic compounds that contain carbon-nitrogen bonds. Amines are formed when one or more hydrogen atoms in ammonia are replaced by alkyl or aryl groups. The nitrogen atom in an amine possesses a lone pair of elec ...
group in 2-aminopyrimidine by chlorine
Chlorine is a chemical element; it has Symbol (chemistry), symbol Cl and atomic number 17. The second-lightest of the halogens, it appears between fluorine and bromine in the periodic table and its properties are mostly intermediate between ...
and its reverse.
Electron lone pair
In chemistry, a lone pair refers to a pair of valence electrons that are not shared with another atom in a covalent bondIUPAC ''Gold Book'' definition''lone (electron) pair''/ref> and is sometimes called an unshared pair or non-bonding pair. Lone ...
availability (basicity
In chemistry, there are three definitions in common use of the word "base": ''Arrhenius bases'', ''Brønsted bases'', and ''Lewis bases''. All definitions agree that bases are substances that react with acids, as originally proposed by Guilla ...
) is decreased compared to pyridine. Compared to pyridine, ''N''-alkylation and ''N''-oxidation are more difficult. The p''K''a value for protonated pyrimidine is 1.23 compared to 5.30 for pyridine. Protonation and other electrophilic additions will occur at only one nitrogen due to further deactivation by the second nitrogen. The 2-, 4-, and 6- positions on the pyrimidine ring are electron deficient analogous to those in pyridine and nitro- and dinitrobenzene. The 5-position is less electron deficient and substituents there are quite stable. However, electrophilic substitution is relatively facile at the 5-position, including nitration
In organic chemistry, nitration is a general class of chemical processes for the introduction of a nitro group () into an organic compound. The term also is applied incorrectly to the different process of forming nitrate esters () between Alcohol ...
and halogenation.
Reduction in resonance stabilization
In chemistry, resonance, also called mesomerism, is a way of describing bonding in certain molecules or polyatomic ions by the combination of several contributing structures (or ''forms'', also variously known as ''resonance structures'' or '' ...
of pyrimidines may lead to addition and ring cleavage reactions rather than substitutions. One such manifestation is observed in the Dimroth rearrangement.
Pyrimidine is also found in meteorite
A meteorite is a rock (geology), rock that originated in outer space and has fallen to the surface of a planet or Natural satellite, moon. When the original object enters the atmosphere, various factors such as friction, pressure, and chemical ...
s, but scientists still do not know its origin. Pyrimidine also photolytically decomposes into uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
under ultraviolet
Ultraviolet radiation, also known as simply UV, is electromagnetic radiation of wavelengths of 10–400 nanometers, shorter than that of visible light, but longer than X-rays. UV radiation is present in sunlight and constitutes about 10% of ...
light.
Synthesis
Pyrimidine biosynthesis creates derivatives —like orotate, thymine, cytosine, and uracil— ''de novo'' from carbamoyl phosphate and aspartate.
As is often the case with parent heterocyclic ring systems, the synthesis of pyrimidine is not that common and is usually performed by removing functional groups from derivatives. Primary syntheses in quantity involving formamide
Formamide is an amide derived from formic acid. It is a colorless liquid which is miscible with water and has an ammonia-like odor. It is chemical feedstock for the manufacture of sulfa drugs and other pharmaceuticals, herbicides and pesticides, ...
have been reported.
As a class, pyrimidines are typically synthesized by the principal synthesis involving cyclization of β-dicarbonyl
In organic chemistry, a carbonyl group is a functional group with the formula , composed of a carbon atom double bond, double-bonded to an oxygen atom, and it is divalent at the C atom. It is common to several classes of organic compounds (such a ...
compounds with N–C–N compounds. Reaction of the former with amidine
Amidines are organic compounds with the functional group RC(NR)NR2, where the R groups can be the same or different. They are the imine derivatives of amides (RC(O)NR2). The simplest amidine is formamidine, HC(=NH)NH2.
Examples of amidines includ ...
s to give 2-substituted pyrimidines, with urea
Urea, also called carbamide (because it is a diamide of carbonic acid), is an organic compound with chemical formula . This amide has two Amine, amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest am ...
to give 2- pyrimidinones, and guanidine
Guanidine is the compound with the formula HNC(NH2)2. It is a colourless solid that dissolves in polar solvents. It is a strong base that is used in the production of plastics and explosives. It is found in urine predominantly in patients experi ...
s to give 2- aminopyrimidines are typical.
Pyrimidines can be prepared via the Biginelli reaction and other multicomponent reactions. Many other methods rely on condensation
Condensation is the change of the state of matter from the gas phase into the liquid phase, and is the reverse of vaporization. The word most often refers to the water cycle. It can also be defined as the change in the state of water vapor ...
of carbonyl
In organic chemistry, a carbonyl group is a functional group with the formula , composed of a carbon atom double bond, double-bonded to an oxygen atom, and it is divalent at the C atom. It is common to several classes of organic compounds (such a ...
s with diamines for instance the synthesis of 2-thio-6-methyluracil from thiourea
Thiourea () is an organosulfur compound with the formula and the structure . It is structurally similar to urea (), with the oxygen atom replaced by sulfur atom (as implied by the '' thio-'' prefix). The properties of urea and thiourea differ s ...
and ethyl acetoacetate or the synthesis of 4-methylpyrimidine with 4,4-dimethoxy-2-butanone and formamide
Formamide is an amide derived from formic acid. It is a colorless liquid which is miscible with water and has an ammonia-like odor. It is chemical feedstock for the manufacture of sulfa drugs and other pharmaceuticals, herbicides and pesticides, ...
.
A novel method is by reaction of ''N''-vinyl and ''N''-aryl amide
In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
s with carbonitriles under electrophilic activation of the amide with 2-chloro-pyridine and trifluoromethanesulfonic anhydride:
:
Reactions
Because of the decreased basicity compared to pyridine, electrophilic substitution of pyrimidine is less facile. Protonation
In chemistry, protonation (or hydronation) is the adding of a proton (or hydron, or hydrogen cation), usually denoted by H+, to an atom, molecule, or ion, forming a conjugate acid. (The complementary process, when a proton is removed from a Brø ...
or alkylation Alkylation is a chemical reaction that entails transfer of an alkyl group. The alkyl group may be transferred as an alkyl carbocation, a free radical, a carbanion, or a carbene (or their equivalents). Alkylating agents are reagents for effecting al ...
typically takes place at only one of the ring nitrogen atoms. Mono-''N''-oxidation occurs by reaction with peracids.
Electrophilic
In chemistry, an electrophile is a chemical species that forms bonds with nucleophiles by accepting an electron pair. Because electrophiles accept electrons, they are Lewis acids. Most electrophiles are positively charged, have an atom that carr ...
''C''-substitution of pyrimidine occurs at the 5-position, the least electron-deficient. Nitration
In organic chemistry, nitration is a general class of chemical processes for the introduction of a nitro group () into an organic compound. The term also is applied incorrectly to the different process of forming nitrate esters () between Alcohol ...
, nitrosation, azo coupling
In organic chemistry, an azo coupling is an organic reaction, reaction between a diazonium compound () and another aromatic compound that produces an azo compound (). In this electrophilic aromatic substitution reaction, the aryldiazonium cation ...
, halogen
The halogens () are a group in the periodic table consisting of six chemically related elements: fluorine (F), chlorine (Cl), bromine (Br), iodine (I), and the radioactive elements astatine (At) and tennessine (Ts), though some authors would ...
ation, sulfonation
In organic chemistry, aromatic sulfonation is a reaction in which a hydrogen atom on an arene is replaced by a sulfonic acid () group. Together with nitration and chlorination, aromatic sulfonation is a widely used electrophilic aromatic substi ...
, formylation, hydroxymethylation, and aminomethylation have been observed with substituted pyrimidines.
Nucleophilic ''C''-substitution should be facilitated at the 2-, 4-, and 6-positions but there are only a few examples. Amination and hydroxylation have been observed for substituted pyrimidines. Reactions with Grignard or alkyllithium reagents yield 4-alkyl- or 4-aryl pyrimidine after aromatization.
Free radical attack has been observed for pyrimidine and photochemical reactions have been observed for substituted pyrimidines. Pyrimidine can be hydrogenated to give tetrahydropyrimidine.
Derivatives
Nucleotides
Three nucleobase
Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nuc ...
s found in nucleic acid
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a pentose, 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nuclei ...
s, cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C), thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T), and uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U), are pyrimidine derivatives:
:
In DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
and RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
, these bases form hydrogen bond
In chemistry, a hydrogen bond (H-bond) is a specific type of molecular interaction that exhibits partial covalent character and cannot be described as a purely electrostatic force. It occurs when a hydrogen (H) atom, Covalent bond, covalently b ...
s with their complementary purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
s. Thus, in DNA, the purines
Purine is a heterocyclic compound, heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which inc ...
adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
(A) and guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
(G) pair up with the pyrimidines thymine (T) and cytosine (C), respectively.
In RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
, the complement of adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
(A) is uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U) instead of thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T), so the pairs that form are adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
:uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
and guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
:cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
.
Very rarely, thymine can appear in RNA, or uracil in DNA, but when the other three major pyrimidine bases are represented, some minor pyrimidine bases can also occur in nucleic acids
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic a ...
. These minor pyrimidines are usually methylated
Methylation, in the chemical sciences, is the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These term ...
versions of major ones and are postulated to have regulatory functions.
These hydrogen bonding modes are for classical Watson–Crick base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
ing. Other hydrogen bonding modes ("wobble pairings") are available in both DNA and RNA, although the additional 2′-hydroxyl group of RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
expands the configurations, through which RNA can form hydrogen bonds.
Theoretical aspects
In March 2015, NASA Ames scientists reported that, for the first time, complex DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
and RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
organic compound
Some chemical authorities define an organic compound as a chemical compound that contains a carbon–hydrogen or carbon–carbon bond; others consider an organic compound to be any chemical compound that contains carbon. For example, carbon-co ...
s of life
Life, also known as biota, refers to matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes. It is defined descriptively by the capacity for homeostasis, Structure#Biological, organisation, met ...
, including uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
, cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
and thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
, have been formed in the laboratory under outer space conditions, using starting chemicals, such as pyrimidine, found in meteorite
A meteorite is a rock (geology), rock that originated in outer space and has fallen to the surface of a planet or Natural satellite, moon. When the original object enters the atmosphere, various factors such as friction, pressure, and chemical ...
s. Pyrimidine, like polycyclic aromatic hydrocarbons (PAHs), the most carbon-rich chemical found in the universe
The universe is all of space and time and their contents. It comprises all of existence, any fundamental interaction, physical process and physical constant, and therefore all forms of matter and energy, and the structures they form, from s ...
, may have been formed in red giant
A red giant is a luminous giant star of low or intermediate mass (roughly 0.3–8 solar masses ()) in a late phase of stellar evolution. The stellar atmosphere, outer atmosphere is inflated and tenuous, making the radius large and the surface t ...
s or in interstellar dust
Cosmic dustalso called extraterrestrial dust, space dust, or star dustis dust that occurs in outer space or has fallen onto Earth. Most cosmic dust particles measure between a few molecules and , such as micrometeoroids (30 μm). Cosmic dust can ...
and gas clouds.[, also published as ]
Prebiotic synthesis of pyrimidine nucleotides
In order to understand how life
Life, also known as biota, refers to matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes. It is defined descriptively by the capacity for homeostasis, Structure#Biological, organisation, met ...
arose, knowledge is required of the chemical pathways that permit formation of the key building blocks of life under plausible prebiotic conditions. The RNA world
The RNA world is a hypothetical stage in the evolutionary history of life on Earth in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existence ...
hypothesis holds that in the primordial soup
Primordial soup, also known as prebiotic soup and Haldane soup, is the hypothetical set of conditions present on the Earth around 3.7 to 4.0 billion years ago. It is an aspect of the heterotrophic theory (also known as the Oparin–Haldane hypothes ...
there existed free-floating ribonucleotides, the fundamental molecules that combine in series to form RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
. Complex molecules such as RNA must have emerged from relatively small molecules whose reactivity was governed by physico-chemical processes. RNA is composed of pyrimidine and purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
nucleotides, both of which are necessary for reliable information transfer, and thus natural selection and Darwinian evolution
Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
. Becker et al. showed how pyrimidine nucleoside
Nucleosides are glycosylamines that can be thought of as nucleotides without a phosphate group. A nucleoside consists simply of a nucleobase (also termed a nitrogenous base) and a five-carbon sugar (ribose or 2'-deoxyribose) whereas a nucleotid ...
s can be synthesized from small molecules and ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally occurring form, , is a component of the ribonucleotides from which RNA is built, and so this comp ...
, driven solely by wet-dry cycles.[Becker S, Feldmann J, Wiedemann S, Okamura H, Schneider C, Iwan K, Crisp A, Rossa M, Amatov T, Carell T. Unified prebiotically plausible synthesis of pyrimidine and purine RNA ribonucleotides. Science. 2019 Oct 4;366(6461):76-82. doi: 10.1126/science.aax2747. PMID 31604305] Purine nucleosides can be synthesized by a similar pathway. 5’-mono-and diphosphates also form selectively from phosphate-containing minerals, allowing concurrent formation of polyribonucleotides with both the pyrimidine and purine bases. Thus a reaction network towards the pyrimidine and purine RNA building blocks can be established starting from simple atmospheric or volcanic molecules.
See also
* ANRORC mechanism
The ANRORC mechanism in organic chemistry describes a special type of substitution reaction. ANRORC stands for Addition of the Nucleophile, Ring Opening, and Ring Closure in nucleophilic attack on ring systems and it helps to explain product format ...
* Purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
* Pyrimidine metabolism
* Simple aromatic rings
* Transition
* Transversion
References
{{Authority control
Biomolecules
Aromatic bases
Simple aromatic rings
Substances discovered in the 19th century