Pyrrolysine
   HOME

TheInfoList



OR:

Pyrrolysine (symbol Pyl or O), encoded by the 'amber'
stop codon In molecular biology, a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the additio ...
UAG, is a
proteinogenic amino acid Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation from RNA. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) ...
that is used in some
methanogen Methanogens are anaerobic archaea that produce methane as a byproduct of their energy metabolism, i.e., catabolism. Methane production, or methanogenesis, is the only biochemical pathway for Adenosine triphosphate, ATP generation in methanogens. A ...
ic
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
and in
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
. It consists of
lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. Lysine contains an α-amino group (which is in the protonated form when the lysine is dissolved in water at physiological pH), an α-carboxylic acid group ( ...
with a 4-methylpyrroline-5-carboxylate in amide linkage with the εN of the lysine. Its pyrroline side-chain is similar to that of lysine in being basic and positively charged at neutral pH.


Genetics

Nearly all genes are translated using only 20 standard
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
building blocks. Two unusual genetically-encoded amino acids are selenocysteine and pyrrolysine. Pyrrolysine was discovered in 2002 at the active site of methyltransferase enzyme from a methane-producing archeon, '' Methanosarcina barkeri''. This amino acid is encoded by UAG (normally a stop codon), and its synthesis and incorporation into protein is mediated via the biological machinery encoded by the ''pylTSBCD'' cluster of genes.


Synthesis

Pyrrolysine is synthesized ''in vivo'' by joining two molecules of L-lysine. One molecule of lysine is first converted to (3''R'')-3-methyl-D-ornithine, which is then ligated to a second lysine. An NH2 group is eliminated, followed by cyclization and dehydration step to yield L-pyrrolysine.


Catalytic function

The extra pyrroline ring is incorporated into the active site of several methyltransferases, where it is believed to rotate relatively freely. It is believed that the ring is involved in positioning and displaying the
methyl group In organic chemistry, a methyl group is an alkyl derived from methane, containing one carbon atom bonded to three hydrogen atoms, having chemical formula (whereas normal methane has the formula ). In formulas, the group is often abbreviated a ...
of methylamine for attack by a corrinoid cofactor. The proposed model is that a nearby
carboxylic acid In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group () attached to an Substituent, R-group. The general formula of a carboxylic acid is often written as or , sometimes as with R referring to an organyl ...
bearing residue,
glutamate Glutamic acid (symbol Glu or E; known as glutamate in its anionic form) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a Essential amino acid, non-essential nutrient for humans, meaning that ...
, becomes protonated, and the proton can then be transferred to the
imine In organic chemistry, an imine ( or ) is a functional group or organic compound containing a carbon–nitrogen double bond (). The nitrogen atom can be attached to a hydrogen or an organic group (R). The carbon atom has two additional single bon ...
ring nitrogen, exposing the adjacent ring carbon to nucleophilic addition by methylamine. The positively charged nitrogen created by this interaction may then interact with the deprotonated glutamate, causing a shift in ring orientation and exposing the methyl group derived from the methylamine to the binding cleft where it can interact with corrinoid. In this way a net is transferred to the cofactor's
cobalt Cobalt is a chemical element; it has Symbol (chemistry), symbol Co and atomic number 27. As with nickel, cobalt is found in the Earth's crust only in a chemically combined form, save for small deposits found in alloys of natural meteoric iron. ...
atom with a change of
oxidation state In chemistry, the oxidation state, or oxidation number, is the hypothetical Electrical charge, charge of an atom if all of its Chemical bond, bonds to other atoms are fully Ionic bond, ionic. It describes the degree of oxidation (loss of electrons ...
from +1 to +3. The methylamine-derived
ammonia Ammonia is an inorganic chemical compound of nitrogen and hydrogen with the chemical formula, formula . A Binary compounds of hydrogen, stable binary hydride and the simplest pnictogen hydride, ammonia is a colourless gas with a distinctive pu ...
is then released, restoring the original imine.


Genetic coding

Unlike
posttranslational modification In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA ...
s of lysine such as hydroxylysine, methyllysine, and hypusine, pyrrolysine is incorporated during
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
( protein synthesis) as directed by the
genetic code Genetic code is a set of rules used by living cell (biology), cells to Translation (biology), translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished ...
, just like the standard amino acids. It is encoded in
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
by the UAG codon, which in most organisms is the 'amber'
stop codon In molecular biology, a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the additio ...
. This requires only the presence of the ''pylT'' gene, which encodes an unusual
transfer RNA Transfer ribonucleic acid (tRNA), formerly referred to as soluble ribonucleic acid (sRNA), is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes). In a cell, it provides the physical link between the gene ...
(tRNA) with a CUA anticodon, and the '' pylS'' gene, which encodes a class II aminoacyl-tRNA synthetase that charges the ''pylT''-derived tRNA with pyrrolysine. It was originally proposed that a specific downstream sequence "PYLIS", forming a
stem-loop Stem-loops are nucleic acid Biomolecular structure, secondary structural elements which form via intramolecular base pairing in single-stranded DNA or RNA. They are also referred to as hairpins or hairpin loops. A stem-loop occurs when two regi ...
in the
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
, forced the incorporation of pyrrolysine instead of terminating
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
in methanogenic archaea. This would be analogous to the SECIS element for selenocysteine incorporation. However, the PYLIS model has lost favor in view of the lack of structural homology between PYLIS elements and the lack of UAG stops in those species.


Use in bioengineering

The tRNA-aaRS pair for pyrrolysine ("orthogonal pair") is independent of other synthetases and tRNAs in most organisms including ''
Escherichia coli ''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
'', and further possesses some flexibility in the range of amino acids processed (the aaRS accepts some different molecules that are structually similar to pyrrolysine), making it an attractive tool to allow the placement of a possibly wide range of functional chemical groups at arbitrarily specified locations in modified proteins. For example, the system provided one of two fluorophores incorporated site-specifically within calmodulin to allow the real-time examination of changes within the protein by FRET spectroscopy, and site-specific introduction of a photocaged lysine derivative. ''(See Expanded genetic code)'' The recognition of a tRNA by an aaRS is by its acceptor stem sequence. The pyrrolysine tRNA can be modified to have an acceptor stem of another tRNA, allowing a different aaRS to act on it. In 2024, it was reported a version modified to accept
alanine Alanine (symbol Ala or A), or α-alanine, is an α-amino acid that is used in the biosynthesis of proteins. It contains an amine group and a carboxylic acid group, both attached to the central carbon atom which also carries a methyl group sid ...
can effectively supress premature termination codons (all three of them) in human cell lines.


Evolution

The ''pylT'' (tRNA) and ''pylS'' (aa-tRNA synthase) genes are part of an operon of ''
Methanosarcina ''Methanosarcina'' is a genus of euryarchaeote archaea that produce methane. These single-celled organisms are known as anaerobic methanogens that produce methane using all three metabolic pathways for methanogenesis. They live in diverse e ...
barkeri'', with homologues in other sequenced members of the ''Methanosarcinaceae'' family: ''M. acetivorans'', ''M. mazei'', and ''M. thermophila''. Pyrrolysine-containing proteins are known to include monomethylamine methyltransferase (''mtmB''), dimethylamine methyltransferase (''mtbB''), and trimethylamine methyltransferase (''mttB''). Homologs of ''pylS'' and ''pylT'' have also been found in an Antarctic archaeon, '' Methanosarcina barkeri'' and a
Gram-positive In bacteriology, gram-positive bacteria are bacteria that give a positive result in the Gram stain test, which is traditionally used to quickly classify bacteria into two broad categories according to their type of cell wall. The Gram stain is ...
bacterium Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among the ...
, '' Desulfitobacterium hafniense''.Reviewed in The other genes of the ''Pyl'' operon mediate pyrrolysine biosynthesis, leading to description of the operon as a "natural genetic code expansion cassette". A number of evolutionary scenarios have been proposed for the pyrrolysine system. The current (2022) view, given available sequences for tRNA and Pyl-tRNA (PylRS) synthase genes, is that: * tRNA(Pyl) diverged from tRNA(Phe) some time between the divergence of the three domains (~ LUCA) and the divergence of archaeal phyla, but was lost in non-archaeal lineages; * PylRS originated within a common ancestor of all
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
. A number of domain organizations of PylRS is known: ''pylS'' itself consists of an N-terminal tRNA-binding domain and a C-terminal synthase domain, but other organizations consist of two domains in separate proteins or a protein made up of a lone C-terminal domain. The CTD probably originated from PheRS. The NTD is an archaeal innovation with no known relative. The ancestral PylRS probably adopted the "two separate proteins" configuration. * The "genetic code expansion cassette" was later transferred into various
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
. This cassette's PylRS has a split-domain configuration. Earlier evolutionary scenarios were limited by the taxonomic range of known synthases: * In 2007, when use of the amino acid appeared confined to the ''Methanosarcinaceae'', the system was described as a "late archaeal invention" by which a 21st amino acid was added to the genetic code. It is now known that a wide range of prokaryotes have these two genes. * In 2009, structure comparison suggested that PylRS may have originated in the LUCA, but it only persisted in organisms using methylamines as energy sources. It is now known that some non-methanogens also have these two genes, but the dating was not too far off. * In 2009, it was suggested that the system could have migrated into bacteria by
horizontal gene transfer Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between organisms other than by the ("vertical") transmission of DNA from parent to offspring (reproduction). HGT is an important factor in the e ...
. This is probably true based on the 2022 study, though the paper originally assumed a link to methanogenesis.


Evolution of pyrrolysine methyltransferases

''MttB'', ''MtbB'', and ''MtmB'' exhibit no obvious sequence homology. Solved crystal structures of MttB and MtmB show the same TIM barrel fold. Both are also homohexamers with D3 symmetry, but the relative positions in the homohexamer are different. Most importantly, the active-site pyrrolysine residue in both point toward the center of the TIM-barrel, but are located at completely different sides of the barrel. This suggests that the use of pyrrolysine was not a feature of the shared ancestor; instead, pyrrolysine may have evolved for use in one of these enzymes and was co-opted into the other two. ''MttB'' is known to have several relatives without pyrrolysine in the ''MttB'' superfamily, all with high sequence similarity. Among all structures in the PDB as of 2023, the most similar (by structure and sequence identity) to ''MttB'' are the glycine betaine methyltransferase ''MtgB'' and a protein of unknown function 4YYC. ''MtmB'' is the third most similar structurally (among all PDB structures as of 2023) with a great decrease in similarity. A non-pyrrolysine member of the ''MtmB'' superfamily was discovered via metagenomics of a "''Ca.'' Formimonas warabiya" DCMF in 2022. The only thing known about its function is that its expression is increased in the presence of dichloromethane. There has been no report of any member of the ''MtbB'' superfamily without pyrrolysine in literature as of 2025.


Non-methyltransferase function

The tRNAHis guanylyltransferase gene ''Thg1'' from the archaeon '' Methanosarcina acetivorans'' has a pyrrolysine residue. The gene works as usual if the pyrrolysine is substituted for other amino acids. In this case, the presence of Pyl results from simple neutral evolution. The insertion of Pyl into a protein requires no special signal in the mRNA, only a UAG codon, so it stands to reason that the barrier for changing a residue into Pyl in a protein sequence is the same as any other amino-acid substitution. The bacterium '' Acetohalobium arabaticum'' has a serine dehydratase with two Pyl residues in addition to the regular methyltransferaseses. It also only expresses the Pyl machinery when trimethylamine is present. Altogether, this causes the bacterium to only produce a functional version of the serine dehydratase, the methyltransferaseses, and potentially the rest of its many proteins coded by a gene (about 20% of all its ORFs) with an in-frame TAG when trimethylamine is present. Although the incorporation of Pyl in those non-methyltransferase genes probably have no particular catalytic function, it has been adapted into a new way to regulate the production of protein products. The prevelance of ORFs with a in-frame TAG is much smaller in archaeons, as expected for their "always-on" (constitutent) expression of the Pyl machinery, at about 5%. Still, with a great number of genomes carrying the Pyl machinery in archaea, 5% is no small number. A 2024 preprint examined 425 archaeal genomes with a Pyl machinery and found 360 cases where a TAG occurs in the middle of a predicted protein with known non-TAG homologs. Some of these may have arised through neutral mutations like previously described for ''Thg1'' and indeed the rates are mostly consistent with random mutation. However, a few occurences of Pyl are conserved and may reflect a new, beneficial function. In any case, the high prevelance of Pyl in some genomes, especially non-methanogenic ones, show that Pyl has become another regular piece of the genetic code in those lineages.


Potential for an alternative translation

The tRNACUA can be charged with
lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. Lysine contains an α-amino group (which is in the protonated form when the lysine is dissolved in water at physiological pH), an α-carboxylic acid group ( ...
''in vitro'' by the concerted action of the ''M. barkeri'' Class I and Class II lysyl-tRNA synthetases (LysRS1 and LysRS2), neither of which recognizes pyrrolysine. Charging a tRNACUA with lysine was originally hypothesized to be the first step in translating UAG amber codons as pyrrolysine, a mechanism analogous to that used for selenocysteine. More recent data favor direct charging of pyrrolysine on to the tRNACUA by the protein product of the ''pylS'' gene, leading to the suggestion that the LysRS1:LysRS2 complex may participate in a parallel pathway designed to ensure that proteins containing the UAG codon can be fully translated using lysine as a substitute amino acid in the event of pyrrolysine deficiency. Further study found that the genes encoding LysRS1 and LysRS2 are not required for normal growth on
methanol Methanol (also called methyl alcohol and wood spirit, amongst other names) is an organic chemical compound and the simplest aliphatic Alcohol (chemistry), alcohol, with the chemical formula (a methyl group linked to a hydroxyl group, often ab ...
and methylamines with normal methyltransferase levels, and they cannot replace ''pylS'' in a recombinant system for UAG amber stop codon suppression.


References


Further reading

* *


External links

* {{Amino acids Alpha-Amino acids Proteinogenic amino acids Pyrrolines Secondary amino acids