''N''
6-Methyladenosine (m
6A) was originally identified and partially characterised in the 1970s,
and is an abundant modification in
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
and DNA. It is found within some viruses,
and most eukaryotes including mammals,
insects,
plants
and yeast.
It is also found in
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
,
rRNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from riboso ...
, and
small nuclear RNA
Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcr ...
(snRNA) as well as several
long non-coding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as mi ...
, such as ''
Xist''.
The
methylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
of
adenosine
Adenosine (symbol A) is an organic compound that occurs widely in nature in the form of diverse derivatives. The molecule consists of an adenine attached to a ribose via a β-N9-glycosidic bond. Adenosine is one of the four nucleoside buildin ...
is directed by a large m
6A
methyltransferase
Methyltransferases are a large group of enzymes that all methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which contain a Ros ...
complex containing
METTL3 as the SAM-binding sub-unit.
''In vitro'', this methyltransferase complex preferentially methylates RNA
oligonucleotides containing GGACU
and a similar preference was identified ''in vivo'' in mapped m
6A sites in Rous sarcoma virus genomic RNA
and in bovine prolactin mRNA.
More recent studies have characterized other key components of the m
6A methyltransferase complex in mammals, including
METTL14
Methyltransferase like 14 is a protein that in humans is encoded by the METTL14 gene
In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ' ...
,
Wilms tumor 1 associated protein (WTAP),
KIAA1429 and METTL5. Following a 2010 speculation of m
6A in mRNA being dynamic and reversible, the discovery of the first m
6A demethylase, fat mass and obesity-associated protein (FTO) in 2011 confirmed this hypothesis and revitalized the interests in the study of m
6A. A second m
6A demethylase alkB homolog 5 (ALKBH5) was later discovered as well.
The biological functions of m
6A are mediated through a group of RNA binding proteins that specifically recognize the methylated adenosine on RNA. These binding proteins are named m
6A readers. The YT521-B homology (YTH) domain family of proteins (
YTHDF1,
YTHDF2,
YTHDF3 and
YTHDC1) have been characterized as direct m
6A readers and have a conserved m
6A-binding pocket.
Insulin-like growth factor-2 mRNA-binding proteins 1, 2, and 3 (IGF2BP1–3) are reported as a novel class of m
6A readers.
IGF2BPs use K homology (KH) domains to selectively recognize m6A-containing RNAs and promote their translation and stability.
These m
6A readers, together with m
6A methyltransferases (writers) and demethylases (erasers), establish a complex mechanism of m
6A regulation in which writers and erasers determine the distributions of m
6A on RNA, whereas readers mediate m
6A-dependent functions. m
6A has also been shown to mediate a structural switch termed m
6A switch.
Species distribution
Yeast
In budding yeast (''Saccharomyces cerevisiae''), the
homologue of
METTL3, IME4 is induced in diploid cells in response to nitrogen and fermentable carbon source starvation and is required for mRNA methylation and the initiation of correct meiosis and sporulation.
mRNAs of IME1 and IME2, key early regulators of
meiosis
Meiosis (; , since it is a reductional division) is a special type of cell division of germ cells in sexually-reproducing organisms that produces the gametes, such as sperm or egg cells. It involves two rounds of division that ultimately ...
, are known to be targets for
methylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
, as are
transcripts of IME4 itself.
Plants
In plants, the majority of the m
6A is found within 150 nucleotides before the start of the
poly(A) tail
Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In euk ...
.
Mutations of MTA, the ''
Arabidopsis thaliana
''Arabidopsis thaliana'', the thale cress, mouse-ear cress or arabidopsis, is a small flowering plant native to Eurasia and Africa. ''A. thaliana'' is considered a weed; it is found along the shoulders of roads and in disturbed land.
A winter ...
'' homologue of METTL3, results in
embryo
An embryo is an initial stage of development of a multicellular organism. In organisms that reproduce sexually, embryonic development is the part of the life cycle that begins just after fertilization of the female egg cell by the male sperm ...
arrest at the globular stage. A >90% reduction of m
6A levels in mature plants leads to dramatically altered growth patterns and floral homeotic abnormalities.
Mammals
Mapping of m
6A in human and mouse RNA has identified over 18,000 m
6A sites in the transcripts of more than 7,000 human genes with a
consensus sequence
In molecular biology and bioinformatics, the consensus sequence (or canonical sequence) is the calculated order of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It serves as a simplified r ...
of
/A/UG>A]m
6AC
A/C">>A/Cref name="Meyer_2012"/>
consistent with the previously identified motif. The localization of individual m
6A sites in many mRNAs is highly similar between
human
Humans (''Homo sapiens'') are the most abundant and widespread species of primate, characterized by bipedalism and exceptional cognitive skills due to a large and complex brain. This has enabled the development of advanced tools, culture, ...
and
mouse
A mouse ( : mice) is a small rodent. Characteristically, mice are known to have a pointed snout, small rounded ears, a body-length scaly tail, and a high breeding rate. The best known mouse species is the common house mouse (''Mus musculus' ...
,
and
transcriptome
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The ...
-wide analysis reveals that m
6A is found in regions of high
evolutionary conservation.
m
6A is found within long internal
exons
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequenc ...
and is preferentially enriched within 3' UTRs and around
stop codons
In molecular biology (specifically protein biosynthesis), a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in mess ...
. m
6A within 3' UTRs is also associated with the presence of microRNA binding sites; roughly 2/3 of the mRNAs which contain an m
6A site within their 3' UTR also have at least one microRNA binding site.
By integrating all m
6A sequencing data, a novel database called RMBase has identified and provided ~200,000 sites in the human and mouse genomes corresponding to N6-Methyladenosines (m
6A) in RNA.
Precise m6A mapping by m6A-CLIP/IP
(briefly m6A-
CLIP) revealed that a majority of m6A locates in the last exon of mRNAs in multiple tissues/cultured cells of mouse and human,
and the m6A enrichment around stop codons is a coincidence that many stop codons locate round the start of last exons where m6A is truly enriched.
The major presence of m6A in last exon (>=70%) allows the potential for 3'UTR regulation, including alternative polyadenylation.
The study combining m6A-CLIP with rigorous cell fractionation biochemistry reveals that m6A mRNA modifications are deposited in nascent pre-mRNA and are not required for splicing but do specify cytoplasmic turnover.
m
6A is susceptible to dynamic regulation both throughout development and in response to cellular stimuli. Analysis of m
6A in
mouse brain
The mouse brain refers to the brain of Mus musculus. Various brain atlases exist.
For reasons of reproducibility, genetically characterized, stable strains like C57BL/6 were chosen to produce high-resolution images and databases. Well known onl ...
RNA reveals that m
6A levels are low during embryonic development and increase dramatically by adulthood.
Additionally, silencing the m
6A
methyltransferase
Methyltransferases are a large group of enzymes that all methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which contain a Ros ...
significantly affects gene expression and alternative
RNA splicing
RNA splicing is a process in molecular biology where a newly-made precursor messenger RNA (pre-mRNA) transcript is transformed into a mature messenger RNA (mRNA). It works by removing all the introns (non-coding regions of RNA) and ''splicing'' b ...
patterns, resulting in modulation of the
p53
p53, also known as Tumor protein P53, cellular tumor antigen p53 (UniProt name), or transformation-related protein 53 (TRP53) is a regulatory protein that is often mutated in human cancers. The p53 proteins (originally thought to be, and often s ...
(also known as
TP53
p53, also known as Tumor protein P53, cellular tumor antigen p53 (UniProt name), or transformation-related protein 53 (TRP53) is a regulatory protein that is often mutated in human cancers. The p53 proteins (originally thought to be, and often s ...
) signalling pathway and
apoptosis.
m
6A is also found on the RNA components of R-loops in human cells, where it is involved in regulation of stability of RNA:DNA hybrids.
The importance of m
6A methylation for physiological processes was recently demonstrated. Inhibition of m
6A methylation via pharmacological inhibition of cellular methylations or more specifically by siRNA-mediated silencing of the m
6A methylase ''Mettl3'' led to the elongation of the
circadian
A circadian rhythm (), or circadian cycle, is a natural, internal process that regulates the sleep–wake cycle and repeats roughly every 24 hours. It can refer to any process that originates within an organism (i.e., endogenous) and responds to ...
period. In contrast, overexpression of ''Mettl3'' led to a shorter period. The mammalian
circadian clock
A circadian clock, or circadian oscillator, is a biochemical oscillator that cycles with a stable phase and is synchronized with solar time.
Such a clock's ''in vivo'' period is necessarily almost exactly 24 hours (the earth's current solar day) ...
, composed of a transcription feedback loop tightly regulated to oscillate with a period of about 24 hours, is therefore extremely sensitive to perturbations in m
6A-dependent RNA processing, likely due to the presence of m
6A sites within clock gene transcripts.
The effects of global methylation inhibition on the
circadian
A circadian rhythm (), or circadian cycle, is a natural, internal process that regulates the sleep–wake cycle and repeats roughly every 24 hours. It can refer to any process that originates within an organism (i.e., endogenous) and responds to ...
period in mouse cells can be prevented by ectopic expression of an enzyme from the bacterial methyl metabolism. Mouse cells expressing this bacterial protein were resistant to pharmacological inhibition of methyl metabolism, showing no decrease in mRNA m
6A methylation or protein
methylation
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These t ...
.
Clinical significance
Considering the versatile functions of m
6A in various physiological processes, it is thus not surprising to find links between m
6A and numerous human diseases; many originated from mutations or single nucleotide polymorphisms (SNPs) of cognate factors of m
6A. The linkages between m
6A and numerous cancer types have been indicated in reports that include stomach cancer, prostate cancer, breast cancer, pancreatic cancer, kidney cancer, mesothelioma, sarcoma, and leukaemia. The impacts of m
6A on cancer cell proliferation might be much more profound with more data emerging. The depletion of METTL3 is known to cause apoptosis of cancer cells and reduce invasiveness of cancer cells, while the activation of ALKBH5 by hypoxia was shown to cause cancer stem cell enrichment. m
6A has also been indicated in the regulation of energy homeostasis and obesity, as FTO is a key regulatory gene for energy metabolism and obesity. SNPs of ''FTO'' have been shown to associate with body mass index in human populations and occurrence of obesity and diabetes. The influence of FTO on pre-adipocyte differentiation has been suggested. The connection between m
6A and neuronal disorders has also been studied. For instance, neurodegenerative diseases may be affected by m
6A as the cognate dopamine signalling was shown to be dependent on FTO and correct m
6A methylation on key signalling transcripts. The mutations in HNRNPA2B1, a potential reader of m
6A, have been known to cause neurodegeneration. The IGF2BP1–3, a novel class of m
6A reader, has oncogenic functions. IGF2BP1–3 knockdown or knockout decreased MYC protein expression, cell proliferation and colony formation in human cancer cell lines.
The ZC3H13, a member of the m6A methyltransferase complex, markedly inhibited colorectal cancer cells growth when knocked down.
Additionally, m
6A has been reported to impact viral infections. Many RNA viruses including SV40, adenovirus, herpes virus, Rous sarcoma virus, and influenza virus have been known to contain internal m
6A methylation on virus genomic RNA. Several more recent studies have revealed that m
6A regulators govern the efficiency of infection and replication of RNA viruses such as human immunodeficiency virus (HIV), hepatitis C virus (HCV), and Zika virus (ZIKV).
These results suggest m
6A and its cognate factors play crucial roles in regulating virus life cycles and host-viral interactions.
References
{{DEFAULTSORT:Methyladenosine, N6-
Nucleosides
Purines
Hydroxymethyl compounds