RNA-targeting small molecules represent a class of
small molecule
Within the fields of molecular biology and pharmacology, a small molecule or micromolecule is a low molecular weight (≤ 1000 daltons) organic compound that may regulate a biological process, with a size on the order of 1 nm. Many drugs are ...
s,
organic compound
In chemistry, organic compounds are generally any chemical compounds that contain carbon- hydrogen or carbon-carbon bonds. Due to carbon's ability to catenate (form chains with other carbon atoms), millions of organic compounds are known. Th ...
s with traditional drug properties (e.g.,
Lipinski's rule of five) that can bind to
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary or tertiary structures and alter translation patterns, localization, and degradation.
Origins
Recent discoveries implicating RNA in the
pathogenesis
Pathogenesis is the process by which a disease or disorder develops. It can include factors which contribute not only to the onset of the disease or disorder, but also to its progression and maintenance. The word comes from Greek πάθος ''pat ...
of several forms of cancer and
neuromuscular disease
A neuromuscular disease is any disease affecting the peripheral nervous system (PNS), the neuromuscular junction, or skeletal muscle, all of which are components of the motor unit. Damage to any of these structures can cause muscle atrophy and we ...
s have created a paradigm shift in
drug discovery
In the fields of medicine, biotechnology and pharmacology, drug discovery is the process by which new candidate medications are discovered.
Historically, drugs were discovered by identifying the active ingredient from traditional remedies or by ...
. This work combined with advances in structural characterization techniques such as
NMR spectroscopy
Nuclear magnetic resonance spectroscopy, most commonly known as NMR spectroscopy or magnetic resonance spectroscopy (MRS), is a spectroscopic technique to observe local magnetic fields around atomic nuclei. The sample is placed in a magnetic fiel ...
and
X-ray crystallography
X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angle ...
together with
computational modeling
Computer simulation is the process of mathematical modelling, performed on a computer, which is designed to predict the behaviour of, or the outcome of, a real-world or physical system. The reliability of some mathematical models can be dete ...
, has pushed forward the realization that RNA is a dynamic yet viable
drug target. Traditionally, RNA was thought to be a mediator between
DNA sequence-encoded instructions and functional
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
. However, recent reports have shown that there are a large number of
non-coding RNA
A non-coding RNA (ncRNA) is a functional RNA molecule that is not translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally important types of non- ...
s (ncRNAs) that are not translated into protein. Whereas 85% of the
human genome
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the ...
is transcribed into RNA only 3% of the transcripts code for functional protein.
Although, ncRNAs do affect
gene expression
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. ...
levels by a variety of mechanisms.
Further, RNA can adopt discrete secondary or tertiary structures which play a pivotal role in many biological processes and disease pathology. For these reasons, RNA is being recognized as an attractive drug target for
small molecule
Within the fields of molecular biology and pharmacology, a small molecule or micromolecule is a low molecular weight (≤ 1000 daltons) organic compound that may regulate a biological process, with a size on the order of 1 nm. Many drugs are ...
s.
The earliest attempts to target RNA led to the discovery that
aminoglycoside
Aminoglycoside is a medicinal and bacteriologic category of traditional Gram-negative antibacterial medications that inhibit protein synthesis and contain as a portion of the molecule an amino-modified glycoside ( sugar). The term can also refer ...
s could bind to human RNA. In an early report, Noller discovered that several classes of antibiotics (
streptomycin
Streptomycin is an antibiotic medication used to treat a number of bacterial infections, including tuberculosis, ''Mycobacterium avium'' complex, endocarditis, brucellosis, ''Burkholderia'' infection, plague, tularemia, and rat bite fever ...
,
tetracycline
Tetracycline, sold under various brand names, is an oral antibiotic in the tetracyclines family of medications, used to treat a number of infections, including acne, cholera, brucellosis, plague, malaria, and syphilis.
Common side effects ...
,
spectinomycin
Spectinomycin, sold under the tradename Trobicin among others, is an antibiotic useful for the treatment of gonorrhea infections. It is given by injection into a muscle.
Common side effects include pain at the area of injection, rash, nausea, f ...
, edeine, hygromycin, and the
neomycin
Neomycin is an aminoglycoside antibiotic that displays bactericidal activity against gram-negative aerobic bacilli and some anaerobic bacilli where resistance has not yet arisen. It is generally not effective against gram-positive bacilli and ...
s) could "protect"
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecul ...
s in
16S ribosomal RNA
16 S ribosomal RNA (or 16 S rRNA) is the RNA component of the 30S subunit of a prokaryotic ribosome ( SSU rRNA). It binds to the Shine-Dalgarno sequence and provides most of the SSU structure.
The genes coding for it are referred to as 16S r ...
by binding to this RNA. Subsequent studies by Schroeder and Green began to plant the seed that RNA could be targeted. Schroeder uncovered that aminoglycosides could inhibit protein synthesis by interacting with the
ribosome through interactions with the 3’ end of the 16S RNA of ''
E. coli'' taking advantage of RNA conformational changes. Green and coworkers further confirmed this idea, discovering that aminoglycosides blocked the interaction of
HIV-1
The subtypes of HIV include two major types, HIV type 1 (HIV-1) and HIV type 2 (HIV-2). HIV-1 is related to viruses found in chimpanzees and gorillas living in western Africa, while HIV-2 viruses are related to viruses found in the sooty mangabey ...
Rev protein and its viral RNA-binding site.
David Wilson and David Draper were the first to suggest that RNA structures could be targeted by small molecules. They hypothesized that RNA could be "druggable" by targeting the 3D structure in the same way as protein 3D structures are used as drug targets and furthered the idea that targeting RNA could be used to treat diseases. Czanik and co-workers at Parke Davis completed a screen on HIV Tat. They found multiple small molecule inhibitors of the HIV-1 Tat—TAR system that recognized the bulge, lower stem, or loop region of the TAR RNA. One of the compounds discovered, 2,4,5,6-tetraaminoquinozaline, binds to the loop region of TAR, downregulates cellular Tat transactivation, and ultimately inhibits HIV-1 replication.
The use of aminoglycosides, while an early start to RNA-targeting, came with some challenges. These molecules were only modestly selective and showed unfavorable toxicity levels at relevant therapeutic concentrations.
As another strategy for targeting RNA,
antisense oligonucleotides were developed which have been pushed forward through the clinic for several diseases. By this principle, if one can identify an RNA involved in disease then the sequence can be used to design a complementary
antisense oligonucleotide, and that agent can be introduced into cells to treat the disease.
But, this approach in its basic form has been met with several challenges. The most obvious are their large size and propensity to degradation by nucleases. In order for cellular RNA to be effective it must enter the cells intact. While backbone modifications to
antisense oligonucleotides in order to prevent nuclease degradation have been shown to work, this approach is still somewhat limited. Small molecules may present a better way to target RNA and subsequently DNA because they can be designed to be more "drug-like" and have a better chance of reaching their target, most by oral administration. For this reason, there is an emerging interest in designing and discovering small molecules to target RNA secondary and tertiary structures to ultimately treat new diseases.
Human treatment

There are limited examples of small molecules that target RNA and are approved drugs for the treatment of human disease.
Ribavirin
Ribavirin, also known as tribavirin, is an antiviral medication used to treat RSV infection, hepatitis C and some viral hemorrhagic fevers. For hepatitis C, it is used in combination with other medications such as simeprevir, sofosbuvir, ...
was approved in 2002 to treat
Hepatitis C
Hepatitis C is an infectious disease caused by the hepatitis C virus (HCV) that primarily affects the liver; it is a type of viral hepatitis. During the initial infection people often have mild or no symptoms. Occasionally a fever, dark urine, ...
and
viral hemorrhagic fever
Viral hemorrhagic fevers (VHFs) are a diverse group of animal and human illnesses in which fever and hemorrhage are caused by a viral infection. VHFs may be caused by five distinct families of RNA viruses: the families ''Filoviridae'', ''Flavi ...
. As a nucleoside inhibitor, the guanosine analog prodrug is used to stop viral RNA synthesis and viral
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
capping by incorporating into RNA and pairing to uracil or cytosine.
Branaplam
Branaplam (development codes LMI070 and NVS-SM1) is a pyridazine derivative that is being studied as an experimental drug. It was originally developed by Novartis to treat spinal muscular atrophy (SMA); since 2020 it is being developed to treat H ...
is currently in phase I/II clinical trial for the treatment of
Spinal Muscular Atrophy
Spinal muscular atrophy (SMA) is a rare neuromuscular disorder that results in the loss of motor neurons and progressive muscle wasting. It is usually diagnosed in infancy or early childhood and if left untreated it is the most common genet ...
(SMA). This molecule is from a class of pyridazine small molecules and enhances the inclusion of exon 7, resulting in a full-length and functional protein product.
Branaplam
Branaplam (development codes LMI070 and NVS-SM1) is a pyridazine derivative that is being studied as an experimental drug. It was originally developed by Novartis to treat spinal muscular atrophy (SMA); since 2020 it is being developed to treat H ...
represents the first mechanistic study of splicing modulation using a sequence-selective small molecule. The drug stabilizes the transient double-stranded RNA (dsRNA) structure formed between the
SMN2
Survival of motor neuron 2 (''SMN2'') is a gene that encodes the SMN protein (full and truncated) in humans.
Gene
The ''SMN2'' gene is part of a 500 kb inverted duplication on chromosome 5q13. This duplicated region contains at least four ge ...
pre-mRNA and U1
snRNP
snRNPs (pronounced "snurps"), or small nuclear ribonucleoproteins, are RNA-protein complexes that combine with unmodified pre-mRNA and various other proteins to form a spliceosome, a large RNA-protein molecular complex upon which splicing of pr ...
complex, a key component of the
splicesome. Further, this compound acts by increasing the binding affinity of U1 snRNP to the 5’ splice site (5’ss) in a sequence-selective manner that is discrete from constitutive recognition.
Ataluren is in clinical trials for the treatment of
Duchenne Muscular Dystrophy
Duchenne muscular dystrophy (DMD) is a severe type of muscular dystrophy that primarily affects boys. Muscle weakness usually begins around the age of four, and worsens quickly. Muscle loss typically occurs first in the thighs and pelvis follo ...
(DMD). It is believed that
Ataluren acts by promoting insertion of near-cognate
tRNAs
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
at the site of the nonsense codon without affecting
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
, mRNA processing, mRNA stability, or protein stability to give nonsense suppression. This drug would be effective for ~10% of patients with DMD who have a single mutation in the DMD gene causing a stop codon to appear prematurely (
nonsense mutation
In genetics, a nonsense mutation is a point mutation in a sequence of DNA that results in a premature stop codon, or a ''nonsense codon'' in the transcribed mRNA, and in leading to a truncated, incomplete, and usually nonfunctional protein produc ...
).
Animal treatment
Compared to the number of drug candidates that have successfully made it to the clinical trial phase, there are many more lead compounds which have been tested ''in vivo'' using various animal models. Much of the current work that has progressed to ''in vivo'' testing has been directed to the RNA repeat expansions implicated in genetic neuromuscular diseases. In
myotonic dystrophy
Myotonic dystrophy (DM) is a type of muscular dystrophy, a group of genetic disorders that cause progressive muscle loss and weakness. In DM, muscles are often unable to relax after contraction. Other manifestations may include cataracts, intell ...
type 1 (DM1), r(CUG)exp mRNAs sequester proteins including the alternative splicing regulator
MBNL1 into the nucleus causing missplicing. Several groups have developed compounds which bind the toxic RNA and dissolve nuclear foci. In 2011, Artero and coworkers discovered that a peptide could reduce the toxicity associated with r(CUG) repeats in ''
Drosophila
''Drosophila'' () is a genus of flies, belonging to the family Drosophilidae, whose members are often called "small fruit flies" or (less frequently) pomace flies, vinegar flies, or wine flies, a reference to the characteristic of many s ...
'' and mouse models. Disney and colleagues provided the first small molecules that targeted r(CUG) repeats in animals models by using rational designing to identify many small molecules directly targeting this toxic RNA and the compounds improved disease defects in a DM1 mouse model.
Other works by the Disney group has shown that in cellular models of various RNA-mediated diseases that are causes by RNA repeats such as r(CAG) in Huntington's disease and r(CCUG) repeats in Myotonic Dystrophy Types 2 could also be targeted with small molecules. Nakamori and colleagues also reported in 2012 that
erythromycin
Erythromycin is an antibiotic used for the treatment of a number of bacterial infections. This includes respiratory tract infections, skin infections, chlamydia infections, pelvic inflammatory disease, and syphilis. It may also be used du ...
could be orally dosed in DM1 mouse models to restore missplicing defects and inhibit the complex formed between r(CUG) and
MBNL1. In that same year, Miller and coworkers screened a library of compounds to find a small molecule drug that could improve splicing defects in a mouse model. The Zimmerman group has taken a rational design approach to discovering small molecule drugs that target r(CUG). One such compound contains a selective triaminotriazine recognition motif which binds to the UU mismatches in r(CUG) selectively most likely in a base triplet combined with an amidinium RNA groove binding unit. Studies using a ''Drosophila'' model for DM1 showed an influence on related phenotypic outcomes such as eye morphology and climbing distance.
Aside from studies involved r(CUG) repeats, other complex RNA structures have also been targeted. Pearson and coworkers discovered that a cationic
porphyrin
Porphyrins ( ) are a group of heterocyclic macrocycle organic compounds, composed of four modified pyrrole subunits interconnected at their α carbon atoms via methine bridges (=CH−). The parent of porphyrin is porphine, a rare chemical ...
(TMPyP4) bound a
G-quadruplex
In molecular biology, G-quadruplex secondary structures (G4) are formed in nucleic acids by sequences that are rich in guanine. They are helical in shape and contain guanine tetrads that can form from one, two or four strands. The unimolecular ...
r(G
4C
2) and inhibited the binding of proteins to r(G
4C
2). Work by Disney and Petrucelli rationally identified small molecules that can target this repeat and affect disease biology in model cellular systems and also in patient-derive iNeurons. Further studies by Rothstein and colleagues determined that TMPyP4 could suppress r(G
4C
2)-mediated
neurodegeneration
A neurodegenerative disease is caused by the progressive loss of structure or function of neurons, in the process known as neurodegeneration. Such neuronal damage may ultimately involve cell death. Neurodegenerative diseases include amyotrophic ...
in a ''Drosophila'' model. Additionally targets have been rationally identified by using a powerful seqecune-based design approach termed informal to identify dozens of bioactive small molecules that target disease causing non-coding RNA termed INFORNA.
This study important showed for the first time that small molecules appear to have selectivities that are competitive with oligonucleotides with cell-permeable and medicinally optimizable small molecules. Additionally, compounds have been shown to be bioactive in diverse disease settings that ranged from breast cancer.
and hepatocellular carcinoma. More recently, the Disney group further used their prediction database INFORNA to design Targaprimir-96 to target miRNA precursors in animal models of cancer, the first small molecules to do so. This compound has a nanomolar affinity for the miRNA hairpin precursor selectively over other sequences. Targaprimir-96 was further tested in cells and in mice, inhibiting tumor growth in a xenograft mouse model of triple negative breast cancer upon i.p. injection.
Cellular use
RNA-targeting small molecule drug discovery has greatly benefitted from the available cellular models for disease. The use of cell culture in early development has become a requirement for assessing the basic efficacy of a drug candidate. Thus, more research groups have implemented these techniques in their programs. In a leading example, Al-Hashimi and coworkers identified six small molecules with high affinity for TAR of HIV-1 through a computational approach. They docked a library of small molecules onto RNA dynamic structures generated by NMR and
Molecular Dynamics
Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of th ...
(MD) simulations. The hit molecules inhibited the Tat—TAR interaction ''in vitro''. They arrived at lead molecule,
netilmicin
Netilmicin (1-N-ethylsisomicin) is a semisynthetic aminoglycoside antibiotic, and a derivative of sisomicin, produced by ''Micromonospora inyoensis''. Aminoglycoside antibiotics have the ability to kill a wide variety of bacteria. Netilmicin is not ...
, that had the best selectivity for HIV-1 TAR and inhibited HIV-1 replication in cells with a low IC50. The Disney group has studied aminoglycoside derivatives in 2009 for their ability to inhibit interactions between repeat RNA and proteins. Using their prediction database INFORNA, they discovered that a compound could bind to 1 x 1 UU internal loops on an N-methyl peptide backbone. They confirmed that like other compounds that target DM1 r(CUG), they could inhibit the complex between r(CUG)-MBNL1, disrupt nuclear foci, and increase nucleocytoplasmic transport of the gene in patient-derived DM1
fibroblast
A fibroblast is a type of biological cell that synthesizes the extracellular matrix and collagen, produces the structural framework ( stroma) for animal tissues, and plays a critical role in wound healing. Fibroblasts are the most common cells of ...
s.
In that study the Disney group also described several approaches to validate the RNA targets of small molecules. In the first approach termed chemical cross-linking and isolation by pull down (Chem-CLIP) and chemical cross-linking and isolation by pull down to map binding sites (Chem-CLIP-Map).
These studies showed in cells that small molecules can direct target disease-causing r(CUG) repeats in DM1 and that impressively the compound can discriminate against other RNAs with shorter repeats and also between the mutant and wild type allele of the DMPK mRNA that contains r(CUG) disease-causing repeats. In an additional approach, dubbed small–molecule nucleic acid profiling by cleavage applied to RNA (Ribo-SNAP)
showed that small molecules can be used to cleave RNA targets in cells and also importantly demonstrated that designer small molecules target precisely disease causing RNA repeats and discriminate against RNAs that are not disease causing but have short repeats of r(CUG). Thus, targeting RNA structure with small molecules can have important selectively discrimination implications in cells.
In 2014, Chenowith and colleagues reported a cationic triptycene scaffold that targets RNA and DNA three-way junctions. Subsequent studies showed that these molecules exhibited favorable cellular uptake and cytotoxicity in human ovarian cancer cell lines. In 2017, the Xodo group reported anthrafurandione and anthrathiophenedione small molecules with aminoethyl side chains could bind to RNA
G-quadraplexes at the 5’-UTR of certain mRNAs. Further, these compounds were shown to suppress the
KRAS
''KRAS'' ( Kirsten rat sarcoma virus) is a gene that provides instructions for making a protein called K-Ras, a part of the RAS/MAPK pathway. The protein relays signals from outside the cell to the cell's nucleus. These signals instruct the cel ...
oncogene in
pancreatic cancer
Pancreatic cancer arises when cells in the pancreas, a glandular organ behind the stomach, begin to multiply out of control and form a mass. These cancerous cells have the ability to invade other parts of the body. A number of types of panc ...
cells and induce
apoptosis by reducing the
metabolic activity of the cells.
In vitro

In 2009, the Zimmerman group discovered a compound to target the trinucleotide repeat expanded RNA and DNA that cause DM1. Through rational design, they utilized a triaminotriazine recognition unit to target TT or UU mismatches through a Janus Wedge type binding mode, creating a base triplet with the mismatch. The combined use of an
acridine
Acridine is an organic compound and a nitrogen heterocycle with the formula C13H9N. Acridines are substituted derivatives of the parent ring. It is a planar molecule that is structurally related to anthracene with one of the central CH groups re ...
intercalator to pi-pi stack on the target gave a nanomolar binding affinity for TT or UU mismatches over others. Along with high binding affinity, this molecule was shown to displace MBNL from the complex with r(CUG) with a micromolar K
i. Additionally, HIV-1 RNA has been targeted extensively ''in vitro'' by RNA-binding small molecules. In 2007, Miller and coworkers used
dynamic combinatorial chemistry
Dynamic combinatorial chemistry (DCC); also known as constitutional dynamic chemistry (CDC) is a method to the generation of new molecules formed by reversible reaction of simple building blocks under thermodynamic control.Schaufelberger, F.; Ti ...
to screen a compound library against HIV-1 frameshift regulatory stem-loop RNA. They identified a hit compound that was selective for the regulatory sequence with micromolar binding affinity.
In 2011, Butcher and colleagues discovered a frameshifting stimulator (DB213) which bound to HIV-1 FS RNA with moderate binding affinity. An NMR structure of the RNA in complex with DB213, showed that the small molecule bound to the major groove of the RNA duplex. Schneekloth and
Hargrove have taken a different approach by targeting the HIV-1 TAR RNA hairpin. In a small molecule microarray screening, the Schneekloth group identified a thienopyridine derivative that interacts with HIV-1 TAR RNA hairpin. Further SAR studies provided more information on the structure and binding mode. The lead analogue was found to bind to the 5’-UTR of HIV with an IC
50 of 40 μM for displacing a Tat-derived peptide. The
Hargrove group developed a small library of
amiloride
Amiloride, sold under the trade name Midamor among others, is a medication typically used with other medications to treat high blood pressure or swelling due to heart failure or cirrhosis of the liver. Amiloride is classified as a potassium-spar ...
derivatives with changes at the C(5) and C(6) positions to improve the binding affinity of
amiloride
Amiloride, sold under the trade name Midamor among others, is a medication typically used with other medications to treat high blood pressure or swelling due to heart failure or cirrhosis of the liver. Amiloride is classified as a potassium-spar ...
to the loop and bulge of the HIV-1 TAR RNA. Using ''in vitro'' studies and modeling, they found a hit compound whose inhibition activity was increased by more than 100x compared to the parent amiloride. This compound is reported to be one of the tightest non-
aminoglycoside
Aminoglycoside is a medicinal and bacteriologic category of traditional Gram-negative antibacterial medications that inhibit protein synthesis and contain as a portion of the molecule an amino-modified glycoside ( sugar). The term can also refer ...
TAR ligands reported to date.
References
{{reflist
Anti–RNA virus drugs
Drug discovery
Cell biology
Molecular biology
RNA