ORF3b
   HOME

TheInfoList



OR:

ORF3b is a
gene In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
found in
coronavirus Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the comm ...
es of the
subgenus In biology, a subgenus ( subgenera) is a taxonomic rank directly below genus. In the International Code of Zoological Nomenclature, a subgeneric name can be used independently or included in a species name, in parentheses, placed between the ge ...
''
Sarbecovirus Severe acute respiratory syndrome–related coronavirus (SARSr-CoV or SARS-CoV'', Betacoronavirus pandemicum'')The terms ''SARSr-CoV'' and ''SARS-CoV'' are sometimes used interchangeably, especially prior to the discovery of SARS-CoV-2. This m ...
'', encoding a short non-structural
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
. It is present in both
SARS-CoV Severe acute respiratory syndrome–related coronavirus (SARSr-CoV or SARS-CoV'', Betacoronavirus pandemicum'')The terms ''SARSr-CoV'' and ''SARS-CoV'' are sometimes used interchangeably, especially prior to the discovery of SARS-CoV-2. This m ...
(which causes the disease
SARS Severe acute respiratory syndrome (SARS) is a viral respiratory disease of zoonotic origin caused by the virus SARS-CoV-1, the first identified strain of the SARS-related coronavirus. The first known cases occurred in November 2002, and the ...
) and
SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19, the respiratory illness responsible for the COVID-19 pandemic. The virus previously had the Novel coronavirus, provisional nam ...
(which causes
COVID-19 Coronavirus disease 2019 (COVID-19) is a contagious disease caused by the coronavirus SARS-CoV-2. In January 2020, the disease spread worldwide, resulting in the COVID-19 pandemic. The symptoms of COVID‑19 can vary but often include fever ...
), though the protein product has very different lengths in the two viruses. The encoded protein is significantly shorter in SARS-CoV-2, at only 22
amino acid residue In chemistry, amines (, ) are organic compounds that contain carbon-nitrogen bonds. Amines are formed when one or more hydrogen atoms in ammonia are replaced by alkyl or aryl groups. The nitrogen atom in an amine possesses a lone pair of elec ...
s compared to 153–155 in SARS-CoV. Both the longer SARS-CoV and shorter SARS-CoV-2 proteins have been reported as
interferon Interferons (IFNs, ) are a group of signaling proteins made and released by host cells in response to the presence of several viruses. In a typical scenario, a virus-infected cell will release interferons causing nearby cells to heighten ...
antagonists. It is unclear whether the SARS-CoV-2 gene expresses a functional protein.


Nomenclature

There has been significant confusion in the scientific literature around the nomenclature used for the
accessory protein A viral regulatory and accessory protein is a type of viral protein that can play an indirect role in the function of a virus. An example is Nef (protein), Nef. References Further reading

* Viral proteins {{virus-stub ...
s of
SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19, the respiratory illness responsible for the COVID-19 pandemic. The virus previously had the Novel coronavirus, provisional nam ...
, especially several
overlapping gene An overlapping gene (or OLG) is a gene whose expressible nucleotide sequence partially overlaps with the expressible nucleotide sequence of another gene. In this way, a nucleotide sequence may make a contribution to the function of one or more g ...
s with
ORF3a ORF3a (previously known as X1 or U274) is a gene found in coronaviruses of the subgenus ''Sarbecovirus'', including SARS-CoV and SARS-CoV-2. It encodes an accessory protein about 275 amino acid residues long, which is thought to function as a ...
. Due to differences in the genomes of
SARS-CoV Severe acute respiratory syndrome–related coronavirus (SARSr-CoV or SARS-CoV'', Betacoronavirus pandemicum'')The terms ''SARSr-CoV'' and ''SARS-CoV'' are sometimes used interchangeably, especially prior to the discovery of SARS-CoV-2. This m ...
and SARS-CoV-2, two distinct
open reading frame In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames ...
s (ORFs) in the SARS-CoV-2 genome have been referred to as "ORF3b". In SARS-CoV, ORF3b is a gene of 155
codon Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
s. In SARS-CoV-2, the homologous region of the genome includes several
stop codon In molecular biology, a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the additio ...
s in the same
reading frame In molecular biology, a reading frame is a specific choice out of the possible ways to read the nucleic acid sequence, sequence of nucleotides in a nucleic acid (DNA or RNA) molecule as a sequence of triplets. Where these triplets equate to amino ...
, resulting in a truncated gene of 22 codons. As a result, some papers have used the term "ORF3b" to refer to a later ORF with 57 codons. Exacerbating the confusion, both the 57-codon protein product and the 22-codon protein product have been described to have similar effects as
interferon Interferons (IFNs, ) are a group of signaling proteins made and released by host cells in response to the presence of several viruses. In a typical scenario, a virus-infected cell will release interferons causing nearby cells to heighten ...
antagonists. In addition, the putative product of yet a third ORF of 41 codons has at least once been described as "3b protein". Numerous publications on SARS-CoV-2 refer ambiguously to "ORF3b". The recommended nomenclature for SARS-CoV-2 uses the term ''ORF3b'' for the 22-codon gene homologous to the 5' end of ORF3b in SARS-CoV. The term '' ORF3c'' is used for the 41-codon gene and the term ''
ORF3d ORF3d is a gene found in SARS-CoV-2 (the virus that causes COVID-19) and at least one closely related coronavirus found in pangolins, though it is not found in other closely related viruses within the '' Sarbecovirus'' subgenus. It is 57 codons l ...
'' is used for the 57-codon gene.


Comparative genomics

Like other genes encoding accessory proteins, ORF3b is located in the
genome A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
near the genes encoding viral structural proteins. It is one of several
overlapping gene An overlapping gene (or OLG) is a gene whose expressible nucleotide sequence partially overlaps with the expressible nucleotide sequence of another gene. In this way, a nucleotide sequence may make a contribution to the function of one or more g ...
s in this region of the genome, overlapping
ORF3a ORF3a (previously known as X1 or U274) is a gene found in coronaviruses of the subgenus ''Sarbecovirus'', including SARS-CoV and SARS-CoV-2. It encodes an accessory protein about 275 amino acid residues long, which is thought to function as a ...
and, in
SARS-CoV Severe acute respiratory syndrome–related coronavirus (SARSr-CoV or SARS-CoV'', Betacoronavirus pandemicum'')The terms ''SARSr-CoV'' and ''SARS-CoV'' are sometimes used interchangeably, especially prior to the discovery of SARS-CoV-2. This m ...
, the E gene encoding the envelope protein. Its length varies significantly, from 22 amino acids in SARS-CoV-2 to around 155 residues in SARS-CoV, with other related bat coronaviruses exhibiting intermediate truncations of varying lengths. It is the only ORF in the ''
Sarbecovirus Severe acute respiratory syndrome–related coronavirus (SARSr-CoV or SARS-CoV'', Betacoronavirus pandemicum'')The terms ''SARSr-CoV'' and ''SARS-CoV'' are sometimes used interchangeably, especially prior to the discovery of SARS-CoV-2. This m ...
''
subgenus In biology, a subgenus ( subgenera) is a taxonomic rank directly below genus. In the International Code of Zoological Nomenclature, a subgeneric name can be used independently or included in a species name, in parentheses, placed between the ge ...
with significant length variations among known related viruses. Its sequence is not well conserved within the SARSr-CoV species.


Expression and localization

In SARS-CoV, the ORF3b protein is
translated Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
through an
internal ribosome entry site An internal ribosome entry site, abbreviated IRES, is an RNA element that allows for translation initiation in a cap-independent manner, as part of the greater process of protein synthesis. Initiation of eukaryotic translation nearly always occur ...
(IRES). It has a
nuclear localization signal A nuclear localization signal ''or'' sequence (NLS) is an amino acid sequence that 'tags' a protein for import into the cell nucleus by nuclear transport. Typically, this signal consists of one or more short sequences of positively charged lysin ...
at the
C-terminus The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, carboxy tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein Proteins are large biomolecules and macromolecules that comp ...
and has been localized to the
nucleolus The nucleolus (; : nucleoli ) is the largest structure in the cell nucleus, nucleus of eukaryote, eukaryotic cell (biology), cells. It is best known as the site of ribosome biogenesis. The nucleolus also participates in the formation of signa ...
and
mitochondria A mitochondrion () is an organelle found in the cells of most eukaryotes, such as animals, plants and fungi. Mitochondria have a double membrane structure and use aerobic respiration to generate adenosine triphosphate (ATP), which is us ...
. It is not essential for
viral replication Viral replication is the formation of biological viruses during the infection process in the target host cells. Viruses must first get into the cell before viral replication can occur. Through the generation of abundant copies of its genome ...
. In SARS-CoV-2, it is unclear if ORF3b is functional.
Proteomics Proteomics is the large-scale study of proteins. Proteins are vital macromolecules of all living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replicatio ...
studies,
RNA sequencing RNA-Seq (named as an abbreviation of RNA sequencing) is a technique that uses next-generation sequencing to reveal the presence and quantity of RNA molecules in a biological sample, providing a snapshot of gene expression in the sample, also kn ...
of subgenomic RNA,
ribosome profiling Ribosome profiling, or Ribo-Seq (also named ribosome footprinting), is an adaptation of a technique developed by Joan Steitz and Marilyn Kozak almost 50 years ago that Nicholas Ingolia and Jonathan Weissman adapted to work with next generation se ...
, and
comparative genomics Comparative genomics is a branch of biological research that examines genome sequences across a spectrum of species, spanning from humans and mice to a diverse array of organisms from bacteria to chimpanzees. This large-scale holistic approach c ...
have all been used to examine the functional gene content of SARS-CoV-2 and found little evidence that ORF3b
expresses Expression may refer to: Linguistics * Expression (linguistics), a word, phrase, or sentence * Expression (mathematics), Symbolic description of a mathematical object * Fixed expression, a form of words with a specific meaning * Idiom, a type of ...
a functional protein. The SARS-CoV-2 protein has been reported to localize primarily to the
cytosol The cytosol, also known as cytoplasmic matrix or groundplasm, is one of the liquids found inside cells ( intracellular fluid (ICF)). It is separated into compartments by membranes. For example, the mitochondrial matrix separates the mitochondri ...
when expressed in cell culture. Truncated forms of the protein from bat coronaviruses are also reportedly cytosolic, likely due to loss of the C-terminal nuclear localization sequence.


Function


Cell growth

In SARS-CoV, ORF3b has been reported to induce G0/ G1
cell cycle The cell cycle, or cell-division cycle, is the sequential series of events that take place in a cell (biology), cell that causes it to divide into two daughter cells. These events include the growth of the cell, duplication of its DNA (DNA re ...
arrest and
apoptosis Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
when studied in
cell culture Cell culture or tissue culture is the process by which cell (biology), cells are grown under controlled conditions, generally outside of their natural environment. After cells of interest have been Cell isolation, isolated from living tissue, ...
.


Interferon antagonist

In SARS-CoV, ORF3b has been described as an
interferon Interferons (IFNs, ) are a group of signaling proteins made and released by host cells in response to the presence of several viruses. In a typical scenario, a virus-infected cell will release interferons causing nearby cells to heighten ...
antagonist, suppressing the
type I interferon The type-I interferons (IFN) are cytokines which play essential roles in inflammation, immunoregulation, tumor cells recognition, and T-cell responses. In the human genome, a cluster of thirteen functional IFN genes is located at the 9p21.3 cyt ...
response through inhibition of
IRF3 Interferon regulatory factor 3, also known as IRF3, is an interferon regulatory factor. Function IRF3 is a member of the interferon regulatory transcription factor (IRF) family. IRF3 was originally discovered as a homolog of IRF1 and IRF2. IR ...
. Studies of the truncated SARS-CoV-2 ORF3b protein in cell culture suggest it is a more potent interferon antagonist than the SARS-CoV protein, which may be related to its length and to differences in subcellular localization.


Effect on AP-1

In SARS-CoV, ORF3b protein reportedly activates the
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription (genetics), transcription of genetics, genetic information from DNA to messenger RNA, by binding t ...
AP-1 through the
JNK c-Jun N-terminal kinases (JNKs), were originally identified as kinases that bind and phosphorylate c-Jun on Ser-63 and Ser-73 within its transcriptional activation domain. They belong to the mitogen-activated protein kinase family, and are r ...
and ERK
signaling pathway In biology, cell signaling (cell signalling in British English) is the process by which a cell interacts with itself, other cells, and the environment. Cell signaling is a fundamental property of all cellular life in both prokaryotes and eukary ...
s.


References

{{Viral proteins Viral nonstructural proteins Coronavirus proteins