Endogenous retroviruses (ERVs) are
endogenous viral element
An endogenous viral element (EVE) is a DNA sequence derived from a virus, and present within the germline of a non-viral organism. EVEs may be entire viral genomes ( proviruses), or fragments of viral genomes. They arise when a viral DNA sequence ...
s in the
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
that closely resemble and can be derived from
retrovirus
A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase e ...
es. They are abundant in the genomes of
jawed vertebrates
Gnathostomata (; from Ancient Greek: (') 'jaw' + (') 'mouth') are jawed vertebrates. Gnathostome diversity comprises roughly 60,000 species, which accounts for 99% of all extant vertebrates, including all living bony fishes (both ray-finned ...
, and they comprise up to 5–8% of the human genome (lower estimates of ~1%).
[
An ERV is a vertically inherited proviral sequence and a subclass of a type of ]gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
called a transposon
A transposable element (TE), also transposon, or jumping gene, is a type of mobile genetic element, a nucleic acid sequence in DNA that can change its position within a genome.
The discovery of mobile genetic elements earned Barbara McClinto ...
, which can normally be packaged and moved within the genome to serve a vital role in gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
and in regulation
Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
. ERVs, however, lack most transposon functions, are typically not infectious, and are often defective genomic remnants of the retroviral replication cycle. They are distinguished as germline
In biology and genetics, the germline is the population of a multicellular organism's cells that develop into germ cells. In other words, they are the cells that form gametes ( eggs and sperm), which can come together to form a zygote. They dif ...
provirus
A provirus is a virus genome that is integrated into the DNA of a host cell. In the case of bacterial viruses (bacteriophages), proviruses are often referred to as prophages. However, proviruses are distinctly different from prophages and these te ...
retroelements
Retrotransposons (also called Class I transposable elements) are mobile elements which move in the host genome by converting their transcribed RNA into DNA through reverse transcription. Thus, they differ from Class II transposable elements, or ...
due to their integration and reverse-transcription into the nuclear genome of the host cell.
Researchers have suggested that retroviruses evolved from a type of transposon called a retrotransposon
Retrotransposons (also called Class I transposable elements) are mobile elements which move in the host genome by converting their transcribed RNA into DNA through reverse transcription. Thus, they differ from Class II transposable elements, or ...
, a Class I element;[ these genes can mutate and instead of moving to another location in the genome, they can become exogenous or pathogenic. This means that not all ERVs may have originated as an insertion by a retrovirus but that some may have been the source for the genetic information in the retroviruses they resemble. When integration of viral DNA occurs in the germ-line, it can give rise to an ERV, which can later become fixed in the gene pool of the host population.]
Formation
The replication cycle of a retrovirus
A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase e ...
entails the insertion ("integration") of a DNA copy of the viral genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
into the nuclear
Nuclear may refer to:
Physics
Relating to the nucleus of the atom:
*Nuclear engineering
*Nuclear physics
*Nuclear power
*Nuclear reactor
*Nuclear weapon
*Nuclear medicine
*Radiation therapy
*Nuclear warfare
Mathematics
* Nuclear space
*Nuclear ...
genome of the host
A host is a person responsible for guests at an event or for providing hospitality during it.
Host may also refer to:
Places
* Host, Pennsylvania, a village in Berks County
* Host Island, in the Wilhelm Archipelago, Antarctica
People
* ...
cell
Cell most often refers to:
* Cell (biology), the functional basic unit of life
* Cellphone, a phone connected to a cellular network
* Clandestine cell, a penetration-resistant form of a secret or outlawed organization
* Electrochemical cell, a de ...
. Most retroviruses infect somatic cells
In cellular biology, a somatic cell (), or vegetal cell, is any biological cell forming the body of a multicellular organism other than a gamete, germ cell, gametocyte or undifferentiated stem cell. Somatic cells compose the body of an organism a ...
, but occasional infection of germline
In biology and genetics, the germline is the population of a multicellular organism's cells that develop into germ cells. In other words, they are the cells that form gametes ( eggs and sperm), which can come together to form a zygote. They dif ...
cells (cells that produce eggs and sperm) can also occur. Rarely, retroviral integration may occur in a germline cell that goes on to develop into a viable organism. This organism will carry the inserted retroviral genome as an integral part of its own genome—an "endogenous" retrovirus (ERV) that may be inherited by its offspring
In biology, offspring are the young creation of living organisms, produced either by sexual reproduction, sexual or asexual reproduction. Collective offspring may be known as a brood or progeny. This can refer to a set of simultaneous offspring ...
as a novel allele
An allele is a variant of the sequence of nucleotides at a particular location, or Locus (genetics), locus, on a DNA molecule.
Alleles can differ at a single position through Single-nucleotide polymorphism, single nucleotide polymorphisms (SNP), ...
. Many ERVs have persisted in the genome of their hosts for millions of years. However, most of these have acquired inactivating mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s during host DNA replication
In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all life, living organisms, acting as the most essential part of heredity, biolog ...
and are no longer capable of producing the virus. ERVs can also be partially excised from the genome by a process known as recombinational deletion, in which recombination between the identical sequences that flank newly integrated retroviruses results in deletion of the internal, protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
-coding regions of the viral genome.
The general retrovirus genome consists of three genes vital for the invasion, replication, escape, and spreading of its viral genome. These three genes are gag (encodes for structural proteins for the viral core
A capsid is the protein shell of a virus, enclosing its genetic material. It consists of several oligomeric (repeating) structural subunits made of protein called protomers. The observable 3-dimensional morphological subunits, which may or ma ...
), pol (encodes for reverse transcriptase
A reverse transcriptase (RT) is an enzyme used to convert RNA genome to DNA, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobi ...
, integrase
Retroviral integrase (IN) is an enzyme
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme ...
, and protease
A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalysis, catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products ...
), and env (encodes for coat proteins for the virus's exterior). These viral proteins are encoded as polyprotein
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
s. In order to carry out their life cycle, the retrovirus relies heavily on the host cell's machinery. Protease degrades peptide bonds of the viral polyproteins, making the separate proteins functional. Reverse transcriptase functions to synthesize viral DNA from the viral RNA in the host cell's cytoplasm before it enters the nucleus. Integrase guides the integration of viral DNA into the host genome.
Over time, the genome of ERVs not only acquire point mutations, but also shuffle and recombine with other ERVs. ERVs with a decayed sequence for the ''env'' become more likely to propagate.
Role in genomic evolution
Endogenous retroviruses can play an active role in shaping genomes. Most studies in this area have focused on the genomes of humans and higher primates, but other vertebrates, such as mice and sheep, have also been studied in depth. The long terminal repeat ( LTR) sequences that flank ERV genomes frequently act as alternate promoters and enhancers
In genetics, an enhancer is a short (50–1500 bp) region of DNA that can be bound by proteins ( activators) to increase the likelihood that transcription of a particular gene will occur. These proteins are usually referred to as transcriptio ...
, often contributing to the transcriptome
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The ...
by producing tissue-specific variants. In addition, the retroviral protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s themselves have been co-opted to serve novel host functions, particularly in reproduction and development. Recombination between homologous retroviral sequences has also contributed to gene shuffling and the generation of genetic variation. Furthermore, in the instance of potentially antagonistic effects of retroviral sequences, repressor genes have co-evolved to combat them.
About 90% of endogenous retroviruses are solo LTRs, lacking all open reading frame
In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames ...
s (ORFs). Solo LTRs and LTRs associated with complete retroviral sequences have been shown to act as transcriptional elements on host genes. Their range of action is mainly by insertion into the 5' untranslated regions (UTRs) of protein coding genes; however, they have been known to act upon genes up to 70–100 kb away. The majority of these elements are inserted in the sense direction (5' to 3') to their corresponding genes, but there has been evidence of LTRs acting in the antisense direction (3' to 5') and as a bidirectional promoter for neighboring genes. In a few cases, the LTR functions as the major promoter for the gene.
For example, in humans, amylase alpha 1C (AMY1C) has a complete ERV sequence in its promoter region; the associated LTR confers salivary specific expression of the digestive enzyme
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
amylase
An amylase () is an enzyme that catalysis, catalyses the hydrolysis of starch (Latin ') into sugars. Amylase is present in the saliva of humans and some other mammals, where it begins the chemical process of digestion. Foods that contain large ...
. Also, the primary promoter for bile acid-CoA:amino acid N-acyltransferase (BAAT), which codes for an enzyme that is integral in bile metabolism, is of LTR origin.
The insertion of a solo ERV-9 LTR may have produced a functional open reading frame, causing the rebirth of the human immunity related GTPase gene (IRGM). ERV insertions have also been shown to generate alternative splice sites either by direct integration into the gene, as with the human leptin hormone receptor, or driven by the expression of an upstream LTR, as with the phospholipase A-2 like protein.
Most of the time, however, the LTR functions as one of many alternate promoters, often conferring tissue-specific expression related to reproduction and development. In fact, 64% of known LTR-promoted transcription variants are expressed in reproductive tissues. For example, the gene CYP19 codes for aromatase
Aromatase (), also called estrogen synthetase or estrogen synthase, is an enzyme responsible for a key step in the biosynthesis of estrogens. It is CYP19A1, a member of the cytochrome P450 superfamily, which are monooxygenases that catalyze man ...
P450, an important enzyme for estrogen synthesis, that is normally expressed in the brain and reproductive organs of most mammals. However, in primates, an LTR-promoted transcriptional variant confers expression to the placenta and is responsible for controlling estrogen levels during pregnancy. Furthermore, the neuronal apoptosis inhibitory protein (NAIP), normally widespread, has an LTR of the human ERV-P (HERV-P) family acting as a promoter that confers expression to the testis and prostate. Other proteins, such as nitric oxide synthase 3 (NOS3
Endothelial NOS (eNOS), also known as nitric oxide synthase 3 (NOS3) or constitutive NOS (cNOS), is an enzyme that in humans is encoded by the ''NOS3'' gene located in the 7q35-7q36 region of chromosome 7. This enzyme is one of three isoforms tha ...
), interleukin-2 receptor B ( IL-2RB), and another mediator of estrogen synthesis, HSD17B1, are also alternatively regulated by LTRs that confer placental expression, but their specific functions are not yet known. The high degree of reproductive expression is thought to be an after effect of the method by which they were endogenized; however, this also may be due to a lack of DNA methylation
DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter (genetics), promoter, DNA methylati ...
in germ-line tissues.
The best-characterized instance of placental protein expression comes not from an alternatively promoted host gene but from a complete co-option of a retroviral protein. Retroviral fusogenic env proteins, which play a role in the entry of the virion into the host cell, have had an important impact on the development of the mammalian placenta
The placenta (: placentas or placentae) is a temporary embryonic and later fetal organ that begins developing from the blastocyst shortly after implantation. It plays critical roles in facilitating nutrient, gas, and waste exchange between ...
. In mammals, intact env proteins called syncytins are responsible for the formation and function of syncytiotrophoblasts
The syncytiotrophoblast (from the Greek 'syn'- "together"; 'cytio'- "of cells"; 'tropho'- "nutrition"; 'blast'- "bud") is the epithelial covering of the highly vascular embryonic placental villi, which invades the wall of the uterus to establish ...
. These multinucleated cells are mainly responsible for maintaining nutrient exchange and separating the fetus from the mother's immune system. It has been suggested that the selection and fixation of these proteins for this function have played a critical role in the evolution of viviparity
In animals, viviparity is development of the embryo inside the body of the mother, with the maternal circulation providing for the metabolic needs of the embryo's development, until the mother gives birth to a fully or partially developed juv ...
.
In addition, the insertion of ERVs and their respective LTRs have the potential to induce chromosomal rearrangement due to recombination between viral sequences at inter-chromosomal loci. These rearrangements have been shown to induce gene duplications and deletions that largely contribute to genome plasticity and dramatically change the dynamic of gene function. Furthermore, retroelements in general are largely prevalent in rapidly evolving, mammal-specific gene families whose function is largely related to the response to stress and external stimuli. In particular, both human class I and class II MHC genes have a high density of HERV elements as compared to other multi-locus-gene families. It has been shown that HERVs have contributed to the formation of extensively duplicated duplicon blocks that make up the human leukocyte antigen (HLA) class 1 family of genes. More specifically, HERVs primarily occupy regions within and between the break points of these blocks, suggesting that considerable duplication and deletion events, typically associated with unequal crossover, facilitated their formation. The generation of these blocks, inherited as immunohaplotypes, act as a protective polymorphism against a wide range of antigens that may have imbued humans with an advantage over other primates.
The characteristic of placenta
The placenta (: placentas or placentae) is a temporary embryonic and later fetal organ that begins developing from the blastocyst shortly after implantation. It plays critical roles in facilitating nutrient, gas, and waste exchange between ...
s being very evolutionary distinct organs between different species has been suggested to result from the co-option
Co-option, also known as co-optation and sometimes spelt cooption or cooptation, is a term with three common meanings. It may refer to:
1) The process of adding members to an elite Social group, group at the discretion of members of the body, us ...
of ERV enhancers. Regulatory mutations, instead of mutations in genes that encode for hormone
A hormone (from the Ancient Greek, Greek participle , "setting in motion") is a class of cell signaling, signaling molecules in multicellular organisms that are sent to distant organs or tissues by complex biological processes to regulate physio ...
s and growth factor
A growth factor is a naturally occurring substance capable of stimulating cell proliferation, wound healing, and occasionally cellular differentiation. Usually it is a secreted protein or a steroid hormone. Growth factors are important for ...
s, support the known evolution of placental morphology, especially since the majority of hormone and growth factor genes are expressed in response to pregnancy, not during placental development. Researchers studied the regulatory landscape of placental development between the rat and mouse, two closely related species. This was done by mapping all regulatory elements of the rat trophoblast
The trophoblast (from Greek language, Greek : to feed; and : germinator) is the outer layer of cells of the blastocyst. Trophoblasts are present four days after Human fertilization, fertilization in humans. They provide nutrients to the embryo an ...
stem cell
In multicellular organisms, stem cells are undifferentiated or partially differentiated cells that can change into various types of cells and proliferate indefinitely to produce more of the same stem cell. They are the earliest type of cell ...
s (TSCs) and comparing them to their ortholog
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a speci ...
s in mouse TSCs. TSCs were observed because they reflect the initial cells that develop in the fetal placenta. Regardless of their tangible similarities, enhancer and repressed regions were mostly species-specific. However, most promoter sequences were conserved between mouse and rat. In conclusion to their study, researchers proposed that ERVs influenced species-specific placental evolution through mediation of placental growth, immunosuppression
Immunosuppression is a reduction of the activation or efficacy of the immune system. Some portions of the immune system itself have immunosuppressive effects on other parts of the immune system, and immunosuppression may occur as an adverse react ...
, and cell fusion Cell fusion is an important cellular process in which several uninucleate cells (cells with a single nucleus) combine to form a multinucleate cell, known as a syncytium. Cell fusion occurs during differentiation of myoblasts, osteoclasts and ...
.
Another example of ERV exploiting cellular mechanisms is p53
p53, also known as tumor protein p53, cellular tumor antigen p53 (UniProt name), or transformation-related protein 53 (TRP53) is a regulatory transcription factor protein that is often mutated in human cancers. The p53 proteins (originally thou ...
, a tumor suppressor gene
A tumor suppressor gene (TSG), or anti-oncogene, is a gene that regulates a cell (biology), cell during cell division and replication. If the cell grows uncontrollably, it will result in cancer. When a tumor suppressor gene is mutated, it results ...
(TSG). DNA damage and cellular stress induces the p53 pathway, which results in cell apoptosis
Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
. Using chromatin immunoprecipitation with sequencing, thirty-percent of all p53-binding sites were located within copies of a few primate-specific ERV families. A study suggested that this benefits retroviruses because p53's mechanism provides a rapid induction of transcription, which leads to the exit of viral RNA from the host cell.[
Finally, the insertion of ERVs or ERV elements into genic regions of host DNA, or overexpression of their transcriptional variants, has a much higher potential to produce deleterious effects than positive ones. Their appearance into the genome has created a ]coevolution
In biology, coevolution occurs when two or more species reciprocally affect each other's evolution through the process of natural selection. The term sometimes is used for two traits in the same species affecting each other's evolution, as well a ...
ary dynamic that proliferated the duplication and expansion of repressor genes. The most clear-cut example of this involves the rapid duplication and proliferation of tandem zinc-finger
A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) which stabilizes the fold. The term ''zinc finger'' was originally coined to describe the finger-like appearance of a hy ...
genes in mammal genomes. Zinc-finger genes, particularly those that include a (Krüppel associated box) KRAB domain, exist in high copy number in vertebrate genomes, and their range of functions are limited to transcriptional roles. It has been shown in mammals, however, that the diversification of these genes was due to multiple duplication and fixation events in response to new retroviral sequences or their endogenous copies to repress their transcription.
Role in disease
The majority of ERVs that occur in vertebrate genomes are ancient, inactivated by mutation, and have reached genetic fixation in their host species. For these reasons, they are extremely unlikely to have negative effects on their hosts except under unusual circumstances. Nevertheless, it is clear from studies in birds and non-human mammal species including mice, cats and koalas
The koala (''Phascolarctos cinereus''), sometimes inaccurately called the koala bear, is an arboreal herbivorous marsupial native to Australia. It is the only extant representative of the family '' Phascolarctidae''. Its closest living re ...
, that younger (i.e., more recently integrated) ERVs can be associated with disease. The number of active ERVs in the genome of mammals is negatively related to their body size, suggesting a contribution to Peto's paradox
Peto's paradox is the observation that, at the species level, the incidence of cancer does not appear to correlate with the number of cells in an organism. For example, the incidence of cancer in humans is much higher than the incidence of cancer ...
through cancer pathogenesis. This has led researchers to propose a role for ERVs in several forms of human cancer and autoimmune disease
An autoimmune disease is a condition that results from an anomalous response of the adaptive immune system, wherein it mistakenly targets and attacks healthy, functioning parts of the body as if they were foreign organisms. It is estimated tha ...
, although conclusive evidence is lacking.
Neurological disorders
In humans, ERVs have been proposed to be involved in multiple sclerosis
Multiple sclerosis (MS) is an autoimmune disease resulting in damage to myelinthe insulating covers of nerve cellsin the brain and spinal cord. As a demyelinating disease, MS disrupts the nervous system's ability to Action potential, transmit ...
(MS). A specific association between MS and the ERVWE1, or "syncytin" gene, which is derived from an ERV insertion, has been reported, along with the presence of an "MS-associated retrovirus" (MSRV), in patients with the disease. HERVs have also been implicated in amyotrophic lateral sclerosis (ALS
Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, terminal neurodegenerative disorder that results in the progressive loss of both upper and low ...
) and addiction.
In 2004 it was reported that antibodies to HERVs were found in greater frequency in the sera of people with schizophrenia
Schizophrenia () is a mental disorder characterized variously by hallucinations (typically, Auditory hallucination#Schizophrenia, hearing voices), delusions, thought disorder, disorganized thinking and behavior, and Reduced affect display, f ...
. Additionally, the cerebrospinal fluid
Cerebrospinal fluid (CSF) is a clear, colorless Extracellular fluid#Transcellular fluid, transcellular body fluid found within the meninges, meningeal tissue that surrounds the vertebrate brain and spinal cord, and in the ventricular system, ven ...
of people with recent onset schizophrenia contained levels of a retroviral marker, reverse transcriptase
A reverse transcriptase (RT) is an enzyme used to convert RNA genome to DNA, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobi ...
, four times higher than control subjects. Researchers continue to look at a possible link between HERVs and schizophrenia, with the additional possibility of a triggering infection inducing schizophrenia.
Immunity
ERVs have been found to be associated to disease not only through disease-causing relations, but also through immunity. The frequency of ERVs in LTRs likely correlates to viral adaptations to take advantage of immunity signaling pathways that promote viral transcription and replication. A study done in 2016 investigated the benefit of ancient viral DNA integrated into a host through gene regulation networks induced by interferons
Interferons (IFNs, ) are a group of signaling proteins made and released by host cells in response to the presence of several viruses. In a typical scenario, a virus-infected cell will release interferons causing nearby cell (biology), cell ...
, a branch of innate immunity. These cytokines
Cytokines () are a broad and loose category of small proteins (~5–25 kDa) important in cell signaling.
Cytokines are produced by a broad range of cells, including immune cells like macrophages, B cell, B lymphocytes, T cell, T lymphocytes ...
are first to respond to viral infection and are also important in immunosurveillance for malignant cells. ERVs are predicted to act as cis-regulatory elements, but much of the adaptive consequences for certain physiological functions is still unknown. There is data that supports the general role of ERVs in the regulation of human interferon (IFN) response, specifically to interferon-gamma
Interferon gamma (IFNG or IFN-γ) is a dimerized soluble cytokine that is the only member of the type II class of interferons. The existence of this interferon, which early in its history was known as immune interferon, was described by E. F. ...
(IFN-G). For example, IFN-stimulated genes were found to be greatly enriched with ERVs bound by signal transducer and activator of transcription 1 (STAT1) and/or Interferon regulatory factor
Interferon regulatory factors (IRF) are proteins which regulate transcription of interferons (see regulation of gene expression). Interferon regulatory factors contain a conserved N-terminal region of about 120 amino acids, which folds into a s ...
(IRF1) in CD14+ macrophages.
HERVs also play various roles shaping the human innate immunity
The innate immune system or nonspecific immune system is one of the two main immunity strategies in vertebrates (the other being the adaptive immune system). The innate immune system is an alternate defense strategy and is the dominant immune s ...
response, with some sequences activating the system and others suppressing it. They may also protect from exogenous retroviral infections: the virus-like transcripts can activate pattern recognition receptor
Pattern recognition receptors (PRRs) play a crucial role in the proper function of the innate immune system. PRRs are germline-encoded host sensors, which detect molecules typical for the pathogens. They are proteins expressed mainly by cells of th ...
s, and the proteins can interfere with active retroviruses. A gag protein from HERV-K(HML2) is shown to mix with HIV Gag, impairing HIV capsid formation as a result.
Gene regulation
Another idea proposed was that ERVs from the same family played a role in recruiting multiple genes into the same network of regulation. It was found that MER41 elements provided additional redundant regulatory enhancement to the genes located near STAT1 binding sites.
Role in medicine
Porcine endogenous retrovirus
For humans, porcine
The pig (''Sus domesticus''), also called swine (: swine) or hog, is an omnivorous, domesticated, even-toed, hoofed mammal. It is named the domestic pig when distinguishing it from other members of the genus '' Sus''. Some authorities consid ...
endogenous retroviruses (PERVs) pose a concern when using porcine tissues and organs in xenotransplantation, the transplanting of living cells, tissues, and organs from an organism of one species to an organism of different species. Although pigs are generally the most suitable donors to treat human organ diseases due to practical, financial, safety, and ethical reasons, PERVs previously could not be removed from pigs, due to their viral ability to integrate into the host genome and pass onto host offspring. This changed in 2017 when one lab, using CRISPR-Cas9
Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
, removed all 62 retroviruses from the pig genome. The consequences of cross-species transmission remain unexplored and have dangerous potential.
Researchers have indicated that infection of human tissues by PERVs is very possible, especially in immunosuppressed individuals. An immunosuppressed condition could potentially permit a more rapid and tenacious replication of viral DNA, and would later have less difficulty adapting to human-to-human transmission. Although known infectious pathogens present in the donor organ/tissue can be eliminated by breeding pathogen-free herds, unknown retroviruses can be present in the donor. These retroviruses are often latent and asymptomatic in the donor, but can become active in the recipient. Some examples of endogenous viruses that can infect and multiply in human cells are from baboons (BaEV), cats (RD114), and mice.
There are three different classes of PERVs, PERV-A, PERV-B, and PERV-C. PERV-A and PERV-B are polytropic
A polytropic process is a thermodynamic process that obeys the relation:
p V^ = C
where ''p'' is the pressure, ''V'' is volume, ''n'' is the polytropic index, and ''C'' is a constant. The polytropic process equation describes expansion and com ...
and can infect human cells in vitro, while PERV-C is ecotropic Ecotropism or ecotropic (from ''eco'' – hearth and ''tropic'' – to turn towards) refers to the philosophy that for human culture to be healthy, it must exist as in an ecological niche and thereby relate appropriately with all the fields of force ...
and does not replicate on human cells. The major differences between the classes is in the receptor binding domain of the ''env'' protein and the LTRs that influence the replication of each class. PERV-A and PERV-B display LTRs that have repeats in the U3 region. However, PERV-A and PERV-C show repeatless LTRs. Researchers found that PERVs in culture actively adapted to the repeat structure of their LTR in order to match the best replication performance a host cell could perform. At the end of their study, researchers concluded that repeatless PERV LTR evolved from the repeat-harboring LTR. This was likely to have occurred from insertional mutation and was proven through use of data on LTR and ''env''/Env. It is thought that the generation of repeatless LTRs could be reflective of an adaptation process of the virus, changing from an exogenous to an endogenous lifestyle.
A clinical trial study performed in 1999 sampled 160 patients who were treated with different living pig tissues and observed no evidence of a persistent PERV infection in 97% of the patients from whom a sufficient amount of DNA was available to PCR for amplification of PERV sequences. This study stated found that retrospective studies are limited to finding the true incidence of infection or associated clinical symptoms. It suggested using closely monitored prospective trials which would provide a more complete and detailed evaluation of the possible cross-species PERV transmission and a comparison of the PERV.
Human endogenous retroviruses
HERVs comprise a significant part of the human genome
The human genome is a complete set of nucleic acid sequences for humans, encoded as the DNA within each of the 23 distinct chromosomes in the cell nucleus. A small DNA molecule is found within individual Mitochondrial DNA, mitochondria. These ar ...
, with approximately 98,000 ERV elements and fragments making up 5–8%. According to a study published in 2005, no HERVs capable of replication had been identified; all appeared to be defective, containing major deletions or nonsense mutations (not true for HERV-K). This is because most HERVs are merely traces of original viruses, having first integrated millions of years ago. An analysis of HERV integrations is ongoing as part of the 100,000 Genomes Project
The 100,000 Genomes Project is a now-completed UK Government project managed by Genomics England that is sequencing whole genomes from National Health Service patients. The project is focusing on rare diseases, some common types of cancer, and i ...
.
A 2023 study found HERV can become awakened from dormant states and contribute to aging
Ageing (or aging in American English) is the process of becoming Old age, older until death. The term refers mainly to humans, many other animals, and fungi; whereas for example, bacteria, perennial plants and some simple animals are potentiall ...
which could be blocked by neutralizing antibodies
A neutralizing antibody (NAb) is an antibody that defends a cell from a pathogen or infectious particle by neutralizing any effect it has biologically. Neutralization renders the particle no longer infectious or pathogenic.
Neutralizing antibo ...
.
HERVs were originally discovered when human genomic libraries were screened under low-stringency conditions using either probes from animal retroviruses or by using oligonucleotide
Oligonucleotides are short DNA or RNA molecules, oligomers, that have a wide range of applications in genetic testing, Recombinant DNA, research, and Forensic DNA, forensics. Commonly made in the laboratory by Oligonucleotide synthesis, solid-phase ...
s with similarity to virus sequences.
Classification
HERVs are classified based on their homologies to animal retroviruses. Families belonging to Class I are similar in sequence to mammalian ''Gammaretrovirus
''Gammaretrovirus'' is a genus in the ''Retroviridae'' family. Example species are the murine leukemia virus and the feline leukemia virus. They cause various sarcomas, leukemias and immune deficiencies in mammals, reptiles and birds.
Introdu ...
es'' (type C) and ''Epsilonretrovirus
''Epsilonretrovirus'' is a genus of viruses in the family ''Retroviridae
A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After inv ...
es'' (Type E). Families belonging to Class II show homology to mammalian ''Betaretrovirus
''Betaretrovirus'' is a genus of viruses in the Family (biology), family ''Retroviridae''.
Example of viruses in the genus include: Mouse mammary tumor virus, enzootic nasal tumor virus (ENTV-1, ENTV-2), and simian retrovirus types 1, 2 and 3 ( ...
es'' (Type B) and ''Deltaretrovirus
''Deltaretrovirus'' is a genus of the ''Retroviridae'' family. It consists of exogenous horizontally transmitted viruses found in several groups of mammals. The genus contains Bovine leukemia virus and three species of primate T-lymphotropic v ...
es'' (Type D). Families belonging to Class III are similar to foamy viruses. For all classes, if homologies appear well conserved in the ''gag'', ''pol'', and ''env'' gene, they are grouped into a superfamily
SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, str ...
. There are more Class I families known to exist. The families themselves are named in a less uniform manner, with a mixture of naming based on an exogenous retrovirus, the priming tRNA ( HERV-W, HERV-K), or some neighboring gene (HERV-ADP), clone number (HERV-S71), or some amino acid motif (HERV-FRD). A proposed nomenclature aims to clean up the sometimes paraphyletic standards.
Origin
Sometime during human evolution, exogenous progenitors of HERV inserted themselves into germ line cells and then replicated along with the host's genes using and exploiting the host's cellular mechanisms. Because of their distinct genomic structure, HERVs were subjected to many rounds of amplification and transposition, which led to a more widespread distribution of retroviral DNA.
Nevertheless, one family of viruses has been active since the divergence of human
Humans (''Homo sapiens'') or modern humans are the most common and widespread species of primate, and the last surviving species of the genus ''Homo''. They are Hominidae, great apes characterized by their Prehistory of nakedness and clothing ...
s and chimpanzee
The chimpanzee (; ''Pan troglodytes''), also simply known as the chimp, is a species of Hominidae, great ape native to the forests and savannahs of tropical Africa. It has four confirmed subspecies and a fifth proposed one. When its close rel ...
s. This family, termed HERV-K (HML2), makes up less than 1% of HERV elements but is one of the most studied. There are indications it has even been active in the past few hundred thousand years, e.g., some human individuals carry more copies of HML2 than others. Traditionally, age estimates of HERVs are performed by comparing the 5' and 3' LTR of a HERV; however, this method is only relevant for full-length HERVs. A recent method, called cross-sectional dating, uses variations within a single LTR to estimate the ages of HERV insertions. This method is more precise in estimating HERV ages and can be used for any HERV insertions. Cross-sectional dating has been used to suggest that two members of HERV-K (HML2), HERV-K106 and HERV-K116, were active in the last 800,000 years and that HERV-K106 may have infected modern humans 150,000 years ago. However, the absence of known infectious members of the HERV-K (HML2) family, and the lack of elements with a full coding potential within the published human genome sequence, suggests to some that the family is less likely to be active at present. In 2006 and 2007, researchers working independently in France and USA recreated functional versions of HERV-K (HML2).
Expression of HERV proteins
The expression of HERV-K, a biologically active family of HERV, produces proteins found in placenta. Furthermore, the expression of the envelope genes of HERV-W
ERVW-1
) and HERV-FRD
ERVFRD-1
) produces syncytins which are important for the generation of the syncytiotrophoblast
The syncytiotrophoblast (from the Greek 'syn'- "together"; 'cytio'- "of cells"; 'tropho'- "nutrition"; 'blast'- "bud") is the epithelial covering of the highly vascular embryonic placental villi, which invades the wall of the uterus to establish ...
cell layer during placentogenesis by inducing cell-cell fusion. The HUGO Gene Nomenclature Committee
The HUGO Gene Nomenclature Committee (HGNC) is a committee of the Human Genome Organisation (HUGO) that sets the standards for human gene nomenclature. The HGNC approves a ''unique'' and ''meaningful'' name for every known human gene, based on a ...
(HGNC) approves gene symbols for transcribed human ERVs.
Functional impact
MER41.AIM2 is an HERV that regulates the transcription of AIM2 (Absent in Melanoma 2) which encodes for a sensor of foreign cytosolic DNA. This acts as a binding site for AIM2, meaning that it is necessary for the transcription of AIM2. Researchers had shown this by deleting MER41.AIM2 in HeLa cells using CRISPR/Cas9, leading to an undetectable transcript level of AIM2 in modified HeLa cells. The control cells, which still contained the MER41.AIM2 ERV, were observed with normal amounts of AIM2 transcript. In terms of immunity, researchers concluded that MER41.AIM2 is necessary for an inflammatory response to infection.
Activation by exogenous viruses
Considerable evidence indicates that HERVs can be reactivated by viral infections, such as:
* retroviruses – human immunodeficiency virus type-1 (HIV-1
The subtypes of HIV include two main subtypes, known as HIV type 1 (HIV-1) and HIV type 2 (HIV-2). These subtypes have distinct genetic differences and are associated with different epidemiological patterns and clinical characteristics.
HIV-1 e ...
), human T-lymphotropic virus 1
Human T-cell lymphotropic virus type 1 or human T-lymphotropic virus (HTLV-I), also called the adult T-cell lymphoma virus type 1, is a retrovirus of the human T-lymphotropic virus (HTLV) family.
Most people with HTLV-1 infection do not appear t ...
(HTLV-1);
* RNA viruses – influenza A virus
''Influenza A virus'' (''Alphainfluenzavirus influenzae'') or IAV is the only species of the genus ''Alphainfluenzavirus'' of the virus family '' Orthomyxoviridae''. It is a pathogen with strains that infect birds and some mammals, as well as c ...
, hepatitis C virus
The hepatitis C virus (HCV) is a small (55–65 nm in size), enveloped, positive-sense single-stranded RNA virus of the family ''Flaviviridae''. The hepatitis C virus is the cause of hepatitis C and some cancers such as liver cancer ( hepatoc ...
(HCV), severe acute respiratory syndrome coronavirus-2 ( SARSCoV-2);
* DNA viruses – herpes simplex virus type-1 (HSV-1), Epstein-Barr virus (EBV), human cytomegalovirus
Human cytomegalovirus (HCMV), also called human herpesvirus 5 (HHV-5), is a species of virus in the genus ''Cytomegalovirus'', which in turn is a member of the viral family known as ''Herpesviridae'' or herpesviruses. It is also commonly call ...
(CMV), Kaposi’s sarcoma-associated herpesvirus
Kaposi's sarcoma-associated herpesvirus (KSHV) is the ninth known human herpesvirus. It is also called Human herpesvirus 8, or HHV-8 in short. This virus causes Kaposi's sarcoma, a cancer commonly occurring in AIDS patients, as well as primary ef ...
(KSHV)
Several studies have shown that EBV is able to transactivate the expression of the normally inactive HERV-K18 Env protein, e.g., interacting with resting B cell
B cells, also known as B lymphocytes, are a type of the lymphocyte subtype. They function in the humoral immunity component of the adaptive immune system. B cells produce antibody molecules which may be either secreted or inserted into the plasm ...
s via the CD21
Complement receptor type 2 (CR2), also known as complement C3d receptor, Epstein–Barr virus receptor, and CD21 (cluster of differentiation 21), is a protein that in humans is encoded by the CR2 gene.
CR2 is involved in the complement system. I ...
receptor. Further studies revealed that the mechanism of transactivation depends on the expression of the major EBV late gene transactivator, EBNA-2. In-depth analysis completed the picture identifying the EBV latent membrane protein LMP-2A as a strong candidate for the HERV-K18 transactivation. HERV-K18 has also been reported to have superantigen activity (i.e., polyclonal T and B cell activation regardless of the specificity of their antigen receptor).
It has also been shown that in vitro binding of the EBV gp350 protein caused activation of MSRV''env'' and syncytin-1 in B-cells, monocytes, macrophages, and astrocytes - cells that are involved in pathogenesis of multiple sclerosis
Multiple sclerosis (MS) is an autoimmune disease resulting in damage to myelinthe insulating covers of nerve cellsin the brain and spinal cord. As a demyelinating disease, MS disrupts the nervous system's ability to Action potential, transmit ...
. Monocytes, especially after their differentiation into macrophages, appeared to be the most responsive to EBVgp350, expressing even higher levels of HERV-W''env'' than B cells. This finding is concordant with another study, which demonstrated that during infectious mononucleosis, EBV promoted the strongest activation of HERV-W/MSRV expression in monocytes compared to other blood cell types.
Immune response to HERVs
Despite having been integrated into genomes of vertebrates for millions of years, ERVs represent an intermediate stage between exogenous viruses and the host genome; it is suggested that immunological tolerance to HERV-derived proteins and peptides is imperfect due to the epigenetic silencing of HERV in the thymus and bone marrow, which prevents deletion of all HERV-specific T and B cells. As evidence of this, immunization of non-human primates with ERV-derived antigens mounted robust polyfunctional cytotoxic T cell
A cytotoxic T cell (also known as TC, cytotoxic T lymphocyte, CTL, T-killer cell, cytolytic T cell, CD8+ T-cell or killer T cell) is a T lymphocyte (a type of white blood cell) that kills cancer cells, cells that are infected by intracellular ...
response as well as high antibody titers. According to phylogenetic studies, among 30 HERV families existing in the human genome, HERV-K (HML-2) elements which integrated most recently are the most intact and biologically active forms. HERV-K ''env'' and HERV-H ''env'', considered to be a new class of tumor-associated antigens, have been found to promote strong cytotoxic T-cell responses in patients with various types of cancers.
On a level of the innate immune sensing of nucleic acids, single-stranded RNA (ssRNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
) and double-stranded RNA
(dsRNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself ( non-coding RNA) or by forming a template for the production of proteins ( messenger RNA). RNA and deoxy ...
) derived from endogenous retroviruses are recognized by pattern recognition receptor
Pattern recognition receptors (PRRs) play a crucial role in the proper function of the innate immune system. PRRs are germline-encoded host sensors, which detect molecules typical for the pathogens. They are proteins expressed mainly by cells of th ...
s (PRRs).
SsRNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
s can be sensed by Toll-like receptor
Toll-like receptors (TLRs) are a class of proteins that play a key role in the innate immune system. They are single-pass membrane protein, single-spanning receptor (biochemistry), receptors usually expressed on sentinel cells such as macrophages ...
s TLR-7 and TLR-8, resulting in secretion of IFN-α
The type-I interferons (IFN) are cytokines which play essential roles in inflammation, immunoregulation, tumor cells recognition, and T-cell responses. In the human genome, a cluster of thirteen functional IFN genes is located at the 9p21.3 cyt ...
by stimulated dendritic cell
A dendritic cell (DC) is an antigen-presenting cell (also known as an ''accessory cell'') of the mammalian immune system. A DC's main function is to process antigen material and present it on the cell surface to the T cells of the immune system ...
s (DCs) and macrophage
Macrophages (; abbreviated MPhi, φ, MΦ or MP) are a type of white blood cell of the innate immune system that engulf and digest pathogens, such as cancer cells, microbes, cellular debris and foreign substances, which do not have proteins that ...
s, which was observed for ssRNAs derived from HIV-1
The subtypes of HIV include two main subtypes, known as HIV type 1 (HIV-1) and HIV type 2 (HIV-2). These subtypes have distinct genetic differences and are associated with different epidemiological patterns and clinical characteristics.
HIV-1 e ...
.
DsRNAs might be one of the most immunogenic nucleic acid pathogen-associated molecular pattern
Pathogen-associated molecular patterns (PAMPs) are small molecular motifs conserved within a class of microbes, but not present in the host. They are recognized by toll-like receptors (TLRs) and other pattern recognition receptors (PRRs) in both p ...
s (PAMPs), since they are not found in cells in a normal state. HERV-derived dsRNA can be recognized by TLR-3, RIG-I
RIG-I (retinoic acid-inducible gene I) is a cytosolic pattern recognition receptor (PRR) that can mediate induction of a type-I interferon (IFN1) response. RIG-I is an essential molecule in the innate immune system for recognizing cells that ...
, and MDA5
MDA5 (melanoma differentiation-associated protein 5) is a RIG-I-like receptor dsRNA helicase enzyme that is encoded by the ''IFIH1'' gene in humans. MDA5 is part of the RIG-I-like receptor (RLR) family, which also includes RIG-I and LGP2, and f ...
; RIG-I and MDA5 are known to induce a type I IFN response.
When retrotranscribed into DNA, retroviruses can be sensed by cyclic guanosine monophosphate (GMP)-adenosine monophosphate (AMP) synthase-stimulator of interferon genes (cGAS-STING) pathway, leading to the activation of nuclear factor-kappa B
Nuclear may refer to:
Physics
Relating to the nucleus of the atom:
* Nuclear engineering
* Nuclear physics
* Nuclear power
*Nuclear reactor
*Nuclear weapon
* Nuclear medicine
*Radiation therapy
*Nuclear warfare
Mathematics
* Nuclear space
* Nu ...
(NF-kB) and IFN regulatory factor 3 (IRF3), which in turn trigger a type I IFN response. DsDNA could also be sensed by DNA-dependent activator of IFN-regulatory factors (DAI); DNA:RNA hybrids could be recognized by TLR-9
The recognition of nucleic acids through PRRs provides a very efficient strategy to fight against viral infections, at the same time exposing the host to a risk due to the possibility of recognizing self-nucleic acids and promotion of autoimmunity. Not surprisingly, HERVs have been found to be associated with different autoimmune and inflammatory diseases, such as multiple sclerosis
Multiple sclerosis (MS) is an autoimmune disease resulting in damage to myelinthe insulating covers of nerve cellsin the brain and spinal cord. As a demyelinating disease, MS disrupts the nervous system's ability to Action potential, transmit ...
, amyotrophic lateral sclerosis (ALS), systemic lupus erythematosus (SLE)
Lupus, formally called systemic lupus erythematosus (SLE), is an autoimmune disease in which the body's immune system mistakenly attacks healthy tissue in many parts of the body. Symptoms vary among people and may be mild to severe. Common ...
, rheumathoid arthritis (RA), and Sjögren syndrome (SS).
On a protein level, a direct interaction between TLRs and certain HERV proteins has been shown. For example, the surface unit of HERV-W Env (also known as multiple sclerosis-associated retroviral element (MSRV) env) was found to bind to TLR4
Toll-like receptor 4 (TLR4), also designated as CD284 (cluster of differentiation 284), is a key activator of the innate immune response and plays a central role in the fight against bacterial infections. TLR4 is a transmembrane protein of approx ...
and CD14
CD14 ( cluster of differentiation 14) is a human protein made mostly by macrophages as part of the innate immune system. It helps to detect bacteria in the body by binding lipopolysaccharide (LPS), a pathogen-associated molecular pattern (PAMP). ...
, stimulating the production of pro-inflammatory cytokines including IL-1β
Interleukin-1 beta (IL-1β) also known as leukocytic pyrogen, leukocytic endogenous mediator, mononuclear cell factor, lymphocyte activating factor and other names, is a cytokine protein that in humans is encoded by the ''IL1B'' gene."Catabolin" ...
, IL-6, and TNFα
Tumor necrosis factor (TNF), formerly known as TNF-α, is a chemical messenger produced by the immune system that induces inflammation. TNF is produced primarily by activated macrophages, and induces inflammation by binding to its receptors o ...
. HERV-W Env can trigger a maturation process in human dendritic cell
A dendritic cell (DC) is an antigen-presenting cell (also known as an ''accessory cell'') of the mammalian immune system. A DC's main function is to process antigen material and present it on the cell surface to the T cells of the immune system ...
s, endowing them with the capacity to support a Th1-like type of Th cell differentiation.
Immunological studies have shown some evidence for T cell
T cells (also known as T lymphocytes) are an important part of the immune system and play a central role in the adaptive immune response. T cells can be distinguished from other lymphocytes by the presence of a T-cell receptor (TCR) on their cell ...
immune responses against HERVs in HIV-infected individuals. The hypothesis that HIV induces HERV expression in HIV-infected cells led to the proposal that a vaccine targeting HERV antigens could specifically eliminate HIV-infected cells. The potential advantage of this novel approach is that, by using HERV antigens as surrogate markers of HIV-infected cells, it could circumvent the difficulty inherent in directly targeting notoriously diverse and fast-mutating HIV antigens.
Techniques for characterizing ERVs
Whole genome sequencing
Example: A PERV Chinese-born minipig isolate, PERV-A-BM, was sequenced completely and along with different breeds and cell lines in order to understand its genetic variation and evolution. The observed number of nucleotide substitutions and among the different genome sequences helped researchers determine an estimated age that PERV-A-BM was integrated into its host genome, which was found to be of an evolutionary age earlier than the European-born pigs isolates.
Chromatin immunoprecipitation with sequencing (ChIP-seq)
This technique is used to find histone marks indicative of promoters and enhancers, which are binding sites for DNA proteins, and repressed regions and trimethylation. DNA methylation has been shown to be vital to maintain silencing of ERVs in mouse somatic cells, while histone marks are vital for the same purpose in embryonic stem cells
Embryonic stem cells (ESCs) are Cell potency#Pluripotency, pluripotent stem cells derived from the inner cell mass of a blastocyst, an early-stage pre-Implantation (human embryo), implantation embryo. Human embryos reach the blastocyst stage 4� ...
(ESCs) and early embryogenesis.
Applications
Constructing phylogenies
Because most HERVs have no function, are selectively neutral, and are very abundant in primate genomes, they easily serve as phylogenetic markers for linkage analysis. They can be exploited by comparing the integration site polymorphisms or the evolving, proviral, nucleotide sequences of orthologs. To estimate when integration occurred, researchers used distances from each phylogenetic tree to find the rate of molecular evolution at each particular locus. It is also useful that ERVs are abundant within the genomes of many species (i.e., plants, insects, mollusks, fish, rodents, domestic pets, and livestock) and the application of this ERV approach can be used to answer a variety of phylogenetic questions.
Designating the age of provirus and the time points of species separation events
This is accomplished by comparing the different HERV from different evolutionary periods. For example, this study was done for different hominoids, which ranged from humans to apes and to monkeys. This is difficult to do with PERV because of the large diversity present.
Further research
Epigenetic variability
Researchers could analyze individual epigenomes and transcriptomes
The transcriptome is the set of all RNA transcripts, including coding and non-coding RNA, non-coding, in an individual or a population of cell (biology), cells. The term can also sometimes be used to refer to RNA#Types of RNA, all RNAs, or just Mes ...
to study the reactivation of dormant transposable elements through epigenetic release and their potential associations with human disease and exploring the specifics of gene regulatory networks.
Immunological problems of xenotransplantation
Little is known about an effective way to overcoming hyperacute rejection
Transplant rejection occurs when transplanted tissue is rejected by the recipient's immune system, which destroys the transplanted tissue. Transplant rejection can be lessened by determining the molecular similitude between donor and recipient ...
(HAR), which follows the activation of complement initiated by xenoreactive antibodies recognizing galactosyl-alpha1-3galatosyl (alpha-Gal) antigens on the donor epithelium.
Risk factors of HERVs in gene therapy
Because retroviruses are able to recombine with each other and with other endogenous DNA sequences, it would be beneficial for gene therapy to explore the potential risks HERVs can cause, if any. Also, this ability of HERVs to recombine can be manipulated for site-directed integration by including HERV sequences in retroviral vectors.
HERV gene expression
Researchers believe that RNA and proteins encoded for by HERV genes should continue to be explored for putative function in cell physiology and in pathological conditions. These studies would examine and more deeply define the biological significance of the proteins synthesized.
See also
*Avian sarcoma leukosis virus
Avian sarcoma leukosis virus (ASLV) is an endogenous retrovirus that infects and can lead to cancer in chickens; experimentally it can infect other species of birds and mammals. ASLV replicates in chicken embryo fibroblasts, the cells that contri ...
(ASLV)
*Endogenous viral element
An endogenous viral element (EVE) is a DNA sequence derived from a virus, and present within the germline of a non-viral organism. EVEs may be entire viral genomes ( proviruses), or fragments of viral genomes. They arise when a viral DNA sequence ...
*Endogeny (biology)
Endogeny, in biology, refers to the property of originating or developing from within an organism, Tissue (biology), tissue, or Cell (biology), cell.
For example, ''endogenous substances'', and ''endogenous processes'' are those that originate ...
*ERV3
HERV-R_7q21.2 provirus ancestral envelope (Env) polyprotein is a protein that in humans is encoded by the ''ERV3'' gene.
Function
The human genome includes many retroelements including the human endogenous retroviruses (HERVs), which compose a ...
* HERV-FRD
*Horizontal gene transfer
Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between organisms other than by the ("vertical") transmission of DNA from parent to offspring (reproduction). HGT is an important factor in the e ...
*Jaagsiekte sheep retrovirus
Jaagsiekte sheep retrovirus (JSRV) is a betaretrovirus which is the causative agent of a contagious lung cancer in sheep, called ovine pulmonary adenocarcinoma.
Natural history
JSRV is the virus that is the cause of the contagious lung tumors i ...
(JSRV)
*Koala retrovirus
Koala retrovirus (KoRV) is a retrovirus that is present in many populations of koalas. It has been implicated as the agent of koala immune deficiency syndrome (KIDS), an HIV/AIDS, AIDS-like immunodeficiency that leaves infected koalas more susce ...
(KoRV)
*Mouse mammary tumor virus
Mouse mammary tumor virus (MMTV) is a milk-transmitted retrovirus like the HTL viruses, HI viruses, and BLV. It belongs to the genus ''Betaretrovirus''. MMTV was formerly known as Bittner virus, and previously the "milk factor", referring to t ...
(MMTV)
*Murine leukemia virus
The murine leukemia viruses (MLVs or MuLVs) are retroviruses named for their ability to cause cancer in murine (mouse) hosts. Some MLVs may infect other vertebrates. MLVs include both exogenous and endogenous viruses. Replicating MLVs have a p ...
(MLV) and