A fusion gene is a hybrid gene formed from two previously independent genes. It can occur as a result of
translocation
Translocation may refer to:
* Chromosomal translocation, a chromosome abnormality caused by rearrangement of parts
** Robertsonian translocation, a chromosomal rearrangement in pairs 13, 14, 15, 21, and 22
** Nonreciprocal translocation, transfer ...
,
interstitial deletion, or
chromosomal inversion. Fusion genes have been found to be prevalent in all main types of human
neoplasia. The identification of these fusion genes play a prominent role in being a diagnostic and prognostic marker.
History
The first fusion gene was described in cancer cells in the early 1980s. The finding was based on the discovery in 1960 by
Peter Nowell
Peter Carey Nowell (February 8, 1928 – December 26, 2016) was a cancer researcher and co-discoverer of the Philadelphia chromosome. At the time of his death, he was the Gaylord P. and Mary Louise Harnwell Emeritus Professor of Pathology and Labo ...
and
David Hungerford in Philadelphia of a small abnormal marker
chromosome
A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins ar ...
in patients with
chronic myeloid leukemia—the first consistent chromosome abnormality detected in a human malignancy, later designated the
Philadelphia chromosome
The Philadelphia chromosome or Philadelphia translocation (Ph) is a specific genetic abnormality in Chromosome 22 (human), chromosome 22 of Leukemia, leukemia cancer cells (particularly chronic myeloid leukemia (CML) cells). This chromosome is defe ...
. In 1973,
Janet Rowley in Chicago showed that the Philadelphia chromosome had originated through a translocation between chromosomes
9 and
22, and not through a simple deletion of chromosome 22 as was previously thought. Several investigators in the early 1980s showed that the Philadelphia
chromosome translocation led to the formation of a new BCR::ABL1 fusion gene, composed of the 3' part of the ABL1 gene in the breakpoint on chromosome 9 and the 5' part of a gene called BCR in the breakpoint in chromosome 22. In 1985 it was clearly established that the fusion gene on chromosome 22 produced an abnormal
chimeric BCR::ABL1 protein with the capacity to induce chronic myeloid leukemia.
Oncogenes
It has been known for 30 years that the corresponding gene fusion plays an important role in tumorigenesis. Fusion genes can contribute to tumor formation because fusion genes can produce much more active abnormal protein than non-fusion genes. Often, fusion genes are
oncogenes that cause
cancer
Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
; these include
BCR-ABL, TEL-AML1 (
ALL with t(12 ; 21)), AML1-ETO (
M2 AML with t(8 ; 21)), and
TMPRSS2-
ERG with an interstitial deletion on
chromosome 21, often occurring in prostate cancer.
In the case of TMPRSS2-ERG, by disrupting androgen receptor (AR) signaling and inhibiting AR expression by oncogenic ETS transcription factor, the fusion product regulates the prostate cancer.
Most fusion genes are found from
hematological cancer
Tumors of the hematopoietic and lymphoid tissues (American English) or tumours of the haematopoietic and lymphoid tissues (British English) are tumors that affect the blood, bone marrow, lymph, and lymphatic system. Because these tissues are all ...
s,
sarcoma
A sarcoma is a malignant tumor, a type of cancer that arises from transformed cells of mesenchymal (connective tissue) origin. Connective tissue is a broad term that includes bone, cartilage, fat, vascular, or hematopoietic tissues, and sa ...
s, and
prostate cancer
Prostate cancer is cancer of the prostate. Prostate cancer is the second most common cancerous tumor worldwide and is the fifth leading cause of cancer-related mortality among men. The prostate is a gland in the male reproductive system that su ...
.
BCAM-AKT2 is a fusion gene that is specific and unique to high-grade
serous ovarian cancer.
Oncogenic fusion genes may lead to a gene product with a new or different function from the two fusion partners. Alternatively, a proto-oncogene is fused to a strong
promoter, and thereby the oncogenic function is set to function by an
upregulation
In the biological context of organisms' production of gene products, downregulation is the process by which a cell decreases the quantity of a cellular component, such as RNA or protein, in response to an external stimulus. The complementary pr ...
caused by the strong promoter of the upstream fusion partner. The latter is common in
lymphoma
Lymphoma is a group of blood and lymph tumors that develop from lymphocytes (a type of white blood cell). In current usage the name usually refers to just the cancerous versions rather than all such tumours. Signs and symptoms may include enl ...
s, where oncogenes are juxtaposed to the promoters of the
immunoglobulin
An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
genes. Oncogenic
fusion transcripts may also be caused by
trans-splicing or
read-through events.
Since chromosomal translocations play such a significant role in neoplasia, a specialized database of chromosomal aberrations and gene fusions in cancer has been created. This database is called Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer.
Diagnostics
Presence of certain chromosomal aberrations and their resulting fusion genes is commonly used within cancer diagnostics in order to set a precise diagnosis. Chromosome
banding analysis,
fluorescence ''in situ'' hybridization (FISH), and
reverse transcription polymerase chain reaction (RT-PCR) are common methods employed at diagnostic laboratories. These methods all have their distinct shortcomings due to the very complex nature of cancer
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
s. Recent developments such as
high-throughput sequencing and custom
DNA microarray
A DNA microarray (also commonly known as DNA chip or biochip) is a collection of microscopic DNA spots attached to a solid surface. Scientists use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or to ...
s bear promise of introduction of more efficient methods.
Evolution
Gene fusion plays a key role in the evolution of gene architecture. We can observe its effect if gene fusion occurs in coding sequences. Duplication, sequence divergence, and recombination are the major contributors at work in gene evolution. These events can probably produce new genes from already existing parts. When gene fusion happens in non-coding sequence region, it can lead to the misregulation of the expression of a gene now under the control of the
cis-regulatory
''Cis''-regulatory elements (CREs) or ''Cis''-regulatory modules (CRMs) are regions of non-coding DNA which regulate the transcription of neighboring genes. CREs are vital components of genetic regulatory networks, which in turn control morphoge ...
sequence of another gene. If it happens in coding sequences, gene fusion cause the assembly of a new gene, then it allows the appearance of new functions by adding peptide modules into multi domain protein. The detecting methods to inventory gene fusion events on a large biological scale can provide insights about the multi modular architecture of proteins.
Purine biosynthesis
The
purine
Purine is a heterocyclic aromatic organic compound that consists of two rings ( pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted purin ...
s
adenine and
guanine
Guanine () (symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
are two of the four information encoding bases of the universal
genetic code
The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
. Biosynthesis of these purines occurs by similar, but not identical, pathways in different species of the three domains of life, the
Archaea,
Bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
and
Eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s. A major distinctive feature of the purine biosynthetic pathways in Bacteria is the prevalence of gene fusions where two or more purine biosynthetic enzymes are encoded by a single gene. Such gene fusions are almost exclusively between genes that encode enzymes that perform sequential steps in the biosynthetic pathway. Eukaryotic species generally exhibit the most common gene fusions seen in the Bacteria, but in addition have new fusions that potentially increase metabolic flux.
Detection
In recent years, next generation sequencing technology has already become available to screen known and novel gene fusion events on a genome wide scale. However, the precondition for large scale detection is a paired-end sequencing of the cell's
transcriptome. The direction of fusion gene detection is mainly towards data analysis and visualization. Some researchers already developed a new tool called Transcriptome Viewer (TViewer) to directly visualize detected gene fusions on the transcript level.
Research applications
Biologists may also deliberately create fusion genes for research purposes. The fusion of
reporter gene
In molecular biology, a reporter gene (often simply reporter) is a gene that researchers attach to a regulatory sequence of another gene of interest in bacteria, cell culture, animals or plants. Such genes are called reporters because the char ...
s to the regulatory elements of genes of interest allows researches to study gene expression. Reporter gene fusions can be used to measure activity levels of gene regulators, identify the regulatory sites of genes (including the signals required), identify various genes that are regulated in response to the same stimulus, and artificially control the expression of desired genes in particular cells. For example, by creating a fusion gene of a protein of interest and
green fluorescent protein
The green fluorescent protein (GFP) is a protein that exhibits bright green fluorescence when exposed to light in the blue to ultraviolet range. The label ''GFP'' traditionally refers to the protein first isolated from the jellyfish '' Aeq ...
, the protein of interest may be observed in
cells or
tissue using fluorescence
microscopy
Microscopy is the technical field of using microscopes to view objects and areas of objects that cannot be seen with the naked eye (objects that are not within the resolution range of the normal eye). There are three well-known branches of mi ...
.
The protein synthesized when a fusion gene is
expressed is called a ''
fusion protein''.
See also
*
De novo gene birth
*
ETV6-NTRK3 gene fusion
*
Exon shuffling
*
Gene duplication
Gene duplication (or chromosomal duplication or gene amplification) is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene ...
*
Horizontal gene transfer
Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between unicellular and/or multicellular organisms other than by the ("vertical") transmission of DNA from parent to offspring ( reproduction). ...
*
List of RNA-Seq bioinformatics tools
*
Mobile genetic elements
Mobile genetic elements (MGEs) sometimes called selfish genetic elements are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In ...
References
External links
ChiTaRS 5.0 The Improved Database of Chimeric Ttanscripts and RNA-seq Data.
ChiPPI The Server Protein-Protein Interaction of Chimeric Proteins.
ChimerDB 2.0 a knowledgebase for fusion genes updated.
dbCRID a new, comprehensive database of human CR events and associated diseases (both tumor and non-tumor) with detailed documentation of the CR events.
Mitelman Database: a database relates chromosomal aberrations to tumor characteristics, based either on individual cases or associations.
{{Genetic recombination
Mutation
Modification of genetic information