A conjoined gene (CG) is defined as a
gene, which gives rise to transcripts by combining at least part of one exon from each of two or more distinct known (parent) genes which lie on the same chromosome, are in the same orientation, and often (95%) translate independently into different proteins. In some cases, the transcripts formed by CGs are translated to form chimeric or completely novel proteins.
Several alternative names are used to address conjoined genes, including combined gene and complex gene,
fusion gene,
fusion protein
Fusion proteins or chimeric (kī-ˈmir-ik) proteins (literally, made of parts from different sources) are proteins created through the joining of two or more genes that originally coded for separate proteins. Translation of this ''fusion gene'' r ...
, read-through transcript, co-transcribed genes, bridged genes, spanning genes, hybrid genes, locus-spanning transcripts, etc.
At present, 800 CGs have been identified in the entire human genome by different research groups across the world including Prakash et al., Akiva et al., Parra et al., Kim et al., and in the 1% of the human genome in the ENCODE pilot project. 36% of all these CGs could be validated experimentally using RT-PCR and sequencing techniques. However, only a very limited number of these CGs are found in the public human genome resources such as the
Entrez Gene database, the
UCSC Genome Browser and the
Vertebrate Genome Annotation (Vega) database. More than 70% of the human conjoined genes are found to be conserved across other vertebrate genomes with higher order vertebrates showing more conservation, including the closest human ancestor, chimpanzee. Formation of CGs is not only limited to the human genome but some CGs have also been identified in other eukaryotic genomes, including mouse and drosophila. There are a few web resources which include information about some CGs in addition to the other fusion genes, for example,
ChimerDB
ChimerDB in computational biology is a database of fusion sequences.
ChimerDB currently consists of three searchable datasets.
ChimerKBis a curated knowledge base of 1,066 fusion genes sourced from publicly available scientific literature.ChimerP ...
an
HYBRIDdb Another database
ConjoinG is a comprehensive resource dedicated only to the 800 Conjoined Genes identified in the entire human genome.
See also
*
Gene expression
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
References
{{DEFAULTSORT:Conjoined Gene
Genes
Gene expression