hCONDELs
   HOME

TheInfoList



OR:

hCONDELs refer to regions of deletions within the human genome containing sequences that are highly conserved among closely related relatives. Almost all of these deletions fall within regions that perform non-coding functions. These represent a new class of
regulatory sequences Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. F ...
and may have played an important role in the development of specific traits and behavior that distinguish closely related organisms from each other.


Nomenclature

The group of CONDELs of a specific organism is specified by prefixing the CONDELs with the first letter of the organism. For instance, hCONDELs refer to the group of CONDELs found in humans whereas mCONDELs and cCONDELs refer to mouse and chimpanzee CONDELs respectively.


Identification of CONDELs

The term hCONDEL was first used in the 2011 ''
Nature Nature is an inherent character or constitution, particularly of the Ecosphere (planetary), ecosphere or the universe as a whole. In this general sense nature refers to the Scientific law, laws, elements and phenomenon, phenomena of the physic ...
'' article by McLean et al. in whole-genome comparison analysis. This involved firstly identifying a subset of 37,251 human deletions (hDELs) through pairwise comparisons of
chimpanzee The chimpanzee (; ''Pan troglodytes''), also simply known as the chimp, is a species of Hominidae, great ape native to the forests and savannahs of tropical Africa. It has four confirmed subspecies and a fifth proposed one. When its close rel ...
and
macaque The macaques () constitute a genus (''Macaca'') of gregarious Old World monkeys of the subfamily Cercopithecinae. The 23 species of macaques inhabit ranges throughout Asia, North Africa, and Europe (in Gibraltar). Macaques are principally f ...
genomes. Chimpanzee sequences highly conserved in other species were then identified by pairwise alignment of chimpanzee with macaque, mouse and chicken sequences with BLASTZ followed by multiple alignment of the pairwise alignments done with MULTIZ. The highly conserved chimpanzee sequences were searched against the human genome using BLAT to identify conserved regions not present in humans. This identified 583 regions of deletions that were then referred to as hCONDELs. 510 of these identified hCONDELs were then validated computationally with 39 of these being validated by
polymerase chain reaction The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA (or a part of it) sufficiently to enable detailed st ...
(PCR).


Characteristics

hCONDELs in humans cover approximately 0.14% of chimpanzee genome. The number of hCONDELs currently identified is 583 using the genome-wide comparison method; however, validation of these predicated regions of deletions through polymerase chain reaction methods produces 510 hCONDELs. The remainder of these hCONDELs are either false-positives or non-existent genes. hCONDELs have been confirmed through PCR with 88 percent of these shown to have been lost from the draft
Neanderthal Neanderthals ( ; ''Homo neanderthalensis'' or sometimes ''H. sapiens neanderthalensis'') are an extinction, extinct group of archaic humans who inhabited Europe and Western and Central Asia during the Middle Pleistocene, Middle to Late Plei ...
genome. hCONDELs, on average, remove about 95 base pairs (bp) of highly conserved sequences from the human genome. The median size of these 510 validated CONDELs is about 2,804 bp, thus showing a diverse range in length of the characteristic deletions. Another noticeable characteristic of hCONDELs (and other groups of identified CONDELs such as those from mouse and chimpanzee) is that they tend to be specifically skewed towards GC poor regions. Simulations show that hCONDELs are enriched near genes involved in hormone receptor signaling and neural function, and near genes encoding fibronectin-type-III-or
CD80 The Cluster of differentiation 80 (also CD80 and B7-1) is a B7, type I membrane protein in the immunoglobulin superfamily, with an extracellular immunoglobulin constant-like domain and a variable-like domain required for receptor binding. It is c ...
-like
immunoglobulin An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
C2-set domains.


Impact in humans


Sialic acid loss

Of the 510 identified hCONDELs, only one of these deletions has been shown to remove a 92 bp sequence that is part of a protein-coding region in the human sequence. The deletion that affects the protein
coding region The coding region of a gene, also known as the coding DNA sequence (CDS), is the portion of a gene's DNA or RNA that codes for a protein. Studying the length, composition, regulation, splicing, structures, and functions of coding regions compared ...
in humans results in a
frameshift mutation A frameshift mutation (also called a framing error or a reading frame shift) is a genetic mutation caused by indels ( insertions or deletions) of a number of nucleotides in a DNA sequence that is not divisible by three. Due to the triplet natur ...
in the CMAH gene which codes for the cytidine monophosphate-N-acetylneurminic acid hydroxylase-like protein, an enzyme involved in the production of N-glycolylneuraminic acid, one type of
sialic acid Sialic acids are a class of alpha-keto acid sugars with a nine-carbon backbone. The term "sialic acid" () was first introduced by Swedish biochemist Gunnar Blix in 1952. The most common member of this group is ''N''-acetylneuraminic acid ...
. Sialic acid is known to play a crucial part in cell signaling pathways and interaction processes. The loss of this gene is evident in the undetectable levels of sialic acid in humans but highly present in mouse, pig, chimpanzee and other mammal tissues and may provide more insight into the historic background of human evolution. The mechanisms and time of occurrence of hCONDELs are not entirely understood but given that conserved non-coding sequences play a major developmental role through regulation of genes, their loss in regions of deletions, it is expected that their loss in hCONDELs will result in developmental consequences that can be observed in human-specific traits.
In situ hybridization ''In situ'' hybridization (ISH) is a type of Hybridisation (molecular biology), hybridization that uses a labeled complementary DNA, RNA or modified nucleic acid strand (i.e., a Hybridization probe, probe) to localize a specific DNA or RNA seq ...
experiments done by Mclean et al. by fusion of mouse constructs fused to basal promoter with LacZ expression for hCONDELs near the
androgen receptor The androgen receptor (AR), also known as NR3C4 (nuclear receptor subfamily 3, group C, member 4), is a type of nuclear receptor that is activated by binding any of the androgenic hormones, including testosterone and dihydrotestosterone, in th ...
(AR) locus and the growth arrest and DNA-damage-inducible protein GADD45 gamma ( GADD45G) locus suggest a role in deletions that affect regulatory sequences in humans.


Loss of whiskers and penile spine

An hCONDEL located near the locus of the
androgen receptor The androgen receptor (AR), also known as NR3C4 (nuclear receptor subfamily 3, group C, member 4), is a type of nuclear receptor that is activated by binding any of the androgenic hormones, including testosterone and dihydrotestosterone, in th ...
(AR) gene may be responsible for the loss of whiskers and
penile spines Many mammalian species have developed keratinized penile spines along the glans or shaft, which may be involved in sexual selection. These spines have been described as being simple, single-pointed structures (macaques) or complex with two or t ...
in humans compared to its close relatives, including chimpanzees. The 60.7kb hCONDEL which is located near the AR locus has been found to be responsible for removing a 5 kb sequence that codes for an enhancer for the AR locus. Using the mouse construct with LacZ expression showed localization of this hCONDEL region (AR enhancer) to the
mesenchyme Mesenchyme () is a type of loosely organized animal embryonic connective tissue of undifferentiated cells that give rise to most tissues, such as skin, blood, or bone. The interactions between mesenchyme and epithelium help to form nearly ever ...
of vibrissae follicles and the
mesoderm The mesoderm is the middle layer of the three germ layers that develops during gastrulation in the very early development of the embryo of most animals. The outer layer is the ectoderm, and the inner layer is the endoderm.Langman's Medical ...
cells of penile organs.


Expansion of brain size

Many hCONDELs are located around genes expressed during cortical
neurogenesis Neurogenesis is the process by which nervous system cells, the neurons, are produced by neural stem cells (NSCs). This occurs in all species of animals except the porifera (sponges) and placozoans. Types of NSCs include neuroepithelial cells ( ...
. A 3,181 bp hCONDEL which is located near the GADD45G gene removes a forebrain-specific p300 enhancer binding site. The removal of this region, known to function as a suppressor, specifically increases the proliferation of the subventricular zone (SVZ) of the septum. The loss of this SVZ enhancer region in an hCONDEL may provide further insights into the role of DNA sequence changes that may have resulted in evolution of the human brain and may provide a better understanding of the evolution of humans.


References

{{DISPLAYTITLE:hCONDELs Human genetics Human evolution