HOME

TheInfoList



OR:

A topologically associating domain (TAD) is a self-interacting genomic region, meaning that
DNA sequence A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nu ...
s within a TAD physically interact with each other more frequently than with sequences outside the TAD. The average size of a topologically associating domain (TAD) is 1000 kb in humans, 880 kb in mouse cells, and 140 kb in fruit flies. Boundaries at both side of these domains are conserved between different mammalian cell types and even across species and are highly enriched with CCCTC-binding factor (CTCF) and
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
. In addition, some types of genes (such as
transfer RNA Transfer ribonucleic acid (tRNA), formerly referred to as soluble ribonucleic acid (sRNA), is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes). In a cell, it provides the physical link between the gene ...
genes and
housekeeping genes In molecular biology, housekeeping genes are typically constitutive genes that are required for the maintenance of basic cellular function, and are gene expression, expressed in all cells of an organism under normal and patho-physiological condit ...
) appear near TAD boundaries more often than would be expected by chance. The functions of TADs are not fully understood and are still a matter of debate. Most of the studies indicate TADs regulate
gene expression Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
by limiting the enhancer- promoter interaction to each TAD; however, a recent study uncouples TAD organization and gene expression. Disruption of TAD boundaries are found to be associated with wide range of diseases such as
cancer Cancer is a group of diseases involving Cell growth#Disorders, abnormal cell growth with the potential to Invasion (cancer), invade or Metastasis, spread to other parts of the body. These contrast with benign tumors, which do not spread. Po ...
, variety of limb malformations such as
synpolydactyly Synpolydactyly (SPD) is a joint presentation of syndactyly (fusion of digits) and polydactyly (production of supernumerary digits).Goodman FR, Mundlos S, Muragaki Y, et al. Synpolydactyly phenotypes correlate with size of expansions in hoxd13 poly ...
, Cooks syndrome, and F-syndrome, and number of brain disorders like Hypoplastic corpus callosum and Adult-onset demyelinating leukodystrophy. Furthermore, studies have revealed that interactions between promoters and enhancers spanning single or multiple TADs, are fundamental to the exact dynamics of gene expression. The genomic elements underlying these interactions are named distal tethering elements (DTEs) and it has been shown that these elements are important for precise gene activation of Hox genes in early embryogenesis of '' D. melanogaster''. The mechanisms underlying TAD formation are also complex and not yet fully elucidated, though a number of
protein complex A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multidomain enzymes, in which multiple active site, catalytic domains are found in a single polypeptide chain. ...
es and DNA elements are associated with TAD boundaries. However, the handcuff model and the loop extrusion model describe the TAD formation by the aid of CTCF and cohesin proteins. Furthermore, it has been proposed that the stiffness of TAD boundaries itself could cause the domain insulation and TAD formation.


Discovery and diversity

TADs are defined as regions whose DNA sequences preferentially contact each other. They were discovered in 2012 using
chromosome conformation capture Chromosome conformation capture techniques (often abbreviated to 3C technologies or 3C-based methods) are a set of molecular biology methods used to analyze the spatial organization of chromatin in a cell. These methods quantify the number of in ...
techniques including
Hi-C Hi-C is an American fruit juice-flavored drink made by the Minute Maid division of The Coca-Cola Company. It was created by Niles Foster in 1946 and released in 1947. The sole original flavor was orange, with additional flavours introduced in ...
. They have been shown to be present in multiple species, including fruit flies (
Drosophila ''Drosophila'' (), from Ancient Greek δρόσος (''drósos''), meaning "dew", and φίλος (''phílos''), meaning "loving", is a genus of fly, belonging to the family Drosophilidae, whose members are often called "small fruit flies" or p ...
),
mouse A mouse (: mice) is a small rodent. Characteristically, mice are known to have a pointed snout, small rounded ears, a body-length scaly tail, and a high breeding rate. The best known mouse species is the common house mouse (''Mus musculus'' ...
, plants, fungi and
human Humans (''Homo sapiens'') or modern humans are the most common and widespread species of primate, and the last surviving species of the genus ''Homo''. They are Hominidae, great apes characterized by their Prehistory of nakedness and clothing ...
genomes. In bacteria, they are referred to as Chromosomal Interacting Domains (CIDs).


Analytical tools and databases

TAD locations are defined by applying an algorithm to Hi-C data. For example, TADs are often called according to the so-called "directionality index". The directionality index is calculated for individual 40kb bins, by collecting the reads that fall in the bin, and observing whether their paired reads map upstream or downstream of the bin (read pairs are required to span no more than 2Mb). A positive value indicates that more read pairs lie downstream than upstream, and a negative value indicates the reverse. Mathematically, the directionality index is a signed chi-square statistic. The development of specialized genome browsers and visualization tools such as Juicebox, HiGlass/HiPiler, The 3D Genome Browser, 3DIV, 3D-GNOME, and TADKB have enabled us to visualize the TAD organization of regions of interest in different cell types.


Mechanisms of formation

A number of proteins are known to be associated with TAD formation including the protein
CTCF Transcriptional repressor CTCF also known as 11-zinc finger protein or CCCTC-binding factor is a transcription factor that in humans is encoded by the ''CTCF'' gene. CTCF is involved in many cellular processes, including transcriptional regulati ...
and the protein complex
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
. It is also unknown what components are required at TAD boundaries; however, in mammalian cells, it has been shown that these boundary regions have comparatively high levels of CTCF binding. In addition, some types of genes (such as
transfer RNA Transfer ribonucleic acid (tRNA), formerly referred to as soluble ribonucleic acid (sRNA), is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes). In a cell, it provides the physical link between the gene ...
genes and
housekeeping genes In molecular biology, housekeeping genes are typically constitutive genes that are required for the maintenance of basic cellular function, and are gene expression, expressed in all cells of an organism under normal and patho-physiological condit ...
) appear near TAD boundaries more often than would be expected by chance. Computer simulations have shown that chromatin loop extrusion driven by cohesin motors can generate TADs. In the loop extrusion model, cohesin binds chromatin, pulls it in, and extrudes chromatin to progressively grow a loop. Chromatin on both sides of the cohesin complex is extruded until cohesin encounters a chromatin-bound CTCF protein, typically located at the boundary of a TAD. In this way, TAD boundaries can be brought together as the anchors of a chromatin loop. Indeed, in vitro, cohesin has been observed to processively extrude DNA loops in an ATP-dependent manner and stall at CTCF. However, some in vitro data indicates that the observed loops may be artifacts. Importantly, since cohesins can dynamically unbind from chromatin, this model suggests that TADs (and associated chromatin loops) are dynamic, transient structures, in agreement with in vivo observations. Other mechanisms for TAD formation have been suggested. For example, some simulations suggest that transcription-generated supercoiling can relocalize cohesin to TAD boundaries or that passively diffusing cohesin “slip links” can generate TADs.


Properties


Conservation

TADs have been reported to be relatively constant between different cell types (in stem cells and blood cells, for example), and even between species in specific cases. Comparative TAD analysis between ''Drosophila melanogaster'' and ''
Drosophila subobscura ''Drosophila subobscura'' is a species of fruit fly in the family Drosophilidae. Originally found around the Mediterranean, it has spread to most of Europe and the Near East. It has been introduced into the west coasts of Canada, the United Stat ...
'', with a divergence time of approximately 49 million years, has revealed a conservation in range of 30-40%.


Relationship with promoter-enhancer contacts

The majority of observed interactions between promoters and enhancers do not cross TAD boundaries. Removing a TAD boundary (for example, using
CRISPR CRISPR (; acronym of clustered regularly interspaced short palindromic repeats) is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. Each sequence within an individual prokaryotic CRISPR is d ...
to delete the relevant region of the genome) can allow new promoter-enhancer contacts to form. This can affect
gene expression Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
nearby - such misregulation has been shown to cause limb malformations (e.g.
polydactyly Polydactyly is a birth defect that results in extra fingers or toes. The hands are more commonly involved than the feet. Extra fingers may be painful, affect self-esteem, or result in clumsiness. It is associated with at least 39 genetic mut ...
) in humans and mice. Computer simulations have shown that transcription-induced supercoiling of chromatin fibres can explain how TADs are formed and how they can assure very efficient interactions between enhancers and their cognate promoters located in the same TAD.


Relationship with other structural features of the genome

Replication timing domains have been shown to be associated with TADs as their boundary is co localized with the boundaries of TADs that are located at either sides of compartments.
Insulated neighborhood In mammalian biology, insulated neighborhoods are chromosomal loop structures formed by the physical interaction of two DNA loci bound by the transcription factor CTCF and co-occupied by cohesin. Insulated neighborhoods are thought to be structur ...
s, DNA loops formed by CTCF/cohesin-bound regions, are proposed to functionally underlie TADs. Genome rearrangement breakpoint have shown to be enriched at the TAD boundaries in ''D. melanogaster''.


Role in disease

Disruption of TAD boundaries can affect the expression of nearby genes, and this can cause disease. For example, genomic structural variants that disrupt TAD boundaries have been reported to cause developmental disorders such as human limb malformations. Additionally, several studies have provided evidence that the disruption or rearrangement of TAD boundaries can provide growth advantages to certain cancers, such as T-cell acute lymphoblastic leukemia (T-ALL), gliomas, and lung cancer.


Lamina-associated domains

Lamina-associated domains (LADs) are parts of the chromatin that heavily interact with the lamina, a network-like structure at the inner membrane of the nucleus. LADs consist mostly of transcriptionally silent chromatin, being enriched with trimethylated Lys27 on
histone H3 Histone H3 is one of the five main histones involved in the structure of chromatin in eukaryotic cells. Featuring a main globular domain and a long N-terminal end, N-terminal tail, H3 is involved with the structure of the nucleosomes of the 'b ...
, (i.e.
H3K27me3 H3K27me3 is an epigenetic modification to the DNA packaging protein histone H3. It is a mark that indicates the tri-methylation of lysine 27 on histone H3 protein. This tri-methylation is associated with the Downregulation and upregulation, down ...
); which is a common
posttranslational In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA int ...
histone In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei and in most Archaeal phyla. They act as spools around which DNA winds to create structural units called nucleosomes ...
modification of
heterochromatin Heterochromatin is a tightly packed form of DNA or '' condensed DNA'', which comes in multiple varieties. These varieties lie on a continuum between the two extremes of constitutive heterochromatin and facultative heterochromatin. Both play a rol ...
. LADs have CTCF-binding sites at their periphery.


See also

*
Insulated neighborhood In mammalian biology, insulated neighborhoods are chromosomal loop structures formed by the physical interaction of two DNA loci bound by the transcription factor CTCF and co-occupied by cohesin. Insulated neighborhoods are thought to be structur ...


References

{{reflist


External links


JuiceboxHiGlassHiPiler3DIV3D-GNOMETADKB
Gene expression Nuclear organization