HOME

TheInfoList



OR:

Transactive response DNA binding protein 43  kDa (TAR DNA-binding protein 43 or TDP-43) is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
that in humans is encoded by the ''TARDBP''
gene In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
.


Structure

TDP-43 is 414
amino acid residues Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid m ...
long. It consists of four domains: an N-terminal domain spanning residues 1–76 (NTD) with a well-defined fold that has been shown to form a dimer or
oligomer In chemistry and biochemistry, an oligomer () is a molecule that consists of a few repeating units which could be derived, actually or conceptually, from smaller molecules, monomers.Quote: ''Oligomer molecule: A molecule of intermediate relativ ...
; two highly conserved folded RNA recognition motifs spanning residues 106–176 (RRM1) and 191–259 (RRM2), respectively, required to bind target
RNA Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
and
DNA Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
; an unstructured C-terminal domain encompassing residues 274–414 (CTD), which contains a
glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid. Glycine is one of the proteinogenic amino acids. It is encoded by all the codons starting with GG (G ...
-rich region, is involved in protein-protein interactions, and harbors most of the
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s associated with familial
amyotrophic lateral sclerosis Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, Terminal illness, terminal neurodegenerative disease, neurodegenerative disorder that results i ...
. The entire protein devoid of large solubilising tags has been purified. The full-length protein is a dimer. The dimer is formed due to a self-interaction between two NTD domains, where the dimerisation can be propagated to form higher-order oligomers. The protein sequence also has a
nuclear localization signal A nuclear localization signal ''or'' sequence (NLS) is an amino acid sequence that 'tags' a protein for import into the cell nucleus by nuclear transport. Typically, this signal consists of one or more short sequences of positively charged lysin ...
(NLS, residues 82–98), a former nuclear export signal (NES residues 239–250) and 3 putative caspase-3 cleavage sites (residues 13, 89, 219). In December 2021 the structure of TDP-43 was resolved with
cryo-EM Cryogenic electron microscopy (cryo-EM) is a transmission electron microscopy technique applied to samples cooled to cryogenic temperatures. For biological specimens, the structure is preserved by embedding in an environment of vitreous ice. An ...
but shortly after it was argued that in the context of FTLD-TDP the protein involved could be TMEM106B (which has been also resolved with cryo-EM), rather than of TDP-43.


N-Terminal domain (NTD)

The NTD located between residues 1 and 76 is involved in TDP-43
polymerization In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many fo ...
. Indeed, dimers are formed by head-to-head interactions between NTDs, and the polymer thus obtained allows for
pre-mRNA A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by Transcription (genetics), transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcript ...
splicing. However, further
oligomer In chemistry and biochemistry, an oligomer () is a molecule that consists of a few repeating units which could be derived, actually or conceptually, from smaller molecules, monomers.Quote: ''Oligomer molecule: A molecule of intermediate relativ ...
ization brings to more toxic accumulates. This process of polymerization into dimers, larger forms or just stabilizing monomers is dependent on TDP-43 conformational equilibrium between monomers, homodimers and oligomers. Hence, in TDP-43 diseased cells, TDP-43's over-expression leads to the NTD showing high propensity to aggregate. Contrary to this, in normal cells, normal levels of TDP-43 allow for folded NTD, preventing aggregates and polymers formation. More recently, this domain was found to have a
ubiquitin Ubiquitin is a small (8.6  kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 19 ...
-like structure. It bears 27,6% of homology with Ubiquitin-1 and a β1-β2- α1-β3-β4-β5-β6 + 2* SO42- form. Ubiquitin-like domain are usually associated with a greater affinity for
RNA Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
/
DNA Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
. However, in the unique case of TDP-43, the Ubiquitin-like NTD binds directly to ssDNA. This interaction permits the conformational equilibrium cited higher to shift towards non-aggregated forms. The domain spanning from ,80has a
solenoid upright=1.20, An illustration of a solenoid upright=1.20, Magnetic field created by a seven-loop solenoid (cross-sectional view) described using field lines A solenoid () is a type of electromagnet formed by a helix, helical coil of wire whos ...
-like structure which sterically impedes interactions between aggregation prone C-term regions. All of this raises the possibility that NTD and the RNA recognition motifs (later on defined) could cooperatively interact with nucleic acids to accomplish TDP-43's physiological functions.


Mitochondrial localization signal

There are six
mitochondrial A mitochondrion () is an organelle found in the cells of most eukaryotes, such as animals, plants and fungi. Mitochondria have a double membrane structure and use aerobic respiration to generate adenosine triphosphate (ATP), which is used ...
localization signals to be accounted on TDP-43's
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
sequence, although only M1, M3, and M5 were shown to be essential for mitochondrial localization. Indeed, their ablation leads to a lessened mitochondrial localization. These localizing sequences are found on the following amino acids: M1: 5, 41 M2: 05, 112 M3: 46-150 M4: 28, 235 M5: 94, 300 M6: 28, 236


Nuclear localization signal (NLS)

The
nuclear Nuclear may refer to: Physics Relating to the nucleus of the atom: *Nuclear engineering *Nuclear physics *Nuclear power *Nuclear reactor *Nuclear weapon *Nuclear medicine *Radiation therapy *Nuclear warfare Mathematics * Nuclear space *Nuclear ...
localization signal (NLS) domain is located between residues 82 and 98 is of critical importance in
ALS Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, terminal neurodegenerative disorder that results in the progressive loss of both upper and low ...
, and such is witnessed by the depletion or the mutations (notably A90V) of this domain, which cause loss-of-function from nucleus and promote aggregating, two processes very likely to conduct to TDP-43's toxic gain of function. It is thereby of the utmost importance to note that TDP-43's nuclear localization is absolutely critical for it to fulfill its physiological functions.


RNA recognition motif

The RNA recognition motif ranges between residues 105 and 181, much like many hnRNPs, TDP-43's RRMs encompass highly conserved motifs of primary importance for fulfilling their function. Both RRMs follow this pattern: β1-α1-β2-β3-α2-β4-β5, which allows them to bind to both
RNA Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
and
DNA Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
onto U G/ T G-repeats of 3'UTR (Untranslated Terminal Regions) end of
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
/DNA. These sequences mainly ensure mRNA processing, RNA export and RNA stabilizing. It is notably thanks to these sequences that TDP-43 importantly binds to its own mRNA regulates its very own
solubility In chemistry, solubility is the ability of a chemical substance, substance, the solute, to form a solution (chemistry), solution with another substance, the solvent. Insolubility is the opposite property, the inability of the solute to form su ...
and
polymerization In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many fo ...
.


RRM2

RRM2 spans between residues 181 and 261. In pathological conditions, it notably binds to p65/NF-kB, an
apoptosis Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
implicated factor, and is thus a potential therapeutic target. Moreover it can be burdened with a mutation, D169G, altering a key cleaving site for regulating formation of toxic inclusions.


Nuclear export signal (NES)

The nuclear export signal is located between residues 239 and 251 sequence probably bears a role in TDP-43's shuttling function, and was recently found using a prediction algorithm.


Disordered glycin rich C-terminal domain (CTD)

The Disordered Glycin Rich C-terminal domain is located between residues 277 and 414. Much like 70 other RNA binding proteins, TDP-43 bears a Q/ N rich domain 44, 366which resembles
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom (biology), kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are est ...
prion A prion () is a Proteinopathy, misfolded protein that induces misfolding in normal variants of the same protein, leading to cellular death. Prions are responsible for prion diseases, known as transmissible spongiform encephalopathy (TSEs), w ...
sequence. This sequence is called a Prion-Like Domain (PLD). PLDs are low complexity sequences that have been reported to mediate gene regulation via Liquid-Liquid Phase Transition (LLP) thus driving RNP granule assembly. Forming these microscopically visible RNP granules is thought to induce more effective gene regulatory process. It is here noted that LLP are reversible phenomenons of de-mixing a solution into two distinct liquid phases, hereby forming granules. Mutations within the TDP-43 proteins Glycine Rich Region (GRR) have recently been identified as associates that can contribute to various neurodegenerative diseases, with the most notable and common NDD being ALS, about 10% of the mutations causing familial ALS are accredited with the TDP-43 protein This CTD is often reported to play important role in pathogenic behavior of TDP-43: RNPs granules could have a role in stress response, and thus, aging, or persistence stress could lead the LLPs to turn into irreversible Liquid Solid Phase separation, pathological aggregates notably found in
ALS Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, terminal neurodegenerative disorder that results in the progressive loss of both upper and low ...
neurons. CTD's disorganized structure can turn into a full fledged
amyloid Amyloids are aggregates of proteins characterised by a fibrillar morphology of typically 7–13 nm in diameter, a β-sheet secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the human ...
-like
beta-sheet The beta sheet (β-sheet, also β-pleated sheet) is a common structural motif, motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone chain, backbon ...
rich structure, causing it to adopt
prion A prion () is a Proteinopathy, misfolded protein that induces misfolding in normal variants of the same protein, leading to cellular death. Prions are responsible for prion diseases, known as transmissible spongiform encephalopathy (TSEs), w ...
-like properties. Moreover, CTFs are a common maker in diseased
neuron A neuron (American English), neurone (British English), or nerve cell, is an membrane potential#Cell excitability, excitable cell (biology), cell that fires electric signals called action potentials across a neural network (biology), neural net ...
s and are argued to be of high toxicity. However, notice is to be taken that some points are not always consensual. Indeed, due to its
hydrophobic In chemistry, hydrophobicity is the chemical property of a molecule (called a hydrophobe) that is seemingly repelled from a mass of water. In contrast, hydrophiles are attracted to water. Hydrophobic molecules tend to be nonpolar and, thu ...
structure, TDP-43 can be hard to analyze, and parts of it remain somewhat vague. Precise sites of
phosphorylation In biochemistry, phosphorylation is described as the "transfer of a phosphate group" from a donor to an acceptor. A common phosphorylating agent (phosphate donor) is ATP and a common family of acceptor are alcohols: : This equation can be writ ...
,
methylation Methylation, in the chemistry, chemical sciences, is the addition of a methyl group on a substrate (chemistry), substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replac ...
, or even binding are still a bit elusive.


Function

TDP-43 is a transcriptional
repressor In molecular genetics, a repressor is a DNA- or RNA-binding protein that inhibits the expression of one or more genes by binding to the operator or associated silencers. A DNA-binding repressor blocks the attachment of RNA polymerase to the ...
that binds to chromosomally integrated TAR DNA and represses
HIV-1 The subtypes of HIV include two main subtypes, known as HIV type 1 (HIV-1) and HIV type 2 (HIV-2). These subtypes have distinct genetic differences and are associated with different epidemiological patterns and clinical characteristics. HIV-1 e ...
transcription. In addition, this protein regulates alternate splicing of the CFTR gene. In particular, TDP-43 is a splicing factor binding to the intron8/exon9 junction of the CFTR gene and to the intron2/exon3 region of the apoA-II gene. A similar pseudogene is present on chromosome 20. TDP-43 has been shown to bind both DNA and RNA and have multiple functions in transcriptional repression, pre-mRNA splicing and translational regulation. Recent work has characterized the transcriptome-wide binding sites revealing that thousands of RNAs are bound by TDP-43 in neurons. TDP-43 was originally identified as a transcriptional repressor that binds to chromosomally integrated trans-activation response element (TAR) DNA and represses
HIV-1 The subtypes of HIV include two main subtypes, known as HIV type 1 (HIV-1) and HIV type 2 (HIV-2). These subtypes have distinct genetic differences and are associated with different epidemiological patterns and clinical characteristics. HIV-1 e ...
transcription. It was also reported to regulate alternate splicing of the CFTR gene and the apoA-II gene. In spinal motor neurons TDP-43 has also been shown in humans to be a low molecular weight neurofilament (hNFL) mRNA-binding protein. It has also shown to be a neuronal activity response factor in the dendrites of hippocampal neurons suggesting possible roles in regulating mRNA stability, transport and local translation in neurons. It has been demonstrated that zinc ions are able to induce aggregation of endogenous TDP-43 in cells. Moreover, zinc could bind to RNA binding domain of TDP-43 and induce the formation of amyloid-like aggregates ''in vitro.''


DNA repair

TDP-43 protein is a key element of the
non-homologous end joining Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. It is called "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology directed repair ...
(NHEJ) enzymatic pathway that repairs DNA
double-strand breaks DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. A weakened capacity for DNA repair is a risk factor for the development of cancer. DNA is constantly modified ...
(DSBs) in pluripotent
stem cell In multicellular organisms, stem cells are undifferentiated or partially differentiated cells that can change into various types of cells and proliferate indefinitely to produce more of the same stem cell. They are the earliest type of cell ...
-derived
motor neuron A motor neuron (or motoneuron), also known as efferent neuron is a neuron whose cell body is located in the motor cortex, brainstem or the spinal cord, and whose axon (fiber) projects to the spinal cord or outside of the spinal cord to directly o ...
s. TDP-43 is rapidly recruited to DSBs where it acts as a scaffold for the further recruitment of the
XRCC4 DNA repair protein XRCC4 (hXRCC4) also known as X-ray repair cross-complementing protein 4 is a protein that in humans is encoded by the ''XRCC4'' gene. ''XRCC4'' is also expressed in many other animals, fungi and plants. hXRCC4 is one of severa ...
- DNA ligase protein complex that then acts to seal the DNA breaks. In TDP-43 depleted human neural stem cell-derived motor neurons, as well as in sporadic
ALS Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, terminal neurodegenerative disorder that results in the progressive loss of both upper and low ...
patients' spinal cord specimens there is significant DSB accumulation and reduced levels of NHEJ.


Clinical significance

A hyper-
phosphorylated In biochemistry, phosphorylation is described as the "transfer of a phosphate group" from a donor to an acceptor. A common phosphorylating agent (phosphate donor) is ATP and a common family of acceptor are alcohols: : This equation can be writt ...
,
ubiquitin Ubiquitin is a small (8.6  kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 19 ...
ated and cleaved form of TDP-43—known as pathologic TDP43—is the major disease protein in
ubiquitin Ubiquitin is a small (8.6  kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 19 ...
-positive, tau-, and
alpha-synuclein Alpha-synuclein (aSyn) is a protein that in humans is encoded by the ''SNCA'' gene. It is a neuronal protein involved in the regulation of synaptic vesicle trafficking and the release of neurotransmitters. Alpha-synuclein is abundant in the brai ...
-negative
frontotemporal dementia Frontotemporal dementia (FTD), also called frontotemporal degeneration disease or frontotemporal neurocognitive disorder, encompasses several types of dementia involving the progressive degeneration of the brain's frontal lobe, frontal and tempor ...
(FTLD-TDP, previously referred to as FTLD-U) and in
amyotrophic lateral sclerosis Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, Terminal illness, terminal neurodegenerative disease, neurodegenerative disorder that results i ...
(ALS). Elevated levels of the TDP-43 protein have also been identified in individuals diagnosed with
chronic traumatic encephalopathy Chronic traumatic encephalopathy (CTE) is a neurodegenerative disease linked to repeated trauma to the head. The encephalopathy symptoms can include behavioral problems, mood problems, and problems with thinking. The disease often gets wor ...
, and has also been associated with ALS leading to the inference that athletes who have experienced multiple
concussion A concussion, also known as a mild traumatic brain injury (mTBI), is a head injury that temporarily affects brain functioning. Symptoms may include headache, dizziness, difficulty with thinking and concentration, sleep disturbances, a brief ...
s and other types of
head injury A head injury is any injury that results in trauma to the skull or brain. The terms ''traumatic brain injury'' and ''head injury'' are often used interchangeably in the medical literature. Because head injuries cover such a broad scope of inju ...
are at an increased risk for both encephalopathy and motor neuron disease (ALS). Abnormalities of TDP-43 also occur in an important subset of
Alzheimer's disease Alzheimer's disease (AD) is a neurodegenerative disease and the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As the disease advances, symptoms can include problems wit ...
patients, correlating with clinical and neuropathologic features indexes. Misfolded TDP-43 is found in the brains of
older adults Old age is the range of ages for people nearing and surpassing life expectancy. People who are of old age are also referred to as: old people, elderly, elders, senior citizens, seniors or older adults. Old age is not a definite biological sta ...
over age 85 with
limbic-predominant age-related TDP-43 encephalopathy LATE is a term that describes a prevalent medical condition with impaired memory and thinking in advanced age, often culminating in the dementia clinical syndrome. In other words, the symptoms of LATE are similar to those of Alzheimer's disease ...
, (LATE), a form of dementia. New monoclonal antibodies, 2G11 and 2H1, have been developed to specify different TDP-43 inclusion types that occur across neurodegenerative diseases, without relying on hyper-phosphorylated epitopes. These antibodies were raised against an epitope within the RRM2 domain (amino acid residues 198–216). Mutations in the ''TARDBP'' gene are associated with neurodegenerative disorders including
frontotemporal lobar degeneration Frontotemporal lobar degeneration (FTLD) is a pathological process that occurs in frontotemporal dementia. It is characterized by atrophy in the frontal lobe and temporal lobe of the brain, with sparing of the parietal and occipital lobes. Com ...
and
amyotrophic lateral sclerosis Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, Terminal illness, terminal neurodegenerative disease, neurodegenerative disorder that results i ...
(ALS). In particular, the TDP-43 mutants M337V and Q331K are being studied for their roles in ALS. While the aberrant mislocalization and cytoplasmic aggregation of TDP-43 characterizes FTLD with TDP-43 pathology (FTLD-TDP), recent work suggests the amyloid fibrils found in human FTLD-TDP brains are composed of transmembrane lysosomal protein TMEM106b rather than TDP-43. Cytoplasmic TDP-43 pathology is the dominant histopathological feature of multisystem proteinopathy. The N-terminal domain, which contributes importantly to the aggregation of the C-terminal region, has a novel structure with two negatively charged loops.. A recent study has demonstrated that cellular stress can trigger the abnormal cytoplasmic mislocalisation of TDP-43 in spinal motor neurons in vivo, providing insight into how TDP-43 pathology may develop in sporadic ALS patients.


Figures


References


Further reading

* * * * * * * * * * * * * * * *


External links


GeneReviews/NCBI/NIH/UW entry on TARDBP-Related Amyotrophic Lateral Sclerosis
* {{PDB Gallery, geneid=23435 DNA-binding proteins