HOME

TheInfoList



OR:

Titin (; also called connectin) is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
that in humans is encoded by the ''TTN''
gene In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
. The protein, which is over 1 μm in length, functions as a molecular spring that is responsible for the passive elasticity of
muscle Muscle is a soft tissue, one of the four basic types of animal tissue. There are three types of muscle tissue in vertebrates: skeletal muscle, cardiac muscle, and smooth muscle. Muscle tissue gives skeletal muscles the ability to muscle contra ...
. It comprises 244 individually folded
protein domain In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s connected by unstructured
peptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty am ...
sequences. These domains unfold when the protein is stretched and refold when the tension is removed. Titin is important in the contraction of
striated muscle tissue Striated muscle tissue is a muscle tissue that features repeating functional units called sarcomeres. Under the microscope, sarcomeres are visible along muscle fibers, giving a striated appearance to the tissue. The two types of striated muscle a ...
s. It connects the Z disc to the M line in the
sarcomere A sarcomere (Greek σάρξ ''sarx'' "flesh", μέρος ''meros'' "part") is the smallest functional unit of striated muscle tissue. It is the repeating unit between two Z-lines. Skeletal striated muscle, Skeletal muscles are composed of tubular ...
. The protein contributes to force transmission at the Z disc and resting tension in the I band region. It limits the range of motion of the sarcomere in tension, thus contributing to the passive stiffness of muscle. Variations in the sequence of titin between different types of striated muscle ( cardiac or
skeletal A skeleton is the structural frame that supports the body of most animals. There are several types of skeletons, including the exoskeleton, which is a rigid outer shell that holds up an organism's shape; the endoskeleton, a rigid internal fram ...
) have been correlated with differences in the mechanical properties of these muscles. Titin is the third most abundant protein in muscle (after
myosin Myosins () are a Protein family, family of motor proteins (though most often protein complexes) best known for their roles in muscle contraction and in a wide range of other motility processes in eukaryotes. They are adenosine triphosphate, ATP- ...
and
actin Actin is a family of globular multi-functional proteins that form microfilaments in the cytoskeleton, and the thin filaments in muscle fibrils. It is found in essentially all eukaryotic cells, where it may be present at a concentration of ...
), and an adult human contains approximately 0.5 kg of titin. With its length of ~27,000 to ~35,000
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
s (depending on the splice isoform), titin is the largest known
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
. Furthermore, the gene for titin contains the largest number of
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
s (363) discovered in any single gene, as well as the longest single exon (17,106 bp).


Discovery

In 1954, Reiji Natori proposed the existence of an elastic structure in muscle fiber to account for the return to the resting state when muscles are stretched and then released. In 1977, Koscak Maruyama and coworkers isolated an elastic protein from muscle fiber that they called connectin. Two years later, Kuan Wang and coworkers identified a doublet band on electrophoresis gel corresponding to a high molecular weight, elastic protein that they named titin. In 1990, Siegfried Labeit isolated a partial cDNA clone of titin. Five years later, Labeit and Bernhard Kolmerer determined the cDNA sequence of human cardiac titin. In 2001, Labeit and colleagues determined the complete sequence of the human titin gene.


Genetics

The human gene encoding for titin is located on the long arm of chromosome 2 and contains 363 exons, which together code for 38,138
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
residues (4200 kDa). Within the gene are found a large number of PEVK (proline-glutamate-valine-lysine -abundant structural motifs) exons 84 to 99 nucleotides in length, which code for conserved 28- to 33-residue motifs that may represent structural units of the titin PEVK spring. The number of PEVK motifs in the titin gene appears to have increased during evolution, apparently modifying the genomic region responsible for titin's spring properties.


Isoforms

A number of titin isoforms are produced in different striated muscle tissues as a result of
alternative splicing Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
. All but one of these isoforms are in the range of ~27,000 to ~36,000 amino acid residues in length. The exception is the small cardiac novex-3 isoform, which is only 5,604 amino acid residues in length. The following table lists the known titin isoforms:


Structure

Titin is the largest known protein; its human variant consists of 34,350
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
s, with the
molecular weight A molecule is a group of two or more atoms that are held together by Force, attractive forces known as chemical bonds; depending on context, the term may or may not include ions that satisfy this criterion. In quantum physics, organic chemi ...
of the mature "canonical" isoform of the protein being approximately 3,816,030.05 Da. Its mouse homologue is even larger, comprising 35,213 amino acids with a molecular weight of 3,906,487.6 Da. It has a theoretical
isoelectric point The isoelectric point (pI, pH(I), IEP), is the pH at which a molecule carries no net electric charge, electrical charge or is electrically neutral in the statistical mean. The standard nomenclature to represent the isoelectric point is pH(I). Howe ...
of 6.02. The protein's
empirical Empirical evidence is evidence obtained through sense experience or experimental procedure. It is of central importance to the sciences and plays a role in various other fields, like epistemology and law. There is no general agreement on how t ...
chemical formula A chemical formula is a way of presenting information about the chemical proportions of atoms that constitute a particular chemical compound or molecule, using chemical element symbols, numbers, and sometimes also other symbols, such as pare ...
is C169,719H270,466N45,688O52,238S911. It has a theoretical instability index (II) of 42.38, classifying the protein as unstable. The protein's
in vivo Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, an ...
half-life Half-life is a mathematical and scientific description of exponential or gradual decay. Half-life, half life or halflife may also refer to: Film * Half-Life (film), ''Half-Life'' (film), a 2008 independent film by Jennifer Phang * ''Half Life: ...
, the time it takes for half of the amount of protein in a cell to break down after its synthesis in the cell, is predicted to be approximately 30 hours (in
mammalian A mammal () is a vertebrate animal of the Class (biology), class Mammalia (). Mammals are characterised by the presence of milk-producing mammary glands for feeding their young, a broad neocortex region of the brain, fur or hair, and three ...
reticulocytes). The Titin protein is located between the
myosin Myosins () are a Protein family, family of motor proteins (though most often protein complexes) best known for their roles in muscle contraction and in a wide range of other motility processes in eukaryotes. They are adenosine triphosphate, ATP- ...
thick filament and the Z disk. Titin consists primarily of a linear array of two types of modules, also referred to as
protein domain In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s (244 copies in total): type I fibronectin type III domain (132 copies) and type II immunoglobulin domain (112 copies). However, the exact number of these domains is different in different species. This linear array is further organized into two regions: * N-terminal I-band: acts as the elastic part of the molecule and is composed mainly of type II modules. More specifically the I-band contains two regions of tandem type II immunoglobulin domains on either side of a ''PEVK region'' that is rich in proline (P),
glutamate Glutamic acid (symbol Glu or E; known as glutamate in its anionic form) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a Essential amino acid, non-essential nutrient for humans, meaning that ...
(E),
valine Valine (symbol Val or V) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated −NH3+ form under biological conditions), an α- carboxylic acid group (which is in the deproton ...
(V) and
lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. Lysine contains an α-amino group (which is in the protonated form when the lysine is dissolved in water at physiological pH), an α-carboxylic acid group ( ...
(K). * C-terminal A-band: is thought to act as a protein-ruler and is composed of alternating type I (Fn3) and II (Ig) modules with super-repeat segments. These have been shown to align to the 43 nm axial repeats of myosin thick filaments with immunoglobulin domains correlating to myosin crowns. The C-terminal region also contains a serine kinase domain that is primarily known for adapting the muscle to mechanical strain. It is “stretch-sensitive” and helps repair overstretching of the sarcomere. The N-terminal (the Z-disc end) contains a "Z repeat" that recognizes Actinin alpha 2. The elasticity of the PEVK region has both entropic and enthalpic contributions and is characterized by a polymer persistence length and a stretch modulus. At low to moderate extensions PEVK elasticity can be modeled with a standard worm-like chain (WLC) model of entropic elasticity. At high extensions PEVK stretching can be modeled with a modified WLC model that incorporates enthalpic elasticity. The difference between low-and high- stretch elasticity is due to electrostatic stiffening and hydrophobic effects. Embedded between the PEVK and Ig residues are N2A domains.


Evolution

The titin domains have evolved from a common ancestor through many gene duplication events. Domain duplication was facilitated by the fact that most domains are encoded by single exons. Other giant sarcomeric proteins made out of Fn3/Ig repeats include obscurin and myomesin. Throughout evolution, titin mechanical strength appears to decrease through the loss of disulfide bonds as the organism becomes heavier. Titin A-band has homologs in invertebrates, such as twitchin (unc-22) and projectin, which also contain Ig and FNIII repeats and a protein kinase domain. The gene duplication events took place independently but were from the same ancestral Ig and FNIII domains. It is said that the protein titin was the first to diverge out of the family. ''Drosophila'' projectin, officially known as bent (''bt''), is associated with lethality by failing to escape the egg in some mutations as well as dominant changes in wing angles. ''Drosophila'' Titin, also known as Kettin or sallimus (''sls''), is kinase-free. It has roles in the elasticity of both muscle and chromosomes. It is homologous to vertebrate titin I-band and contains Ig PEVK domains, the many repeats being a hot target for splicing. There also exists a titin homologue, ''ttn-1'', in '' C. elegans''. It has a kinase domain, some Ig/Fn3 repeats, and PEVT repeats that are similarly elastic.


Function

Titin is a large abundant protein of striated muscle. Titin's primary functions are to stabilize the thick filament, center it between the thin filaments, prevent overstretching of the sarcomere, and to recoil the sarcomere like a spring after it is stretched. An N-terminal Z-disc region and a C-terminal M-line region bind to the Z-line and M-line of the
sarcomere A sarcomere (Greek σάρξ ''sarx'' "flesh", μέρος ''meros'' "part") is the smallest functional unit of striated muscle tissue. It is the repeating unit between two Z-lines. Skeletal striated muscle, Skeletal muscles are composed of tubular ...
, respectively, so that a single titin molecule spans half the length of a sarcomere. Titin also contains binding sites for muscle-associated proteins so it serves as an adhesion template for the assembly of contractile machinery in muscle cells. It has also been identified as a structural protein for
chromosome A chromosome is a package of DNA containing part or all of the genetic material of an organism. In most chromosomes, the very long thin DNA fibers are coated with nucleosome-forming packaging proteins; in eukaryotic cells, the most import ...
s. Considerable variability exists in the I-band, the M-line and the Z-disc regions of titin. Variability in the I-band region contributes to the differences in elasticity of different titin isoforms and, therefore, to the differences in elasticity of different muscle types. Of the many titin variants identified, five are described with complete transcript information available. Dominant mutation in TTN causes predisposition to
hernia A hernia (: hernias or herniae, from Latin, meaning 'rupture') is the abnormal exit of tissue or an organ (anatomy), organ, such as the bowel, through the wall of the cavity in which it normally resides. The term is also used for the normal Devel ...
s. Titin interacts with many sarcomeric proteins including: * Z line region: telethonin and alpha-actinin * I band region: calpain-3 and obscurin * M line region: myosin-binding protein C, calmodulin 1, CAPN3, and MURF1


Clinical relevance

Mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s anywhere within the unusually long sequence of this gene can cause premature stop codons or other defects. Titin mutations are associated with hereditary
myopathy In medicine, myopathy is a disease of the muscle in which the muscle fibers do not function properly. ''Myopathy'' means muscle disease ( Greek : myo- ''muscle'' + patheia '' -pathy'' : ''suffering''). This meaning implies that the primary defec ...
with early respiratory failure, early-onset myopathy with fatal
cardiomyopathy Cardiomyopathy is a group of primary diseases of the heart muscle. Early on there may be few or no symptoms. As the disease worsens, shortness of breath, feeling tired, and swelling of the legs may occur, due to the onset of heart failure. A ...
, core myopathy with heart disease, centronuclear myopathy, limb-girdle muscular dystrophy type 2J, familial dilated cardiomyopathy 9, hypertrophic cardiomyopathy and tibial muscular dystrophy. Further research also suggests that no genetically linked form of any dystrophy or myopathy can be safely excluded from being caused by a mutation on the TTN gene. Truncating mutations in dilated cardiomyopathy patients are most commonly found in the A region; although truncations in the upstream I region might be expected to prevent translation of the A region entirely,
alternative splicing Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
creates some transcripts that do not encounter the premature stop codon, ameliorating its effect. mRNA splicing factors such as RBM20 and SLM2 ( KHDRBS3) were shown to mediated alternative mRNA splicing of titin mRNA contributing to the development of
heart failure Heart failure (HF), also known as congestive heart failure (CHF), is a syndrome caused by an impairment in the heart's ability to Cardiac cycle, fill with and pump blood. Although symptoms vary based on which side of the heart is affected, HF ...
due to cardiomyopathies. Autoantibodies to titin are produced in patients with the autoimmune disease
Myasthenia gravis Myasthenia gravis (MG) is a long-term neuromuscular junction disease that leads to varying degrees of skeletal muscle weakness. The most commonly affected muscles are those of the eyes, face, and swallowing. It can result in double vision, ...
.


Interactions

Titin has been shown to interact with: * ANK1, * ANKRD1, * ANKRD23 * CAPN3, * FHL2, * OBSCN, * TCAP, and * TRIM63.


Linguistic significance

The name titin is derived from the Greek
Titan Titan most often refers to: * Titan (moon), the largest moon of Saturn * Titans, a race of deities in Greek mythology Titan or Titans may also refer to: Arts and entertainment Fictional entities Fictional locations * Titan in fiction, fictiona ...
(a giant deity, anything of great size). Titin also has the longest IUPAC name of a protein. The full chemical name of the human canonical form of titin, which starts '' methionyl...'' and ends ''...
isoleucine Isoleucine (symbol Ile or I) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
'', contains 189,819 letters and is sometimes stated to be the longest word in the English language, or of any language. However, lexicographers regard formal names of
chemical compound A chemical compound is a chemical substance composed of many identical molecules (or molecular entities) containing atoms from more than one chemical element held together by chemical bonds. A molecule consisting of atoms of only one element ...
s as ''verbal
formulae In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a ''chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betwe ...
'' rather than English words.


See also

* PKZILLA-1 - new largest known protein with 45,212 amino acids


References


Further reading

* * * * * * * * *


External links


GeneReviews/NIH/NCBI/UW entry on Familial Hypertrophic Cardiomyopathy Overview

GeneReviews/NCBI/NIH/UW entry on Udd Distal Myopathy, Tibial Muscular Dystrophy, Udd Myopathy

GeneReviews/NIH/NCBI/UW entry on Salih Myopathy or Early-Onset Myopathy with Fatal Cardiomyopathy

InterPro domain organization of titin
{{NLM content Structural proteins Genes on human chromosome 2 EC 2.7.11 Muscular system