Protein tertiary structure
   HOME

TheInfoList



OR:

Protein tertiary structure is the three dimensional shape of a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the
protein domain In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of ...
s.
Amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
side chain In organic chemistry and biochemistry, a side chain is a chemical group that is attached to a core part of the molecule called the "main chain" or backbone. The side chain is a hydrocarbon branching element of a molecule that is attached to a ...
s may interact and bond in a number of ways. The interactions and bonds of side chains within a particular protein determine its tertiary structure. The protein tertiary structure is defined by its
atom Every atom is composed of a nucleus and one or more electrons bound to the nucleus. The nucleus is made of one or more protons and a number of neutrons. Only the most common variety of hydrogen has no neutrons. Every solid, liquid, gas, ...
ic coordinates. These coordinates may refer either to a protein domain or to the entire tertiary structure.Branden C. and Tooze J. "Introduction to Protein Structure" Garland Publishing, New York. 1990 and 1991. A number of tertiary structures may fold into a
quaternary structure Protein quaternary structure is the fourth (and highest) classification level of protein structure. Protein quaternary structure refers to the structure of proteins which are themselves composed of two or more smaller protein chains (also refe ...
.Kyte, J. "Structure in Protein Chemistry." Garland Publishing, New York. 1995.


History

The science of the tertiary structure of proteins has progressed from one of
hypothesis A hypothesis (plural hypotheses) is a proposed explanation for a phenomenon. For a hypothesis to be a scientific hypothesis, the scientific method requires that one can test it. Scientists generally base scientific hypotheses on previous obse ...
to one of detailed definition. Although
Emil Fischer Hermann Emil Louis Fischer (; 9 October 1852 – 15 July 1919) was a German chemist and 1902 recipient of the Nobel Prize in Chemistry. He discovered the Fischer esterification. He also developed the Fischer projection, a symbolic way of draw ...
had suggested proteins were made of polypeptide chains and amino acid side chains, it was Dorothy Maud Wrinch who incorporated
geometry Geometry (; ) is, with arithmetic, one of the oldest branches of mathematics. It is concerned with properties of space such as the distance, shape, size, and relative position of figures. A mathematician who works in the field of geometry is ...
into the prediction of
protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monom ...
s. Wrinch demonstrated this with the ''Cyclol'' model, the first prediction of the structure of a
globular protein In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (formi ...
. Contemporary methods are able to determine, without prediction, tertiary structures to within 5 Å (0.5 nm) for small proteins (<120 residues) and, under favorable conditions, confident secondary structure predictions.


Determinants


Stability of native states


Thermostability

A protein folded into its native state or native conformation typically has a lower
Gibbs free energy In thermodynamics, the Gibbs free energy (or Gibbs energy; symbol G) is a thermodynamic potential that can be used to calculate the maximum amount of work that may be performed by a thermodynamically closed system at constant temperature and ...
(a combination of
enthalpy Enthalpy , a property of a thermodynamic system, is the sum of the system's internal energy and the product of its pressure and volume. It is a state function used in many measurements in chemical, biological, and physical systems at a constant ...
and
entropy Entropy is a scientific concept, as well as a measurable physical property, that is most commonly associated with a state of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynam ...
) than the unfolded conformation. A protein will tend towards low-energy conformations, which will determine the protein's fold in the cellular environment. Because many similar conformations will have similar energies, protein structures are dynamic, fluctuating between these similar structures.
Globular protein In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (formi ...
s have a core of
hydrophobic In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water. Hydrophobic molecules tend to be nonpolar and, t ...
amino acid residues and a surface region of
water Water (chemical formula ) is an Inorganic compound, inorganic, transparent, tasteless, odorless, and Color of water, nearly colorless chemical substance, which is the main constituent of Earth's hydrosphere and the fluids of all known living ...
-exposed, charged,
hydrophilic A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press. In contrast, hydrophobes are ...
residues. This arrangement may stabilise interactions within the tertiary structure. For example, in
secrete 440px Secretion is the movement of material from one point to another, such as a secreted chemical substance from a cell or gland. In contrast, excretion is the removal of certain substances or waste products from a cell or organism. The classical ...
d proteins, which are not bathed in
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
,
disulfide bond In biochemistry, a disulfide (or disulphide in British English) refers to a functional group with the structure . The linkage is also called an SS-bond or sometimes a disulfide bridge and is usually derived by the coupling of two thiol groups. In ...
s between cysteine residues help to maintain the tertiary structure. There is a commonality of stable tertiary structures seen in proteins of diverse function and diverse
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
. For example, the
TIM barrel The TIM barrel (triose-phosphate isomerase), also known as an alpha/beta barrel, is a conserved protein fold consisting of eight alpha helices (α-helices) and eight parallel beta strands (β-strands) that alternate along the peptide backbone ...
, named for the enzyme
triosephosphateisomerase Triose-phosphate isomerase (TPI or TIM) is an enzyme () that catalyzes the reversible interconversion of the triose phosphate isomers dihydroxyacetone phosphate and D-glyceraldehyde 3-phosphate. TPI plays an important role in glycolysis and ...
, is a common tertiary structure as is the highly stable, dimeric,
coiled coil A coiled coil is a structural motif in proteins in which 2–7 alpha-helices are coiled together like the strands of a rope. (Dimers and trimers are the most common types.) Many coiled coil-type proteins are involved in important biological fu ...
structure. Hence, proteins may be classified by the structures they hold. Databases of proteins which use such a classification include ''
SCOP A ( or ) was a poet as represented in Old English poetry. The scop is the Old English counterpart of the Old Norse ', with the important difference that "skald" was applied to historical persons, and scop is used, for the most part, to designa ...
'' and '' CATH''.


Kinetic traps

Folding kinetics may trap a protein in a high-
energy In physics, energy (from Ancient Greek: ἐνέργεια, ''enérgeia'', “activity”) is the quantitative property that is transferred to a body or to a physical system, recognizable in the performance of work and in the form of hea ...
conformation, i.e. a high-energy intermediate conformation blocks access to the lowest-energy conformation. The high-energy conformation may contribute to the function of the protein. For example, the influenza
hemagglutinin In molecular biology, hemagglutinins (or ''haemagglutinin'' in British English) (from the Greek , 'blood' + Latin , 'glue') are receptor-binding membrane fusion glycoproteins produced by viruses in the '' Paramyxoviridae'' family. Hemagglutinins a ...
protein is a single polypeptide chain which when activated, is proteolytically cleaved to form two polypeptide chains. The two chains are held in a high-energy conformation. When the local pH drops, the protein undergoes an energetically favorable conformational rearrangement that enables it to penetrate the host
cell membrane The cell membrane (also known as the plasma membrane (PM) or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of all cells from the outside environment ( ...
.


Metastability

Some tertiary protein structures may exist in long-lived states that are not the expected most stable state. For example, many
serpins Serpins are a superfamily of proteins with similar structures that were first identified for their protease inhibition activity and are found in all kingdoms of life. The acronym serpin was originally coined because the first serpins to be id ...
(serine protease inhibitors) show this
metastability In chemistry and physics, metastability denotes an intermediate energetic state within a dynamical system other than the system's state of least energy. A ball resting in a hollow on a slope is a simple example of metastability. If the ball i ...
. They undergo a conformational change when a loop of the protein is cut by a
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
.


Chaperone proteins

It is commonly assumed that the native state of a protein is also the most thermodynamically stable and that a protein will reach its native state, given its
chemical kinetics Chemical kinetics, also known as reaction kinetics, is the branch of physical chemistry that is concerned with understanding the rates of chemical reactions. It is to be contrasted with chemical thermodynamics, which deals with the direction in ...
, before it is translated. Protein chaperones within the cytoplasm of a cell assist a newly synthesised polypeptide to attain its native state. Some chaperone proteins are highly specific in their function, for example,
protein disulfide isomerase Protein disulfide isomerase (), or PDI, is an enzyme in the endoplasmic reticulum (ER) in eukaryotes and the periplasm of bacteria that catalyzes the formation and breakage of disulfide bonds between cysteine residues within proteins as the ...
; others are general in their function and may assist most globular proteins, for example, the
prokaryotic A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
GroEL GroEL is a protein which belongs to the chaperonin family of molecular chaperones, and is found in many bacteria. It is required for the proper folding of many proteins. To function properly, GroEL requires the lid-like cochaperonin protein com ...
/
GroES Heat shock 10 kDa protein 1 (Hsp10), also known as chaperonin 10 (cpn10) or early-pregnancy factor (EPF), is a protein that in humans is encoded by the ''HSPE1'' gene. The homolog in '' E. coli'' is GroES that is a chaperonin which usually works ...
system of proteins and the homologous
eukaryotic Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
heat shock protein Heat shock proteins (HSP) are a family of proteins produced by cells in response to exposure to stressful conditions. They were first described in relation to heat shock, but are now known to also be expressed during other stresses including expo ...
s (the Hsp60/Hsp10 system).


Cytoplasmic environment

Prediction of protein tertiary structure relies on knowing the protein's primary structure and comparing the possible predicted tertiary structure with known tertiary structures in
protein data bank The Protein Data Bank (PDB) is a database for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids. The data, typically obtained by X-ray crystallography, NMR spectroscopy, or, increasingly, ...
s. This only takes into account the cytoplasmic environment present at the time of protein synthesis to the extent that a similar cytoplasmic environment may also have influenced the structure of the proteins recorded in the protein data bank.


Ligand binding

The structure of a protein, for example an
enzyme Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products ...
, may change upon binding of its natural ligands, for example a cofactor. In this case, the structure of the protein bound to the ligand is known as holo structure, of the unbound protein as apo structure. Structure stabilized by the formation of weak bonds between amino acid side chains - Determined by the folding of the polypeptide chain on itself (nonpolar residues are located inside the protein, while polar residues are mainly located outside) - Envelopment of the protein brings the protein closer and relates a-to located in distant regions of the sequence - Acquisition of the tertiary structure leads to the formation of pockets and sites suitable for the recognition and the binding of specific molecules (biospecificity)


Determination

The knowledge of the tertiary structure of soluble
globular protein In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (formi ...
s is more advanced than that of
membrane protein Membrane proteins are common proteins that are part of, or interact with, biological membranes. Membrane proteins fall into several broad categories depending on their location. Integral membrane proteins are a permanent part of a cell membrane ...
s because the former are easier to study with available technology.


X-ray crystallography

X-ray crystallography X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
is the most common tool used to determine
protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monom ...
. It provides high resolution of the structure but it does not give information about protein's conformational flexibility.


NMR

Protein NMR Nuclear magnetic resonance spectroscopy of proteins (usually abbreviated protein NMR) is a field of structural biology in which NMR spectroscopy is used to obtain information about the structure and dynamics of proteins, and also nucleic acids, and ...
gives comparatively lower resolution of protein structure. It is limited to smaller proteins. However, it can provide information about conformational changes of a protein in solution.


Cryogenic electron microscopy

Cryogenic electron microscopy Cryogenic electron microscopy (cryo-EM) is a cryomicroscopy technique applied on samples cooled to cryogenic temperatures. For biological specimens, the structure is preserved by embedding in an environment of vitreous ice. An aqueous sample so ...
(cryo-EM) can give information about both a protein's tertiary and quaternary structure. It is particularly well-suited to large proteins and symmetrical complexes of
protein subunits In structural biology, a protein subunit is a polypeptide chain or single protein molecule that assembles (or "''coassembles''") with others to form a protein complex. Large assemblies of proteins such as viruses often use a small number of ty ...
.


Dual polarisation interferometry

Dual polarisation interferometry Dual-polarization interferometry (DPI) is an analytical technique that probes molecular layers adsorbed to the surface of a waveguide using the evanescent wave of a laser beam. It is used to measure the conformational change in proteins, or othe ...
provides complementary information about surface captured proteins. It assists in determining structure and conformation changes over time.


Projects


Prediction algorithm

The
Folding@home Folding@home (FAH or F@h) is a volunteer computing project aimed to help scientists develop new therapeutics for a variety of diseases by the means of simulating protein dynamics. This includes the process of protein folding and the movements ...
project at Stanford University is a
distributed computing A distributed system is a system whose components are located on different networked computers, which communicate and coordinate their actions by passing messages to one another from any system. Distributed computing is a field of computer sci ...
research effort which uses approximately 5 petaFLOPS (≈10 x86 petaFLOPS) of available computing. It aims to find an
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
which will consistently predict protein tertiary and quaternary structures given the protein's amino acid sequence and its cellular conditions."Folding@home."
Stanford University. Accessed 18 December 2013.
"Folding@home – FAQ"
Stanford University. Accessed 18 December 2013.
"Folding@home – Science."
Stanford University.
A list of software for protein tertiary structure prediction can be found at List of protein structure prediction software.


Protein aggregation diseases

Protein aggregation In molecular biology, protein aggregation is a phenomenon in which intrinsically-disordered or mis-folded proteins aggregate (i.e., accumulate and clump together) either intra- or extracellularly. Protein aggregates have been implicated in a wi ...
diseases such as Alzheimer's disease and
Huntington's disease Huntington's disease (HD), also known as Huntington's chorea, is a neurodegenerative disease that is mostly inherited. The earliest symptoms are often subtle problems with mood or mental abilities. A general lack of coordination and an uns ...
and prion diseases such as
bovine spongiform encephalopathy Bovine spongiform encephalopathy (BSE), commonly known as mad cow disease, is an incurable and invariably fatal neurodegenerative disease of cattle. Symptoms include abnormal behavior, trouble walking, and weight loss. Later in the course of t ...
can be better understood by constructing (and reconstructing)
disease model ''Medical model'' is the term coined by psychiatrist R. D. Laing in his ''The Politics of the Family and Other Essays'' (1971), for the "set of procedures in which all doctors are trained". It includes complaint, history, physical examinati ...
s. This is done by causing the disease in laboratory animals, for example, by administering a
toxin A toxin is a naturally occurring organic poison produced by metabolic activities of living cells or organisms. Toxins occur especially as a protein or conjugated protein. The term toxin was first used by organic chemist Ludwig Brieger (1849 ...
, such as
MPTP MPTP (1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine) is a prodrug to the neurotoxin MPP+, which causes permanent symptoms of Parkinson's disease by destroying dopaminergic neurons in the substantia nigra of the brain. It has been used to study d ...
to cause Parkinson's disease, or through
genetic manipulation Genetic engineering, also called genetic modification or genetic manipulation, is the modification and manipulation of an organism's genes using technology. It is a set of technologies used to change the genetic makeup of cells, including ...
.
Protein structure prediction Protein structure prediction is the inference of the three-dimensional structure of a protein from its amino acid sequence—that is, the prediction of its secondary and tertiary structure from primary structure. Structure prediction is different ...
is a new way to create disease models, which may avoid the use of animals.


Protein Tertiary Structure Retrieval Project (CoMOGrad)

Matching patterns in tertiary structure of a given protein to huge number of known protein tertiary structures and retrieve most similar ones in ranked order is in the heart of many research areas like function prediction of novel proteins, study of evolution, disease diagnosis, drug discovery, antibody design etc. The CoMOGrad project at BUET is a research effort to device an extremely fast and much precise method for protein tertiary structure retrieval and develop online tool based on research outcome.


See also

* Folding (chemistry) * I-TASSER *
Nucleic acid tertiary structure Nucleic acid tertiary structure is the three-dimensional shape of a nucleic acid polymer. RNA and DNA molecules are capable of diverse functions ranging from molecular recognition to catalysis. Such functions require a precise three-dimension ...
* Protein contact map * Proteopedia *
Structural biology Structural biology is a field that is many centuries old which, and as defined by the Journal of Structural Biology, deals with structural analysis of living material (formed, composed of, and/or maintained and refined by living cells) at every le ...
* Structural motif *
Protein tandem repeats An array of protein tandem repeats is defined as several (at least two) adjacent copies having the same or similar sequence motifs. These periodic sequences are generated by internal duplications in both coding and non-coding genomic sequences. Re ...


References


External links


Protein Data BankDisplay, analyse and superimpose protein 3D structuresDisplay, analyse and superimpose protein 3D structuresWWW-based course teaching elementary protein bioinformaticsCritical Assessment of Structure Prediction (CASP)Structural Classification of Proteins (SCOP)CATH Protein Structure ClassificationDALI/FSSP software and database of superposed protein structuresTOPOFIT-DB Invariant Structural Cores between proteins
* PDBWiki
PDBWiki Home Page
– a website for community annotation of PDB structures. {{DEFAULTSORT:Tertiary Structure Protein structure 3