
In
molecular biology
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and phys ...
, protein aggregation is a phenomenon in which
intrinsically-disordered or mis-folded
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s aggregate (i.e., accumulate and clump together) either intra- or extracellularly.
Protein aggregates have been implicated in a wide variety of diseases known as
amyloidoses
Amyloidosis is a group of diseases in which abnormal proteins, known as amyloid fibrils, build up in tissue. There are several non-specific and vague signs and symptoms associated with amyloidosis. These include fatigue, peripheral edema, weight ...
, including
ALS,
Alzheimer's,
Parkinson's and
prion
Prions are misfolded proteins that have the ability to transmit their misfolded shape onto normal variants of the same protein. They characterize several fatal and transmissible neurodegenerative diseases in humans and many other animals. It ...
disease.
After synthesis, proteins typically
fold
Fold, folding or foldable may refer to:
Arts, entertainment, and media
* ''Fold'' (album), the debut release by Australian rock band Epicure
*Fold (poker), in the game of poker, to discard one's hand and forfeit interest in the current pot
*Above ...
into a particular
three-dimensional conformation that is the most
thermodynamically favorable: their
native state. This folding process is driven by the
hydrophobic effect
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and exclude water molecules. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpola ...
: a tendency for
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, ...
(water-fearing) portions of the protein to shield themselves from the
hydrophilic
A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press.
In contrast, hydrophobes are ...
(water-loving) environment of the cell by burying into the interior of the protein. Thus, the exterior of a protein is typically hydrophilic, whereas the interior is typically hydrophobic.
Protein structures are stabilized by
non-covalent interactions and
disulfide bonds between two
cysteine
Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile.
When present as a deprotonated catalytic residue, s ...
residues. The non-covalent interactions include
ionic interactions and weak
van der Waals interactions
A van is a type of road vehicle used for transporting goods or people. Depending on the type of van, it can be bigger or smaller than a pickup truck and SUV, and bigger than a common car. There is some varying in the scope of the word across th ...
. Ionic interactions form between an anion and a cation and form
salt bridges that help stabilize the protein. Van der Waals interactions include nonpolar interactions (i.e.
London dispersion force) and polar interactions (i.e.
hydrogen bond
In chemistry, a hydrogen bond (or H-bond) is a primarily electrostatic force of attraction between a hydrogen (H) atom which is covalently bound to a more electronegative "donor" atom or group (Dn), and another electronegative atom bearing ...
s,
dipole-dipole bond). These play an important role in a protein's
secondary structure
Protein secondary structure is the three dimensional form of ''local segments'' of proteins. The two most common secondary structural elements are alpha helices and beta sheets, though beta turns and omega loops occur as well. Secondary struct ...
, such as forming an
alpha helix
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earl ...
or a
beta sheet
The beta sheet, (β-sheet) (also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a gen ...
, and tertiary structure. Interactions between amino acid residues in a specific protein are very important in that protein's final structure.
When there are changes in the non-covalent interactions, as may happen with a change in the amino acid sequence, the protein is susceptible to misfolding or unfolding. In these cases, if the cell does not assist the protein in re-folding, or
degrade the unfolded protein, the unfolded/misfolded protein may aggregate, in which the exposed hydrophobic portions of the protein may interact with the exposed hydrophobic patches of other proteins.
There are three main types of protein aggregates that may form: amorphous aggregates,
oligomer
In chemistry and biochemistry, an oligomer () is a molecule that consists of a few repeating units which could be derived, actually or conceptually, from smaller molecules, monomers.Quote: ''Oligomer molecule: A molecule of intermediate relat ...
s, and
amyloid
Amyloids are aggregates of proteins characterised by a fibrillar morphology of 7–13 nm in diameter, a beta sheet (β-sheet) secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the hu ...
fibrils.
Causes
Protein aggregation can occur due to a variety of causes. There are four classes that these causes can be categorized into, which are detailed below.
Mutations
Mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, m ...
s that occur in the DNA sequence may or may not affect the amino acid sequence of the protein. When the sequence is affected, a different amino acid may change the interactions between the side chains that affect the folding of the protein. This can lead to exposed hydrophobic regions of the protein that aggregate with the same misfolded/unfolded protein or a different protein.
In addition to mutations in the affected proteins themselves, protein aggregation could also be caused indirectly through mutations in proteins in regulatory pathways such as the refolding pathway (molecular
chaperones) or the
ubiquitin-proteasome pathway
Proteasomes are protein complexes which degrade unneeded or damaged proteins by proteolysis, a chemical reaction that breaks peptide bonds. Enzymes that help such reactions are called proteases.
Proteasomes are part of a major mechanism by whi ...
(ubiquitin ligases).
Chaperones help with protein refolding by providing a safe environment for the protein to fold. Ubiquitin ligases target proteins for degradation through ubiquitin modification.
Problems with protein synthesis
Protein aggregation can be caused by problems that occur during
transcription or
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
. During transcription, DNA is copied into mRNA, forming a strand of pre-mRNA that undergoes
RNA processing to form mRNA.
During translation,
ribosomes and
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
help translate the mRNA sequence into an amino acid sequence.
If problems arise during either step, making an incorrect mRNA strand and/or an incorrect amino acid sequence, this can cause the protein to misfold, leading to protein aggregation.
Environmental stresses
Environmental stresses such as extreme temperatures and pH or
oxidative stress
Oxidative stress reflects an imbalance between the systemic manifestation of reactive oxygen species and a biological system's ability to readily detoxify the reactive intermediates or to repair the resulting damage. Disturbances in the normal re ...
can also lead to protein aggregation.
One such disease is
cryoglobulinemia.
Extreme temperatures can weaken and destabilize the non-covalent interactions between the amino acid residues. pHs outside of the protein's pH range can change the protonation state of the amino acids, which can increase or decrease the non-covalent interactions. This can also lead to less stable interactions and result in protein unfolding.
Oxidative stress can be caused by radicals such as
reactive oxygen species
In chemistry, reactive oxygen species (ROS) are highly reactive chemicals formed from diatomic oxygen (). Examples of ROS include peroxides, superoxide, hydroxyl radical, singlet oxygen, and alpha-oxygen.
The reduction of molecular oxygen ...
(ROS). These unstable radicals can attack the amino acid residues, leading to oxidation of side chains (e.g.
aromatic
In chemistry, aromaticity is a chemical property of cyclic (ring-shaped), ''typically'' planar (flat) molecular structures with pi bonds in resonance (those containing delocalized electrons) that gives increased stability compared to sat ...
side chains,
methionine side chains) and/or cleavage of the polypeptide bonds. This can affect the non-covalent interactions that hold the protein together correctly, which can cause protein destabilization, and may cause the protein to unfold.
Aging
Cells have mechanisms that can refold or degrade protein aggregates. However, as cells age, these control mechanisms are weakened and the cell is less able to resolve the aggregates.
The hypothesis that protein aggregation is a causative process in aging is testable now since some models of delayed aging are in hand. If the development of protein aggregates was an aging independent process, slowing down aging will show no
effect on the rate of
proteotoxicity over time. However, if aging is associated
with decline in the activity of protective mechanisms against proteotoxicity,
the slow aging models would show reduced aggregation and proteotoxicity. To
address this problem several toxicity assays have been done in ''C. elegans''.
These studies indicated that reducing the activity of insulin/IGF signaling
(IIS), a prominent aging regulatory pathway protects from
neurodegeneration-linked toxic protein aggregation. The validity of this approach
has been tested and confirmed in mammals as reducing the activity of the IGF-1
signaling pathway protected Alzheimer's model mice from the behavioral and
biochemical impairments associated with the disease.
Aggregate localization
Several studies have shown that cellular responses to protein aggregation are well-regulated and organized. Protein aggregates localize to specific areas in the cell, and research has been done on these localizations in prokaryotes (E.coli) and eukaryotes (yeast, mammalian cells).
Bacteria
The aggregates in bacteria asymmetrically end up at one of the poles of the cell, the "older pole." After the cell divides, the daughter cells with the older pole gets the protein aggregate and grows more slowly than daughter cells without the aggregate. This provides a natural selection mechanism for reducing protein aggregates in the bacterial population.
Yeast

Most of the protein aggregates in yeast cells get refolded by molecular chaperones. However, some aggregates, such as the oxidatively damaged proteins or the proteins marked for degradation, cannot be refolded. Rather, there are two compartments that they can end up in. Protein aggregates can be localized at the Juxtanuclear quality-control compartment (
JUNQ), which is near the nuclear membrane, or at the Insoluble Protein deposit (
IPOD
The iPod is a discontinued series of portable media players and multi-purpose mobile devices designed and marketed by Apple Inc. The first version was released on October 23, 2001, about months after the Macintosh version of iTunes ...
), near the vacuole in yeast cells.
Protein aggregates localize at JUNQ when they are ubiquitinated and targeted for degradation. The aggregated and insoluble proteins localize at IPOD as a more permanent deposition. There is evidence that the proteins here may be removed by autophagy.
These two pathways work together in that the proteins tend to come to the IPOD when the proteasome pathway is being overworked.
Mammalian cells
In mammalian cells, these protein aggregates are termed "aggresomes" and they are formed when the cell is diseased. This is because aggregates tend to form when there are
heterologous proteins present in the cell, which can arise when the cell is mutated. Different mutates of the same protein may form aggresomes of different morphologies, ranging from diffuse dispersion of soluble species to large puncta, which in turn bear different pathogenicity. The E3 ubiquitin ligase is able to recognize misfolded proteins and ubiquinate them. HDAC6 can then bind to the ubiquitin and the motor protein
dynein
Dyneins are a family of cytoskeletal motor proteins that move along microtubules in cells. They convert the chemical energy stored in ATP to mechanical work. Dynein transports various cellular cargos, provides forces and displacements importa ...
to bring the marked aggregates to the microtubule organizing center (
MTOC). There, they pack together into a sphere that surrounds the MTOC. They bring over chaperones and proteasomes and activate autophagy.
Elimination
There are two main protein quality control systems in the cell that are responsible for eliminating protein aggregates. Misfolded proteins can get refolded by the bi-chaperone system or degraded by the ubiquitin proteasome system or autophagy.
Refolding
The bi-chaperone system utilizes the Hsp70 (DnaK-DnaJ-GrpE in E. coli and Ssa1-Ydj1/Sis1-Sse1/Fe1 in yeast) and Hsp100 (ClpB in E. coli and Hsp104 in yeast) chaperones for protein disaggregation and refolding.
Hsp70 interacts with the protein aggregates and recruits Hsp100. Hsp70 stabilizes an activated Hsp100. Hsp100 proteins have aromatic pore loops that are used for threading activity to disentangle single polypeptides. This threading activity can be initiated at the N-terminus, C-terminus or in the middle of the polypeptide. The polypeptide gets translocated through Hsp100 in a series of steps, utilizing an ATP at each step.
The polypeptide unfolds and is then allowed to refold either by itself or with the help of heat shock proteins.
Degradation
Misfolded proteins can be eliminated through the ubiquitin-proteasome system (
UPS). This consists of an E1-E2-E3 pathway that ubiquinates proteins to mark them for degradation. In eukaryotes, the proteins get degraded by the 26S proteasome. In mammalian cells, the E3 ligase, carboxy-terminal Hsp70 interacting protein (CHIP), targets Hsp70-bound proteins. In yeast, the E3 ligases Doa10 and Hrd1 have similar functions on
endoplasmic reticulum
The endoplasmic reticulum (ER) is, in essence, the transportation system of the eukaryotic cell, and has many other important functions such as protein folding. It is a type of organelle made up of two subunits – rough endoplasmic reticulum ( ...
proteins.
On the molecular level, degradation rate of aggregates vary from protein to protein due to their different internal environments, and thus different accessibility for protease molecules.

Misfolded proteins can also be eliminated through autophagy, in which the protein aggregates are delivered to the lysosome.
Toxicity
Although it has been thought that the mature protein aggregates themselves are toxic, evidence suggests that it is in fact immature protein aggregates that are most toxic.
The hydrophobic patches of these aggregates can interact with other components of the cell and damage them. The hypotheses are that the toxicity of protein aggregates is related to mechanisms of the sequestration of cellular components, the generation of reactive oxygen species and the binding to specific receptors in the membrane or through the disruption of membranes. A quantitative assay has been used to determine that higher molecular weight species are responsible for the membrane permeation.
It is known that protein aggregates in vitro can destabilize artificial phospholipid bilayers, leading to permeabilization of the membrane.
In biomanufacturing
Protein aggregation is also a common phenomenon in the biopharmaceutical manufacturing process, which may pose risks to patients via generating adverse immune responses.
[{{Cite journal, last1=Vázquez-Rey, first1=María, last2=Lang, first2=Dietmar, date=2011-04-07, title=Aggregates in monoclonal antibody manufacturing processes, volume=108, pages=1494-1508, doi=10.1002/bit.23155]
See also
*
Amyloid
Amyloids are aggregates of proteins characterised by a fibrillar morphology of 7–13 nm in diameter, a beta sheet (β-sheet) secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the hu ...
*
Proteopathy
*
Amyloidosis
Amyloidosis is a group of diseases in which abnormal proteins, known as amyloid fibrils, build up in tissue. There are several non-specific and vague signs and symptoms associated with amyloidosis. These include fatigue, peripheral edema, weig ...
*
JUNQ and IPOD
*
Protein aggregation predictors
References
Alzheimer's disease
Neurodegenerative disorders
Neurological disorders
Structural proteins