
A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an
enzyme
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
that
catalyzes
Catalysis () is the increase in rate of a chemical reaction due to an added substance known as a catalyst (). Catalysts are not consumed by the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recycles quick ...
proteolysis
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
, breaking down
proteins
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, re ...
into smaller
polypeptide
Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
s or single
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
s, and spurring the formation of new protein products.
They do this by cleaving the
peptide bonds
In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 (carbon number one) of one alpha-amino acid and N2 (nitrogen number two) of another, along a peptide or protein chai ...
within proteins by
hydrolysis
Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution reaction, substitution, elimination reaction, elimination, and solvation reactions in which water ...
, a reaction where
water
Water is an inorganic compound with the chemical formula . It is a transparent, tasteless, odorless, and Color of water, nearly colorless chemical substance. It is the main constituent of Earth's hydrosphere and the fluids of all known liv ...
breaks
bonds. Proteases are involved in numerous biological pathways, including
digestion
Digestion is the breakdown of large insoluble food compounds into small water-soluble components so that they can be absorbed into the blood plasma. In certain organisms, these smaller substances are absorbed through the small intestine into th ...
of ingested proteins,
protein catabolism (breakdown of old proteins),
and
cell signaling
In biology, cell signaling (cell signalling in British English) is the Biological process, process by which a Cell (biology), cell interacts with itself, other cells, and the environment. Cell signaling is a fundamental property of all Cell (biol ...
.
In the absence of functional accelerants, proteolysis would be very slow, taking hundreds of
year
A year is a unit of time based on how long it takes the Earth to orbit the Sun. In scientific use, the tropical year (approximately 365 Synodic day, solar days, 5 hours, 48 minutes, 45 seconds) and the sidereal year (about 20 minutes longer) ...
s. Proteases can be found in all forms of life and
virus
A virus is a submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are ...
es. They have independently
evolved multiple times, and different classes of protease can perform the same reaction by completely different
catalytic mechanism
Enzyme catalysis is the increase in the rate of a process by an "enzyme", a biological molecule. Most enzymes are proteins, and most such processes are chemical reactions. Within the enzyme, generally catalysis occurs at a localized site, calle ...
s.
Classification
Based on catalytic residue
Proteases can be classified into seven broad groups:
*
Serine protease
Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site.
They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
s - using a
serine
Serine
(symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
alcohol
Alcohol may refer to:
Common uses
* Alcohol (chemistry), a class of compounds
* Ethanol, one of several alcohols, commonly known as alcohol in everyday life
** Alcohol (drug), intoxicant found in alcoholic beverages
** Alcoholic beverage, an alco ...
*
Cysteine protease
Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad.
Discovered by Gopal Chu ...
s - using a
cysteine
Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
thiol
In organic chemistry, a thiol (; ), or thiol derivative, is any organosulfur compound of the form , where R represents an alkyl or other organic substituent. The functional group itself is referred to as either a thiol group or a sulfhydryl grou ...
*
Threonine proteases - using a
threonine
Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
secondary alcohol
In chemistry, an alcohol (), is a type of organic compound that carries at least one hydroxyl () functional group bound to a Saturated and unsaturated compounds, saturated carbon atom. Alcohols range from the simple, like methanol and ethanol ...
*
Aspartic protease
Aspartic proteases (also "aspartyl proteases", "aspartic endopeptidases") are a catalytic type of protease enzymes that use an activated water molecule bound to one or more aspartate residues for catalysis of their peptide substrates. In general, ...
s - using an
aspartate
Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. The L-isomer of aspartic acid is one of the 22 proteinogenic amino acids, i.e., the building blocks of protein ...
carboxylic acid
In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group () attached to an Substituent, R-group. The general formula of a carboxylic acid is often written as or , sometimes as with R referring to an organyl ...
*
Glutamic protease
Glutamic proteases are a group of proteolytic enzymes containing a glutamic acid residue within the active site. This type of protease was first described in 2004 and became the sixth catalytic type of protease. Members of this group of protease h ...
s - using a
glutamate
Glutamic acid (symbol Glu or E; known as glutamate in its anionic form) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a Essential amino acid, non-essential nutrient for humans, meaning that ...
carboxylic acid
In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group () attached to an Substituent, R-group. The general formula of a carboxylic acid is often written as or , sometimes as with R referring to an organyl ...
*
Metalloproteases - using a
metal
A metal () is a material that, when polished or fractured, shows a lustrous appearance, and conducts electrical resistivity and conductivity, electricity and thermal conductivity, heat relatively well. These properties are all associated wit ...
, usually
zinc
Zinc is a chemical element; it has symbol Zn and atomic number 30. It is a slightly brittle metal at room temperature and has a shiny-greyish appearance when oxidation is removed. It is the first element in group 12 (IIB) of the periodic tabl ...
*
Asparagine peptide lyases - using an
asparagine
Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
to perform an
elimination reaction
An elimination reaction is a type of organic reaction in which two substituents are removed from a molecule in either a one- or two-step mechanism. The one-step mechanism is known as the E2 reaction, and the two-step mechanism is known as the E1 r ...
(not requiring water)
Proteases were first grouped into 84 families according to their evolutionary relationship in 1993, and classified under four catalytic types:
serine
Serine
(symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
,
cysteine
Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
,
aspartic, and
metallo
Metallo () is the name of different supervillains appearing in American comic books published by DC Comics, commonly as an List of Superman enemies, adversary of Superman. All versions of the character are powered by kryptonite and are partially ...
proteases. The
threonine
Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
and
glutamic proteases were not described until 1995 and 2004 respectively. The mechanism used to cleave a
peptide bond
In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 (carbon number one) of one alpha-amino acid and N2 (nitrogen number two) of another, along a peptide or protein cha ...
involves making an
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
residue that has the cysteine and threonine (proteases) or a water molecule (aspartic, glutamic and metalloproteases) nucleophilic so that it can attack the peptide
carbonyl
In organic chemistry, a carbonyl group is a functional group with the formula , composed of a carbon atom double bond, double-bonded to an oxygen atom, and it is divalent at the C atom. It is common to several classes of organic compounds (such a ...
group. One way to make a
nucleophile
In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
is by a
catalytic triad
A catalytic triad is a set of three coordinated amino acid residues that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, aminoac ...
, where a
histidine
Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an Amine, α-amino group (which is in the protonated –NH3+ form under Physiological condition, biological conditions), a carboxylic ...
residue is used to activate
serine
Serine
(symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
,
cysteine
Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
, or
threonine
Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
as a nucleophile. This is not an evolutionary grouping, however, as the nucleophile types have
evolved convergently in different
superfamilies, and some superfamilies show divergent evolution to multiple different nucleophiles. Metalloproteases, aspartic, and glutamic proteases utilize their active site residues to activate a water molecule, which then attacks the scissile bond.
Peptide lyases
A seventh catalytic type of proteolytic enzymes,
asparagine peptide lyase, was described in 2011. Its proteolytic mechanism is unusual since, rather than
hydrolysis
Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution reaction, substitution, elimination reaction, elimination, and solvation reactions in which water ...
, it performs an
elimination reaction
An elimination reaction is a type of organic reaction in which two substituents are removed from a molecule in either a one- or two-step mechanism. The one-step mechanism is known as the E2 reaction, and the two-step mechanism is known as the E1 r ...
.
During this reaction, the catalytic
asparagine
Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
forms a cyclic chemical structure that cleaves itself at asparagine residues in proteins under the right conditions. Given its fundamentally different mechanism, its inclusion as a peptidase may be debatable.
Based on evolutionary phylogeny
An up-to-date classification of protease evolutionary
superfamilies is found in the MEROPS database.
In this database, proteases are classified firstly by 'clan' (
superfamily) based on structure, mechanism and catalytic residue order (e.g. the
PA clan where P indicates a mixture of nucleophile families). Within each 'clan', proteases are classified into
families
Family (from ) is a group of people related either by consanguinity (by recognized birth) or affinity (by marriage or other relationship). It forms the basis for social order. Ideally, families offer predictability, structure, and safety as ...
based on sequence similarity (e.g. the S1 and C3 families within the PA clan). Each family may contain many hundreds of related proteases (e.g.
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
,
elastase,
thrombin
Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin.
Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
and
streptogrisin within the S1 family).
Currently more than 50 clans are known, each indicating an independent evolutionary origin of proteolysis.
Based on optimal pH
Alternatively, proteases may be classified by the optimal
pH in which they are active:
*''Acid proteases''
*''Neutral proteases'' involved in
type 1 hypersensitivity. Here, it is released by
mast cell
A mast cell (also known as a mastocyte or a labrocyte) is a resident cell of connective tissue that contains many granules rich in histamine and heparin. Specifically, it is a type of granulocyte derived from the myeloid stem cell that is a p ...
s and causes activation of
complement and
kinins.
This group includes the
calpains.
*''
Basic proteases'' (or ''alkaline proteases'')
Enzymatic function and mechanism

Proteases are involved in
digesting long protein chains into shorter fragments by splitting the
peptide bonds
In organic chemistry, a peptide bond is an amide type of covalent chemical bond linking two consecutive alpha-amino acids from C1 (carbon number one) of one alpha-amino acid and N2 (nitrogen number two) of another, along a peptide or protein chai ...
that link
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
residues. Some detach the terminal amino acids from the protein chain (
exopeptidases, such as
aminopeptidase
Aminopeptidases are enzymes that catalyze the cleavage of amino acids from the N-terminus (beginning), of proteins or peptides. They are found in many organisms; in the cell, they are found in many organelles, in the cytosol (internal cellular f ...
s,
carboxypeptidase A); others attack internal peptide bonds of a protein (
endopeptidases
Endopeptidase or endoproteinase are proteolytic peptidases that break peptide bonds of nonterminal amino acids (i.e. within the molecule), in contrast to exopeptidases, which break peptide bonds from end-pieces of terminal amino acids. For this ...
, such as
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
,
chymotrypsin,
pepsin
Pepsin is an endopeptidase that breaks down proteins into smaller peptides and amino acids. It is one of the main digestive enzymes in the digestive systems of humans and many other animals, where it helps digest the proteins in food. Pe ...
,
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wi ...
,
elastase).
Catalysis
Catalysis
Catalysis () is the increase in rate of a chemical reaction due to an added substance known as a catalyst (). Catalysts are not consumed by the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recycles quick ...
is achieved by one of two mechanisms:
*Aspartic, glutamic, and metallo-proteases activate a water molecule, which performs a nucleophilic attack on the peptide bond to hydrolyze it.
*Serine, threonine, and cysteine proteases use a nucleophilic residue (usually in a
catalytic triad
A catalytic triad is a set of three coordinated amino acid residues that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, aminoac ...
). That residue performs a nucleophilic attack to
covalent
A covalent bond is a chemical bond that involves the sharing of electrons to form electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs. The stable balance of attractive and repulsive forces between atom ...
ly link the protease to the substrate protein, releasing the first half of the product. This covalent acyl-enzyme intermediate is then hydrolyzed by activated water to complete catalysis by releasing the second half of the product and regenerating the free enzyme
Specificity
Proteolysis can be highly
promiscuous
Promiscuity is the practice of engaging in sexual activity frequently with different partners or being indiscriminate in the choice of sexual partners. The term can carry a moral judgment. A common example of behavior viewed as promiscuous by man ...
such that a wide range of protein substrates are hydrolyzed. This is the case for digestive enzymes such as
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
, which have to be able to cleave the array of proteins ingested into smaller peptide fragments. Promiscuous proteases typically bind to a single
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
on the substrate and so only have specificity for that residue. For example,
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
is specific for the sequences ...K\... or ...R\... ('\'=cleavage site).
Conversely some proteases are highly specific and only cleave substrates with a certain sequence. Blood clotting (such as
thrombin
Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin.
Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
) and viral polyprotein processing (such as
TEV protease) requires this level of specificity in order to achieve precise cleavage events. This is achieved by proteases having a long binding cleft or tunnel with several pockets that bind to specified residues. For example,
TEV protease is specific for the sequence ...ENLYFQ\S... ('\'=cleavage site).
Degradation and autolysis
Proteases, being themselves proteins, are cleaved by other protease molecules, sometimes of the same variety. This acts as a method of regulation of protease activity. Some proteases are less active after autolysis (e.g.
TEV protease) whilst others are more active (e.g.
trypsinogen).
Biodiversity of proteases
Proteases occur in all organisms, from
prokaryote
A prokaryote (; less commonly spelled procaryote) is a unicellular organism, single-celled organism whose cell (biology), cell lacks a cell nucleus, nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Gree ...
s to
eukaryote
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s to
viruses
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
. These enzymes are involved in a multitude of physiological reactions from simple digestion of food proteins to highly regulated cascades (e.g., the
blood-clotting
cascade, the
complement system
The complement system, also known as complement cascade, is a part of the humoral, innate immune system and enhances (complements) the ability of antibodies and phagocytic cells to clear microbes and damaged cells from an organism, promote inf ...
,
apoptosis
Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
pathways, and the invertebrate prophenoloxidase-activating cascade). Proteases can either break specific peptide bonds (''limited proteolysis''), depending on the
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
sequence of a protein, or completely break down a peptide to amino acids (''unlimited proteolysis''). The activity can be a destructive change (abolishing a protein's function or digesting it to its principal components), it can be an activation of a function, or it can be a signal in a signalling pathway.
Plants
Plant genomes encode hundreds of proteases, largely of unknown function. Those with known function are largely involved in
developmental regulation. Plant proteases also play a role in regulation of
photosynthesis
Photosynthesis ( ) is a system of biological processes by which photosynthetic organisms, such as most plants, algae, and cyanobacteria, convert light energy, typically from sunlight, into the chemical energy necessary to fuel their metabo ...
.
Animals
Proteases are used throughout an organism for various metabolic processes. Acid proteases secreted into the stomach (such as
pepsin
Pepsin is an endopeptidase that breaks down proteins into smaller peptides and amino acids. It is one of the main digestive enzymes in the digestive systems of humans and many other animals, where it helps digest the proteins in food. Pe ...
) and serine proteases present in the
duodenum
The duodenum is the first section of the small intestine in most vertebrates, including mammals, reptiles, and birds. In mammals, it may be the principal site for iron absorption.
The duodenum precedes the jejunum and ileum and is the shortest p ...
(
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
and
chymotrypsin) enable the digestion of protein in food. Proteases present in blood serum (
thrombin
Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin.
Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
,
plasmin
Plasmin is an important enzyme () present in blood that degrades many blood plasma proteins, including fibrin thrombus, clots. The degradation of fibrin is termed fibrinolysis. In humans, the plasmin protein (in the zymogen form of plasminogen) i ...
,
Hageman factor, etc.) play an important role in blood-clotting, as well as lysis of the clots, and the correct action of the immune system. Other proteases are present in leukocytes (
elastase,
cathepsin G) and play several different roles in metabolic control. Some
snake venoms are also proteases, such as
pit viper
The Crotalinae, commonly known as pit vipers,Mehrtens JM (1987). ''Living Snakes of the World in Color''. New York: Sterling Publishers. 480 pp. . or pit adders, are a subfamily (biology), subfamily of Viperidae, vipers found in Asia and the ...
haemotoxin and interfere with the victim's blood clotting cascade. Proteases determine the lifetime of other proteins playing important physiological roles like hormones, antibodies, or other enzymes. This is one of the fastest "switching on" and "switching off" regulatory mechanisms in the physiology of an organism.
By a complex cooperative action, proteases can catalyze
cascade
Cascade, or Cascading may refer to:
Science and technology Science
* Air shower (physics), a cascade (particle shower) of subatomic particles and ionized nuclei
** Particle shower, a cascade of secondary particles produced as the result of a high ...
reactions, which result in rapid and efficient amplification of an organism's response to a physiological signal.
Bacteria
Bacteria
Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
secrete proteases to
hydrolyse the peptide bonds in proteins and therefore break the proteins down into their constituent
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
s. Bacterial and fungal proteases are particularly important to the global
carbon
Carbon () is a chemical element; it has chemical symbol, symbol C and atomic number 6. It is nonmetallic and tetravalence, tetravalent—meaning that its atoms are able to form up to four covalent bonds due to its valence shell exhibiting 4 ...
and
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
cycles in the recycling of proteins, and such activity tends to be regulated by nutritional signals in these organisms. The net impact of nutritional regulation of protease activity among the thousands of species present in soil can be observed at the overall microbial community level as proteins are broken down in response to carbon, nitrogen, or sulfur limitation.
Bacteria contain proteases responsible for general protein quality control (e.g. the AAA+
proteasome
Proteasomes are essential protein complexes responsible for the degradation of proteins by proteolysis, a chemical reaction that breaks peptide bonds. Enzymes that help such reactions are called proteases. Proteasomes are found inside all e ...
) by degrading
unfolded or misfolded proteins.
A secreted bacterial protease may also act as an exotoxin, and be an example of a
virulence factor
Virulence factors (preferably known as pathogenicity factors or effectors in botany) are cellular structures, molecules and regulatory systems that enable microbial pathogens (bacteria, viruses, fungi, and protozoa) to achieve the following:
* c ...
in bacterial
pathogenesis
In pathology, pathogenesis is the process by which a disease or disorder develops. It can include factors which contribute not only to the onset of the disease or disorder, but also to its progression and maintenance. The word comes .
Descript ...
(for example,
exfoliative toxin). Bacterial exotoxic proteases destroy extracellular structures.
Viruses
The genomes of some
viruses
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
encode one massive
polyprotein, which needs a protease to cleave this into functional units (e.g. the
hepatitis C virus
The hepatitis C virus (HCV) is a small (55–65 nm in size), enveloped, positive-sense single-stranded RNA virus of the family ''Flaviviridae''. The hepatitis C virus is the cause of hepatitis C and some cancers such as liver cancer ( hepatoc ...
and the
picornavirus
Picornaviruses are a group of related Viral envelope, nonenveloped RNA viruses which infect vertebrates including fish, mammals, and birds. They are viruses that represent a large family of small, Positive-sense single-stranded RNA virus, positi ...
es). These proteases (e.g.
TEV protease) have high specificity and only cleave a very restricted set of substrate sequences. They are therefore a common target for
protease inhibitors.
Archaea
Archaea
Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
use proteases to regulate various cellular processes from
cell-signaling,
metabolism
Metabolism (, from ''metabolē'', "change") is the set of life-sustaining chemical reactions in organisms. The three main functions of metabolism are: the conversion of the energy in food to energy available to run cellular processes; the co ...
,
secretion
Secretion is the movement of material from one point to another, such as a secreted chemical substance from a cell or gland. In contrast, excretion is the removal of certain substances or waste products from a cell or organism. The classical mec ...
and protein quality control.
Only two ATP-dependent proteases are found in archaea: the membrane associated LonB protease and a soluble
20S proteosome complex .
Tumours
Proteases are associated with
cancer progression due to their ability to degrade
extracellular matrices, which facilitates
invasion
An invasion is a Offensive (military), military offensive of combatants of one geopolitics, geopolitical Legal entity, entity, usually in large numbers, entering territory (country subdivision), territory controlled by another similar entity, ...
and
metastasis
Metastasis is a pathogenic agent's spreading from an initial or primary site to a different or secondary site within the host's body; the term is typically used when referring to metastasis by a cancerous tumor. The newly pathological sites, ...
; these enzymes target a diversity of substrates and favour all steps of tumour production; some proteases have
tumour-suppressive effects, associated with more than 30 different enzymes that belong to three distinct protease classes.
Uses
The field of protease research is enormous. Since 2004, approximately 8000
papers related to this field were published each year. Proteases are used in industry,
medicine
Medicine is the science and Praxis (process), practice of caring for patients, managing the Medical diagnosis, diagnosis, prognosis, Preventive medicine, prevention, therapy, treatment, Palliative care, palliation of their injury or disease, ...
and as a basic biological research tool.
Digestive proteases are part of many
laundry detergent
Laundry detergent is a type of detergent (cleaning agent) used for cleaning dirty laundry (clothes). Laundry detergent is manufactured in powder (washing powder) and liquid form.
While powdered and liquid detergents hold roughly equal share of ...
s and are also used extensively in the bread industry in
bread improver. A variety of proteases are used medically both for their native function (e.g. controlling blood clotting) or for completely artificial functions (''e.g.'' for the targeted degradation of pathogenic proteins). Highly specific proteases such as
TEV protease and
thrombin
Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin.
Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
are commonly used to cleave
fusion proteins and
affinity tags in a controlled fashion.
Protease-containing plant-solutions called ''
vegetarian rennet'' have been in use for hundreds of years in
Europe
Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
and the
Middle East
The Middle East (term originally coined in English language) is a geopolitical region encompassing the Arabian Peninsula, the Levant, Turkey, Egypt, Iran, and Iraq.
The term came into widespread usage by the United Kingdom and western Eur ...
for making
kosher and halal Cheeses. Vegetarian rennet from ''
Withania coagulans'' has been in use for thousands of years as a
Ayurvedic remedy for digestion and diabetes in the Indian subcontinent. It is also used to make
Paneer.
Inhibitors
The activity of proteases is inhibited by
protease inhibitors. One example of protease inhibitors is the
serpin
Serpins are a superfamily of proteins with similar structures that were first identified for their protease inhibition activity and are found in all kingdoms of life. The acronym serpin was originally coined because the first serpins to be ...
superfamily. It includes
alpha 1-antitrypsin
Alpha-1 antitrypsin or α1-antitrypsin (A1AT, α1AT, A1A, or AAT) is a protein belonging to the serpin superfamily. It is encoded in humans by the ''SERPINA1'' gene. A protease inhibitor, it is also known as alpha1–proteinase inhibitor (A1P ...
(which protects the body from excessive effects of its own
inflammatory proteases),
alpha 1-antichymotrypsin (which does likewise),
C1-inhibitor (which protects the body from excessive protease-triggered activation of its own
complement system
The complement system, also known as complement cascade, is a part of the humoral, innate immune system and enhances (complements) the ability of antibodies and phagocytic cells to clear microbes and damaged cells from an organism, promote inf ...
),
antithrombin (which protects the body from excessive
coagulation
Coagulation, also known as clotting, is the process by which blood changes from a liquid to a gel, forming a thrombus, blood clot. It results in hemostasis, the cessation of blood loss from a damaged vessel, followed by repair. The process of co ...
),
plasminogen activator inhibitor-1
Plasminogen activator inhibitor-1 (PAI-1) also known as endothelial plasminogen activator inhibitor (serpin E1) is a protein that in humans is encoded by the ''SERPINE1'' gene. Elevated PAI-1 is a risk factor for thrombosis and atherosclerosis.
...
(which protects the body from inadequate coagulation by blocking protease-triggered
fibrinolysis
Fibrinolysis is a process that prevents blood clots from growing and becoming problematic. Primary fibrinolysis is a normal body process, while secondary fibrinolysis is the breakdown of clots due to a medicine, a medical disorder, or some other c ...
), and
neuroserpin.
Natural protease inhibitors include the family of
lipocalin proteins, which play a role in cell regulation and differentiation.
Lipophilic
Lipophilicity (from Greek language, Greek λίπος "fat" and :wikt:φίλος, φίλος "friendly") is the ability of a chemical compound to dissolve in fats, oils, lipids, and non-polar solvents such as hexane or toluene. Such compounds are c ...
ligands, attached to lipocalin proteins, have been found to possess tumor protease inhibiting properties. The natural
protease inhibitors are not to be confused with the
protease inhibitors used in antiretroviral therapy. Some
virus
A virus is a submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are ...
es, with
HIV/AIDS
The HIV, human immunodeficiency virus (HIV) is a retrovirus that attacks the immune system. Without treatment, it can lead to a spectrum of conditions including acquired immunodeficiency syndrome (AIDS). It is a Preventive healthcare, pr ...
among them, depend on proteases in their reproductive cycle. Thus,
protease inhibitors are developed as
antiviral therapeutic agents.
Other natural protease inhibitors are used as defense mechanisms. Common examples are the
trypsin inhibitor
A trypsin inhibitor (TI) is a protein and a type of serine protease inhibitor ( serpin) that reduces the biological activity of trypsin by controlling the activation and catalytic reactions of proteins. Trypsin is an enzyme involved in the breakdow ...
s found in the seeds of some plants, most notable for humans being soybeans, a major food crop, where they act to discourage predators. Raw soybeans are
toxic
Toxicity is the degree to which a chemical substance or a particular mixture of substances can damage an organism. Toxicity can refer to the effect on a whole organism, such as an animal, bacterium, or plant, as well as the effect on a subst ...
to many animals, including humans, until the protease inhibitors they contain have been denatured.
See also
*
Ligase
In biochemistry, a ligase is an enzyme that can catalyze the joining ( ligation) of two molecules by forming a new chemical bond. This is typically via hydrolysis of a small pendant chemical group on one of the molecules, typically resulting i ...
*Protease
**
cysteine-
**
serine-
**
threonine-
**
aspartic-
**
glutamic-
**
metallo-
*
PA clan
*
Convergent evolution
Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last comm ...
*
Proteolysis
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
*
Catalytic triad
A catalytic triad is a set of three coordinated amino acid residues that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, aminoac ...
*
The Proteolysis Map
*
Proteases in angiogenesis Angiogenesis is the process of forming new blood vessels from existing blood vessels, formed in vasculogenesis. It is a highly complex process involving extensive interplay between cells, soluble factors, and the extracellular matrix (ECM). Angioge ...
*
Intramembrane proteases
*
Protease inhibitor (pharmacology)
Protease inhibitors (PIs) are medications that act by interfering with enzymes that cleave proteins. Some of the most well known are antiviral drugs widely used to treat HIV/AIDS, hepatitis C and COVID-19. These protease inhibitors prevent vir ...
*
Protease inhibitor (biology)
In biology and biochemistry, protease inhibitors, or antiproteases, are molecules that inhibit the function of proteases (enzymes that aid proteolysis, the breakdown of proteins). Many naturally occurring protease inhibitors are proteins.
In medi ...
*
TopFIND - database of protease specificity, substrates, products and inhibitors
*
MEROPS - Database of protease evolutionary groups
References
External links
International Proteolysis SocietyMEROPS - the peptidase database
Protease cutting predictor(see als
)
Proteolysis MAP from Center for Proteolytic PathwaysProteolysis Cut Site database - curated expert annotation from usersProtease cut sites graphical interfaceTopFIND protease database covering cut sites, substrates and protein termini*
{{Authority control
Proteases,
*
Post-translational modification