Serpins
   HOME

TheInfoList



OR:

Serpins are a superfamily of
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s with similar structures that were first identified for their protease inhibition activity and are found in all kingdoms of life. The acronym serpin was originally coined because the first serpins to be identified act on chymotrypsin-like
serine protease Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
s (serine protease inhibitors). They are notable for their unusual mechanism of action, in which they irreversibly inhibit their target
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalysis, catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products ...
by undergoing a large
conformational change In biochemistry, a conformational change is a change in the shape of a macromolecule, often induced by environmental factors. A macromolecule is usually flexible and dynamic. Its shape can change in response to changes in its environment or othe ...
to disrupt the target's
active site In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
. This contrasts with the more common
competitive Competition is a rivalry where two or more parties strive for a common goal which cannot be shared: where one's gain is the other's loss (an example of which is a zero-sum game). Competition can arise between entities such as organisms, indi ...
mechanism for protease inhibitors that bind to and block access to the protease active site. Protease inhibition by serpins controls an array of biological processes, including
coagulation Coagulation, also known as clotting, is the process by which blood changes from a liquid to a gel, forming a thrombus, blood clot. It results in hemostasis, the cessation of blood loss from a damaged vessel, followed by repair. The process of co ...
and
inflammation Inflammation (from ) is part of the biological response of body tissues to harmful stimuli, such as pathogens, damaged cells, or irritants. The five cardinal signs are heat, pain, redness, swelling, and loss of function (Latin ''calor'', '' ...
, and consequently these proteins are the target of
medical research Medical research (or biomedical research), also known as health research, refers to the process of using scientific methods with the aim to produce knowledge about human diseases, the prevention and treatment of illness, and the promotion of ...
. Their unique conformational change also makes them of interest to the
structural biology Structural biology deals with structural analysis of living material (formed, composed of, and/or maintained and refined by living cells) at every level of organization. Early structural biologists throughout the 19th and early 20th centuries we ...
and
protein folding Protein folding is the physical process by which a protein, after Protein biosynthesis, synthesis by a ribosome as a linear chain of Amino acid, amino acids, changes from an unstable random coil into a more ordered protein tertiary structure, t ...
research communities. The conformational-change mechanism confers certain advantages, but it also has drawbacks: serpins are vulnerable to
mutations In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosi ...
that can result in serpinopathies such as
protein misfolding In medicine, proteinopathy ( 'pref''. protein -pathy 'suff''. disease proteinopathies ''pl''.; proteinopathic ''adj''), or proteopathy, protein conformational disorder, or protein misfolding disease, is a class of diseases in which certain prote ...
and the formation of inactive long-chain
polymer A polymer () is a chemical substance, substance or material that consists of very large molecules, or macromolecules, that are constituted by many repeat unit, repeating subunits derived from one or more species of monomers. Due to their br ...
s. Serpin
polymerisation In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many form ...
not only reduces the amount of active inhibitor, but also leads to accumulation of the polymers, causing
cell death Cell death is the event of a biological cell ceasing to carry out its functions. This may be the result of the natural process of old cells dying and being replaced by new ones, as in programmed cell death, or may result from factors such as di ...
and
organ failure Organ dysfunction is a condition where an organ does not perform its expected function. Organ failure is organ dysfunction to such a degree that normal homeostasis cannot be maintained without external clinical intervention or life support. It i ...
. Although most serpins control
proteolytic Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
cascades, some proteins with a serpin structure are not
enzyme inhibitor An enzyme inhibitor is a molecule that binds to an enzyme and blocks its Enzyme activity, activity. Enzymes are proteins that speed up chemical reactions necessary for life, in which Substrate (biochemistry), substrate molecules are converted ...
s, but instead perform diverse functions such as storage (as in
egg white Egg white is the clear liquid (also called the albumen or the glair/glaire) contained within an egg. In chickens, it is formed from the layers of secretions of the anterior section of the hen's oviduct during the passage of the egg. It forms a ...
ovalbumin Ovalbumin (abbreviated OVA) is the main protein found in egg white, making up approximately 55% of the total protein. Ovalbumin displays sequence and three-dimensional homology to the serpin superfamily, but unlike most serpins it is not a serine ...
), transport as in hormone
carriage proteins A membrane transport protein is a membrane protein involved in the movement of ions, small molecules, and macromolecules, such as another protein, across a biological membrane. Transport proteins are integral membrane proteins, integral transmembr ...
(
thyroxine-binding globulin Thyroxine-binding globulin (TBG) is a globulin protein encoded by the ''SERPINA7'' gene in humans. TBG binds thyroid hormones in circulation. It is one of three transport proteins (along with transthyretin and serum albumin) responsible for ...
,
cortisol-binding globulin Transcortin, also known as corticosteroid-binding globulin (CBG) or serpin A6, is a protein produced in the liver in animals. In humans it is encoded by the SERPINA6 gene. It is an alpha-globulin. Function This gene encodes an alpha-globulin pr ...
) and molecular chaperoning ( HSP47). The term ''serpin'' is used to describe these members as well, despite their non-inhibitory function, since they are evolutionarily related.


History

Protease inhibitory activity in blood plasma was first reported in the late 1800s, but it was not until the 1950s that the serpins
antithrombin Antithrombin (AT) is a small glycoprotein that inactivates several enzymes of the coagulation system. It is a 464-amino-acid protein produced by the liver. It contains three disulfide bonds and a total of four possible glycosylation sites. α-An ...
and
alpha 1-antitrypsin Alpha-1 antitrypsin or α1-antitrypsin (A1AT, α1AT, A1A, or AAT) is a protein belonging to the serpin superfamily. It is encoded in humans by the ''SERPINA1'' gene. A protease inhibitor, it is also known as alpha1–proteinase inhibitor (A1P ...
were isolated, with the subsequent recognition of their close family homology in 1979. That they belonged to a new protein family became apparent on their further alignment with the non-inhibitory egg-white protein
ovalbumin Ovalbumin (abbreviated OVA) is the main protein found in egg white, making up approximately 55% of the total protein. Ovalbumin displays sequence and three-dimensional homology to the serpin superfamily, but unlike most serpins it is not a serine ...
, to give what was initially called the ''alpha1-antitrypsin-antithrombin III-ovalbumin superfamily of serine proteinase inhibitors,'' but was subsequently succinctly renamed as the Serpins. The initial characterisation of the new family centred on alpha1-antitrypsin, a serpin present in high concentration in blood plasma, the common
genetic disorder A genetic disorder is a health problem caused by one or more abnormalities in the genome. It can be caused by a mutation in a single gene (monogenic) or multiple genes (polygenic) or by a chromosome abnormality. Although polygenic disorders ...
of which was shown to cause a predisposition to the lung disease
emphysema Emphysema is any air-filled enlargement in the body's tissues. Most commonly emphysema refers to the permanent enlargement of air spaces (alveoli) in the lungs, and is also known as pulmonary emphysema. Emphysema is a lower respiratory tract di ...
and to liver
cirrhosis Cirrhosis, also known as liver cirrhosis or hepatic cirrhosis, chronic liver failure or chronic hepatic failure and end-stage liver disease, is a chronic condition of the liver in which the normal functioning tissue, or parenchyma, is replaced ...
. The identification of the S and Z mutations responsible for the genetic deficiency and the subsequent sequence alignments of alpha1-antitrypsin and antithrombin in 1982 led to the recognition of the close homologies of the active sites of the two proteins, centred on a methionine in alpha1-antitrypsin as an inhibitor of tissue elastase and on arginine in antithrombin as an inhibitor of thrombin. The critical role of the active centre residue in determining the specificity of inhibition of serpins was unequivocally confirmed by the finding that a natural mutation of the active centre methionine  in alpha1-antitrypsin to an arginine, as in antithrombin, resulted in a severe bleeding disorder. This active-centre specificity of inhibition was also evident in the many other families of protease inhibitors but the serpins differed from them in being much larger proteins and also in possessing what was soon apparent as an inherent ability to undergo a change in shape. The nature of this conformational change was revealed with the determination in 1984 of the first crystal
structure A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such as ...
of a serpin, that of post-cleavage alpha1-antitrypsin. This together with the subsequent solving of the structure of native (uncleaved) ovalbumin indicated that the inhibitory mechanism of the  serpins involved a remarkable conformational shift,  with the movement of the exposed peptide loop containing the reactive site and its incorporation as a middle strand in the main beta-pleated sheet that characterises the serpin molecule. Early evidence of the essential role of this loop movement in the inhibitory mechanism came from the finding that even minor aberrations in the amino acid residues that form the hinge of the movement in antithrombin resulted in thrombotic disease. Ultimate confirmation of the linked displacement of the target protease by this loop movement was provided  in 2000 by the structure of the post-inhibitory complex of alpha1-antitrypsin with trypsin, showing how the displacement results in the deformation and inactivation of the attached protease. Subsequent structural studies have revealed an additional advantage of the conformational mechanism in allowing the subtle modulation of inhibitory activity, as notably seen at tissue level with the functionally diverse serpins in human plasma. Over 1000 serpins have now been identified, including 36 human proteins, as well as molecules in all kingdoms of life—
animals Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia (). With few exceptions, animals consume organic material, breathe oxygen, have myocytes and are able to move, can reproduce sexually, and grow from a ...
,
plants Plants are the eukaryotes that form the kingdom Plantae; they are predominantly photosynthetic. This means that they obtain their energy from sunlight, using chloroplasts derived from endosymbiosis with cyanobacteria to produce sugars f ...
,
fungi A fungus (: fungi , , , or ; or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and mold (fungus), molds, as well as the more familiar mushrooms. These organisms are classified as one ...
,
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
, and
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
—and some
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
. The central feature of all is a tightly conserved framework, which allows the precise alignment of their key structural and functional components based on the template structure of alpha1-antitrypsin. In the 2000s, a systematic nomenclature was introduced in order to categorise members of the serpin superfamily based on their evolutionary relationships. Serpins are therefore the largest and most diverse superfamily of protease inhibitors.


Activity

Most serpins are
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalysis, catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products ...
inhibitors, targeting extracellular,
chymotrypsin Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenu ...
-like
serine protease Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
s. These proteases possess a
nucleophilic In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they a ...
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
residue in a
catalytic triad A catalytic triad is a set of three coordinated amino acid residues that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, aminoac ...
in their
active site In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
. Examples include
thrombin Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin. Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
,
trypsin Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
, and human neutrophil elastase. Serpins act as irreversible, suicide inhibitors by trapping an intermediate of the protease's catalytic mechanism. Some serpins inhibit other protease classes, typically
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal Chu ...
s, and are termed "cross-class inhibitors". These enzymes differ from serine proteases in that they use a nucleophilic
cysteine Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
residue, rather than a
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
, in their active site. Nonetheless, the enzymatic chemistry is similar, and the mechanism of inhibition by serpins is the same for both classes of protease. Examples of cross-class inhibitory serpins include serpin B4 a
squamous cell carcinoma Squamous-cell carcinoma (SCC), also known as epidermoid carcinoma, comprises a number of different types of cancer that begin in squamous cells. These cells form on the surface of the skin, on the lining of hollow organs in the body, and on the ...
antigen 1 (SCCA-1) and the avian serpin myeloid and erythroid nuclear termination stage-specific protein (MENT), which both inhibit papain-like
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal Chu ...
s.


Biological function and localization


Protease inhibition

Approximately two-thirds of human serpins perform extracellular roles, inhibiting proteases in the bloodstream in order to modulate their activities. For example, extracellular serpins regulate the proteolytic cascades central to
blood clotting Coagulation, also known as clotting, is the process by which blood changes from a liquid to a gel, forming a thrombus, blood clot. It results in hemostasis, the cessation of blood loss from a damaged vessel, followed by repair. The process of co ...
(antithrombin), the inflammatory and immune responses (antitrypsin, antichymotrypsin, and
C1-inhibitor C1-inhibitor (C1-inh, C1 esterase inhibitor) is a protease inhibitor belonging to the serpin superfamily. Its main function is the inhibition of the complement system (C1r, C1s) to prevent spontaneous activation but also as the major regulator ...
) and tissue remodelling (PAI-1). By inhibiting signalling cascade proteases, they can also affect
development Development or developing may refer to: Arts *Development (music), the process by which thematic material is reshaped * Photographic development *Filmmaking, development phase, including finance and budgeting * Development hell, when a proje ...
. The table of human serpins (below) provides examples of the range of functions performed by human serpin, as well as some of the diseases that result from serpin deficiency. The protease targets of intracellular inhibitory serpins have been difficult to identify, since many of these molecules appear to perform overlapping roles. Further, many human serpins lack precise functional equivalents in model organisms such as the mouse. Nevertheless, an important function of intracellular serpins may be to protect against the inappropriate activity of proteases inside the cell. For example, one of the best-characterised human intracellular serpins is Serpin B9, which inhibits the
cytotoxic Cytotoxicity is the quality of being toxic to cells. Examples of toxic agents are toxic metals, toxic chemicals, microbe neurotoxins, radiation particles and even specific neurotransmitters when the system is out of balance. Also some types of dr ...
granule protease
granzyme B Granzyme B (GrB) is one of the serine protease granzymes most commonly found in the granules of natural killer cells (NK cells) and cytotoxic T cells. It is secreted by these cells along with the pore forming protein perforin to mediate apoptos ...
. In doing so, Serpin B9 may protect against inadvertent release of granzyme B and premature or unwanted activation of
cell death Cell death is the event of a biological cell ceasing to carry out its functions. This may be the result of the natural process of old cells dying and being replaced by new ones, as in programmed cell death, or may result from factors such as di ...
pathways. Some
virus A virus is a submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are ...
es use serpins to disrupt protease functions in their host. The
cowpox Cowpox is an infectious disease caused by Cowpox virus (CPXV). It presents with large blisters in the skin, a fever and swollen glands, historically typically following contact with an infected cow, though in the last several decades more often ...
viral serpin CrmA (cytokine response modifier A) is used in order to avoid inflammatory and
apoptotic Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemical events lead to characteristic cell changes ( morphology) and death. These ...
responses of infected host cells. CrmA increases infectivity by suppressing its host's inflammatory response through inhibition of IL-1 and IL-18 processing by the cysteine protease
caspase Caspases (cysteine-aspartic proteases, cysteine aspartases or cysteine-dependent aspartate-directed proteases) are a family of protease enzymes playing essential roles in programmed cell death. They are named caspases due to their specific cyste ...
-1. In
eukaryotes The eukaryotes ( ) constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of ...
, a plant serpin inhibits both
metacaspase Metacaspases are members of the C14 class of cysteine proteases and thus related to caspases, orthocaspases and paracaspases. The metacaspases are arginine/lysine-specific, in contrast to caspases, which are aspartate-specific. Structure and Phy ...
s and a papain-like cysteine protease.


Non-inhibitory roles

Non-inhibitory extracellular serpins also perform a wide array of important roles.
Thyroxine-binding globulin Thyroxine-binding globulin (TBG) is a globulin protein encoded by the ''SERPINA7'' gene in humans. TBG binds thyroid hormones in circulation. It is one of three transport proteins (along with transthyretin and serum albumin) responsible for ...
and transcortin transport the hormones
thyroxine Thyroxine, also known as T4, is a hormone produced by the thyroid gland. It is the primary form of thyroid hormone found in the blood and acts as a prohormone of the more active thyroid hormone, triiodothyronine (T3). Thyroxine and its acti ...
and
cortisol Cortisol is a steroid hormone in the glucocorticoid class of hormones and a stress hormone. When used as medication, it is known as hydrocortisone. Cortisol is produced in many animals, mainly by the ''zona fasciculata'' of the adrenal corte ...
, respectively. The non-inhibitory serpin
ovalbumin Ovalbumin (abbreviated OVA) is the main protein found in egg white, making up approximately 55% of the total protein. Ovalbumin displays sequence and three-dimensional homology to the serpin superfamily, but unlike most serpins it is not a serine ...
is the most abundant protein in
egg white Egg white is the clear liquid (also called the albumen or the glair/glaire) contained within an egg. In chickens, it is formed from the layers of secretions of the anterior section of the hen's oviduct during the passage of the egg. It forms a ...
. Its exact function is unknown, but it is thought to be a
storage protein Storage proteins serve as biological reserves of metal ions and amino acids, used by organisms. They are found in plant seeds, egg whites, and milk. Ferritin is an example of a storage protein that stores iron. Iron is a component of heme, which ...
for the developing foetus. Heat shock serpin 47 is a chaperone, essential for proper
folding Fold, folding or foldable may refer to: Arts, entertainment, and media * ''Fold'' (album), the debut release by Australian rock band Epicure * Fold (poker), in the game of poker, to discard one's hand and forfeit interest in the current pot *Abov ...
of
collagen Collagen () is the main structural protein in the extracellular matrix of the connective tissues of many animals. It is the most abundant protein in mammals, making up 25% to 35% of protein content. Amino acids are bound together to form a trip ...
. It acts by stabilising collagen's
triple helix In the fields of geometry and biochemistry, a triple helix (: triple helices) is a set of three congruent geometrical helices with the same axis, differing by a translation along the axis. This means that each of the helices keeps the same distan ...
whilst it is being processed in the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
. Some serpins are both protease inhibitors and perform additional roles. For example, the nuclear cysteine protease inhibitor MENT, in
bird Birds are a group of warm-blooded vertebrates constituting the class (biology), class Aves (), characterised by feathers, toothless beaked jaws, the Oviparity, laying of Eggshell, hard-shelled eggs, a high Metabolism, metabolic rate, a fou ...
s also acts as a chromatin remodelling molecule in a bird's
red blood cell Red blood cells (RBCs), referred to as erythrocytes (, with -''cyte'' translated as 'cell' in modern usage) in academia and medical publishing, also known as red cells, erythroid cells, and rarely haematids, are the most common type of blood cel ...
s.


Structure

All serpins share a common
structure A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such as ...
(or fold), despite their varied functions. All typically have three β-sheets (named A, B and C) and eight or nine
α-helices An alpha helix (or α-helix) is a sequence of amino acids in a protein that are twisted into a coil (a helix). The alpha helix is the most common structural arrangement in the secondary structure of proteins. It is also the most extreme type of l ...
(named hA–hI). The most significant regions to serpin function are the A-sheet and the reactive centre loop (RCL). The A-sheet includes two β-strands that are in a parallel orientation with a region between them called the 'shutter', and upper region called the 'breach'. The RCL forms the initial interaction with the target protease in inhibitory molecules. Structures have been solved showing the RCL either fully exposed or partially inserted into the A-sheet, and serpins are thought to be in
dynamic equilibrium In chemistry, a dynamic equilibrium exists once a reversible reaction occurs. Substances initially transition between the reactants and products at different rates until the forward and backward reaction rates eventually equalize, meaning the ...
between these two states. The RCL also only makes temporary interactions with the rest of the structure, and is therefore highly flexible and exposed to the solvent. The serpin structures that have been determined cover several different conformations, which has been necessary for the understanding of their multiple-step mechanism of action.
Structural biology Structural biology deals with structural analysis of living material (formed, composed of, and/or maintained and refined by living cells) at every level of organization. Early structural biologists throughout the 19th and early 20th centuries we ...
has therefore played a central role in the understanding of serpin function and biology.


Conformational change and inhibitory mechanism

Inhibitory serpins do not inhibit their target proteases by the typical
competitive Competition is a rivalry where two or more parties strive for a common goal which cannot be shared: where one's gain is the other's loss (an example of which is a zero-sum game). Competition can arise between entities such as organisms, indi ...
(''lock-and-key'') mechanism used by most small
protease inhibitors Protease inhibitors (PIs) are medications that act by interfering with protease, enzymes that cleave proteins. Some of the most well known are antiviral drugs widely used to treat HIV/AIDS, hepatitis C and COVID-19. These protease inhibitors pre ...
(e.g. Kunitz-type inhibitors). Instead, serpins use an unusual
conformational change In biochemistry, a conformational change is a change in the shape of a macromolecule, often induced by environmental factors. A macromolecule is usually flexible and dynamic. Its shape can change in response to changes in its environment or othe ...
, which disrupts the structure of the protease and prevents it from completing catalysis. The conformational change involves the RCL moving to the opposite end of the protein and inserting into β-sheet A, forming an extra antiparallel β-strand. This converts the serpin from a stressed state, to a lower-energy relaxed state (S to R transition).
Serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
and
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal Chu ...
s catalyse peptide bond cleavage by a two-step process. Initially, the catalytic residue of the active site triad performs a
nucleophilic In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they a ...
attack on the peptide bond of the substrate. This releases the new
N-terminus The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amin ...
and forms a covalent
ester In chemistry, an ester is a compound derived from an acid (either organic or inorganic) in which the hydrogen atom (H) of at least one acidic hydroxyl group () of that acid is replaced by an organyl group (R). These compounds contain a distin ...
-bond between the enzyme and the substrate. This covalent complex between enzyme and substrate is called an acyl-enzyme intermediate. For standard substrates, the ester bond is
hydrolysed Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution, elimination, and solvation reactions in which water is the nucleophile. Biological hydrolysis ...
and the new
C-terminus The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, carboxy tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein Proteins are large biomolecules and macromolecules that comp ...
is released to complete catalysis. However, when a serpin is cleaved by a protease, it rapidly undergoes the S to R transition before the acyl-enzyme intermediate is hydrolysed. The efficiency of inhibition depends on fact that the relative kinetic rate of the conformational change is several orders of magnitude faster than hydrolysis by the protease. Since the RCL is still covalently attached to the protease via the ester bond, the S to R transition pulls protease from the top to the bottom of the serpin and distorts the catalytic triad. The distorted protease can only hydrolyse the acyl enzyme intermediate extremely slowly and so the protease remains covalently attached for days to weeks. Serpins are classed as irreversible inhibitors and as suicide inhibitors since each serpin protein permanently inactivates a single protease, and can only function once.


Allosteric activation

The conformational mobility of serpins provides a key advantage over static lock-and-key protease inhibitors. In particular, the function of inhibitory serpins can be
regulated Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
by
allosteric In the fields of biochemistry and pharmacology an allosteric regulator (or allosteric modulator) is a substance that binds to a site on an enzyme or receptor distinct from the active site, resulting in a conformational change that alters the p ...
interactions with specific cofactors. The X-ray crystal structures of
antithrombin Antithrombin (AT) is a small glycoprotein that inactivates several enzymes of the coagulation system. It is a 464-amino-acid protein produced by the liver. It contains three disulfide bonds and a total of four possible glycosylation sites. α-An ...
, heparin cofactor II, MENT and murine antichymotrypsin reveal that these serpins adopt a conformation wherein the first two amino acids of the RCL are inserted into the top of the A
β-sheet The beta sheet (β-sheet, also β-pleated sheet) is a common structural motif, motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone chain, backbon ...
. The partially inserted conformation is important because co-factors are able to conformationally switch certain partially inserted serpins into a fully expelled form. This conformational rearrangement makes the serpin a more effective inhibitor. The archetypal example of this situation is antithrombin, which circulates in plasma in a partially inserted relatively inactive state. The primary specificity determining residue (the P1 arginine) points toward the body of the serpin and is unavailable to the protease. Upon binding a high-affinity pentasaccharide sequence within long-chain
heparin Heparin, also known as unfractionated heparin (UFH), is a medication and naturally occurring glycosaminoglycan. Heparin is a blood anticoagulant that increases the activity of antithrombin. It is used in the treatment of myocardial infarction, ...
, antithrombin undergoes a conformational change, RCL expulsion, and exposure of the P1 arginine. The heparin pentasaccharide-bound form of antithrombin is, thus, a more effective inhibitor of
thrombin Prothrombin (coagulation factor II) is encoded in the human by the F2-gene. It is proteolytically cleaved during the clotting process by the prothrombinase enzyme complex to form thrombin. Thrombin (Factor IIa) (, fibrose, thrombase, throm ...
and
factor Xa Coagulation factor X (), or Stuart factor, is an enzyme of the coagulation cascade, encoded in humans by ''F10'' gene. It is a serine endopeptidase (protease group S1, PA clan). Factor X is synthesized in the liver and requires vitamin K for ...
. Furthermore, both of these coagulation proteases also contain binding sites (called exosites) for heparin. Heparin, therefore, also acts as a template for binding of both protease and serpin, further dramatically accelerating the interaction between the two parties. After the initial interaction, the final serpin complex is formed and the heparin moiety is released. This interaction is physiologically important. For example, after injury to the blood vessel wall, heparin is exposed, and antithrombin is activated to control the clotting response. Understanding of the molecular basis of this interaction enabled the development of
Fondaparinux Fondaparinux (trade name Arixtra) is an anticoagulant medication chemically related to low molecular weight heparins. It is marketed by Viatris. A generic version developed by Alchemia is marketed within the US by Dr. Reddy's Laboratories. Med ...
, a synthetic form of Heparin pentasaccharide used as an anti-clotting drug.


Latent conformation

Certain serpins spontaneously undergo the S to R transition without having been cleaved by a protease, to form a conformation termed the latent state. Latent serpins are unable to interact with proteases and so are no longer protease inhibitors. The conformational change to latency is not exactly the same as the S to R transition of a cleaved serpin. Since the RCL is still intact, the first strand of the C-sheet has to peel off to allow full RCL insertion. Regulation of the latency transition can act as a control mechanism in some serpins, such as PAI-1. Although PAI-1 is produced in the inhibitory S conformation, it "auto-inactivates" by changing to the latent state unless it is bound to the cofactor
vitronectin Vitronectin (VTN or VN) is a glycoprotein of the hemopexin family which is synthesized and excreted by the liver, and abundantly found in serum, the extracellular matrix and bone. In humans it is encoded by the ''VTN'' gene. Vitronectin bind ...
. Similarly, antithrombin can also spontaneously convert to the latent state, as an additional modulation mechanism to its allosteric activation by heparin. Finally, the N-terminus of , a serpin from '' Thermoanaerobacter tengcongensis'', is required to lock the molecule in the native inhibitory state. Disruption of interactions made by the N-terminal region results in spontaneous conformational change of this serpin to the latent conformation.


Conformational change in non-inhibitory functions

Certain non-inhibitory serpins also use the serpin conformational change as part of their function. For example, the native (S) form of
thyroxine-binding globulin Thyroxine-binding globulin (TBG) is a globulin protein encoded by the ''SERPINA7'' gene in humans. TBG binds thyroid hormones in circulation. It is one of three transport proteins (along with transthyretin and serum albumin) responsible for ...
has high affinity for thyroxine, whereas the cleaved (R) form has low affinity. Similarly, transcortin has higher affinity for cortisol when in its native (S) state, than its cleaved (R) state. Thus, in these serpins, RCL cleavage and the S to R transition has been commandeered to allow for ligand release, rather than protease inhibition. In some serpins, the S to R transition can activate
cell signalling In biology, cell signaling (cell signalling in British English) is the process by which a cell interacts with itself, other cells, and the environment. Cell signaling is a fundamental property of all cellular life in both prokaryotes and eukary ...
events. In these cases, a serpin that has formed a complex with its target protease, is then recognised by a receptor. The binding event then leads to downstream signalling by the receptor. The S to R transition is therefore used to alert cells to the presence of protease activity. This differs from the usual mechanism whereby serpins affect signalling simply by inhibiting proteases involved in a signalling cascade.


Degradation

When a serpin inhibits a target protease, it forms a permanent complex, which needs to be disposed of. For extracellular serpins, the final serpin-enzyme complexes are rapidly cleared from circulation. One mechanism by which this occurs in mammals is via the low-density lipoprotein receptor-related protein ( LRP), which binds to inhibitory complexes made by antithrombin, PA1-1, and neuroserpin, causing cellular uptake. Similarly, the ''
Drosophila ''Drosophila'' (), from Ancient Greek δρόσος (''drósos''), meaning "dew", and φίλος (''phílos''), meaning "loving", is a genus of fly, belonging to the family Drosophilidae, whose members are often called "small fruit flies" or p ...
'' necrotic serpin is degraded in the
lysosome A lysosome () is a membrane-bound organelle that is found in all mammalian cells, with the exception of red blood cells (erythrocytes). There are normally hundreds of lysosomes in the cytosol, where they function as the cell’s degradation cent ...
after being trafficked into the cell by the Lipophorin Receptor-1 (homologous to the mammalian
LDL receptor The low-density lipoprotein receptor (LDL-R) is a mosaic protein of 839 amino acids (after removal of 21-amino acid signal peptide) that mediates the endocytosis of cholesterol-rich low-density lipoprotein (LDL). It is a cell-surface receptor ...
family).


Disease and serpinopathies

Serpins are involved in a wide array of physiological functions, and so mutations in genes encoding them can cause a range of diseases. Mutations that change the activity, specificity or aggregation properties of serpins all affect how they function. The majority of serpin-related diseases are the result of serpin polymerisation into aggregates, though several other types of disease-linked mutations also occur. The disorder
alpha-1 antitrypsin deficiency Alpha-1 antitrypsin deficiency (A1AD or AATD) is a genetic disorder that may result in lung disease or liver disease. Onset of lung problems is typically between 20 and 50 years of age. This may result in shortness of breath, wheezing, or an inc ...
is one of the most common
hereditary disease A genetic disorder is a health problem caused by one or more abnormalities in the genome. It can be caused by a mutation in a single gene (monogenic) or multiple genes (polygenic) or by a chromosome abnormality. Although polygenic disorders are ...
s.


Inactivity or absence

Since the stressed serpin fold is high-energy, mutations can cause them to incorrectly change into their lower-energy conformations (e.g. relaxed or latent) before they have correctly performed their inhibitory role. Mutations that affect the rate or the extent of RCL insertion into the A-sheet can cause the serpin to undergo its S to R conformational change before having engaged a protease. Since a serpin can only make this conformational change once, the resulting misfired serpin is inactive and unable to properly control its target protease. Similarly, mutations that promote inappropriate transition to the monomeric latent state cause disease by reducing the amount of active inhibitory serpin. For example, the disease-linked antithrombin variants ''wibble'' and ''wobble'', both promote formation of the latent state. The structure of the disease-linked mutant of antichymotrypsin (L55P) revealed another, inactive "δ-conformation". In the δ-conformation, four residues of the RCL are inserted into the top of β-sheet A. The bottom half of the sheet is filled as a result of one of the α-helices (the F-helix) partially switching to a β-strand conformation, completing the β-sheet hydrogen bonding. It is unclear whether other serpins can adopt this conformer, and whether this conformation has a functional role, but it is speculated that the δ-conformation may be adopted by Thyroxine-binding globulin during thyroxine release. The non-inhibitory proteins related to serpins can also cause diseases when mutated. For example, mutations in SERPINF1 cause
osteogenesis imperfecta Osteogenesis imperfecta (; OI), colloquially known as brittle bone disease, is a group of genetic disorders that all result in bones that bone fracture, break easily. The range of symptoms—on the skeleton as well as on the body's other Or ...
type VI in humans. In the absence of a required serpin, the protease that it normally would regulate is over-active, leading to pathologies. Consequently, simple deficiency of a serpin (e.g. a
null mutation A null allele is a nonfunctional allele (a variant of a gene) caused by a genetic mutation. Such mutations can cause a complete lack of production of the associated gene product or a product that does not function properly; in either case, the al ...
) can result in disease.
Gene knockout Gene knockouts (also known as gene deletion or gene inactivation) are a widely used genetic engineering technique that involves the gene targeting, targeted removal or inactivation of a specific gene within an organism's genome. This can be done t ...
s, particularly in
mice A mouse (: mice) is a small rodent. Characteristically, mice are known to have a pointed snout, small rounded ears, a body-length scaly tail, and a high breeding rate. The best known mouse species is the common house mouse (''Mus musculus' ...
, are used experimentally to determine the normal functions of serpins by the effect of their absence.


Specificity change

In some rare cases, a single amino acid change in a serpin's RCL alters its specificity to target the wrong protease. For example, the Antitrypsin-Pittsburgh mutation (M358R) causes the
α1-antitrypsin Alpha-1 antitrypsin or α1-antitrypsin (A1AT, α1AT, A1A, or AAT) is a protein belonging to the serpin superfamily. It is encoded in humans by the ''SERPINA1'' gene. A protease inhibitor, it is also known as alpha1–proteinase inhibitor (A1PI ...
serpin to inhibit thrombin, causing a
bleeding Bleeding, hemorrhage, haemorrhage or blood loss, is blood escaping from the circulatory system from damaged blood vessels. Bleeding can occur internally, or externally either through a natural opening such as the mouth, nose, ear, urethr ...
disorder.


Polymerisation and aggregation

The majority of serpin diseases are due to
protein aggregation In molecular biology, protein aggregation is a phenomenon in which intrinsically disordered proteins, intrinsically-disordered or mis-folded proteins aggregate (i.e., accumulate and clump together) either intra- or extracellularly. Protein aggre ...
and are termed "serpinopathies". Serpins are vulnerable to disease-causing mutations that promote formation of misfolded polymers due to their inherently unstable structures. Well-characterised serpinopathies include α1-antitrypsin deficiency (alpha-1), which may cause familial
emphysema Emphysema is any air-filled enlargement in the body's tissues. Most commonly emphysema refers to the permanent enlargement of air spaces (alveoli) in the lungs, and is also known as pulmonary emphysema. Emphysema is a lower respiratory tract di ...
, and sometimes
liver cirrhosis Cirrhosis, also known as liver cirrhosis or hepatic cirrhosis, chronic liver failure or chronic hepatic failure and end-stage liver disease, is a chronic condition of the liver in which the normal functioning tissue, or parenchyma, is replaced ...
, certain familial forms of
thrombosis Thrombosis () is the formation of a Thrombus, blood clot inside a blood vessel, obstructing the flow of blood through the circulatory system. When a blood vessel (a vein or an artery) is injured, the body uses platelets (thrombocytes) and fib ...
related to
antithrombin deficiency Antithrombin III deficiency (abbreviated ATIII deficiency) is a deficiency of antithrombin III. This deficiency may be inherited or acquired. It is a rare hereditary disorder that generally comes to light when a patient suffers recurrent venou ...
, types 1 and 2
hereditary angioedema Hereditary angioedema (HAE) is a disorder that results in recurrent attacks of severe swelling. The swelling most commonly affects the arms, legs, face, intestinal tract, and airway. If the intestinal tract is affected, abdominal pain and vo ...
(HAE) related to deficiency of
C1-inhibitor C1-inhibitor (C1-inh, C1 esterase inhibitor) is a protease inhibitor belonging to the serpin superfamily. Its main function is the inhibition of the complement system (C1r, C1s) to prevent spontaneous activation but also as the major regulator ...
, and familial encephalopathy with neuroserpin inclusion bodies (FENIB; a rare type of
dementia Dementia is a syndrome associated with many neurodegenerative diseases, characterized by a general decline in cognitive abilities that affects a person's ability to perform activities of daily living, everyday activities. This typically invo ...
caused by neuroserpin polymerisation). Each monomer of the serpin aggregate exists in the inactive, relaxed conformation (with the RCL inserted into the A-sheet). The polymers are therefore hyperstable to temperature and unable to inhibit proteases. Serpinopathies therefore cause pathologies similarly to other proteopathies (e.g.
prion A prion () is a Proteinopathy, misfolded protein that induces misfolding in normal variants of the same protein, leading to cellular death. Prions are responsible for prion diseases, known as transmissible spongiform encephalopathy (TSEs), w ...
diseases) via two main mechanisms. First, the lack of active serpin results in uncontrolled protease activity and tissue destruction. Second, the hyperstable polymers themselves clog up the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
of cells that synthesize serpins, eventually resulting in cell death and tissue damage. In the case of antitrypsin deficiency, antitrypsin polymers cause the death of
liver cells A hepatocyte is a cell of the main parenchymal tissue of the liver. Hepatocytes make up 80% of the liver's mass. These cells are involved in: * Protein synthesis * Protein storage * Transformation of carbohydrates * Synthesis of cholesterol, bile ...
, sometimes resulting in liver damage and
cirrhosis Cirrhosis, also known as liver cirrhosis or hepatic cirrhosis, chronic liver failure or chronic hepatic failure and end-stage liver disease, is a chronic condition of the liver in which the normal functioning tissue, or parenchyma, is replaced ...
. Within the cell, serpin polymers are slowly removed via degradation in the endoplasmic reticulum. However, the details of how serpin polymers cause cell death remains to be fully understood. Physiological serpin polymers are thought to form via domain swapping events, where a segment of one serpin protein inserts into another. Domain-swaps occur when mutations or environmental factors interfere with the final stages of serpin folding to the native state, causing high-energy intermediates to misfold. Both dimer and trimer domain-swap structures have been solved. In the dimer (of antithrombin), the RCL and part of the A-sheet incorporates into the A-sheet of another serpin molecule. The domain-swapped trimer (of antitrypsin) forms via the exchange of an entirely different region of the structure, the B-sheet (with each molecule's RCL inserted into its own A-sheet). It has also been proposed that serpins may form domain-swaps by inserting the RCL of one protein into the A-sheet of another (A-sheet polymerisation). These domain-swapped dimer and trimer structures are thought to be the building blocks of the disease-causing polymer aggregates, but the exact mechanism is still unclear.


Therapeutic strategies

Several therapeutic approaches are in use or under investigation to treat the most common serpinopathy: antitrypsin deficiency. Antitrypsin augmentation therapy is approved for severe antitrypsin deficiency-related emphysema. In this therapy, antitrypsin is purified from the plasma of blood donors and administered intravenously (first marketed as Prolastin). To treat severe antitrypsin deficiency-related disease, lung and liver transplantation has proven effective. In animal models, gene targeting in
induced pluripotent stem cell Induced pluripotent stem cells (also known as iPS cells or iPSCs) are a type of pluripotent stem cell that can be generated directly from a somatic cell. The iPSC technology was pioneered by Shinya Yamanaka and Kazutoshi Takahashi in Kyoto, Jap ...
s has been successfully used to correct an antitrypsin polymerisation defect and to restore the ability of the mammalian liver to secrete active antitrypsin. Small molecules have also been developed that block antitrypsin polymerisation ''in vitro''.


Evolution

Serpins are the most widely distributed and largest superfamily of protease inhibitors. They were initially believed to be restricted to
eukaryote The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
organisms, but have since been found in
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
,
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
and some
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
. It remains unclear whether prokaryote genes are the descendants of an ancestral prokaryotic serpin or the product of
horizontal gene transfer Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between organisms other than by the ("vertical") transmission of DNA from parent to offspring (reproduction). HGT is an important factor in the e ...
from eukaryotes. Most intracellular serpins belong to a single
phylogenetic In biology, phylogenetics () is the study of the evolutionary history of life using observable characteristics of organisms (or genes), which is known as phylogenetic inference. It infers the relationship among organisms based on empirical dat ...
clade, whether they come from plants or animals, indicating that the intracellular and extracellular serpins may have diverged before the plants and animals. Exceptions include the intracellular heat shock serpin HSP47, which is a chaperone essential for proper folding of
collagen Collagen () is the main structural protein in the extracellular matrix of the connective tissues of many animals. It is the most abundant protein in mammals, making up 25% to 35% of protein content. Amino acids are bound together to form a trip ...
, and cycles between the cis-Golgi and the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
. Protease-inhibition is thought to be the ancestral function, with non-inhibitory members the results of evolutionary neofunctionalisation of the structure. The S to R conformational change has also been adapted by some binding serpins to regulate affinity for their targets.


Distribution


Animal


Human

The human genome encodes 16 serpin clades, termed through , including 29 inhibitory and 7 non-inhibitory serpin proteins. The human serpin naming system is based upon a
phylogenetic In biology, phylogenetics () is the study of the evolutionary history of life using observable characteristics of organisms (or genes), which is known as phylogenetic inference. It infers the relationship among organisms based on empirical dat ...
analysis of approximately 500 serpins from 2001, with proteins named , where X is the clade of the protein and Y the number of the protein within that clade. The functions of human serpins have been determined by a combination of
biochemical Biochemistry, or biological chemistry, is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology, ...
studies, human
genetic disorder A genetic disorder is a health problem caused by one or more abnormalities in the genome. It can be caused by a mutation in a single gene (monogenic) or multiple genes (polygenic) or by a chromosome abnormality. Although polygenic disorders ...
s, and knockout mouse models.


Specialised mammalian serpins

Many
mammal A mammal () is a vertebrate animal of the Class (biology), class Mammalia (). Mammals are characterised by the presence of milk-producing mammary glands for feeding their young, a broad neocortex region of the brain, fur or hair, and three ...
ian serpins have been identified that share no obvious orthology with a human serpin counterpart. Examples include numerous
rodent Rodents (from Latin , 'to gnaw') are mammals of the Order (biology), order Rodentia ( ), which are characterized by a single pair of continuously growing incisors in each of the upper and Mandible, lower jaws. About 40% of all mammal specie ...
serpins (particularly some of the
murine The Old World rats and mice, part of the subfamily Murinae in the family Muridae, comprise at least 519 species. Members of this subfamily are called murines. In terms of species richness, this subfamily is larger than all mammal families excep ...
intracellular serpins) as well as the uterine serpins. The term uterine serpin refers to members of the serpin A clade that are encoded by the SERPINA14 gene. Uterine serpins are produced by the
endometrium The endometrium is the inner epithelium, epithelial layer, along with its mucous membrane, of the mammalian uterus. It has a basal layer and a functional layer: the basal layer contains stem cells which regenerate the functional layer. The funct ...
of a restricted group of mammals in the
Laurasiatheria Laurasiatheria (; "Laurasian beasts") is a superorder of Placentalia, placental mammals that groups together true insectivores (eulipotyphlans), bats (chiropterans), carnivorans, pangolins (Pholidota, pholidotes), even-toed ungulates (Artiodacty ...
clade under the influence of
progesterone Progesterone (; P4) is an endogenous steroid and progestogen sex hormone involved in the menstrual cycle, pregnancy, and embryogenesis of humans and other species. It belongs to a group of steroid hormones called the progestogens and is the ma ...
or
estrogen Estrogen (also spelled oestrogen in British English; see spelling differences) is a category of sex hormone responsible for the development and regulation of the female reproductive system and secondary sex characteristics. There are three ...
. They are probably not functional proteinase inhibitors and may function during pregnancy to inhibit maternal immune responses against the
conceptus A conceptus (from Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), ...
or to participate in transplacental transport.


Insect

The ''
Drosophila melanogaster ''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
'' genome contains 29 serpin encoding genes. Amino acid sequence analysis has placed 14 of these serpins in serpin clade Q and three in serpin clade K with the remaining twelve classified as orphan serpins not belonging to any clade. The clade classification system is difficult to use for ''Drosophila'' serpins and instead a nomenclature system has been adopted that is based on the position of serpin genes on the ''Drosophila''
chromosome A chromosome is a package of DNA containing part or all of the genetic material of an organism. In most chromosomes, the very long thin DNA fibers are coated with nucleosome-forming packaging proteins; in eukaryotic cells, the most import ...
s. Thirteen of the ''Drosophila'' serpins occur as isolated genes in the genome (including Serpin-27A, see below), with the remaining 16 organised into five gene clusters that occur at chromosome positions 28D (2 serpins), 42D (5 serpins), 43A (4 serpins), 77B (3 serpins) and 88E (2 serpins). Studies on ''Drosophila'' serpins reveal that Serpin-27A inhibits the Easter protease (the final protease in the Nudel, Gastrulation Defective, Snake and Easter proteolytic cascade) and thus controls dorsoventral patterning. Easter functions to cleave Spätzle (a chemokine-type ligand), which results in toll-mediated signaling. As well as its central role in embryonic patterning, toll signaling is also important for the
innate immune response The innate immune system or nonspecific immune system is one of the two main immunity strategies in vertebrates (the other being the adaptive immune system). The innate immune system is an alternate defense strategy and is the dominant immune s ...
in insects. Accordingly, serpin-27A also functions to control the insect immune response. In ''
Tenebrio molitor Mealworms are the larval form of the yellow mealworm beetle, ''Tenebrio molitor'', a species of darkling beetle. The yellow mealworm beetle prefers a warmer climate and higher humidity. Male mealworm beetles release a sex pheromone to attract ...
'' (a large beetle), a protein (SPN93) comprising two discrete tandem serpin domains functions to regulate the toll proteolytic cascade. Serpins have been found in
tick Ticks are parasitic arachnids of the order Ixodida. They are part of the mite superorder Parasitiformes. Adult ticks are approximately 3 to 5 mm in length depending on age, sex, and species, but can become larger when engorged. Ticks a ...
saliva, suppressing
T lymphocyte T cells (also known as T lymphocytes) are an important part of the immune system and play a central role in the adaptive immune response. T cells can be distinguished from other lymphocytes by the presence of a T-cell receptor (TCR) on their cell ...
production and inhibiting expression of
TNF-α Tumor necrosis factor (TNF), formerly known as TNF-α, is a chemical messenger produced by the immune system that induces inflammation. TNF is produced primarily by activated macrophages, and induces inflammation by binding to its receptors o ...
,
IFN-γ Interferon gamma (IFNG or IFN-γ) is a dimerized soluble cytokine that is the only member of the type II class of interferons. The existence of this interferon, which early in its history was known as immune interferon, was described by E. F. ...
, and IL-6.


Nematode

The genome of the
nematode The nematodes ( or ; ; ), roundworms or eelworms constitute the phylum Nematoda. Species in the phylum inhabit a broad range of environments. Most species are free-living, feeding on microorganisms, but many are parasitic. Parasitic worms (h ...
worm '' C. elegans'' contains 9 serpins, all of which lack signal sequences and so are likely intracellular. However, only 5 of these serpins appear to function as protease inhibitors. One, SRP-6, performs a protective function and guards against stress-induced
calpain A calpain (; , ) is a protein belonging to the family of calcium-dependent, non-lysosomal cysteine proteases ( proteolytic enzymes) expressed ubiquitously in mammals and many other organisms. Calpains constitute the C2 family of protease clan C ...
-associated lysosomal disruption. Further, SRP-6 inhibits lysosomal cysteine proteases released after lysosomal rupture. Accordingly, worms lacking SRP-6 are sensitive to stress. Most notably, SRP-6 knockout worms die when placed in water (the hypo-osmotic stress lethal phenotype or Osl). It has therefore been suggested that lysosomes play a general and controllable role in determining cell fate.


Plant

Plant Plants are the eukaryotes that form the Kingdom (biology), kingdom Plantae; they are predominantly Photosynthesis, photosynthetic. This means that they obtain their energy from sunlight, using chloroplasts derived from endosymbiosis with c ...
serpins were amongst the first members of the superfamily that were identified. The serpin barley protein Z is highly abundant in barley grain, and one of the major protein components in beer. The genome of the model plant, ''
Arabidopsis thaliana ''Arabidopsis thaliana'', the thale cress, mouse-ear cress or arabidopsis, is a small plant from the mustard family (Brassicaceae), native to Eurasia and Africa. Commonly found along the shoulders of roads and in disturbed land, it is generally ...
'' contain 18 serpin-like genes, although only 8 of these are full-length serpin sequences. Plant serpins are potent inhibitors of mammalian chymotrypsin-like serine proteases ''in vitro'', the best-studied example being barley serpin Zx (BSZx), which is able to inhibit trypsin and chymotrypsin as well as several blood coagulation factors. However, close relatives of chymotrypsin-like serine proteases are absent in plants. The RCL of several serpins from wheat grain and rye contain poly-Q repeat sequences similar to those present in the
prolamin Prolamins are a group of plant storage proteins having a high proline amino acid content. They are found in plants, mainly in the seeds of cereal grains such as wheat ( gliadin), barley ( hordein), rye ( secalin), corn ( zein), sorghum ( kafiri ...
storage proteins of the endosperm. It has therefore been suggested that plant serpins may function to inhibit proteases from insects or microbes that would otherwise digest grain storage proteins. In support of this hypothesis, specific plant serpins have been identified in the phloem sap of pumpkin (CmPS-1) and cucumber plants. Although an inverse correlation between up-regulation of CmPS-1 expression and aphid survival was observed, ''in vitro'' feeding experiments revealed that recombinant CmPS-1 did not appear to affect insect survival. Alternative roles and protease targets for plant serpins have been proposed. The ''Arabidopsis'' serpin, AtSerpin1 (At1g47710; ), mediates set-point control over programmed cell death by targeting the 'Responsive to Desiccation-21' (RD21) papain-like cysteine protease. AtSerpin1 also inhibits
metacaspase Metacaspases are members of the C14 class of cysteine proteases and thus related to caspases, orthocaspases and paracaspases. The metacaspases are arginine/lysine-specific, in contrast to caspases, which are aspartate-specific. Structure and Phy ...
-like proteases ''in vitro''. Two other ''Arabidopsis'' serpins, AtSRP2 (At2g14540) and AtSRP3 (At1g64030) appear to be involved in responses to DNA damage.


Fungal

A single
fungal A fungus (: fungi , , , or ; or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as one of the tradit ...
serpin has been characterized to date: from '' Piromyces'' spp. strain E2. ''Piromyces'' is a
genus Genus (; : genera ) is a taxonomic rank above species and below family (taxonomy), family as used in the biological classification of extant taxon, living and fossil organisms as well as Virus classification#ICTV classification, viruses. In bino ...
of anaerobic fungi found in the gut of ruminants and is important for digesting plant material. is predicted to be inhibitory and contains two N-terminal dockerin domains in addition to its serpin domain. Dockerins are commonly found in proteins that localise to the fungal
cellulosome Cellulosomes are multi-enzyme extracellular complexes. Cellulosomes are associated with the cell surface and mediate cell attachment to insoluble substrates and degrade them to soluble products which are then absorbed. Cellulosome complexes are in ...
, a large extracellular multiprotein complex that breaks down cellulose. It is therefore suggested that may protect the cellulosome against plant proteases. Certain bacterial serpins similarly localize to the cellulosome.


Prokaryotic

Predicted serpin genes are sporadically distributed in
prokaryote A prokaryote (; less commonly spelled procaryote) is a unicellular organism, single-celled organism whose cell (biology), cell lacks a cell nucleus, nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Gree ...
s. ''In vitro'' studies on some of these molecules have revealed that they are able to inhibit proteases, and it is suggested that they function as inhibitors ''in vivo''. Several prokaryote serpins are found in
extremophile An extremophile () is an organism that is able to live (or in some cases thrive) in extreme environments, i.e., environments with conditions approaching or stretching the limits of what known life can adapt to, such as extreme temperature, press ...
s. Accordingly, and in contrast to mammalian serpins, these molecules possess elevated resistance to heat denaturation. The precise role of most bacterial serpins remains obscure, although ''
Clostridium thermocellum ''Acetivibrio thermocellus'' is an anaerobic, thermophilic bacterium. ''A. thermocellus'' has garnered research interest due to its cellulolytic and ethanologenic abilities, being capable of directly converting a cellulosic substrate into etha ...
'' serpin localises to the
cellulosome Cellulosomes are multi-enzyme extracellular complexes. Cellulosomes are associated with the cell surface and mediate cell attachment to insoluble substrates and degrade them to soluble products which are then absorbed. Cellulosome complexes are in ...
. It is suggested that the role of cellulosome-associated serpins may be to prevent unwanted protease activity against the cellulosome.


Viral

Serpins are also expressed by
virus A virus is a submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are ...
es as a way to evade the host's immune defense. In particular, serpins expressed by pox viruses, including cow pox (vaccinia) and rabbit pox (myxoma), are of interest because of their potential use as novel therapeutics for immune and inflammatory disorders as well as transplant therapy. Serp1 suppresses the TLR-mediated innate immune response and allows indefinite cardiac
allograft Allotransplant (''allo-'' meaning "other" in Ancient Greek, Greek) is the Organ transplant, transplantation of cell (biology), cells, Biological tissue, tissues, or Organ (anatomy), organs to a recipient from a genetically non-identical donor of ...
survival in rats. Crma and Serp2 are both cross-class inhibitors and target both serine (granzyme B; albeit weakly) and cysteine proteases (caspase 1 and caspase 8). In comparison to their mammalian counterparts, viral serpins contain significant deletions of elements of secondary structure. Specifically, crmA lacks the D-helix as well as significant portions of the A- and E-helices.


References


External links

*
Merops protease inhibitor claudication (Family I4)
*
James Whisstock laboratory
at
Monash University Monash University () is a public university, public research university based in Melbourne, Victoria (state), Victoria, Australia. Named after World War I general Sir John Monash, it was founded in 1958 and is the second oldest university in the ...

Jim Huntington laboratory
at
University of Cambridge The University of Cambridge is a Public university, public collegiate university, collegiate research university in Cambridge, England. Founded in 1209, the University of Cambridge is the List of oldest universities in continuous operation, wo ...

Frank Church laboratory
at
University of North Carolina at Chapel Hill The University of North Carolina at Chapel Hill (UNC, UNC–Chapel Hill, or simply Carolina) is a public university, public research university in Chapel Hill, North Carolina, United States. Chartered in 1789, the university first began enrolli ...

Paul Declerck laboratory
at
Katholieke Universiteit Leuven KU Leuven (Katholieke Universiteit Leuven) is a Catholic research university in the city of Leuven, Belgium. Founded in 1425, it is the oldest university in Belgium and the oldest university in the Low Countries. In addition to its main camp ...

Tom Roberts laboratory
at
University of Sydney The University of Sydney (USYD) is a public university, public research university in Sydney, Australia. Founded in 1850, it is the oldest university in both Australia and Oceania. One of Australia's six sandstone universities, it was one of the ...

Robert Fluhr laboratory
at
Weizmann Institute of Science The Weizmann Institute of Science ( ''Machon Weizmann LeMada'') is a Public university, public research university in Rehovot, Israel, established in 1934, fourteen years before the State of Israel was founded. Unlike other List of Israeli uni ...

Peter Gettins laboratory
at
University of Illinois at Chicago The University of Illinois Chicago (UIC) is a public research university in Chicago, Illinois, United States. Its campus is in the Near West Side community area, adjacent to the Chicago Loop. The second campus established under the Universi ...
* {{Serpins * Protein families