N-linked Glycoprotein
   HOME

TheInfoList



OR:

''N''-linked glycosylation is the attachment of an
oligosaccharide An oligosaccharide (; ) is a carbohydrate, saccharide polymer containing a small number (typically three to ten) of monosaccharides (simple sugars). Oligosaccharides can have many functions including Cell–cell recognition, cell recognition and ce ...
, a carbohydrate consisting of several sugar molecules, sometimes also referred to as
glycan The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate ...
, to a nitrogen atom (the
amide In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
nitrogen of an
asparagine Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
(Asn) residue of a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
), in a process called ''N''-glycosylation, studied in
biochemistry Biochemistry, or biological chemistry, is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology, a ...
. The resulting protein is called an N-linked glycan, or simply an N-glycan. This type of linkage is important for both the structure and function of many eukaryotic proteins. The ''N''-linked
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not ...
process occurs in
eukaryotes The eukaryotes ( ) constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of ...
and widely in
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
, but very rarely in
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
. The nature of ''N''-linked glycans attached to a glycoprotein is determined by the protein and the cell in which it is expressed. It also varies across
species A species () is often defined as the largest group of organisms in which any two individuals of the appropriate sexes or mating types can produce fertile offspring, typically by sexual reproduction. It is the basic unit of Taxonomy (biology), ...
. Different species synthesize different types of ''N''-linked glycans.


Energetics of bond formation

There are two types of bonds involved in a glycoprotein: bonds between the
saccharides A carbohydrate () is a biomolecule composed of carbon (C), hydrogen (H), and oxygen (O) atoms. The typical hydrogen-to-oxygen atomic ratio is 2:1, analogous to that of water, and is represented by the empirical formula (where ''m'' and ''n'' ma ...
residues in the glycan and the linkage between the glycan chain and the protein molecule. The sugar moieties are linked to one another in the glycan chain via glycosidic bonds. These bonds are typically formed between
carbon Carbon () is a chemical element; it has chemical symbol, symbol C and atomic number 6. It is nonmetallic and tetravalence, tetravalent—meaning that its atoms are able to form up to four covalent bonds due to its valence shell exhibiting 4 ...
s 1 and 4 of the sugar molecules. The formation of glycosidic bond is energetically unfavourable, therefore the reaction is coupled to the
hydrolysis Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution reaction, substitution, elimination reaction, elimination, and solvation reactions in which water ...
of two ATP molecules. On the other hand, the attachment of a glycan residue to a protein requires the recognition of a
consensus sequence In molecular biology and bioinformatics, the consensus sequence (or canonical sequence) is the calculated sequence of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It represents the result ...
. ''N''-linked glycans are almost always attached to the
nitrogen Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
atom of an asparagine (Asn) side chain that is present as a part of Asn–X– Ser/ Thr consensus sequence, where X is any amino acid except
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the p ...
(Pro). In animal cells, the glycan attached to the asparagine is almost inevitably ''N''-acetylglucosamine (GlcNAc) in the β-configuration. This β-linkage is similar to glycosidic bond between the sugar moieties in the glycan structure as described above. Instead of being attached to a sugar
hydroxyl In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydroxy ...
group, the
anomeric carbon In carbohydrate chemistry, a pair of anomers () is a pair of near-identical stereoisomers or diastereomers that differ at only the anomeric carbon, the carbon atom that bears the aldehyde or ketone functional group in the sugar's open-chain f ...
atom is attached to an amide nitrogen. The energy required for this linkage comes from the
hydrolysis Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution reaction, substitution, elimination reaction, elimination, and solvation reactions in which water ...
of a
pyrophosphate In chemistry, pyrophosphates are phosphorus oxyanions that contain two phosphorus atoms in a linkage. A number of pyrophosphate salts exist, such as disodium pyrophosphate () and tetrasodium pyrophosphate (), among others. Often pyrophosphates a ...
molecule.


Biosynthesis

The biosynthesis of ''N''-linked glycans occurs via three major steps: # Synthesis of dolichol-linked precursor oligosaccharide # En bloc transfer of precursor oligosaccharide to protein # Processing of the oligosaccharide Synthesis, en bloc transfer and initial trimming of precursor
oligosaccharide An oligosaccharide (; ) is a carbohydrate, saccharide polymer containing a small number (typically three to ten) of monosaccharides (simple sugars). Oligosaccharides can have many functions including Cell–cell recognition, cell recognition and ce ...
occurs in the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
(ER). Subsequent processing and modification of the oligosaccharide chain are carried out in the
Golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic Cell (biology), cells. Part of the endomembrane system in the cytoplasm, it protein targeting, packages proteins ...
. The synthesis of glycoproteins is thus spatially separated in different cellular compartments. Therefore, the type of ''N''-glycan synthesized, depends on its accessibility to the different enzymes present within these cellular compartments. However, in spite of the diversity, all ''N''-glycans are synthesized through a common pathway with a common core glycan structure. The core glycan structure is essentially made up of two ''N''-acetyl glucosamine and three
mannose Mannose is a sugar with the formula , which sometimes is abbreviated Man. It is one of the monomers of the aldohexose series of carbohydrates. It is a C-2 epimer of glucose. Mannose is important in human metabolism, especially in the glycosylatio ...
residues. This core glycan is then elaborated and modified further, resulting in a diverse range of ''N''-glycan structures.


Synthesis of precursor oligosaccharide

The process of ''N''-linked glycosylation starts with the formation of dolichol-linked GlcNAc sugar. Dolichol is a
lipid Lipids are a broad group of organic compounds which include fats, waxes, sterols, fat-soluble vitamins (such as vitamins A, D, E and K), monoglycerides, diglycerides, phospholipids, and others. The functions of lipids include storing ...
molecule composed of repeating
isoprene Isoprene, or 2-methyl-1,3-butadiene, is a common volatile organic compound with the formula CH2=C(CH3)−CH=CH2. In its pure form it is a colorless volatile liquid. It is produced by many plants and animals (including humans) and its polymers ar ...
units. This molecule is found attached to the membrane of the ER. Sugar molecules are attached to the dolichol through a pyrophosphate linkage (one
phosphate Phosphates are the naturally occurring form of the element phosphorus. In chemistry, a phosphate is an anion, salt, functional group or ester derived from a phosphoric acid. It most commonly means orthophosphate, a derivative of orthop ...
was originally linked to dolichol, and the second phosphate came from the
nucleotide Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
sugar). The oligosaccharide chain is then extended through the addition of various sugar molecules in a stepwise manner to form a precursor oligosaccharide. The assembly of this precursor oligosaccharide occurs in two phases: Phase I and II. Phase I takes place on the
cytoplasmic The cytoplasm describes all the material within a eukaryotic or prokaryotic cell, enclosed by the cell membrane, including the organelles and excluding the nucleus in eukaryotic cells. The material inside the nucleus of a eukaryotic cell and ...
side of the ER and Phase II takes place on the luminal side of the ER. The precursor molecule, ready to be transferred to a protein, consists of two GlcNAc, nine mannose, and three
glucose Glucose is a sugar with the Chemical formula#Molecular formula, molecular formula , which is often abbreviated as Glc. It is overall the most abundant monosaccharide, a subcategory of carbohydrates. It is mainly made by plants and most algae d ...
molecules.


Transfer of glycan to protein

Once the precursor oligosaccharide is formed, the completed glycan is then transferred to the nascent
polypeptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
in the lumen of the ER membrane. This reaction is driven by the energy released from the cleavage of the pyrophosphate bond between the dolichol-glycan molecule. There are three conditions to fulfill before a glycan is transferred to a nascent polypeptide: * Asparagine must be located in a specific consensus sequence in the
primary structure Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthe ...
(Asn–X–Ser or Asn–X–Thr or in rare instances Asn–X–Cys). * Asparagine must be located appropriately in the three-dimensional structure of the protein (Sugars are
polar molecules In chemistry, polarity is a separation of electric charge leading to a molecule or its chemical groups having an electric dipole moment, with a negatively charged end and a positively charged end. Polar molecules must contain one or more polar ...
and thus need to be attached to asparagine located on the surface of the protein and not buried within the protein) * Asparagine must be found in the luminal side of the endoplasmic reticulum for ''N''-linked glycosylation to be initiated. Target residues are either found in secretory proteins or in the regions of
transmembrane protein A transmembrane protein is a type of integral membrane protein that spans the entirety of the cell membrane. Many transmembrane proteins function as gateways to permit the transport of specific substances across the membrane. They frequently un ...
that face the lumen. Oligosaccharyltransferase is the enzyme responsible for the recognition of the consensus sequence and the transfer of the precursor glycan to a polypeptide acceptor which is being translated in the endoplasmic reticulum lumen. ''N''-linked glycosylation is, therefore, a co-translational event.


Processing of glycan

''N''-glycan processing is carried out in endoplasmic reticulum and the Golgi body. Initial trimming of the precursor molecule occurs in the ER and the subsequent processing occurs in the Golgi. Upon transferring the completed glycan onto the nascent polypeptide, two glucose residues are removed from the structure. Enzymes known as glycosidases remove some sugar residues. These enzymes can break glycosidic linkages by using a water molecule. These enzymes are exoglycosidases as they only work on
monosaccharide Monosaccharides (from Greek '' monos'': single, '' sacchar'': sugar), also called simple sugars, are the simplest forms of sugar and the most basic units (monomers) from which all carbohydrates are built. Chemically, monosaccharides are polyhy ...
residues located at the non-reducing end of the glycan. This initial trimming step is thought to act as a quality control step in the ER to monitor
protein folding Protein folding is the physical process by which a protein, after Protein biosynthesis, synthesis by a ribosome as a linear chain of Amino acid, amino acids, changes from an unstable random coil into a more ordered protein tertiary structure, t ...
. Once the protein is folded correctly, two glucose residues are removed by
glucosidase Glucosidases are the glycoside hydrolase enzymes categorized under the EC number 3.2.1. Function Alpha-glucosidases are enzymes involved in breaking down complex carbohydrates such as starch and glycogen into their monomers. They catalyze the ...
I and II. The removal of the final third glucose residue signals that the glycoprotein is ready for transit from the ER to the ''cis''-Golgi. ER mannosidase catalyses the removal of this final glucose. However, if the protein is not folded properly, the glucose residues are not removed and thus the glycoprotein can't leave the endoplasmic reticulum. A chaperone protein (
calnexin Calnexin (CNX) is a 67kDa integral protein (that appears variously as a 90kDa, 80kDa, or 75kDa band on western blotting depending on the source of the antibody) of the endoplasmic reticulum (ER). It consists of a large (50 kDa) N-terminal calciu ...
/ calreticulin) binds to the unfolded or partially folded protein to assist protein folding. The next step involves further addition and removal of sugar residues in the cis-Golgi. These modifications are catalyzed by glycosyltransferases and glycosidases respectively. In the ''cis''-Golgi, a series of mannosidases remove some or all of the four mannose residues in α-1,2 linkages. Whereas in the medial portion of the Golgi, glycosyltransferases add sugar residues to the core glycan structure, giving rise to the three main types of glycans: high mannose, hybrid and complex glycans. * High-mannose is, in essence, just two ''N''-acetylglucosamines with many mannose residues, often almost as many as are seen in the precursor oligosaccharides before it is attached to the protein. * Complex oligosaccharides are so named because they can contain almost any number of the other types of saccharides, including more than the original two ''N''-acetylglucosamines. * Hybrid oligosaccharides contain a mannose residues on one side of the branch, while on the other side a ''N''-acetylglucosamine initiates a complex branch. The order of addition of sugars to the growing glycan chains is determined by the substrate specificities of the enzymes and their access to the substrate as they move through
secretory pathway Secretion is the movement of material from one point to another, such as a secreted chemical substance from a cell (biology), cell or gland. In contrast, excretion is the removal of certain substances or waste products from a cell or organism. Th ...
. Thus, the organization of this machinery within a cell plays an important role in determining which glycans are made.


Enzymes in the Golgi

Golgi enzymes play a key role in determining the synthesis of the various types of glycans. The order of action of the enzymes is reflected in their position in the Golgi stack:


In archaea and prokaryotes

Similar ''N''-glycan biosynthesis pathway have been found in prokaryotes and Archaea. However, compared to eukaryotes, the final glycan structure in eubacteria and archaea does not seem to differ much from the initial precursor made in the endoplasmic reticulum. In eukaryotes, the original precursor oligosaccharide is extensively modified en route to the cell surface.


Function

''N''-linked glycans have intrinsic and extrinsic functions. Within the immune system, the ''N''-linked glycans on an immune cell's surface will help dictate that migration pattern of the cell, e.g. immune cells that migrate to the skin have specific glycosylations that favor homing to that site. The glycosylation patterns on the various immunoglobulins including IgE, IgM, IgD, IgA, and IgG bestow them with unique effector functions by altering their affinities for Fc and other immune receptors. Glycans may also be involved in "self" and "non self" discrimination, which may be relevant to the pathophysiology of various autoimmune diseases. In some cases, interaction between the N-glycan and the protein stabilizes the protein through complex electronic effects.


Clinical significance

Changes in ''N''-linked glycosylation has been associated with different diseases including
rheumatoid arthritis Rheumatoid arthritis (RA) is a long-term autoimmune disorder that primarily affects synovial joint, joints. It typically results in warm, swollen, and painful joints. Pain and stiffness often worsen following rest. Most commonly, the wrist and h ...
,
type 1 diabetes Type 1 diabetes (T1D), formerly known as juvenile diabetes, is an autoimmune disease that occurs when the body's immune system destroys pancreatic cells (beta cells). In healthy persons, beta cells produce insulin. Insulin is a hormone require ...
,
Crohn's disease Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
, and cancers. Mutations in eighteen genes involved in ''N''-linked glycosylation result in a variety of diseases, most of which involve the
nervous system In biology, the nervous system is the complex system, highly complex part of an animal that coordinates its behavior, actions and sense, sensory information by transmitting action potential, signals to and from different parts of its body. Th ...
.


Importance in therapeutic proteins

Many
therapeutic A therapy or medical treatment is the attempted remediation of a health problem, usually following a medical diagnosis. Both words, ''treatment'' and ''therapy'', are often abbreviated tx, Tx, or Tx. As a rule, each therapy has indications an ...
proteins in the market are
antibodies An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as bacteria and viruses, including those that caus ...
, which are ''N''-linked glycoproteins. For example,
Etanercept Etanercept, sold under the brand name Enbrel among others, is a biologic medical product that is used to treat autoimmune diseases by interfering with tumor necrosis factor (TNF), a soluble inflammatory cytokine, by acting as a TNF inhibitor. ...
,
Infliximab Infliximab, a chimeric monoclonal antibody, sold under the brand name Remicade among others, is a medication used to treat a number of autoimmune diseases. This includes Crohn's disease, ulcerative colitis, rheumatoid arthritis, ankylosing ...
and
Rituximab Rituximab, sold under the brand name Rituxan among others, is a monoclonal antibody medication used to treat certain autoimmune diseases and types of cancer. It is used for non-Hodgkin lymphoma, chronic lymphocytic leukemia (in children and ad ...
are ''N''-glycosylated therapeutic proteins. The importance of ''N''-linked glycosylation is becoming increasingly evident in the field of
pharmaceuticals Medication (also called medicament, medicine, pharmaceutical drug, medicinal product, medicinal drug or simply drug) is a drug used to diagnose, cure, treat, or prevent disease. Drug therapy ( pharmacotherapy) is an important part of the ...
. Although bacterial or yeast protein production systems have significant potential advantages such as high yield and low cost, problems arise when the protein of interest is a glycoprotein. Most prokaryotic expression systems such as '' E. coli'' cannot carry out
post-translational modifications In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
. On the other hand, eukaryotic expression hosts such as yeast and animal cells, have different glycosylation patterns. The proteins produced in these expression hosts are often not identical to human protein and thus, cause
immunogenic Immunogenicity is the ability of a foreign substance, such as an antigen, to provoke an immune response in the body of a human or other animal. It may be wanted or unwanted: * Wanted immunogenicity typically relates to vaccines, where the injectio ...
reactions in patients. For example, '' S.cerevisiae'' (yeast) often produce high-mannose glycans which are immunogenic. Non-human mammalian expression systems such as CHO or NS0 cells have the machinery required to add complex, human-type glycans. However, glycans produced in these systems can differ from glycans produced in humans, as they can be capped with both ''N''-glycolylneuraminic acid (Neu5Gc) and ''N''-acetylneuraminic acid (Neu5Ac), whereas human cells only produce glycoproteins containing ''N''-acetylneuraminic acid. Furthermore, animal cells can also produce glycoproteins containing the galactose-alpha-1,3-galactose epitope, which can induce serious allergenic reactions, including anaphylactic shock, in people who have
Alpha-gal allergy Alpha-gal syndrome (AGS), also known as alpha-gal allergy or mammalian meat allergy (MMA), is a type of acquired allergy characterized by a delayed onset of symptoms (3–8 hours) after ingesting mammalian meat. The condition results from past exp ...
. These drawbacks have been addressed by several approaches such as eliminating the pathways that produce these glycan structures through genetic knockouts. Furthermore, other expression systems have been genetically engineered to produce therapeutic glycoproteins with human-like ''N''-linked glycans. These include yeasts such as ''
Pichia pastoris ''Komagataella'' is a methylotrophic yeast within the order Saccharomycetales. It was found in the 1960s as ''Pichia pastoris'', with its feature of using methanol as a source of carbon and energy. In 1995, ''P. pastoris'' was reassigned into t ...
'', insect cell lines, green plants, and even bacteria.


See also

*
Glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not ...
* ''O''-linked glycosylation *
Gene expression Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
* ''N''-Glycosyltransferase


References


External links


GlycoEP
In silico Platform for Prediction of ''N''-, ''O''- and ''C''-Glycosites in Eukaryotic Protein Sequences * {{cite journal , vauthors = Maverakis E, Kim K, Shimoda M, Gershwin ME, Patel F, Wilken R, Raychaudhuri S, Ruhaak LR, Lebrilla CB , title = Glycans in the immune system and The Altered Glycan Theory of Autoimmunity: a critical review , journal = Journal of Autoimmunity , volume = 57 , pages = 1–13 , date = February 2015 , pmid = 25578468 , pmc = 4340844 , doi = 10.1016/j.jaut.2014.12.002 Organic chemistry Biochemistry