Bioconjugation is a chemical strategy to form a stable
covalent link between two molecules, at least one of which is a
biomolecule.
Function
Recent advances in the understanding of biomolecules enabled their application to numerous fields like medicine and materials. Synthetically modified biomolecules can have diverse functionalities, such as tracking cellular events, revealing
enzyme
Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different molecule ...
function, determining
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
biodistribution,
imaging
Imaging is the representation or reproduction of an object's form; especially a visual representation (i.e., the formation of an image).
Imaging technology is the application of materials and methods to create, preserve, or duplicate images.
...
specific
biomarkers, and delivering
drugs
A drug is any chemical substance that causes a change in an organism's physiology or psychology when consumed. Drugs are typically distinguished from food and substances that provide nutritional support. Consumption of drugs can be via inhalat ...
to targeted cells.
Bioconjugation is a crucial strategy that links these modified biomolecules with different
substrate
Substrate may refer to:
Physical layers
*Substrate (biology), the natural environment in which an organism lives, or the surface or medium on which an organism grows or is attached
** Substrate (locomotion), the surface over which an organism lo ...
s.
Synthesis
Synthesis of bioconjugates involves a variety of challenges, ranging from the simple and nonspecific use of a
fluorescent dye marker to the complex design of
antibody drug conjugates.
As a result, various bioconjugation reactions –
chemical reactions connecting two biomolecules together – have been developed to chemically modify proteins. Common types of bioconjugation reactions on proteins are coupling of
lysine, of
cysteine
Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile.
When present as a deprotonated catalytic residue, s ...
and of
tyrosine amino acid residues; and modification of
tryptophan amino acid residues and of the
N- and
C- terminus.
However, these reactions often lack
chemoselectivity and efficiency, because they depend on the presence of native amino acid residues, which are usually present in large quantities that hinder selectivity. There is an increasing need for chemical strategies that can effectively attach synthetic molecules site specifically to proteins. One strategy is to first install a unique
functional group
In organic chemistry, a functional group is a substituent or moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions regardless of the res ...
onto a protein, and then a
bioorthogonal or
click type reaction is used to couple a biomolecule with this unique functional group.
The bioorthogonal reactions targeting non-native functional groups are widely used in bioconjugation chemistry. Some important reactions are modification of
ketone
In organic chemistry, a ketone is a functional group with the structure R–C(=O)–R', where R and R' can be a variety of carbon-containing substituents. Ketones contain a carbonyl group –C(=O)– (which contains a carbon-oxygen double bon ...
and
aldehydes,
Staudinger ligation
The Staudinger reaction is a chemical reaction of an organic azide with a phosphine or phosphite produces an iminophosphorane. The reaction was discovered by and named after Hermann Staudinger. The reaction follows this stoichiometry:
:R3P + R' ...
with
organic azides, copper-catalyzed
Huisgen cycloaddition
The azide-alkyne Huisgen cycloaddition is a 1,3-dipolar cycloaddition between an azide and a terminal or internal alkyne to give a 1,2,3-triazole. Rolf Huisgen was the first to understand the scope of this organic reaction. American chemist Karl ...
of azides, and strain promoted Huisgen cycloaddition of azides.
Common bioconjugation reactions
The most common bioconjugations are coupling of a small molecule (such as
biotin or a fluorescent dye) to a protein, or protein-protein conjugations, such as the coupling of an antibody to an enzyme.
Other less common molecules used in bioconjugation are
oligosaccharides,
nucleic acids,
synthetic polymers such as
polyethylene glycol
Polyethylene glycol (PEG; ) is a polyether compound derived from petroleum with many applications, from industrial manufacturing to medicine. PEG is also known as polyethylene oxide (PEO) or polyoxyethylene (POE), depending on its molecular w ...
, and
carbon nanotubes
A scanning tunneling microscopy image of a single-walled carbon nanotube
Rotating single-walled zigzag carbon nanotube
A carbon nanotube (CNT) is a tube made of carbon with diameters typically measured in nanometers.
''Single-wall carbon nan ...
.
Antibody-drug conjugates
Antibody-drug conjugates or ADCs are a class of biopharmaceutical drugs designed as a targeted therapy for treating cancer. Unlike chemotherapy, ADCs are intended to target and kill tumor cells while sparing healthy cells. As of 2019, some 56 p ...
such as
Brentuximab vedotin and
Gemtuzumab ozogamicin are also examples of bioconjugation, and are an active area of research in the pharmaceutical industry. Recently, bioconjugation has also gained importance in
nanotechnology
Nanotechnology, also shortened to nanotech, is the use of matter on an atomic, molecular, and supramolecular scale for industrial purposes. The earliest, widespread description of nanotechnology referred to the particular technological goal o ...
applications such as bioconjugated
quantum dots.
Reactions of lysine residues
The
nucleophilic lysine residue is commonly targeted site in protein bioconjugation, typically through
amine
In chemistry, amines (, ) are compounds and functional groups that contain a basic nitrogen atom with a lone pair. Amines are formally derivatives of ammonia (), wherein one or more hydrogen atoms have been replaced by a substituent su ...
-reactive
''N''-hydroxysuccinimidyl (NHS) esters.
To obtain optimal number of
deprotonated lysine residues, the
pH of the
aqueous solution
An aqueous solution is a solution in which the solvent is water. It is mostly shown in chemical equations by appending (aq) to the relevant chemical formula. For example, a solution of table salt, or sodium chloride (NaCl), in water would ...
must be below the
pKa
PKA may refer to:
* Professionally known as:
** Pen name
** Stage persona
* p''K''a, the symbol for the acid dissociation constant at logarithmic scale
* Protein kinase A, a class of cAMP-dependent enzymes
* Pi Kappa Alpha, the North-American so ...
of the lysine
ammonium group, which is around 10.5, so the typical pH of the reaction is about 8 and 9. The common reagent for the coupling reaction is NHS-ester (shown in the first reaction below in Figure 1), which reacts with nucleophilic lysine through a lysine
acylation mechanism. Other similar reagents are
isocyanates and
isothiocyanates that undergo a similar mechanism (shown in the second and third reactions in Figure 1 below).
Benzoyl fluorides (shown in the last reaction below in Figure 1), which allows for lysine modification of proteins under mild conditions (low temperature, physiological
pH), were recently proposed as an alternative to classically used lysine specific reagents.
Reactions of cysteine residues
Because free cysteine rarely occurs on protein surface, it is an excellent choice for chemoselective modification. Under basic condition, the cysteine residues will be deprotonated to generate a
thiolate nucleophile, which will react with soft
electrophiles, such as
maleimides and
iodoacetamides (shown in the first two reactions in Figure 2 below). As a result, a
carbon-sulfur bond
Organosulfur compounds are organic compounds that contain sulfur. They are often associated with foul odors, but many of the sweetest compounds known are organosulfur derivatives, e.g., saccharin. Nature abounds with organosulfur compounds—sulf ...
is formed. Another modification of cysteine residues involves the formation of
disulfide bond
In biochemistry, a disulfide (or disulphide in British English) refers to a functional group with the structure . The linkage is also called an SS-bond or sometimes a disulfide bridge and is usually derived by the coupling of two thiol groups ...
(shown in the third reaction in Figure 2). The
reduced cysteine residues react with
exogenous disulfides, generating new a disulfide bond on the protein. An excess of disulfides is often used to drive the reaction, such as 2-thiopyridone and 3-carboxy-4-nitrothiophenol.
Electron-deficient
alkynes were demonstrated to selectively react with cysteine residues of proteins in the presence of other nucleophilic amino acid residues. Depending on the alkyne substitution, these reactions can produce either cleavable (when alkynone derivatives are used), or hydrolytically stable bioconjugates (when
3-arylpropiolonitriles
3-Arylpropiolonitriles (APN) belong to a class of electron-deficient alkyne derivatives substituted by two electron-withdrawing groups – a nitrile and an aryl moieties. Such activation results in improved selectivity towards highly reactive thio ...
are used; the last reaction below in Figure 2).
Reactions of tyrosine residues
Tyrosine residues are relatively unreactive; therefore they have not been a popular targets for bioconjugation. Recent development has shown that the tyrosine can be modified through
electrophilic aromatic substitution
Electrophilic aromatic substitution is an organic reaction in which an atom that is attached to an aromatic system (usually hydrogen) is replaced by an electrophile. Some of the most important electrophilic aromatic substitutions are aromatic ni ...
s (EAS) reactions, and it is selective for the
aromatic carbon adjacent to the
phenolic hydroxyl group.
This becomes particularly useful in the case that cysteine residues cannot be targeted. Specifically,
diazonium effectively couples with tyrosine residues (
diazonium salt shown as reagent in the first reaction in Figure 3 below), and an
electron withdrawing substituent in the 4-position of diazonium salt can effectively increase the efficiency of the reaction. Cyclic diazodicarboxyamide derivative like
4-Phenyl-1,2,4-triazole-3,5-dione (PTAD) were reported for selective bioconjugation on tyrosine residues (the second reaction in Figure 3 below).
A three-component
Mannich-type reaction with
aldehydes and
anilines (the last reaction in Figure 3) was also described to be relatively tyrosine-selective under mild optimised reaction conditions.
Reactions of N- and C- termini
Since natural amino acid residues are usually present in large quantities, it is often difficult to modify one single site. Strategies targeting the termini of protein have been developed, because they greatly enhanced the site selectivity of protein modification. One of the N- termini modifications involves the
functionalization of the terminal amino acid. The
oxidation
Redox (reduction–oxidation, , ) is a type of chemical reaction in which the oxidation states of substrate change. Oxidation is the loss of electrons or an increase in the oxidation state, while reduction is the gain of electrons or a ...
of N-terminal
serine and
threonine residues are able to generate N-terminal aldehyde, which can undergo further bioorthogonal reactions (shown in the first reaction in Figure 4). Another type of modification involves the
condensation of N-terminal cysteine with aldehyde, generating
thiazolidine that is stable at high pH (second reaction in Figure 4). Using
pyridoxal phosphate
Pyridoxal phosphate (PLP, pyridoxal 5'- phosphate, P5P), the active form of vitamin B6, is a coenzyme in a variety of enzymatic reactions. The International Union of Biochemistry and Molecular Biology has catalogued more than 140 PLP-depende ...
(PLP), several N-terminal amino acids can undergo
transamination
Transamination is a chemical reaction that transfers an amino group to a ketoacid to form new amino acids. This pathway is responsible for the deamination of most amino acids. This is one of the major degradation pathways which convert essentia ...
to yield N-terminal
aldehyde
In organic chemistry, an aldehyde () is an organic compound containing a functional group with the structure . The functional group itself (without the "R" side chain) can be referred to as an aldehyde but can also be classified as a formyl group ...
, such as
glycine
Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinog ...
and
aspartic acid (third reaction in Figure 4).
An example of C-termini modification is the
native chemical ligation Native Chemical Ligation (NCL) is an important extension of the chemical ligation concept for constructing a larger polypeptide chain by the covalent condensation of two or more unprotected peptides segments. Native chemical ligation is the most e ...
(NCL), which is the coupling between a C-terminal
thioester and a N-terminal cysteine (Figure 5).
Bioorthogonal reactions
Modification of ketones and aldehydes
A ketone or aldehyde can be attached to a protein through the oxidation of N-terminal serine residues or transamination with PLP. Additionally, they can be introduced by incorporating
unnatural amino acid
In biochemistry, non-coded or non-proteinogenic amino acids are distinct from the 22 proteinogenic amino acids (21 in eukaryotesplus formylmethionine in eukaryotes with prokaryote organelles like mitochondria) which are naturally encoded in the ge ...
s via the
Tirrell method
Tirrell may refer to:
People Surname
*Albert Tirrell (1824–1880), man acquitted of murder by the sleepwalking defense
*Alf Tirrell (1894–1944), English footballer
*Charles Q. Tirrell (1844–1910), American politician
*David A. Tirrell (b ...
or
Schultz method.
They will then selectively condense with an alkoxyamine and a
hydrazine, producing
oxime and
hydrazone derivatives (shown in the first and second reactions, respectively, in Figure 6). This reaction is highly chemoselective in terms of protein bioconjugation, but the
reaction rate is slow. The mechanistic studies show that the
rate determining step is the
dehydration of
tetrahedral intermediate, so a mild
acidic
In computer science, ACID ( atomicity, consistency, isolation, durability) is a set of properties of database transactions intended to guarantee data validity despite errors, power failures, and other mishaps. In the context of databases, a s ...
solution is often employed to accelerate the dehydration step.

The introduction of nucleophilic catalyst can significantly enhance reaction rate (shown in Figure 7). For example, using
aniline as a nucleophilic catalyst, a less populated protonated
carbonyl
In organic chemistry, a carbonyl group is a functional group composed of a carbon atom double-bonded to an oxygen atom: C=O. It is common to several classes of organic compounds, as part of many larger functional groups. A compound containin ...
becomes a highly populated protonated
Schiff base.
In other words, it generates a high concentration of reactive electrophile. The oxime ligation can then occur readily, and it has been reported that the rate increased up to 400 times under mild acidic condition.
The key of this catalyst is that it can generate a reactive electrophile without competing with desired product.

Recent developments that exploit proximal functional groups have enabled hydrazone condensations to operate at 20 M
−1s
−1 at neutral pH while oxime condensations have been discovered which proceed at 500-10000 M
−1s
−1 at neutral pH without added catalysts.
Staudinger ligation with azides
The
Staudinger ligation
The Staudinger reaction is a chemical reaction of an organic azide with a phosphine or phosphite produces an iminophosphorane. The reaction was discovered by and named after Hermann Staudinger. The reaction follows this stoichiometry:
:R3P + R' ...
of azides and
phosphine has been used extensively in field of chemical biology. Because it is able to form a stable
amide bond
In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a compound with the general formula , where R, R', and R″ represent organic groups or hydrogen atoms. The amide group is called a peptide bond when it is p ...
in living cells and animals, it has been applied to modification of
cell membrane
The cell membrane (also known as the plasma membrane (PM) or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of all cells from the outside environment (the ...
,
''in vivo'' imaging, and other bioconjugation studies.

Contrasting with the classic Staudinger reaction, Staudinger ligation is a
second order reaction
In chemistry, the rate law or rate equation for a reaction is an equation that links the initial or forward reaction rate with the concentrations or pressures of the reactants and constant parameters (normally rate coefficients and partial reactio ...
in which the rate-limiting step is the formation of
phosphazide An ylide or ylid () is a electric charge, neutral Dipole, dipolar molecule containing a formally negatively charged atom (usually a carbanion) directly attached to a heteroatom with a formal positive charge (usually nitrogen, phosphorus or sulfur), ...
(specific reaction mechanism shown in Figure 9). The
triphenylphosphine
Triphenylphosphine (IUPAC name: triphenylphosphane) is a common organophosphorus compound with the formula P(C6H5)3 and often abbreviated to P Ph3 or Ph3P. It is widely used in the synthesis of organic and organometallic compounds. PPh3 exists ...
first reacts with the azide to yield an azaylide through a four-membered ring
transition state
In chemistry, the transition state of a chemical reaction is a particular configuration along the reaction coordinate. It is defined as the state corresponding to the highest potential energy along this reaction coordinate. It is often marked ...
, and then an intramolecular reaction leads to the
iminophosphorane intermediate, which will then give the amide-linkage under hydrolysis.
Huisgen cyclization of azides
= Copper catalyzed Huisgen cyclization of azides
=
Azide has become a popular target for chemoselective protein modification, because they are small in size and have a favorable
thermodynamic reaction potential. One such azide reactions is the
+2cycloaddition reaction with
alkyne, but the reaction requires high temperature and often gives mixtures of
regioisomers.

An improved reaction developed by chemist
Karl Barry Sharpless involves the copper (I) catalyst, which couples azide with terminal alkyne that only give 1,4 substituted 1,2,3 triazoles in high yields (shown below in Figure 11). The mechanistic study suggests a stepwise reaction.
The Cu (I) first couples with
acetylenes, and then it reacts with azide to generate a six-membered intermediate. The process is very robust that it occurs at pH ranging from 4 to 12, and copper (II) sulfate is often used as a catalyst in the presence of a
reducing agent.
= Strain promoted Huisgen cyclization of azides
=
Even though Staudinger ligation is a suitable bioconjugation in living cells without major toxicity, the phosphine's sensitivity to air oxidation and its poor
solubility
In chemistry, solubility is the ability of a substance, the solute, to form a solution with another substance, the solvent. Insolubility is the opposite property, the inability of the solute to form such a solution.
The extent of the solub ...
in water significantly hinder its efficiency. The copper (I) catalyzed azide-alkyne coupling has reasonable reaction rate and efficiency under physiological conditions, but copper poses significant
toxicity
Toxicity is the degree to which a chemical substance or a particular mixture of substances can damage an organism. Toxicity can refer to the effect on a whole organism, such as an animal, bacterium, or plant, as well as the effect on a subs ...
and sometimes interferes with protein functions in living cells. In 2004, chemist
Carolyn R. Bertozzi's lab developed a metal free
+2cycloaddition using strained cyclooctyne and azide. Cyclooctyne, which is the smallest stable cycloalkyne, can couple with azide through
+2cycloaddition, leading to two regioisomeric triazoles (Figure 12).
The reaction occurs readily at
room temperature
Colloquially, "room temperature" is a range of air temperatures that most people prefer for indoor settings. It feels comfortable to a person when they are wearing typical indoor clothing. Human comfort can extend beyond this range depending on ...
and therefore can be used to effectively modify living cells without negative effects. It has also been reported that the installation of
fluorine substituents on a cyclic alkyne can greatly accelerate the reaction rate.
Examples of Applied Bioconjugation Techniques
Growth Factors
Bioconjugation of TGF-β to iron oxide nanoparticles and its activation through magnetic hyperthermia in-vitro has been reported. This was done by using 1-(3-dimethylaminopropyl)ethylcarbodiimide combined with N-Hydroxysuccinimide to form primary amide bonds with the free primary amines on the growth factor. Carbon nanotubes have been successfully used in conjunction with bioconjugation to link TGF-β followed by an activation with near-infrared light. Typically, these reactions have involved the use of a crosslinker, but some of these add molecular space between the compound of interest and base material and in turn causes higher degrees of non-specific binding and unwanted reactivity.
See also
*
Immunofluorescence
*
Biomolecular engineering
*
Biotinylation
*
SpyTag/SpyCatcher
*
Unnatural amino acids
In biochemistry, non-coded or non-proteinogenic amino acids are distinct from the 22 proteinogenic amino acids (21 in eukaryotesplus formylmethionine in eukaryotes with prokaryote organelles like mitochondria) which are naturally encoded in the ge ...
* ''
Bioconjugate Chemistry'' journal
References
{{reflist, 30em
Biochemistry
Chemical bonding