Cell-free protein synthesis, also known as ''
in vitro
''In vitro'' (meaning in glass, or ''in the glass'') studies are performed with microorganisms, cells, or biological molecules outside their normal biological context. Colloquially called "test-tube experiments", these studies in biology and ...
''
protein synthesis
Protein biosynthesis (or protein synthesis) is a core biological process, occurring inside cells, balancing the loss of cellular proteins (via degradation or export) through the production of new proteins. Proteins perform a number of critical ...
or CFPS, is the production of
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
using
biological
Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditary ...
machinery in a
cell-free system A cell-free system is an ''in vitro'' tool widely used to study biological reactions that happen within cells apart from a full cell system, thus reducing the complex interactions typically found when working in a whole cell. Subcellular fractions ...
, that is, without the use of living
cells
Cell most often refers to:
* Cell (biology), the functional basic unit of life
Cell may also refer to:
Locations
* Monastic cell, a small room, hut, or cave in which a religious recluse lives, alternatively the small precursor of a monastery w ...
. The ''in vitro'' protein synthesis environment is not constrained by a
cell wall
A cell wall is a structural layer surrounding some types of cells, just outside the cell membrane. It can be tough, flexible, and sometimes rigid. It provides the cell with both structural support and protection, and also acts as a filtering mec ...
or
homeostasis
In biology, homeostasis (British English, British also homoeostasis) Help:IPA/English, (/hɒmɪə(ʊ)ˈsteɪsɪs/) is the state of steady internal, physics, physical, and chemistry, chemical conditions maintained by organism, living systems. Thi ...
conditions necessary to maintain cell viability.
Thus, CFPS enables direct access and control of the
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
environment which is advantageous for a number of applications including co-translational solubilisation of membrane proteins, optimisation of protein production, incorporation of non-natural amino acids, selective and site-specific labelling. Due to the open nature of the system, different expression conditions such as pH,
redox potential
Redox potential (also known as oxidation / reduction potential, ''ORP'', ''pe'', ''E_'', or E_) is a measure of the tendency of a chemical species to acquire electrons from or lose electrons to an electrode and thereby be reduced or oxidised respe ...
s, temperatures, and
chaperones can be screened. Since there is no need to maintain cell viability, toxic proteins can be produced.
Introduction
Common components of a cell-free reaction include a cell extract, an energy source, a supply of
amino acids,
cofactors
Cofactor may also refer to:
* Cofactor (biochemistry), a substance that needs to be present in addition to an enzyme for a certain reaction to be catalysed
* A domain parameter in elliptic curve cryptography, defined as the ratio between the orde ...
such as
magnesium
Magnesium is a chemical element with the symbol Mg and atomic number 12. It is a shiny gray metal having a low density, low melting point and high chemical reactivity. Like the other alkaline earth metals (group 2 of the periodic ...
, and the
DNA with the desired
genes
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
. A cell extract is obtained by
lysing the cell of interest and
centrifuging out the cell walls, DNA
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
, and other debris. The remains are the necessary cell machinery including
ribosomes
Ribosomes ( ) are macromolecular machines, found within all cells, that perform biological protein synthesis (mRNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA (mRNA) molecules to ...
,
aminoacyl-tRNA synthetases, translation initiation and
elongation factors
Elongation factors are a set of proteins that function at the ribosome, during protein synthesis, to facilitate translational elongation from the formation of the first to the last peptide bond of a growing polypeptide. Most common elongatio ...
,
nucleases
A nuclease (also archaically known as nucleodepolymerase or polynucleotidase) is an enzyme capable of cleaving the phosphodiester bonds between nucleotides of nucleic acids. Nucleases variously effect single and double stranded breaks in their t ...
, etc.
Two types of DNA can be used in CFPS:
plasmids
A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; howev ...
and
linear expression templates
Linearity is the property of a mathematical relationship (''function'') that can be graphically represented as a straight line. Linearity is closely related to '' proportionality''. Examples in physics include rectilinear motion, the linear re ...
(LETs). Plasmids are circular, and only made inside cells. LETs can be made much more effectively via
PCR PCR or pcr may refer to:
Science
* Phosphocreatine, a phosphorylated creatine molecule
* Principal component regression, a statistical technique
Medicine
* Polymerase chain reaction
** COVID-19 testing, often performed using the polymerase chain r ...
, which replicates DNA much faster than raising cells in an
incubator. While LETs are easier and faster to make, plasmid yields are usually much higher in CFPS. Because of this, much research today is focused on optimizing CFPS LET yields to approach the yields of CFPS with plasmids.
An energy source is an important part of a cell-free reaction. Usually, a separate mixture containing the needed energy source, along with a supply of amino acids, is added to the extract for the reaction. Common sources are
phosphoenol pyruvate
Phosphoenolpyruvate (2-phosphoenolpyruvate, PEP) is the ester derived from the enol of pyruvate and phosphate. It exists as an anion. PEP is an important intermediate in biochemistry. It has the highest-energy phosphate bond found (−61.9 kJ/m ...
,
acetyl phosphate
In organic chemistry, acetyl is a functional group with the chemical formula and the structure . It is sometimes represented by the symbol Ac (not to be confused with the element actinium). In IUPAC nomenclature, acetyl is called ethanoyl ...
, and
creatine phosphate
Phosphocreatine, also known as creatine phosphate (CP) or PCr (Pcr), is a phosphorylated form of creatine that serves as a rapidly mobilizable reserve of high-energy phosphates in skeletal muscle, myocardium and the brain to recycle adenosine tri ...
.
Advantages and Applications
CFPS has many advantages over the traditional ''
in vivo
Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, and ...
'' synthesis of proteins. Most notably, a cell-free reaction, including extract preparation, usually takes 1 –2 days, whereas ''in vivo'' protein expression may take 1–2 weeks.
CFPS is an open reaction. The lack of cell wall allows direct manipulation of the chemical environment. Samples are easily taken, concentrations optimized, and the reaction can be monitored. In contrast, once DNA is inserted into live cells, the reaction cannot be accessed until it is over and the cells are lysed.
Another advantage to CFPS is the lack of concern for toxicity. Some desired proteins and labeled proteins are toxic to cells when synthesized.
Since live cells are not being used, the toxicity of the product protein is not a significant concern.
These advantages enable numerous applications.
A major application of CFPS is incorporation of unnatural amino acids into
protein structure
Protein structure is the molecular geometry, three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single ami ...
s (see
expanded genetic code
An expanded genetic code is an artificially modified genetic code in which one or more specific codons have been re-allocated to encode an amino acid that is not among the 22 common naturally-encoded proteinogenic amino acids.
The key prerequisit ...
). The openness of the reaction is ideal for inserting the modified
tRNAs
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
and unnatural amino acids required for such a reaction.
Synthetic biology has many other uses and is a bright future in fields such as
protein evolution
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respond ...
,
nanomachines
A molecular machine, nanite, or nanomachine is a molecular component that produces quasi-mechanical movements (output) in response to specific stimuli (input). In cellular biology, macromolecular machines frequently perform tasks essential for ...
,
nucleic acid circuits, and synthesis of
virus
A virus is a wikt:submicroscopic, submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and ...
-like particles for
vaccines
A vaccine is a biological preparation that provides active acquired immunity to a particular infectious or malignant disease. The safety and effectiveness of vaccines has been widely studied and verified.[< ...]
and
drug therapy
Pharmacotherapy is therapy using pharmaceutical drugs, as distinguished from therapy using surgery (surgical therapy), radiation (radiation therapy), movement (physical therapy), or other modes. Among physicians, sometimes the term ''medical the ...
.
Limitations
One challenge associated with CFPS is the degradation of the DNA by
endogenous
Endogenous substances and processes are those that originate from within a living system such as an organism, tissue, or cell.
In contrast, exogenous substances and processes are those that originate from outside of an organism.
For example, ...
nucleases in the cell extract. This is particularly problematic with LETs. Cells have
endonucleases
Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Some, such as deoxyribonuclease I, cut DNA relatively nonspecifically (without regard to sequence), while many, typically called restriction endonuclea ...
that attack random sites of a DNA strands; however, much more common are the
exonucleases
Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is th ...
which attack DNA from the ends. Since plasmids are circular and have no end to which the exonucleases may attach, they are not affected by the latter. LETs, however, are susceptible to both. Because of LET vulnerability, much research today is focused on optimizing CFPS LET yields to approach the yields of CFPS using plasmids.
One example of this improved protection with plasmids is use of the
bacteriophage lambda gam protein. Gam is an inhibitor of
RecBCD
Exodeoxyribonuclease V (EC 3.1.11.5, RecBCD, Exonuclease V, ''Escherichia coli'' exonuclease V, ''E. coli'' exonuclease V, gene recBC endoenzyme, RecBC deoxyribonuclease, gene recBC DNase, gene recBCD enzymes) is an enzyme of Escherichia coli, ''E ...
, an exonuclease found in ''
Escherichia coli
''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Esc ...
'' (''E. coli''). With the use of gam, CFPS yields with LETs were greatly increased, and were comparable to CFPS yields with plasmids. PURE extracts can also be made, eliminating the concern of exonucleases. These extracts are expensive to make and are not currently an economical solution to the issue of exogenous DNA degradation.
Types of Cell-free systems
Common cell extracts in use today are made from ''E. coli'' (ECE),
rabbit
Rabbits, also known as bunnies or bunny rabbits, are small mammals in the family Leporidae (which also contains the hares) of the order Lagomorpha (which also contains the pikas). ''Oryctolagus cuniculus'' includes the European rabbit s ...
reticulocytes
Reticulocytes are immature red blood cells (RBCs). In the process of erythropoiesis (red blood cell formation), reticulocytes develop and mature in the bone marrow and then circulate for about a day in the blood stream before developing into mat ...
(RRL),
wheat germ
Cereal germ or Wheat germ:
The germ of a cereal is the reproductive part that germinates to grow into a plant; it is the embryo of the seed. Along with bran, germ is often a by-product of the milling that produces refined grain products. C ...
(WGE),
insect
Insects (from Latin ') are pancrustacean hexapod invertebrates of the class Insecta. They are the largest group within the arthropod phylum. Insects have a chitinous exoskeleton, a three-part body (head, thorax and abdomen), three pairs ...
cells (ICE) and Yeast ''Kluyveromyces'' (
the D2P system).
All of these extracts are commercially available.
ECE is the most popular lysate for several reasons. It is the most inexpensive extract and the least time intensive to create. Also, large amounts of ''E. coli'' are easily grown, and then easily lysed through use of a
homogenizer A homogenizer is a piece of laboratory or industrial equipment used for the homogenization of various types of material, such as tissue, plant, food, soil, and many others. Many different models have been developed using various physical technologie ...
or a
sonicator
A sonicator at the Weizmann Institute of Science during sonicationSonication is the act of applying sound energy to agitate particles in a sample, for various purposes such as the extraction of multiple compounds from plants, microalgae and seawe ...
.
ECE also provides the highest protein yields. However, high yield production can limit the complexity of the synthesized protein, particularly in
post-translational modification
Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribos ...
. In that regard, the lower efficient
eukaryotic
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
systems could be advantageous, provided that modifying
enzyme
Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different molecule ...
systems have been maintained in the extracts.
Each eukaryotic system has their advantages and disadvantages. For example, WGE extract produces the highest yields of the three eukaryotic extracts; however, it is not as effective for some post-translational modifications such as
glycosylation
Glycosylation is the reaction in which a carbohydrate (or 'glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not ...
.
When choosing an extract, the type of post-translational modification, desired yields, and cost should be taken into account.
History
Cell-free protein synthesis has been used for over 60 years, and notably, the first elucidation of a
codon
The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
was done by
Marshall Nirenberg
Marshall Warren Nirenberg (April 10, 1927 – January 15, 2010) was an American biochemist and geneticist. He shared a Nobel Prize in Physiology or Medicine in 1968 with Har Gobind Khorana and Robert W. Holley for "breaking the genetic code" an ...
and
Heinrich J. Matthaei in 1961 at the National Institutes of Health.
They used a cell-free system to translate a poly-
uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced ...
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
sequence (or UUUUU... in
biochemical
Biochemistry or biological chemistry is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology ...
terms) and discovered that the
polypeptide
Peptides (, ) are short chains of amino acids linked by peptide bonds. Long chains of amino acids are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides ...
they had synthesized consisted of only the amino acid
phenylalanine
Phenylalanine (symbol Phe or F) is an essential α-amino acid with the formula . It can be viewed as a benzyl group substituted for the methyl group of alanine, or a phenyl group in place of a terminal hydrogen of alanine. This essential amino a ...
. They thereby deduced from this poly-phenylalanine that the codon UUU specified the amino-acid phenylalanine. Extending this work, Nirenberg and his coworkers were able to determine the nucleotide makeup of each codon.
See also
*
Nirenberg and Matthaei experiment
*
Polymerase chain reaction optimization
References
*
Further reading
*
{{DEFAULTSORT:Cell-Free Protein Synthesis
Cell biology
Synthetic biology
Protein biosynthesis