Genetic engineering techniques allow the modification of
animal
Animals are multicellular, eukaryotic organisms in the Biology, biological Kingdom (biology), kingdom Animalia (). With few exceptions, animals heterotroph, consume organic material, Cellular respiration#Aerobic respiration, breathe oxygen, ...
and
plant genomes. Techniques have been devised to insert, delete, and modify
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
at multiple levels, ranging from a specific
base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
in a specific
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
to entire genes. There are a number of steps that are followed before a
genetically modified organism
A genetically modified organism (GMO) is any organism whose genetic material has been altered using genetic engineering techniques. The exact definition of a genetically modified organism and what constitutes genetic engineering varies, with ...
(GMO) is created. Genetic engineers must first choose what
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
they wish to insert, modify, or delete. The gene must then be isolated and incorporated, along with other genetic elements, into a suitable
vector
Vector most often refers to:
* Euclidean vector, a quantity with a magnitude and a direction
* Disease vector, an agent that carries and transmits an infectious pathogen into another living organism
Vector may also refer to:
Mathematics a ...
. This vector is then used to insert the gene into the host genome, creating a transgenic or edited organism.
The ability to genetically engineer organisms is built on years of research and discovery on gene function and manipulation. Important advances included the discovery of
restriction enzyme
A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
s,
DNA ligases, and the development of
polymerase chain reaction
The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA (or a part of it) sufficiently to enable detailed st ...
and
sequencing
In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
.
Added genes are often accompanied by
promoter and
terminator regions as well as a
selectable marker
A selectable marker is a gene introduced into cell (biology), cells, especially bacteria or cells in cell culture, culture, which confers one or more traits suitable for artificial selection. They are a type of reporter gene used in laboratory micr ...
gene. The added gene may itself be modified to make it
express
Express, The Expresss or EXPRESS may refer to:
Arts, entertainment and media Film
* ''Express: Aisle to Glory'', a 1998 comedy short film featuring Kal Penn
* ''The Express: The Ernie Davis Story'', a 2008 film starring Dennis Quaid
* The Expre ...
more efficiently. This vector is then inserted into the host organism's genome. For animals, the gene is typically inserted into
embryonic stem cells, while in plants it can be inserted into any tissue that can be
cultured into a fully developed plant.
Tests are carried out on the modified organism to ensure stable integration,
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Offi ...
and expression. First generation offspring are
heterozygous
Zygosity (the noun, zygote, is from the Greek "yoked," from "yoke") () is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism.
Mos ...
, requiring them to be inbred to create the homozygous pattern necessary for stable inheritance. Homozygosity must be confirmed in second generation specimens.
Early techniques randomly inserted the genes into the genome. Advances allow targeting specific locations, which reduces unintended side effects. Early techniques relied on
meganucleases and
zinc finger nucleases. Since 2009 more accurate and easier systems to implement have been developed.
Transcription activator-like effector nuclease
Transcription activator-like effector nucleases (TALEN) are restriction enzymes that can be engineered to cut specific sequences of DNA. They are made by fusing a TAL effector DNA-binding domain to a DNA cleavage domain (a nuclease which cuts DN ...
s (TALENs) and the
Cas9-guideRNA system (adapted from
CRISPR
CRISPR (; acronym of clustered regularly interspaced short palindromic repeats) is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. Each sequence within an individual prokaryotic CRISPR is d ...
) are the two most common.
History
Many different discoveries and advancements led to the development of
genetic engineering
Genetic engineering, also called genetic modification or genetic manipulation, is the modification and manipulation of an organism's genes using technology. It is a set of Genetic engineering techniques, technologies used to change the genet ...
. Human-directed genetic manipulation began with the
domestication
Domestication is a multi-generational Mutualism (biology), mutualistic relationship in which an animal species, such as humans or leafcutter ants, takes over control and care of another species, such as sheep or fungi, to obtain from them a st ...
of plants and animals through
artificial selection
Selective breeding (also called artificial selection) is the process by which humans use animal breeding and plant breeding to selectively develop particular phenotypic traits (characteristics) by choosing which typically animal or plant ...
in about 12,000 BC.
Various techniques were developed to aid in breeding and selection.
Hybridization was one way rapid changes in an organism's genetic makeup could be introduced. Crop hybridization most likely first occurred when humans began growing genetically distinct individuals of related species in close proximity.
Some plants were able to be propagated by
vegetative cloning.
Genetic inheritance was first discovered by
Gregor Mendel
Gregor Johann Mendel Order of Saint Augustine, OSA (; ; ; 20 July 1822 – 6 January 1884) was an Austrian Empire, Austrian biologist, meteorologist, mathematician, Augustinians, Augustinian friar and abbot of St Thomas's Abbey, Brno, St. Thom ...
in 1865, following experiments crossing peas. In 1928
Frederick Griffith proved the existence of a "transforming principle" involved in inheritance, which was identified as DNA in 1944 by
Oswald Avery, Colin MacLeod, and Maclyn McCarty.
Frederick Sanger
Frederick Sanger (; 13 August 1918 – 19 November 2013) was a British biochemist who received the Nobel Prize in Chemistry twice.
He won the 1958 Chemistry Prize for determining the amino acid sequence of insulin and numerous other prote ...
developed a method for sequencing DNA in 1977, greatly increasing the genetic information available to researchers.
After discovering the existence and properties of
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
, tools had to be developed that allowed it to be manipulated. In 1970
Hamilton Smiths lab discovered
restriction enzymes, enabling scientists to isolate genes from an organism's genome.
DNA ligases, which join broken DNA together, were discovered earlier in 1967.
By combining the two enzymes it became possible to "cut and paste" DNA sequences to create
recombinant DNA
Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
.
Plasmid
A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria and ...
s, discovered in 1952, became important
tools
A tool is an object that can extend an individual's ability to modify features of the surrounding environment or help them accomplish a particular task. Although many animals use simple tools, only human beings, whose use of stone tools dates ...
for transferring information between cells and
replicating DNA sequences.
Polymerase chain reaction
The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA (or a part of it) sufficiently to enable detailed st ...
(PCR), developed by
Kary Mullis in 1983, allowed small sections of DNA to be amplified (replicated) and aided identification and isolation of genetic material.
As well as manipulating DNA, techniques had to be developed for its insertion into an organism's genome. Griffith's experiment had already shown that some bacteria had
the ability to naturally uptake and express foreign DNA. Artificial competence was induced in ''
Escherichia coli
''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
'' in 1970 by treating them with
calcium chloride
Calcium chloride is an inorganic compound, a Salt (chemistry), salt with the chemical formula . It is a white crystalline solid at room temperature, and it is highly soluble in water. It can be created by neutralising hydrochloric acid with cal ...
solution (CaCl
2). Transformation using
electroporation was developed in the late 1980s, increasing the efficiency and bacterial range. In 1907 a bacterium that caused plant tumors, ''
Agrobacterium tumefaciens
''Agrobacterium tumefaciens'' (also known as ''Rhizobium radiobacter'') is the causal agent of crown gall disease (the formation of tumours) in over 140 species of eudicots. It is a rod-shaped, Gram-negative soil bacterium. Symptoms are cause ...
'', had been discovered. In the early 1970s it was found that this bacteria inserted its DNA into plants using a
Ti plasmid
A tumour inducing (Ti) plasmid is a plasmid found in pathogenic species of ''Agrobacterium'', including Agrobacterium tumefaciens, ''A. tumefaciens'', Rhizobium rhizogenes, ''A. rhizogenes'', ''A. rubi'' and Allorhizobium vitis, ''A. vitis''.
Ev ...
. By removing the genes in the plasmid that caused the tumor and adding in novel genes, researchers were able to infect plants with ''A. tumefaciens'' and let the bacteria insert their chosen DNA into the genomes of the plants.
Choosing target genes
The first step is to identify the target gene or genes to insert into the host organism. This is driven by the goal for the resultant organism. In some cases only one or two genes are affected. For more complex objectives entire
biosynthetic pathways involving multiple genes may be involved. Once found genes and other genetic information from a wide range of organisms can be inserted into bacteria for storage and modification, creating
genetically modified bacteria in the process. Bacteria are cheap, easy to grow,
clonal, multiply quickly, relatively easy to transform and can be stored at -80 °C almost indefinitely. Once a gene is isolated it can be stored inside the bacteria providing an unlimited supply for research.
Genetic screens can be carried out to determine potential genes followed by other tests that identify the best candidates. A simple screen involves randomly
mutating DNA with chemicals or radiation and then selecting those that display the desired trait. For organisms where mutation is not practical, scientists instead look for individuals among the population who present the characteristic through naturally-occurring mutations. Processes that look at a
phenotype
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
and then try and identify the gene responsible are called
forward genetics
Forward genetics is a molecular genetics approach of determining the genetic basis responsible for a phenotype. Forward genetics provides an unbiased approach because it relies heavily on identifying the genes or genetic factors that cause a partic ...
. The gene then needs to be
mapped by comparing the inheritance of the phenotype with known
genetic marker
A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can ...
s. Genes that are close together are likely to be inherited together.
Another option is
reverse genetics. This approach involves targeting a specific gene with a mutation and then observing what phenotype develops.
The mutation can be designed to
inactivate the gene or only allow it to become active under certain conditions.
Conditional mutations are useful for identifying genes that are normally lethal if non-functional. As genes with similar functions share similar sequences (
homologous) it is possible to predict the likely function of a gene by comparing its sequence to that of well-studied genes from
model organism
A model organism is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workings of other organisms. Mo ...
s.
The development of
microarrays
A microarray is a multiplex lab-on-a-chip. Its purpose is to simultaneously detect the expression of thousands of biological interactions. It is a two-dimensional array on a solid substrate—usually a glass slide or silicon thin-film cell� ...
,
transcriptome
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The ...
s and
genome sequencing
Whole genome sequencing (WGS), also known as full genome sequencing or just genome sequencing, is the process of determining the entirety of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's ...
has made it much easier to find desirable genes.
The bacteria ''
Bacillus thuringiensis'' was first discovered in 1901 as the causative agent in the death of
silkworms. Due to these insecticidal properties, the bacteria was used as a
biological insecticide, developed commercially in 1938. The
cry proteins were discovered to provide the insecticidal activity in 1956, and by the 1980s, scientists had successfully cloned the gene that encodes this protein and expressed it in plants. The gene that provides resistance to the herbicide
glyphosate
Glyphosate (IUPAC name: ''N''-(phosphonomethyl)glycine) is a broad-spectrum systemic herbicide and crop desiccant. It is an organophosphorus compound, specifically a phosphonate, which acts by EPSP inhibitor, inhibiting the plant enzyme 5-en ...
was found after seven years of searching in bacteria living in the outflow pipe of a
Monsanto
The Monsanto Company () was an American agrochemical and agricultural biotechnology corporation founded in 1901 and headquartered in Creve Coeur, Missouri. Monsanto's best-known product is Roundup, a glyphosate-based herbicide, developed ...
RoundUp manufacturing facility. In animals, the majority of genes used are
growth hormone
Growth hormone (GH) or somatotropin, also known as human growth hormone (hGH or HGH) in its human form, is a peptide hormone that stimulates growth, cell reproduction, and cell regeneration in humans and other animals. It is thus important in ...
genes.
Gene manipulation
All genetic engineering processes involve the modification of DNA. Traditionally DNA was isolated from the cells of organisms. Later, genes came to be
cloned from a DNA segment after the creation of a
DNA library
In molecular biology, a library is a collection of genetic material fragments that are stored and propagated in a population of microbes through the process of molecular cloning. There are different types of DNA libraries, including CDNA libra ...
or
artificially synthesised. Once isolated, additional genetic elements are added to the gene to allow it to be expressed in the host organism and to aid selection.
Extraction from cells
First the cell must be gently
opened, exposing the DNA without causing too much damage to it. The methods used vary depending on the type of cell. Once it is open, the DNA must be separated from the other cellular components. A ruptured cell contains proteins and other cell debris. By mixing with
phenol and/or chloroform, followed by
centrifuging, the nucleic acids can be separated from this debris into an upper
aqueous phase. This aqueous phase can be removed and further purified if necessary by repeating the phenol-chloroform steps. The nucleic acids can then be
precipitated from the aqueous solution using
ethanol
Ethanol (also called ethyl alcohol, grain alcohol, drinking alcohol, or simply alcohol) is an organic compound with the chemical formula . It is an Alcohol (chemistry), alcohol, with its formula also written as , or EtOH, where Et is the ps ...
or
isopropanol
Isopropyl alcohol (IUPAC name propan-2-ol and also called isopropanol or 2-propanol) is a colorless, flammable, organic compound with a pungent alcoholic odor.
Isopropyl alcohol, an organic polar molecule, is miscible in water, ethanol, an ...
. Any
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
can be removed by adding a
ribonuclease that will degrade it. Many companies now sell kits that simplify the process.
Gene isolation
The gene researchers are looking to modify (known as the gene of interest) must be separated from the extracted DNA. If the sequence is not known then a common method is to break the DNA up with a random digestion method. This is usually accomplished using
restriction enzymes (enzymes that cut DNA). A partial
restriction digest cuts only some of the restriction sites, resulting in overlapping DNA fragment segments. The DNA fragments are put into individual
plasmid vectors and grown inside bacteria. Once in the bacteria the plasmid is copied as the
bacteria divides. To determine if a useful gene is present in a particular fragment, the DNA library is screened for the desired
phenotype
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
. If the phenotype is detected then it is possible that the bacteria contains the target gene.
If the gene does not have a detectable phenotype or a DNA library does not contain the correct gene, other methods must be used to isolate it. If the position of the gene can be determined using
molecular markers then
chromosome walking is one way to isolate the correct DNA fragment. If the gene expresses close
homology to a known gene in another species, then it could be isolated by searching for genes in the library that closely match the known gene.
For known DNA sequences, restriction enzymes that cut the DNA on either side of the gene can be used.
Gel electrophoresis
Gel electrophoresis is an electrophoresis method for separation and analysis of biomacromolecules (DNA, RNA, proteins, etc.) and their fragments, based on their size and charge through a gel. It is used in clinical chemistry to separate ...
then sorts the fragments according to length. Some gels can separate sequences that differ by a single
base-pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
. The DNA can be visualised by staining it with
ethidium bromide and photographing under
UV light. A
marker with fragments of known lengths can be laid alongside the DNA to estimate the size of each band. The DNA band at the correct size should contain the gene, where it can be excised from the gel.
Another technique to isolate genes of known sequences involves
polymerase chain reaction
The polymerase chain reaction (PCR) is a method widely used to make millions to billions of copies of a specific DNA sample rapidly, allowing scientists to amplify a very small sample of DNA (or a part of it) sufficiently to enable detailed st ...
(PCR). PCR is a powerful tool that can amplify a given sequence, which can then be isolated through gel electrophoresis. Its effectiveness drops with larger genes and it has the potential to introduce errors into the sequence.
It is possible to artificially
synthesise genes. Some synthetic sequences are available commercially, forgoing many of these early steps.
Modification
The gene to be inserted must be combined with other genetic elements in order for it to work properly. The gene can be modified at this stage for better expression or effectiveness. As well as the gene to be inserted most
constructs contain a
promoter and
terminator region as well as a
selectable marker
A selectable marker is a gene introduced into cell (biology), cells, especially bacteria or cells in cell culture, culture, which confers one or more traits suitable for artificial selection. They are a type of reporter gene used in laboratory micr ...
gene. The promoter region initiates
transcription of the gene and can be used to control the location and level of gene expression, while the terminator region ends transcription. A selectable marker, which in most cases confers
antibiotic resistance
Antimicrobial resistance (AMR or AR) occurs when microbes evolve mechanisms that protect them from antimicrobials, which are drugs used to treat infections. This resistance affects all classes of microbes, including bacteria (antibiotic resis ...
to the organism it is expressed in, is used to determine which cells are transformed with the new gene. The constructs are made using
recombinant DNA
Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
techniques, such as
restriction digests,
ligations and
molecular cloning
Molecular cloning is a set of experimental methods in molecular biology that are used to assemble recombinant DNA molecules and to direct their DNA replication, replication within Host (biology), host organisms. The use of the word ''cloning'' re ...
.
Inserting DNA into the host genome
Once the gene is constructed it must be stably integrated into the genome of the target organism or exist as
extrachromosomal DNA
Extrachromosomal DNA (abbreviated ecDNA) is any DNA that is found off the chromosomes, either inside or outside the nucleus of a cell. Most DNA in an individual genome is found in chromosomes contained in the nucleus. Multiple forms of extrachrom ...
. There are a number of techniques available for inserting the gene into the host genome and they vary depending on the type of organism targeted. In multicellular
eukaryote
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s, if the
transgene
A transgene is a gene that has been transferred naturally, or by any of a number of genetic engineering techniques, from one organism to another. The introduction of a transgene, in a process known as transgenesis, has the potential to change the ...
is incorporated into the host's
germline cells, the resulting host cell can pass the transgene to its
progeny. If the transgene is incorporated into
somatic cells, the transgene can not be inherited.
Transformation

Transformation is the direct alteration of a cell's genetic components by passing the genetic material through the
cell membrane
The cell membrane (also known as the plasma membrane or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of a cell from the outside environment (the extr ...
. About 1% of bacteria are naturally able to
take up foreign DNA, but this ability can be induced in other bacteria.
Stressing the bacteria with a
heat shock or
electroporation can make the
cell membrane
The cell membrane (also known as the plasma membrane or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of a cell from the outside environment (the extr ...
permeable to DNA that may then be incorporated into the genome or exist as extrachromosomal DNA. Typically the cells are incubated in a solution containing
divalent
In chemistry, the valence (US spelling) or valency (British spelling) of an atom is a measure of its combining capacity with other atoms when it forms chemical compounds or molecules. Valence is generally understood to be the number of chemica ...
cation
An ion () is an atom or molecule with a net electrical charge. The charge of an electron is considered to be negative by convention and this charge is equal and opposite to the charge of a proton, which is considered to be positive by convent ...
s (often
calcium chloride
Calcium chloride is an inorganic compound, a Salt (chemistry), salt with the chemical formula . It is a white crystalline solid at room temperature, and it is highly soluble in water. It can be created by neutralising hydrochloric acid with cal ...
) under cold conditions, before being exposed to a heat pulse (heat shock). Calcium chloride partially disrupts the cell membrane, which allows the recombinant DNA to enter the host cell. It is suggested that exposing the cells to divalent cations in cold condition may change or weaken the cell surface structure, making it more permeable to DNA. The heat-pulse is thought to create a thermal imbalance across the cell membrane, which forces the DNA to enter the cells through either cell pores or the damaged cell wall.
Electroporation is another method of promoting competence. In this method the cells are briefly shocked with an
electric field
An electric field (sometimes called E-field) is a field (physics), physical field that surrounds electrically charged particles such as electrons. In classical electromagnetism, the electric field of a single charge (or group of charges) descri ...
of 10-20
kV/cm, which is thought to create holes in the cell membrane through which the plasmid DNA may enter. After the electric shock, the holes are rapidly closed by the cell's membrane-repair mechanisms. Up-taken DNA can either integrate with the bacterials genome or, more commonly, exist as
extrachromosomal DNA
Extrachromosomal DNA (abbreviated ecDNA) is any DNA that is found off the chromosomes, either inside or outside the nucleus of a cell. Most DNA in an individual genome is found in chromosomes contained in the nucleus. Multiple forms of extrachrom ...
.

In plants the DNA is often inserted using
''Agrobacterium''-mediated recombination,
taking advantage of the ''Agrobacterium''s
T-DNA sequence that allows natural insertion of genetic material into plant cells. Plant tissue are cut into small pieces and soaked in a fluid containing suspended ''Agrobacterium''. The bacteria will attach to many of the plant cells exposed by the cuts. The bacteria uses
conjugation
Conjugation or conjugate may refer to:
Linguistics
*Grammatical conjugation, the modification of a verb from its basic form
*Emotive conjugation or Russell's conjugation, the use of loaded language
Mathematics
*Complex conjugation, the change o ...
to transfer a DNA segment called
T-DNA from its plasmid into the plant. The transferred DNA is piloted to the plant cell nucleus and integrated into the host plants genomic DNA.The plasmid T-DNA is integrated semi-randomly into the
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
of the host cell.
By modifying the plasmid to express the gene of interest, researchers can insert their chosen gene stably into the plants genome. The only essential parts of the T-DNA are its two small (25 base pair) border repeats, at least one of which is needed for plant transformation.
The genes to be introduced into the plant are cloned into a
plant transformation vector that contains the T-DNA region of the
plasmid
A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria and ...
. An alternative method is
agroinfiltration.
Another method used to transform plant cells is
biolistics, where particles of gold or tungsten are coated with DNA and then shot into young plant cells or plant embryos. Some genetic material enters the cells and transforms them. This method can be used on plants that are not susceptible to ''Agrobacterium'' infection and also allows transformation of plant
plastids. Plants cells can also be transformed using electroporation, which uses an electric shock to make the cell membrane permeable to plasmid DNA. Due to the damage caused to the cells and DNA the transformation efficiency of biolistics and electroporation is lower than agrobacterial transformation.
Transfection
Transformation has a
different meaning in relation to animals, indicating progression to a cancerous state, so the process used to insert foreign DNA into animal cells is usually called transfection.
There are many ways to directly introduce DNA into animal cells ''
in vitro
''In vitro'' (meaning ''in glass'', or ''in the glass'') Research, studies are performed with Cell (biology), cells or biological molecules outside their normal biological context. Colloquially called "test-tube experiments", these studies in ...
''. Often these cells are
stem cell
In multicellular organisms, stem cells are undifferentiated or partially differentiated cells that can change into various types of cells and proliferate indefinitely to produce more of the same stem cell. They are the earliest type of cell ...
s that are used for
gene therapy
Gene therapy is Health technology, medical technology that aims to produce a therapeutic effect through the manipulation of gene expression or through altering the biological properties of living cells.
The first attempt at modifying human DNA ...
. Chemical based methods uses natural or synthetic compounds to form particles that facilitate the transfer of genes into cells.
These synthetic vectors have the ability to bind DNA and accommodate large genetic transfers.
One of the simplest methods involves using
calcium phosphate to bind the DNA and then exposing it to cultured cells. The solution, along with the DNA, is
encapsulated by the cells.
Liposomes and
polymer
A polymer () is a chemical substance, substance or material that consists of very large molecules, or macromolecules, that are constituted by many repeat unit, repeating subunits derived from one or more species of monomers. Due to their br ...
s can be used as vectors to deliver DNA into cultured animal cells. Positively charged liposomes bind with DNA, while polymers can designed that interact with DNA.
They form lipoplexes and polyplexes respectively, which are then up-taken by the cells. Other techniques include using electroporation and biolistics. In some cases, transfected cells may stably integrate external DNA into their own genome, this process is known as
stable transfection.
To create
transgenic animals the DNA must be inserted into viable embryos or eggs. This is usually accomplished using
microinjection
Microinjection is the use of a glass micropipette to inject a liquid substance at a microscopic or borderline macroscopic level. The target is often a living cell but may also include intercellular space. Microinjection is a simple mechanical pro ...
, where DNA is injected through the cell's
nuclear envelope
The nuclear envelope, also known as the nuclear membrane, is made up of two lipid bilayer membranes that in eukaryotic cells surround the nucleus, which encloses the genetic material.
The nuclear envelope consists of two lipid bilayer membran ...
directly into the
nucleus.
Superovulated fertilised eggs are collected at the single cell stage and cultured ''in vitro''. When the
pronuclei
A pronucleus (: pronuclei) denotes the Cell nucleus, nucleus found in either a sperm or egg cell during the process of Fertilisation, fertilization. The Spermatozoon, sperm cell undergoes a transformation into a pronucleus after entering the egg ...
from the sperm head and egg are visible through the
protoplasm the genetic material is injected into one of them. The
oocyte
An oocyte (, oöcyte, or ovocyte) is a female gametocyte or germ cell involved in reproduction. In other words, it is an immature ovum, or egg cell. An oocyte is produced in a female fetus in the ovary during female gametogenesis. The female ger ...
is then implanted in the
oviduct
The oviduct in vertebrates is the passageway from an ovary. In human females, this is more usually known as the fallopian tube. The eggs travel along the oviduct. These eggs will either be fertilized by spermatozoa to become a zygote, or will dege ...
of a
pseudopregnant animal.
Another method is Embryonic Stem Cell-Mediated Gene Transfer. The gene is transfected into
embryonic stem cell
Embryonic stem cells (ESCs) are Cell potency#Pluripotency, pluripotent stem cells derived from the inner cell mass of a blastocyst, an early-stage pre-Implantation (human embryo), implantation embryo. Human embryos reach the blastocyst stage 4� ...
s and then they are inserted into mouse
blastocyst
The blastocyst is a structure formed in the early embryonic development of mammals. It possesses an inner cell mass (ICM) also known as the ''embryoblast'' which subsequently forms the embryo, and an outer layer of trophoblast cells called the ...
s that are then implanted into foster mothers. The resulting offspring are
chimeric, and further mating can produce mice fully transgenic with the gene of interest.
Transduction
Transduction is the process by which foreign
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
is introduced into a cell by a
virus
A virus is a submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are ...
or
viral vector
A viral vector is a modified virus designed to gene delivery, deliver genetic material into cell (biology), cells. This process can be performed inside an organism or in cell culture. Viral vectors have widespread applications in basic research, ...
. Genetically modified viruses can be used as
viral vector
A viral vector is a modified virus designed to gene delivery, deliver genetic material into cell (biology), cells. This process can be performed inside an organism or in cell culture. Viral vectors have widespread applications in basic research, ...
s to transfer target genes to another organism in
gene therapy
Gene therapy is Health technology, medical technology that aims to produce a therapeutic effect through the manipulation of gene expression or through altering the biological properties of living cells.
The first attempt at modifying human DNA ...
. First the virulent genes are removed from the virus and the target genes are inserted instead. The sequences that allow the virus to insert the genes into the host organism must be left intact. Popular virus vectors are developed from
retrovirus
A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase e ...
es or
adenoviruses
Adenoviruses (members of the family ''Adenoviridae'') are medium-sized (90–100 nm), nonenveloped (without an outer lipid bilayer) viruses with an icosahedral nucleocapsid containing a double-stranded DNA genome. Their name derives from the ...
. Other viruses used as vectors include,
lentiviruses,
pox viruses and
herpes viruses. The type of virus used will depend on the cells targeted and whether the DNA is to be altered permanently or temporarily.
Regeneration
As often only a single cell is transformed with genetic material, the organism must be
regenerated from that single cell. In plants this is accomplished through the use of
tissue culture
Tissue culture is the growth of tissue (biology), tissues or cell (biology), cells in an artificial medium separate from the parent organism. This technique is also called micropropagation. This is typically facilitated via use of a liquid, semi-s ...
. Each plant species has different requirements for successful regeneration. If successful, the technique produces an adult plant that contains the
transgene
A transgene is a gene that has been transferred naturally, or by any of a number of genetic engineering techniques, from one organism to another. The introduction of a transgene, in a process known as transgenesis, has the potential to change the ...
in every cell. In animals it is necessary to ensure that the inserted DNA is present in the
embryonic stem cells.
Offspring can be screened for the gene. All offspring from the first generation are
heterozygous
Zygosity (the noun, zygote, is from the Greek "yoked," from "yoke") () is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism.
Mos ...
for the inserted gene and must be inbred to produce a
homozygous
Zygosity (the noun, zygote, is from the Greek "yoked," from "yoke") () is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism.
Mos ...
specimen. Bacteria consist of a single cell and reproduce clonally so regeneration is not necessary.
Selectable markers are used to easily differentiate transformed from untransformed cells.
Cells that have been successfully transformed with the DNA contain the marker gene, while those not transformed will not. By growing the cells in the presence of an antibiotic or chemical that
selects or marks the cells expressing that gene, it is possible to separate modified from unmodified cells. Another screening method involves a
DNA probe
In molecular biology, a hybridization probe (HP) is a fragment of DNA or RNA, usually 15–10000 nucleotides long, which can be radioactively or fluorescently labeled. HPs can be used to detect the presence of nucleotide sequences in analyzed ...
that sticks only to the inserted gene. These markers are usually present in the transgenic organism, although a number of strategies have been developed that can remove the selectable marker from the mature transgenic plant.
Confirmation
Finding that a recombinant organism contains the inserted genes is not usually sufficient to ensure that they will be appropriately expressed in the intended tissues. Further testing using PCR,
Southern hybridization, and
DNA sequencing
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, thymine, cytosine, and guanine. The ...
is conducted to confirm that an organism contains the new gene. These tests can also confirm the chromosomal location and copy number of the inserted gene. Once confirmed methods that look for and measure the gene products (RNA and protein) are also used to assess gene expression, transcription, RNA processing patterns and expression and localization of protein product(s). These include
northern hybridisation, quantitative
RT-PCR,
Western blot
The western blot (sometimes called the protein immunoblot), or western blotting, is a widely used analytical technique in molecular biology and immunogenetics to detect specific proteins in a sample of tissue homogenate or extract. Besides detect ...
,
immunofluorescence
Immunofluorescence (IF) is a light microscopy-based technique that allows detection and localization of a wide variety of target biomolecules within a cell or tissue at a quantitative level. The technique utilizes the binding specificity of anti ...
,
ELISA
The enzyme-linked immunosorbent assay (ELISA) (, ) is a commonly used analytical biochemistry assay, first described by Eva Engvall and Peter Perlmann in 1971. The assay is a solid-phase type of enzyme immunoassay (EIA) to detect the presence of ...
and phenotypic analysis. When appropriate, the organism's offspring are studied to confirm that the transgene and associated
phenotype
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
are stably inherited.
Gene insertion targeting
Traditional methods of genetic engineering generally insert the new genetic material randomly within the host genome. This can impair or alter other genes within the organism. Methods were developed that inserted the new genetic material into
specific sites within an organism genome. Early methods that targeted genes at certain sites within a genome relied on
homologous recombination
Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in Cell (biology), cellular organi ...
(HR). By creating DNA constructs that contain a template that matches the targeted genome sequence, it is possible that the HR processes within the cell will insert the construct at the desired location. Using this method on
embryonic stem cell
Embryonic stem cells (ESCs) are Cell potency#Pluripotency, pluripotent stem cells derived from the inner cell mass of a blastocyst, an early-stage pre-Implantation (human embryo), implantation embryo. Human embryos reach the blastocyst stage 4� ...
s led to the development of
transgenic mice with targeted
knocked out. It has also been possible to
knock in genes or alter
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
patterns.
If a vital gene is knocked out it can prove lethal to the organism. In order to study the function of these genes,
site specific recombinases (SSR) were used. The two most common types are the
Cre-LoxP and
Flp-FRT systems.
Cre recombinase is an enzyme that removes DNA by homologous recombination between binding sequences known as Lox-P sites. The Flip-FRT system operates in a similar way, with the Flip recombinase recognizing FRT sequences. By crossing an organism containing the recombinase sites flanking the gene of interest with an organism that expresses the SSR under control of
tissue specific promoters, it is possible to knock out or switch on genes only in certain cells. This has also been used to remove marker genes from transgenic animals. Further modifications of these systems allowed researchers to induce recombination only under certain conditions, allowing genes to be knocked out or expressed at desired times or
stages of development.
Genome editing uses artificially engineered
nuclease
In biochemistry, a nuclease (also archaically known as nucleodepolymerase or polynucleotidase) is an enzyme capable of cleaving the phosphodiester bonds that link nucleotides together to form nucleic acids. Nucleases variously affect single and ...
s that create specific
double-stranded breaks at desired locations in the genome. The breaks are subject to cellular
DNA repair
DNA repair is a collection of processes by which a cell (biology), cell identifies and corrects damage to the DNA molecules that encode its genome. A weakened capacity for DNA repair is a risk factor for the development of cancer. DNA is cons ...
processes that can be exploited for targeted gene knock-out, correction or insertion at high frequencies. If a donor DNA containing the appropriate sequence (homologies) is present, then new genetic material containing the transgene will be integrated at the targeted site with high efficiency by
homologous recombination
Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in Cell (biology), cellular organi ...
.
There are four families of engineered nucleases:
meganucleases,
ZFNs,
transcription activator-like effector nuclease
Transcription activator-like effector nucleases (TALEN) are restriction enzymes that can be engineered to cut specific sequences of DNA. They are made by fusing a TAL effector DNA-binding domain to a DNA cleavage domain (a nuclease which cuts DN ...
s (TALEN), t
he CRISPR/Cas (clustered regularly interspaced short palindromic repeat/CRISPRassociated protein (e.g. CRISPR/Cas9). Among the four types, TALEN and CRISPR/Cas are the two most commonly used.
Recent advances have looked at combining multiple systems to exploit the best features of both (e.g. megaTAL that are a fusion of a TALE DNA binding domain and a meganuclease). Recent research has also focused on developing strategies to create gene knock-out or corrections without creating double stranded breaks (base editors).
[
]
Meganucleases and Zinc finger nucleases
Meganucleases were first used in 1988 in mammalian cells. Meganucleases are endodeoxyribonucleases that function as restriction enzymes with long recognition sites, making them more specific to their target site than other restriction enzyme
A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
s. This increases their specificity and reduces their toxicity as they will not target as many sites within a genome. The most studied meganucleases are the LAGLIDADG family. While meganucleases are still quite susceptible to off-target binding, which makes them less attractive than other gene editing tools, their smaller size still makes them attractive particularly for viral vectorization perspectives.
Zinc-finger nucleases (ZFNs), used for the first time in 1996, are typically created through the fusion of Zinc-finger domains and the ''Fok''I nuclease domain. ZFNs have thus the ability to cleave DNA at target sites. By engineering the zinc finger domain to target a specific site within the genome, it is possible to edit the genomic sequence at the desired location
In geography, location or place is used to denote a region (point, line, or area) on Earth's surface. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ambiguous bou ...
. ZFNs have a greater specificity, but still hold the potential to bind to non-specific sequences.. While a certain amount of off-target cleavage is acceptable for creating transgenic model organisms, they might not be optimal for all human gene therapy treatments.
TALEN and CRISPR
Access to the code governing the DNA recognition by transcription activator-like effectors (TALE) in 2009 opened the way to the development of a new class of efficient TAL-based gene editing tools. TALE, proteins secreted by the Xanthomonas plant pathogen, bind with great specificity to genes within the plant host and initiate transcription of the genes helping infection. Engineering TALE by fusing the DNA binding core to the ''Fok''I nuclease catalytic domain allowed creation of a new tool of designer nucleases, the TALE nuclease (TALEN). They have one of the greatest specificities of all the current engineered nucleases. Due to the presence of repeat sequences, they are difficult to construct through standard molecular biology procedure and rely on more complicated method of such as Golden gate cloning.[
In 2011, another major breakthrough technology was developed based on CRISPR/Cas (clustered regularly interspaced short palindromic repeat / CRISPR associated protein) systems that function as an adaptive immune system in bacteria and ]archaea
Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
. The CRISPR/Cas system allows bacteria and archaea to fight against invading viruses by cleaving viral DNA and inserting pieces of that DNA into their own genome. The organism then transcribes this DNA into RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
and combines this RNA with Cas9
Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 dalton (unit), kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utili ...
proteins to make double-stranded breaks in the invading viral DNA. The RNA serves as a guide RNA to direct the Cas9 enzyme to the correct spot in the virus DNA. By pairing Cas proteins with a designed guide RNA CRISPR/Cas9 can be used to induce double-stranded breaks at specific points within DNA sequences. The break gets repaired by cellular DNA repair enzymes, creating a small insertion/deletion type mutation in most cases. Targeted DNA repair is possible by providing a donor DNA template that represents the desired change and that is (sometimes) used for double-strand break repair by homologous recombination. It was later demonstrated that CRISPR/Cas9 can edit human cells in a dish. Although the early generation lacks the specificity of TALEN, the major advantage of this technology is the simplicity of the design. It also allows multiple sites to be targeted simultaneously, allowing the editing of multiple genes at once. CRISPR/Cpf1 is a more recently discovered system that requires a different guide RNA to create particular double-stranded breaks (leaves overhangs when cleaving the DNA) when compared to CRISPR/Cas9.[
CRISPR/Cas9 is efficient at gene disruption. The creation of HIV-resistant babies by Chinese researcher He Jiankui is perhaps the most famous example of gene disruption using this method. It is far less effective at gene correction. Methods of base editing are under development in which a “nuclease-dead” Cas 9 endonuclease or a related enzyme is used for gene targeting while a linked deaminase enzyme makes a targeted base change in the DNA. The most recent refinement of CRISPR-Cas9 is called Prime Editing. This method links a reverse transcriptase to an RNA-guided engineered nuclease that only makes single-strand cuts but no double-strand breaks. It replaces the portion of DNA next to the cut by the successive action of nuclease and reverse transcriptase, introducing the desired change from an RNA template.]
See also
* List of genetic engineering software: software to code the genetic modifications
* Mutagenesis (molecular biology technique)
Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using labo ...
References
{{Genetic engineering
Genetic engineering