Cpf1
   HOME

TheInfoList



OR:

Cas12a (CRISPR-associated protein 12a, previously known as Cpf1) is an RNA-guided endonuclease-exonuclease that forms an essential component of the
CRISPR CRISPR (; acronym of clustered regularly interspaced short palindromic repeats) is a family of DNA sequences found in the genomes of prokaryotic organisms such as bacteria and archaea. Each sequence within an individual prokaryotic CRISPR is d ...
systems found in some
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
and
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
. In its natural context, Cas12a targets and destroys the genetic material of
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
and other foreign
mobile genetic elements Mobile genetic elements (MGEs), sometimes called selfish genetic elements, are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In ...
, thereby protecting the host cell from infection. Like other Cas enzymes, Cas12a binds to a "guide"
RNA Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
(termed a crRNA, or CRISPR RNA) which targets it to a DNA sequence in a specific and programmable matter. In the host organism, the crRNA contains a constant region that is recognized by the Cas12a protein and a "spacer" region that is complementary to a piece of foreign nucleic acid (e.g. a portion of a
phage A bacteriophage (), also known informally as a phage (), is a virus that infects and replicates within bacteria. The term is derived . Bacteriophages are composed of proteins that encapsulate a DNA or RNA genome, and may have structures tha ...
genome) that previously infected the cell. As with
Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 dalton (unit), kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utili ...
and other Cas proteins, the programmable DNA-targeting activity of Cas12a makes it a useful tool for biotechnology and biological research applications. By modifying the spacer sequence in the crRNA, researchers can target Cas12a to specific DNA sequences, allowing for highly targeted modifications of DNA. Cas12a is distinguished from Cas9 by a its single RuvC endonuclease active site, its 5'
protospacer adjacent motif A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but i ...
preference, and its formation of sticky rather than blunt ends at the cut site; these and other differences may make it more suitable for certain applications. Beyond its use in
basic research Basic research, also called pure research, fundamental research, basic science, or pure science, is a type of scientific research with the aim of improving scientific theories for better understanding and prediction of natural or other phenome ...
, CRISPR-Cas12a could have applications in the treatment of genetic illnesses and in implementing
gene drive A gene drive is a natural process and technology of genetic engineering that propagates a particular suite of genes throughout a population by altering the probability that a specific allele will be transmitted to offspring (instead of the Mende ...
s.


History

Cas12a, a Class II Type V CRISPR-associated nuclease, was characterized in 2015 and was formerly known as Cpf1. This nuclease is found in the CRISPR-Cpf1 system of bacteria such as ''Francisella novicida''. The initial designation, derived from a TIGRFAMs protein family definition established in 2012, reflected the prevalence of this CRISPR-Cas subtype in the ''Prevotella'' and ''Francisella'' lineages. Cas12a exhibits several key distinctions from Cas9: it generates staggered cuts in double-stranded DNA, in contrast to the blunt ends produced by Cas9; it relies on a 'T-rich' protospacer adjacent motif (PAM) (typically 5'-TTTV-3', where V is A, C, or G), offering alternative targeting sites compared to the 'G-rich' PAMs (typically 5'-NGG-3') favored by Cas9; and it requires only a CRISPR RNA (crRNA) for effective targeting, whereas Cas9 necessitates both a crRNA and a ''trans''-activating crRNA (tracrRNA). These differences may confer certain advantages to Cas12a over Cas9. For instance, the smaller size of Cas12a's crRNAs makes them well-suited for multiplexed genome editing, allowing for a greater number of guide RNAs to be packaged into a single vector compared to Cas9's single guide RNAs (sgRNAs). The sticky 5′ overhangs generated by Cas12a can also be utilized for DNA assembly with higher target specificity than traditional restriction enzyme cloning. Furthermore, Cas12a cleaves DNA 18–23 base pairs downstream from the PAM site. This characteristic ensures that the recognition sequence remains intact after repair, enabling Cas12a to perform multiple rounds of DNA cleavage. Conversely, Cas9 cleaves only 3 base pairs upstream of the PAM site, leading to indel mutations from the non-homologous end joining (NHEJ) pathway that can disrupt the recognition sequence, thereby preventing subsequent rounds of cutting. A notable feature of Cas12a, in contrast to Cas9, is its ability to remain bound to its target DNA after cleavage and subsequently cleave other single-stranded DNA molecules non-discriminately . This property, termed "collateral cleavage" or "trans-cleavage" activity, has been harnessed for the development of various diagnostic technologies.


Comparison with Cas9

While classified as a Class II Type V CRISPR-associated nuclease, Cas12a lacks the NHN endonuclease domain found in Cas9 and instead relies on its RuvC-like domain for sequential DNA cleavage, resulting in the production of staggered ends. Unlike Cas9, Cas12a possesses a unique zinc finger-like domain, which may contribute to DNA binding or structural stability. A significant advantage of Cas12a over Cas9 is its inherent RNase activity within the WED domain, enabling the self-processing of precursor crRNAs (pre-crRNAs) into mature crRNAs. This eliminates the requirement for a ''trans''-activating crRNA (tracrRNA), simplifying guide RNA design and facilitating multiplexed editing through the packaging of multiple crRNAs into a single vector.


Multiplex genome editing

Multiplex genome editing is a powerful technique that allows for the simultaneous editing of multiple genes. Cas12a is particularly useful in this context due to features that simplify its application compared to Cas9. For instance, Cas12a requires only a short CRISPR RNA (crRNA) to target and edit genes. These crRNAs can be easily multiplexed by packaging them together into a single delivery system, enabling the simultaneous editing of multiple genomic loci . Furthermore, Cas12a and Cas9 exhibit differences in their protospacer adjacent motif (PAM) requirements. While Cas9 typically looks for G-rich sequences, Cas12a targets T-rich sequences. This difference expands the range of targetable genomic sites, allowing Cas12a to edit genes in regions where Cas9 may not be efficient, such as AT-rich areas prevalent in plant genomes, providing greater flexibility for researchers. The ability of Cas12a to deliver its own crRNA (CRISPR RNA) without the need for additional components like the tracrRNA required by Cas9 further simplifies the design and delivery of guide RNAs for multiplexed editing.


Description


Discovery

CRISPR-Cas12a was found by searching a published database of bacterial genetic sequences for promising bits of DNA. Its identification through
bioinformatics Bioinformatics () is an interdisciplinary field of science that develops methods and Bioinformatics software, software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, ...
as a CRISPR system protein, its naming, and a
hidden Markov model A hidden Markov model (HMM) is a Markov model in which the observations are dependent on a latent (or ''hidden'') Markov process (referred to as X). An HMM requires that there be an observable process Y whose outcomes depend on the outcomes of X ...
(HMM) for its detection were provided in 2012 in a release of the
TIGRFAMs TIGRFAMs is a database of protein families designed to support manual and automated genome annotation. Each entry includes a multiple sequence alignment and hidden Markov model (HMM) built from the alignment. Sequences that score above the defined ...
database of
protein families A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be c ...
. Cas12a appears in many bacterial species. The ultimate Cas12a endonuclease that was developed into a tool for genome editing was taken from one of the first 16 species known to harbor it. Two candidate enzymes from '' Acidaminococcus'' and ''
Lachnospiraceae The Lachnospiraceae are a family of obligately anaerobic, variably spore-forming bacteria in the order Eubacteriales that ferment diverse plant polysaccharides to short-chain fatty acids (butyrate, acetate) and alcohols (ethanol). These bacteria ...
'' display efficient genome-editing activity in human cells.


Classification

CRISPR-Cas systems are separated into two classes: Class I, in which several Cas proteins associate with a crRNA to build a functional endonuclease, and Class II, in which a single Cas endonuclease associates with a crRNA; Class II is further divided into Type II, Type V, and Type VI systems. Cas12a is identified as a Class II, Type V CRISPR-Cas system.


Naming

The acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) refers to the invariant DNA sequences found in bacteria and archaea which encode Cas proteins and their crRNAs. Cas12a was originally known as Cpf1, an abbreviation of CRISPR and two
genera Genus (; : genera ) is a taxonomic rank above species and below family as used in the biological classification of living and fossil organisms as well as viruses. In binomial nomenclature, the genus name forms the first part of the binomial s ...
of bacteria where it appears,
Prevotella ''Prevotella'' is a genus of Gram-negative bacteria. ''Prevotella'' species are widely distributed across varied ecological habitats, with 57 characterized species spanning both human and other mammalian hosts. In mammals, this genus is notabl ...
and Francisella. It was renamed in 2015 after a broader rationalization of the names of Cas (CRISPR associated) proteins to correspond to their
sequence homology Sequence homology is the homology (biology), biological homology between DNA sequence, DNA, RNA sequence, RNA, or Protein primary structure, protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments ...
.


Structure

The Cas12a protein contains a mixed alpha/beta domain, a RuvC-like endonuclease domain (broken into two non-contiguous segments, RuvC-I and RuvC-II) similar to the RuvC domain of Cas9, and a
zinc finger A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) which stabilizes the fold. The term ''zinc finger'' was originally coined to describe the finger-like appearance of a ...
-like domain. Unlike Cas9, Cas12a does not have an HNH endonuclease domain, and the N-terminal region of Cas12a does not have an alpha-helical recognition lobe as seen in Cas9. The Cas12a loci encode
Cas1 CRISPR-associated protein 1 (cas1) is one of the two universally conserved proteins found in the CRISPR prokaryotic immune defense system. Cas1 is a metal-dependent DNA-specific endonuclease that produces double-stranded DNA fragments. Cas1 forms ...
,
Cas2 Cas2 is a protein associated with CRISPR that is involved with spacer acquisition. Representative cas2 proteins have been characterized as endonucleases that cleave single-stranded RNAs preferentially within U-rich regions, or as metal-dependen ...
and Cas4 proteins more similar to types I and III than from type II systems. Database searches suggest the abundance of Cas12a-family proteins in many bacterial species. Also unlike Cas9, Cas12a does not require a
tracrRNA In molecular biology, trans-activating CRISPR RNA (tracrRNA) is a small ''trans''-encoded RNA. It was first discovered by Emmanuelle Charpentier in her study of the human pathogen ''Streptococcus pyogenes'', a type of bacteria that causes harm to h ...
(which in natural CRISPR systems must base-pair with a separate crRNA before binding to a Cas protein), instead binding a single crRNA. Both Cas12a and its guide RNA are smaller than the protein and RNA components of the Cas9 system; the crRNA of Cas12a is approximately half as long as sgRNAs used with Cas9. This reduced size renders Cas12a more suitable for applications such as ''in vivo'' delivery via
adeno-associated virus Adeno-associated viruses (AAV) are small viruses that infect humans and some other primate species. They belong to the genus '' Dependoparvovirus'', which in turn belongs to the family ''Parvoviridae''. They are small (approximately 26 nm in ...
(AAV), which have limited DNA packaging capacity due to their small capsids. The Cas12a-crRNA complex cleaves target DNA or RNA by identification of a
protospacer adjacent motif A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but i ...
(PAM) 5'-YTN-3' (where "Y" is a pyrimidine and "N" is any
nucleobase Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nuc ...
), in contrast to the G-rich PAM targeted by Cas9. After identification of PAM, Cas12a introduces a sticky-end-like DNA double- stranded break of 4 or 5 nucleotides overhang.


Mechanism

The CRISPR-Cas12a system consist of a Cas12a enzyme and a guide RNA that finds and positions the complex at the correct spot on the double helix to cleave target DNA. CRISPR-Cas12a systems activity has three stages: * Adaptation: Cas1 and Cas2 proteins facilitate the adaptation of small fragments of DNA into the CRISPR array. * Formation of crRNAs: processing of pre-cr-RNAs producing of mature crRNAs to guide the Cas protein. * Interference: the Cas12a is bound to a crRNA to form a binary complex to identify and cleave a target DNA sequence. The crRNA-Cas12a complex searches dsDNA for a 3-6nt 5'
protospacer adjacent motif A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but i ...
(PAM). Once a PAM is found, the protein locally denatures the dsDNA and searches for complementarity between the crRNA spacer and the ssDNA protospacer. Sufficient complementarity will trigger RuvC activity and the RuvC active site will then cut the non-target strand and then the target strand, ultimately generating a staggered dsDNA break with 5' ssDNA overhangs (cis cleavage).


Cas9 vs. Cas12a

Cas9 requires two
RNA Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
molecules to cut DNA while Cas12a needs one. The proteins also cut DNA at different places, offering researchers more options when selecting an editing site. Cas9 cuts both strands in a DNA molecule at the same position, leaving behind
blunt ends DNA ends refer to the properties of the ends of linear DNA molecules, which in molecular biology are described as "sticky" or "blunt" based on the shape of the complementary strands at the terminus. In sticky ends, one strand is longer than the ...
. Cas12a leaves one strand longer than the other, creating
sticky ends DNA ends refer to the properties of the ends of linear DNA molecules, which in molecular biology are described as "sticky" or "blunt" based on the shape of the complementary strands at the terminus. In sticky ends, one strand is longer than the o ...
. The sticky ends have different properties than blunt ends during
non-homologous end joining Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. It is called "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology directed repair ...
or homologous repair of DNA, which confers certain advantages to Cas12a when attempting gene insertions, compared to Cas9. Although the CRISPR-Cas9 system can efficiently disable genes, it is challenging to insert genes or generate a knock-in. Cas12a lacks
tracrRNA In molecular biology, trans-activating CRISPR RNA (tracrRNA) is a small ''trans''-encoded RNA. It was first discovered by Emmanuelle Charpentier in her study of the human pathogen ''Streptococcus pyogenes'', a type of bacteria that causes harm to h ...
, utilizes a T-rich PAM and cleaves DNA via a staggered DNA DSB. In summary, important differences between Cas12a and Cas9 systems are that Cas12a: * Recognizes different PAMs, enabling new targeting possibilities. * Creates 4-5 nt long sticky ends, instead of blunt ends produced by Cas9, enhancing the efficiency of genetic insertions and specificity during NHEJ or HDR. * Cuts target DNA further away from PAM, further away from the Cas9 cutting site, enabling new possibilities for cleaving the DNA.


Origin

Cas12 endonucleases ultimately likely evolved from the ''TnpB'' endonuclease of IS200/IS605-family transposons. ''TnpB'', not yet "domesticated" into the CRISPR immune system, are themselves able to perform RNA-guided cleavage using a ''OmegaRNA'' template system.


Tools

Multiple aspects influence target efficiency and specificity when using CRISPR, including guide RNA design. Many design models and CRISPR-Cas software tools for optimal design of guide RNA have been developed. These include SgRNA designer, CRISPR MultiTargeter, SSFinder. In addition, commercial antibodies are available for use to detect Cas12a protein.


Intellectual property

CRISPR-Cas9 is subject to
Intellectual property Intellectual property (IP) is a category of property that includes intangible creations of the human intellect. There are many types of intellectual property, and some countries recognize more than others. The best-known types are patents, co ...
disputes while CRISPR-Cas12a does not have the same issues.


Notes


References

{{DEFAULTSORT:CRISPR Cpf1 Genetic engineering Enzymes Genome editing