HOME

TheInfoList



OR:

Mutational signatures are characteristic combinations of mutation types arising from specific
mutagenesis Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using l ...
processes such as
DNA replication In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part for biological inherita ...
infidelity, exogenous and endogenous genotoxin exposures, defective
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
pathways, and DNA enzymatic editing. The term is used for two distinct concepts, often conflated: mutagen signatures and tumor signatures. Its original use, mutagen signature, referred to a pattern of mutations made in the laboratory by a known mutagen and not made by other mutagens – unique to the mutagen as a human signature is unique to the signer. Uniqueness allows the mutagen to be deduced from a cell's mutations Later, the phrase referred to a pattern of mutations characteristic of a tumor type, although usually not unique to the tumor type nor to a mutagen. If a tumor mutational signature matches a unique mutagen mutational signature, it is valid to deduce the carcinogen exposure or mutagenesis process that occurred in the patient's distant past. Increasingly refined tumor signatures are becoming assignable to mutagen signatures. Deciphering mutational signatures in
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
provides insight into the biological mechanisms involved in
carcinogenesis Carcinogenesis, also called oncogenesis or tumorigenesis, is the formation of a cancer, whereby normal cells are transformed into cancer cells. The process is characterized by changes at the cellular, genetic, and epigenetic levels and abnor ...
and normal somatic
mutagenesis Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using l ...
. Mutational signatures have shown their applicability in cancer treatment and cancer prevention. Advances in the fields of oncogenomics have enabled the development and use of molecularly
targeted therapy Targeted therapy or molecularly targeted therapy is one of the major modalities of medical treatment ( pharmacotherapy) for cancer, others being hormonal therapy and cytotoxic chemotherapy. As a form of molecular medicine, targeted therapy block ...
, but such therapies historically focused on inhibition of oncogenic drivers (e.g. ''EGFR'' gain-of-function mutation and EGFR inhibitor treatment in colorectal cancer). More recently, mutational signatures profiling has proven successful in guiding oncological management and use of targeted therapies (e.g.
immunotherapy Immunotherapy or biological therapy is the treatment of disease by activating or suppressing the immune system. Immunotherapies designed to elicit or amplify an immune response are classified as ''activation immunotherapies,'' while immunotherap ...
in
mismatch repair DNA mismatch repair (MMR) is a system for recognizing and repairing erroneous insertion, deletion, and mis-incorporation of bases that can arise during DNA replication and recombination, as well as repairing some forms of DNA damage. Mismatch ...
deficient of diverse cancer types,
platinum Platinum is a chemical element with the symbol Pt and atomic number 78. It is a dense, malleable, ductile, highly unreactive, precious, silverish-white transition metal. Its name originates from Spanish , a diminutive of "silver". Pla ...
and PARP inhibitor to exploit
synthetic lethality Synthetic lethality is defined as a type of genetic interaction where the combination of two genetic events results in cell death or death of an organism. Although the foregoing explanation is wider than this, it is common when referring to synthet ...
in
homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
deficient
breast cancer Breast cancer is cancer that develops from breast tissue. Signs of breast cancer may include a lump in the breast, a change in breast shape, dimpling of the skin, milk rejection, fluid coming from the nipple, a newly inverted nipple, or ...
).


General concepts


Mechanisms – overview

:*
DNA replication In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part for biological inherita ...
infidelity :**
DNA proofreading The term proofreading is used in genetics to refer to the error-correcting processes, first proposed by John Hopfield and Jacques Ninio, involved in DNA replication, immune system specificity, enzyme-substrate recognition among many other processes ...
is the process by which
DNA polymerase A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create ...
excises an incorrectly incorporated nucleotide via
exonuclease Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is t ...
enzymatic reaction. Inability of
DNA polymerase A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create ...
to correct these replication errors leads to progressive accumulation of mutations through successive cell
mitosis In cell biology, mitosis () is a part of the cell cycle in which replicated chromosomes are separated into two new nuclei. Cell division by mitosis gives rise to genetically identical cells in which the total number of chromosomes is maint ...
. :* Genotoxins :** Endogenous cellular (e.g. spontaneous 5-methylcytosine
deamination Deamination is the removal of an amino group from a molecule. Enzymes that catalyse this reaction are called deaminases. In the human body, deamination takes place primarily in the liver, however it can also occur in the kidney. In situations o ...
leads to C>T
transition (genetics) Transition, in genetics and molecular biology, refers to a point mutation that changes a purine nucleotide to another purine ( A ↔ G), or a pyrimidine nucleotide to another pyrimidine ( C ↔ T). Approximately two out of three single nucl ...
) mutations (see
DNA damage (naturally occurring) DNA damage is an alteration in the chemical structure of DNA, such as a break in a strand of DNA, a nucleobase missing from the backbone of DNA, or a chemically changed base such as 8-OHdG. DNA damage can occur naturally or via environmental fa ...
) :** Exogenous/
carcinogens A carcinogen is any substance, radionuclide, or radiation that promotes carcinogenesis (the formation of cancer). This may be due to the ability to damage the genome or to the disruption of cellular metabolic processes. Several radioactive substan ...
:***
Ultraviolet Ultraviolet (UV) is a form of electromagnetic radiation with wavelength from 10 nm (with a corresponding frequency around 30  PHz) to 400 nm (750  THz), shorter than that of visible light, but longer than X-rays. UV radiati ...
radiation: UVB radiation causes direct
DNA damage DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
and is a known risk factor for
skin cancer Skin cancers are cancers that arise from the skin. They are due to the development of abnormal cells that have the ability to invade or spread to other parts of the body. There are three main types of skin cancers: basal-cell skin cancer (BC ...
(e.g. melanoma) :*** Alkylating antineoplastic agents: This group of
chemotherapy Chemotherapy (often abbreviated to chemo and sometimes CTX or CTx) is a type of cancer treatment that uses one or more anti-cancer drugs (chemotherapeutic agents or alkylating agents) as part of a standardized chemotherapy regimen. Chemothe ...
agents adds
alkyl group In organic chemistry, an alkyl group is an alkane missing one hydrogen. The term ''alkyl'' is intentionally unspecific to include many possible substitutions. An acyclic alkyl has the general formula of . A cycloalkyl is derived from a cycloalk ...
to DNA, which causes
crosslinking of DNA In genetics, crosslinking of DNA occurs when various exogenous or endogenous agents react with two nucleotides of DNA, forming a covalent linkage between them. This crosslink can occur within the same strand (intrastrand) or between opposite stra ...
and interferes with
DNA replication In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part for biological inherita ...
and
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
.
Cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
cells are most impacted because of their high
mitosis In cell biology, mitosis () is a part of the cell cycle in which replicated chromosomes are separated into two new nuclei. Cell division by mitosis gives rise to genetically identical cells in which the total number of chromosomes is maint ...
rate. :***
Tobacco Tobacco is the common name of several plants in the genus '' Nicotiana'' of the family Solanaceae, and the general term for any product prepared from the cured leaves of these plants. More than 70 species of tobacco are known, but the ch ...
: Tobacco contains several
carcinogens A carcinogen is any substance, radionuclide, or radiation that promotes carcinogenesis (the formation of cancer). This may be due to the ability to damage the genome or to the disruption of cellular metabolic processes. Several radioactive substan ...
which are harmful to DNA, including
polycyclic aromatic hydrocarbons A polycyclic aromatic hydrocarbon (PAH) is a class of organic compounds that is composed of multiple aromatic rings. The simplest representative is naphthalene, having two aromatic rings and the three-ring compounds anthracene and phenanthrene. P ...
,
acrolein Acrolein (systematic name: propenal) is the simplest unsaturated aldehyde. It is a colorless liquid with a piercing, acrid smell. The smell of burnt fat (as when cooking oil is heated to its smoke point) is caused by glycerol in the burning fa ...
,
nitrosamines In organic chemistry, nitrosamines (or more formally ''N''-Nitrosamines) are organic compounds with the chemical structure , where R is usually an alkyl group. They feature a nitroso group () bonded to a deprotonated amine. Most nitrosamines a ...
,
cyanide Cyanide is a naturally occurring, rapidly acting, toxic chemical that can exist in many different forms. In chemistry, a cyanide () is a chemical compound that contains a functional group. This group, known as the cyano group, consists of ...
and others (see
health effects of tobacco Tobacco use has predominantly negative effects on human health and concern about health effects of tobacco has a long history. Research has focused primarily on cigarette smoking. Tobacco smoke contains more than 70 chemicals that cause ca ...
) :* DNA repair deficiency :**
Homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
deficiency (HRD): DNA
double-strand break DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA dam ...
requires
homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
mechanism for accurate repair of breakpoints. :**
DNA mismatch repair DNA mismatch repair (MMR) is a system for recognizing and repairing erroneous insertion, deletion, and mis-incorporation of bases that can arise during DNA replication and recombination, as well as repairing some forms of DNA damage. Mismatch ...
(MMR) deficiency: The mismatch repair machinery recognizes and repairs erroneous base pair insertion, deletion or mis-incorporation. :* Enzymatic DNA editing :** Cytidine deaminase enzymes: This family of enzymes are part of the
innate immune system The innate, or nonspecific, immune system is one of the two main immunity strategies (the other being the adaptive immune system) in vertebrates. The innate immune system is an older evolutionary defense strategy, relatively speaking, and is th ...
and are involved in the control of
retroviruses A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. Once inside the host cell's cytoplasm, the virus uses its own reverse transcriptase ...
and
transposons A transposable element (TE, transposon, or jumping gene) is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Trans ...
elements (including endogenous retroviruses). These enzymes ( cytidine deaminase/CDA, activation-induced cytidine deaminase and
APOBEC image:Apobec.J.Steinfeld.D.png, 300px, upExample of a member of the APOBEC family, APOBEC-2. A cytidine deaminase from ''Homo sapiens''.; ; rendered usinPyMOL APOBEC ("apolipoprotein B mRNA editing enzyme, catalytic polypeptide") is a family o ...
protein family) actively cause
cytidine Cytidine (symbol C or Cyd) is a nucleoside molecule that is formed when cytosine is attached to a ribose ring (also known as a ribofuranose) via a β-N1-glycosidic bond. Cytidine is a component of RNA. It is a white water-soluble solid. which is ...
deamination Deamination is the removal of an amino group from a molecule. Enzymes that catalyse this reaction are called deaminases. In the human body, deamination takes place primarily in the liver, however it can also occur in the kidney. In situations o ...
and therefore introduce C>T
transition (genetics) Transition, in genetics and molecular biology, refers to a point mutation that changes a purine nucleotide to another purine ( A ↔ G), or a pyrimidine nucleotide to another pyrimidine ( C ↔ T). Approximately two out of three single nucl ...
mutations.


Genomic data

Cancer mutational signatures analyses require genomic data from cancer genome sequencing with paired-normal DNA sequencing in order to create the tumor mutation catalog (mutation types and counts) of a specific tumor. Different types of
mutations In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosi ...
(e.g. single nucleotide variants, indels, structural variants) can be used individually or in combination to model mutational signatures in cancer.


Types of mutations: base substitutions

There are six classes of base substitution: C>A, C>G, C>T, T>A, T>C, T>G. The G>T substitution is considered equivalent to the C>A substitution because it is not possible to differentiate on which DNA strand (forward or reverse) the substitution initially occurred. Both the C>A and G>T substitutions are therefore counted as part of the "C>A" class. For the same reason the G>C, G>A, A>T, A>G and A>C mutations are counted as part of the "C>G", "C>T", "T>A", "T>C" and "T>G" classes respectively. Taking the information from the 5' and 3' adjacent bases (also called flanking base pairs or trinucleotide context) lead to 96 possible mutation types (e.g. A >A, A >A, etc.). The mutation catalog of a tumor is created by categorizing each single nucleotide variant (SNV) (synonyms: base-pair substitution or substitution
point mutation A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
) in one of the 96 mutation types and counting the total number of substitutions for each of these 96 mutation types (see figure).


Tumor mutation catalog

Once the mutation catalog (e.g. counts for each of the 96 mutation types) of a tumor is obtained, there are two approaches to decipher the contributions of different mutational signatures to tumor genomic landscape: ** The mutation catalog of the tumor is compared to a reference mutation catalogue, or mutational signatures reference dataset, such as the 21 Signatures of Mutational Processes in Human Cancer from the Catalogue of Somatic Mutation In Cancer
COSMIC
database. ** ''De novo'' mutational signatures modelling can be accomplished using statistical methods such as
non-negative matrix factorization Non-negative matrix factorization (NMF or NNMF), also non-negative matrix approximation is a group of algorithms in multivariate analysis and linear algebra where a matrix is factorized into (usually) two matrices and , with the property that ...
to identify potential novel mutational processes. Identifying the contributions of diverse mutational signatures to
carcinogenesis Carcinogenesis, also called oncogenesis or tumorigenesis, is the formation of a cancer, whereby normal cells are transformed into cancer cells. The process is characterized by changes at the cellular, genetic, and epigenetic levels and abnor ...
provides insight into tumor biology and can offer opportunities for
targeted therapy Targeted therapy or molecularly targeted therapy is one of the major modalities of medical treatment ( pharmacotherapy) for cancer, others being hormonal therapy and cytotoxic chemotherapy. As a form of molecular medicine, targeted therapy block ...
.


Types of mutations: indels

Signature 3, seen in
homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
(HR) deficient tumour, is associated with increased burden of large
indels Indel is a molecular biology term for an insertion or deletion of bases in the genome of an organism. It is classified among small genetic variations, measuring from 1 to 10 000 base pairs in length, including insertion and deletion events that ...
(up to 50 nucleotides) with overlapping microhomology at the breakpoints. In such tumors, DNA
double-strand breaks DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA dam ...
are repaired by the imprecise repair mechanisms of
non-homologous end joining Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. NHEJ is referred to as "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology direct ...
(NHEJ) or microhomology-mediated end joining (MMEJ) instead of high fidelity HR repair. Signature 6, seen in tumors with
microsatellite instability Microsatellite instability (MSI) is the condition of genetic hypermutability (predisposition to mutation) that results from impaired DNA mismatch repair (MMR). The presence of MSI represents phenotypic evidence that MMR is not functioning normal ...
, also features enrichment of 1bp indels in nucleotide repeat regions.


Types of mutations: structural variants

Homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
deficiency leads to Signature 3 substitution pattern, but also to increase burden of structural variants. In the absence of
homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may be ...
,
non-homologous end joining Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. NHEJ is referred to as "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology direct ...
leads to large structural variants such as chromosomal translocations,
chromosomal inversions An inversion is a chromosome rearrangement in which a segment of a chromosome becomes inverted within its original position. An inversion occurs when a chromosome undergoes a two breaks within the chromosomal arm, and the segment between the two br ...
and
copy number variants Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of ...
.


Mutational signatures

.


Age-related mutagenesis

Signature 1 features a predominance of C>T
transition (genetics) Transition, in genetics and molecular biology, refers to a point mutation that changes a purine nucleotide to another purine ( A ↔ G), or a pyrimidine nucleotide to another pyrimidine ( C ↔ T). Approximately two out of three single nucl ...
in the Np >T trinucleotide contexts and correlates with the age of patient at time of
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
diagnosis. The underlying proposed biological mechanism is the spontaneous deamination of 5-methylcytosine. Signature 5 has a predominance of T>C substitutions in the ApTpN trinucleotide context with transcriptional strand bias.


Homologous recombination deficiency

Signature 3 displays high mutation counts of multiple mutation classes and is associated with
germline In biology and genetics, the germline is the population of a multicellular organism's cells that pass on their genetic material to the progeny (offspring). In other words, they are the cells that form the egg, sperm and the fertilised egg. They ...
and
somatic (biology) The term somatic - etymologically from the Ancient Greek words of "σωματικός" (sōmatikós, “bodily”) and σῶμα (sôma, “body”) - is often used in biology to refer to the cells of the body in contrast to the reproductive (germl ...
''
BRCA1 Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a ...
'' and ''
BRCA2 ''BRCA2'' and BRCA2 () are a human gene and its protein product, respectively. The official symbol (BRCA2, italic for the gene, nonitalic for the protein) and the official name (originally breast cancer 2; currently BRCA2, DNA repair associated) ...
'' mutations in several
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
types (e.g. breast, pancreatic, ovarian, prostate). This signature results from DNA
double-strand break DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA dam ...
repair deficiency (or homologous recombination deficiency). Signature 3 is associated with high burden of
indels Indel is a molecular biology term for an insertion or deletion of bases in the genome of an organism. It is classified among small genetic variations, measuring from 1 to 10 000 base pairs in length, including insertion and deletion events that ...
with microhomology at the breakpoints.


APOBEC enzymes

APOBEC3 family of cytidine deaminase enzymes respond to viral infections by editing viral genome, but the enzymatic activity of '' APOBEC3A'' and '' APOBEC3B'' has also been found to cause unwanted host genome editing and may even participate to oncogenesis in
human papillomavirus Human papillomavirus infection (HPV infection) is caused by a DNA virus from the '' Papillomaviridae'' family. Many HPV infections cause no symptoms and 90% resolve spontaneously within two years. In some cases, an HPV infection persists and r ...
-related cancers. Signature 2 and Signature 13 are enriched for C>T and C>G substitutions and are thought to arise from cytidine deaminase activity of the AID/
APOBEC image:Apobec.J.Steinfeld.D.png, 300px, upExample of a member of the APOBEC family, APOBEC-2. A cytidine deaminase from ''Homo sapiens''.; ; rendered usinPyMOL APOBEC ("apolipoprotein B mRNA editing enzyme, catalytic polypeptide") is a family o ...
enzymes family. A germline deletion polymorphism involving '' APOBEC3A'' and '' APOBEC3B'' is associated with high burden of Signature 2 and Signature 13 mutations. This polymorphism is considered to be of moderate penetrance (two-fold above background risk) for breast cancer risk. The exact roles and mechanisms underlying
APOBEC image:Apobec.J.Steinfeld.D.png, 300px, upExample of a member of the APOBEC family, APOBEC-2. A cytidine deaminase from ''Homo sapiens''.; ; rendered usinPyMOL APOBEC ("apolipoprotein B mRNA editing enzyme, catalytic polypeptide") is a family o ...
-mediated genome editing are not yet fully delineated, but activation-induced cytidine deaminase(AID)/
APOBEC image:Apobec.J.Steinfeld.D.png, 300px, upExample of a member of the APOBEC family, APOBEC-2. A cytidine deaminase from ''Homo sapiens''.; ; rendered usinPyMOL APOBEC ("apolipoprotein B mRNA editing enzyme, catalytic polypeptide") is a family o ...
complex is thought to be involved in host immune response to viral infections and lipid metabolism. Both Signature 2 and Signature 13 are feature cytosine to uracil substitutions due to cytidine deaminases. Signature 2 has a higher proportion of C >C substitutions and Signature 13 a higher proportion of T >G substitutions. '' APOBEC3A'' and '' APOBEC3B''-mediated mutagenesis preferentially involve the lagging DNA strand during replication.


Mismatch repair deficiency

Four
COSMIC Cosmic commonly refers to: * The cosmos, a concept of the universe Cosmic may also refer to: Media * ''Cosmic'' (album), an album by Bazzi * Afro/Cosmic music * "Cosmic", a song by Kylie Minogue from the album '' X'' * CosM.i.C, a member of ...
mutational signatures have been associated with
DNA mismatch repair DNA mismatch repair (MMR) is a system for recognizing and repairing erroneous insertion, deletion, and mis-incorporation of bases that can arise during DNA replication and recombination, as well as repairing some forms of DNA damage. Mismatch ...
deficiency and found in tumors with
microsatellite instability Microsatellite instability (MSI) is the condition of genetic hypermutability (predisposition to mutation) that results from impaired DNA mismatch repair (MMR). The presence of MSI represents phenotypic evidence that MMR is not functioning normal ...
: Signature 6, 15, 20 and 26. Loss of function ''
MLH1 DNA mismatch repair protein Mlh1 or MutL protein homolog 1 is a protein that in humans is encoded by the MLH1 gene located on chromosome 3. It is a gene commonly associated with hereditary nonpolyposis colorectal cancer. Orthologs of human MLH1 ...
'', ''
MSH2 DNA mismatch repair protein Msh2 also known as MutS homolog 2 or MSH2 is a protein that in humans is encoded by the ''MSH2'' gene, which is located on chromosome 2. MSH2 is a tumor suppressor gene and more specifically a caretaker gene that code ...
'', '' MSH6'' or ''
PMS2 Mismatch repair endonuclease PMS2 is an enzyme that in humans is encoded by the ''PMS2'' gene. Function This gene is one of the PMS2 gene family members which are found in clusters on chromosome 7. Human PMS2 related genes are located at bands ...
'' genes cause defective
DNA mismatch repair DNA mismatch repair (MMR) is a system for recognizing and repairing erroneous insertion, deletion, and mis-incorporation of bases that can arise during DNA replication and recombination, as well as repairing some forms of DNA damage. Mismatch ...
.


DNA proofreading

Signature 10 has a transcriptional bias and is enriched for C>A substitutions in the TpCpT context as well as T>G substitutions in the TpTpTp context. Signature 10 is associated with altered function of
DNA polymerase epsilon DNA polymerase epsilon is a member of the DNA polymerase family of enzymes found in eukaryotes Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong ...
, which result in deficient
DNA proofreading The term proofreading is used in genetics to refer to the error-correcting processes, first proposed by John Hopfield and Jacques Ninio, involved in DNA replication, immune system specificity, enzyme-substrate recognition among many other processes ...
activity. Both germline and somatic ''
POLE (gene) DNA polymerase epsilon catalytic subunit is an enzyme that in humans is encoded by the ''POLE'' gene. It is the central catalytic subunit of DNA polymerase epsilon DNA polymerase epsilon is a member of the DNA polymerase family of enzymes found ...
'' exonuclease domain mutations are associated with Signature 10.


Base excision repair

Somatic enrichment for
transversion Transversion, in molecular biology, refers to a point mutation in DNA in which a single (two ring) purine ( A or G) is changed for a (one ring) pyrimidine ( T or C), or vice versa. A transversion can be spontaneous, or it can be caused by ion ...
mutations (G:C>T:A) has been associated with
base excision repair Base excision repair (BER) is a cellular mechanism, studied in the fields of biochemistry and genetics, that repairs damaged DNA throughout the cell cycle. It is responsible primarily for removing small, non-helix-distorting base lesions from t ...
(BER) deficiency and linked to defective ''
MUTYH ''MUTYH'' (mutY DNA glycosylase) is a human gene that encodes a DNA glycosylase, MUTYH glycosylase. It is involved in oxidative DNA damage repair and is part of the base excision repair pathway. The enzyme excises adenine bases from the DNA backbo ...
'', a
DNA glycosylase DNA glycosylases are a family of enzymes involved in base excision repair, classified under EC number EC 3.2.2. Base excision repair is the mechanism by which damaged bases in DNA are removed and replaced. DNA glycosylases catalyze the first st ...
, in colorectal cancer. Direct
DNA oxidation DNA oxidation is the process of oxidative damage of deoxyribonucleic acid. As described in detail by Burrows et al., 8-oxo-2'-deoxyguanosine (8-oxo-dG) is the most common oxidative lesion observed in duplex DNA because guanine has a lower one-el ...
damage leads to the creation of
8-Oxoguanine 8-Oxoguanine (8-hydroxyguanine, 8-oxo-Gua, or OH8Gua) is one of the most common DNA lesions resulting from reactive oxygen species modifying guanine, and can result in a mismatched pairing with adenine resulting in G to T and C to A substitutions ...
, which if remains un-repaired, will lead to incorporation of
adenine Adenine () (symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its derivativ ...
instead of
cytosine Cytosine () (symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ...
during DNA replication.
MUTYH ''MUTYH'' (mutY DNA glycosylase) is a human gene that encodes a DNA glycosylase, MUTYH glycosylase. It is involved in oxidative DNA damage repair and is part of the base excision repair pathway. The enzyme excises adenine bases from the DNA backbo ...
encodes the
adenine Adenine () (symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its derivativ ...
glycosylase enzyme which excise the mismatched
adenine Adenine () (symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its derivativ ...
from
8-Oxoguanine 8-Oxoguanine (8-hydroxyguanine, 8-oxo-Gua, or OH8Gua) is one of the most common DNA lesions resulting from reactive oxygen species modifying guanine, and can result in a mismatched pairing with adenine resulting in G to T and C to A substitutions ...
:
adenine Adenine () (symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its derivativ ...
base pairing, therefore enabling
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
mechanisms involving '' OGG1'' (Oxoguanine glycosylase) and ''
NUDT1 2-hydroxy-dATP diphosphatase (, also known as oxidized purine nucleoside triphosphatase, or (2'-deoxy) ribonucleoside 5'-triphosphate pyrophosphohydrolase, or Nudix hydrolase 1 (NUDT1), or MutT homolog 1 (MTH1), or 7,8-dihydro-8-oxoguanine triphos ...
'' (Nudix hydrolase 1, also known as ''MTH1'', MutT homolog 1) to remove the damaged
8-Oxoguanine 8-Oxoguanine (8-hydroxyguanine, 8-oxo-Gua, or OH8Gua) is one of the most common DNA lesions resulting from reactive oxygen species modifying guanine, and can result in a mismatched pairing with adenine resulting in G to T and C to A substitutions ...
.


Exposures to exogenous genotoxins

Selected exogenous genotoxins/
carcinogens A carcinogen is any substance, radionuclide, or radiation that promotes carcinogenesis (the formation of cancer). This may be due to the ability to damage the genome or to the disruption of cellular metabolic processes. Several radioactive substan ...
and their
mutagen In genetics, a mutagen is a physical or chemical agent that permanently changes genetic material, usually DNA, in an organism and thus increases the frequency of mutations above the natural background level. As many mutations can cause cancer i ...
-induced DNA damage and repair mechanisms have been linked to specific molecular signatures.


Ultraviolet radiation (UV)

:Signature 7 has a predominance of C>T substitutions at sites of adjacent pyrimidines (adjacent C or T), with a particularly diagnostic subset being the CC>TT dinucleotide mutation. This pattern arises because the major UV-induced DNA photoproducts join two adjacent pyrimidines; the photoproduct is typically the cyclobutane pyrimidine dimer (CPD). Specificity for C>T appears to be due to the million-fold acceleration of C deamination when it is part of a CPD, with the resulting uracil acting as T. CPDs are repaired via transcription-coupled
nucleotide excision repair Nucleotide excision repair is a DNA repair mechanism. DNA damage occurs constantly because of chemicals (e.g. intercalating agents), radiation and other mutagens. Three excision repair pathways exist to repair single stranded DNA damage: Nucle ...
, causing a strong bias for C>T substitutions enriched on the untranscribed DNA strand. The regions of a tumor suppressor protein that are mutationally inactivated in sunlight-related skin cancers are the same as in cancers of organs not exposed to sunlight, but the nucleotide mutated is often shifted a few bases to a site where a CPD could form.
Ultraviolet Ultraviolet (UV) is a form of electromagnetic radiation with wavelength from 10 nm (with a corresponding frequency around 30  PHz) to 400 nm (750  THz), shorter than that of visible light, but longer than X-rays. UV radiati ...
radiation exposure is therefore the proposed underlying mutagenic mechanism of this signature. UV also illustrates a subtlety in interpreting a tumor signature as a mutagen signature: only three-quarters of mutations induced by UV in the laboratory are UV signature mutations because UV also triggers cellular oxidative processes. Therefore even if all mutations in a tumor were caused by UV from sunlight, one quarter of the mutations are expected to not be UV signature mutations. A second carcinogen needn't be invoked to explain those mutations, but a second mutational process is required. Identification of a UV signature in a tumor of unknown primary site is clinically important as it suggests a diagnosis of metastatic skin cancer and has important treatment implications.


Alkylating agents

:Signature 11 was identified in tumors previously exposed to Temozolamide, an
alkylating agent Alkylation is the transfer of an alkyl group from one molecule to another. The alkyl group may be transferred as an alkyl carbocation, a free radical, a carbanion, or a carbene (or their equivalents). Alkylating agents are reagents for effecting ...
. This signature is enriched for C>T substitutions on
guanine Guanine () (symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
bases due to transcription-coupled
nucleotide excision repair Nucleotide excision repair is a DNA repair mechanism. DNA damage occurs constantly because of chemicals (e.g. intercalating agents), radiation and other mutagens. Three excision repair pathways exist to repair single stranded DNA damage: Nucle ...
. A strong transcriptional strand-bias is present in this signature.


Tobacco

:Both Signature 4 (
tobacco Tobacco is the common name of several plants in the genus '' Nicotiana'' of the family Solanaceae, and the general term for any product prepared from the cured leaves of these plants. More than 70 species of tobacco are known, but the ch ...
smoking,
lung cancer Lung cancer, also known as lung carcinoma (since about 98–99% of all lung cancers are carcinomas), is a malignant lung tumor characterized by uncontrolled cell growth in tissues of the lung. Lung carcinomas derive from transformed, malign ...
) and Signature 29 (
tobacco Tobacco is the common name of several plants in the genus '' Nicotiana'' of the family Solanaceae, and the general term for any product prepared from the cured leaves of these plants. More than 70 species of tobacco are known, but the ch ...
chewing, gingivo-buccal oral
squamous cell carcinoma Squamous-cell carcinomas (SCCs), also known as epidermoid carcinomas, comprise a number of different types of cancer that begin in squamous cells. These cells form on the surface of the skin, on the lining of hollow organs in the body, and on th ...
) display transcriptional strand-bias and enrichment for C>A substitutions, but their respective composition and patterns (proportion of each mutation types) differ slightly. :The proposed underlying mechanism of Signature 4 is the removal of DNA adducts (
tobacco Tobacco is the common name of several plants in the genus '' Nicotiana'' of the family Solanaceae, and the general term for any product prepared from the cured leaves of these plants. More than 70 species of tobacco are known, but the ch ...
benzo(a)pyrene Benzo 'a''yrene (B''a''P or B ) is a polycyclic aromatic hydrocarbon and the result of incomplete combustion of organic matter at temperatures between and . The ubiquitous compound can be found in coal tar, tobacco smoke and many foods, esp ...
covalently bounded to
guanine Guanine () (symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
) by the transcription-coupled
nucleotide excision repair Nucleotide excision repair is a DNA repair mechanism. DNA damage occurs constantly because of chemicals (e.g. intercalating agents), radiation and other mutagens. Three excision repair pathways exist to repair single stranded DNA damage: Nucle ...
(NER) machinery.


Immunoglobulin gene hypermutation

Signature 9 has been identified in
chronic lymphocytic leukemia Chronic lymphocytic leukemia (CLL) is a type of cancer in which the bone marrow makes too many lymphocytes (a type of white blood cell). Early on, there are typically no symptoms. Later, non-painful lymph node swelling, feeling tired, fever, n ...
and malignant
B-cell lymphoma The B-cell lymphomas are types of lymphoma affecting B cells. Lymphomas are "blood cancers" in the lymph nodes. They develop more frequently in older adults and in immunocompromised individuals. B-cell lymphomas include both Hodgkin's lymphoma ...
and feature enrichment for T>G
transversion Transversion, in molecular biology, refers to a point mutation in DNA in which a single (two ring) purine ( A or G) is changed for a (one ring) pyrimidine ( T or C), or vice versa. A transversion can be spontaneous, or it can be caused by ion ...
events. It is thought to result from error-prone polymerase ('' POLH'' gene)-associated
mutagenesis Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using l ...
. Recently, polymerase error-prone synthesis signature has been linked to non-hematological cancers (e.g.
skin cancer Skin cancers are cancers that arise from the skin. They are due to the development of abnormal cells that have the ability to invade or spread to other parts of the body. There are three main types of skin cancers: basal-cell skin cancer (BC ...
) and was hypothesized to contribute to YCG motif
mutagenesis Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using l ...
and could partly explain the increase TC dinucleotides substitutions.


History

During the 1990s, Curtis Harris at the US National Cancer Institute and
Bert Vogelstein Bert Vogelstein (born 1949) is director of the Ludwig Center, Clayton Professor of Oncology and Pathology and a Howard Hughes Medical Institute investigator at The Johns Hopkins Medical School and Sidney Kimmel Comprehensive Cancer Center. A pi ...
at the Johns Hopkins Oncology Center in Baltimore reviewed data showing that different types of cancer had their own unique suite of mutations in p53, which were likely to have been caused by different agents, such as the chemicals in
tobacco smoke Tobacco smoke is a sooty aerosol produced by the incomplete combustion of tobacco during the smoking of cigarettes and other tobacco products. Temperatures in burning cigarettes range from about 400 °C between puffs to about 900 °C d ...
or
ultraviolet Ultraviolet (UV) is a form of electromagnetic radiation with wavelength from 10 nm (with a corresponding frequency around 30  PHz) to 400 nm (750  THz), shorter than that of visible light, but longer than X-rays. UV radiati ...
light from the sun. With the advent of
next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
,
Michael Stratton Sir Michael Rudolf Stratton, (born 22 June 1957) is a British clinical scientist and the third director of the Wellcome Trust Sanger Institute. He currently heads the Cancer Genome Project and is a leader of the International Cancer Genome Con ...
saw the potential for the technology to revolutionize our understanding of the genetic changes inside individual tumors, setting the
Wellcome Sanger Institute The Wellcome Sanger Institute, previously known as The Sanger Centre and Wellcome Trust Sanger Institute, is a non-profit British genomics and genetics research institute, primarily funded by the Wellcome Trust. It is located on the Wellcome G ...
's huge banks of DNA-sequencing machines in motion to read every single letter of DNA in a tumor. By 2009, Stratton and his team had produced the first whole cancer genome sequences. These were detailed maps showing all the genetic changes and mutations that had occurred within two individual cancers—a melanoma from the skin and a lung tumor. The melanoma and lung cancer genomes were powerful proof that the fingerprints of specific culprits could be seen in cancers with one major cause. These tumors still contained many mutations that could not be explained by ultraviolet light or tobacco smoking. The detective work became a lot more complicated for cancers with complex, multiple or even completely unknown origins. By way of analogy, imagine a forensic scientist dusting for fingerprints at a murder scene. The forensic scientist might strike it lucky and find a set of perfect prints on a windowpane or door handle that match a known killer. However, they are much more likely to uncover a mish-mash of fingerprints belonging to a whole range of folk—from the victim and potential suspects to innocent parties and police investigators—all laid on top of each other on all sorts of surfaces. This is very similar to cancer genomes where multiple mutational patterns are commonly overlaid one over another making the data incomprehensible. Fortunately, a PhD student of Stratton's,
Ludmil Alexandrov Ludmil B. Alexandrov is a Bulgarian-American scientist and an associate professor at the University of California, San Diego. Education Alexandrov received his PhD from University of Cambridge in 2014. Career and research Alexandrov is known ...
came up with a way of mathematically solving the problem. Alexandrov demonstrated that mutational patterns from individual mutagens found in a tumor can be distinguished from one another using a mathematical approach called
blind source separation Source separation, blind signal separation (BSS) or blind source separation, is the separation of a set of source signals from a set of mixed signals, without the aid of information (or with very little information) about the source signals or t ...
. The newly disentangled patterns of mutations were termed mutational signatures. In 2013, Alexandrov and Stratton published the first computational framework for deciphering mutational signatures from
cancer genomics Oncogenomics is a sub-field of genomics that characterizes cancer-associated genes. It focuses on genomic, epigenomic and transcript alterations in cancer. Cancer is a genetic disease caused by accumulation of DNA mutations and epigenetic alterati ...
data. Subsequently, they applied this framework to more than seven thousand cancer genomes creating the first comprehensive map of mutational signatures in human cancer. Currently, more than one hundred mutational signatures have been identified across the repertoire of human cancer. In April 2022 58 new mutational signatures were described.


See also

* Gene signature * Genomic signature *
Pan-cancer analysis Pan-cancer analysis aims to examine the similarities and differences among the genomic and cellular alterations found across diverse tumor types. International efforts have performed pan-cancer analysis on exomes and the whole genomes of cancers, ...
*
Whole genome sequencing Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a ...


Note list


References

{{Reflist Cancer research Genomics Mutagenesis