HOME

TheInfoList



OR:

Methylated DNA immunoprecipitation (MeDIP or mDIP) is a large-scale (
chromosome A chromosome is a package of DNA containing part or all of the genetic material of an organism. In most chromosomes, the very long thin DNA fibers are coated with nucleosome-forming packaging proteins; in eukaryotic cells, the most import ...
- or
genome A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
-wide) purification technique in molecular biology that is used to enrich for methylated DNA sequences. It consists of isolating methylated DNA fragments via an
antibody An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
raised against
5-methylcytosine 5-Methylcytosine (5mC) is a methylation, methylated form of the DNA base cytosine (C) that regulates gene Transcription (genetics), transcription and takes several other biological roles. When cytosine is methylated, the DNA maintains the same s ...
(5mC). This technique was first described by Weber M. ''et al.'' in 2005 and has helped pave the way for viable
methylome DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to ...
-level assessment efforts, as the purified fraction of methylated DNA can be input to high-throughput DNA detection methods such as high-resolution
DNA microarray A DNA microarray (also commonly known as a DNA chip or biochip) is a collection of microscopic DNA spots attached to a solid surface. Scientists use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or t ...
s ( MeDIP-chip) or
next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation ...
(MeDIP-seq). Nonetheless, understanding of the methylome remains rudimentary; its study is complicated by the fact that, like other
epigenetic In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
properties, patterns vary from cell-type to cell-type.


Background

DNA methylation DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter (genetics), promoter, DNA methylati ...
, referring to the reversible methylation of the 5 position of cytosine by
methyltransferases Methyltransferases are a large group of enzymes that all Methylation, methylate their substrates but can be split into several subclasses based on their structural features. The most common class of methyltransferases is class I, all of which c ...
, is a major
epigenetic In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
modification in multicellular organisms. In mammals, this modification primarily occurs at
CpG sites The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG isl ...
, which in turn tend to cluster in regions called
CpG islands The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG isl ...
. There is a small fraction of CpG islands that can overlap or be in close proximity to promoter regions of transcription start sites. The modification may also occur at other sites, but methylation at either of these sites can repress gene expression by either interfering with the binding of
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription (genetics), transcription of genetics, genetic information from DNA to messenger RNA, by binding t ...
s or modifying
chromatin Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
structure to a repressive state. Disease condition studies have largely fueled the effort in understanding the role of DNA methylation. Currently, the major research interest lies in investigating disease conditions such as
cancer Cancer is a group of diseases involving Cell growth#Disorders, abnormal cell growth with the potential to Invasion (cancer), invade or Metastasis, spread to other parts of the body. These contrast with benign tumors, which do not spread. Po ...
to identify regions of the DNA that has undergone extensive methylation changes. The genes contained in these regions are of functional interest as they may offer a mechanistic explanation to the underlying genetic causes of a disease. For instance, the abnormal methylation pattern of cancer cells was initially shown to be a mechanism through which
tumor suppressor A tumor suppressor gene (TSG), or anti-oncogene, is a gene that regulates a cell (biology), cell during cell division and replication. If the cell grows uncontrollably, it will result in cancer. When a tumor suppressor gene is mutated, it results ...
-like genes are silenced, although it was later observed that a much broader range of gene types are affected.


Other technologies

There are two approaches to methylation analysis: typing and profiling technologies. Typing technologies are targeted towards a small number of loci across many samples, and involve the use of techniques such as PCR,
restriction enzymes A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
, and
mass spectrometry Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a ''mass spectrum'', a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is used ...
. Profiling technologies such as MeDIP are targeted towards a
genome A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
- or
methylome DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to ...
-wide level assessment of methylation; this includes
restriction landmark genomic scanning Restriction landmark genomic scanning (RLGS) is a genome analysis method for rapid simultaneous visualization of thousands of landmarks, or restriction sites. Using a combination of restriction enzymes some of which are specific to DNA modificati ...
(RLGS), and bisulfite conversion-based methods, which rely on the treatment of DNA with
bisulfite The bisulfite ion (IUPAC-recommended nomenclature: hydrogensulfite) is the ion . Salts containing the ion are also known as "sulfite lyes". Sodium bisulfite is used interchangeably with sodium metabisulfite (Na2S2O5). Sodium metabisulfite diss ...
to convert unmethylated
cytosine Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
residues to
uracil Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
.


Limitations of other technologies

Other methods mapping and profiling the methylome have been effective but are not without their limitations that can affect resolution, level of throughput, or experimental variations. For instance, RLGS is limited by the number of restriction sites in genome that can be targets for the restriction enzyme; typically, a maximum of ~4100 landmarks can be assessed.
Bisulfite sequencing Bisulfite sequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the mos ...
-based methods, despite possible single-nucleotide resolution, have a drawback: the conversion of unmethylated cytosine to uracil can be unstable. In addition, when bisulfite conversion is coupled with
DNA microarrays A DNA microarray (also commonly known as a DNA chip or biochip) is a collection of microscopic DNA spots attached to a solid surface. Scientists use DNA microarrays to measure the expression levels of large numbers of genes simultaneously or to ...
to detect bisulfite converted sites, the reduced sequence complexity of DNA is a problem. Microarrays capable of comprehensively profiling the whole-genome become difficult to design as fewer unique probes are available.


Methods

The following sections outline the method of MeDIP coupled with either high-resolution array hybridization or high-throughput sequencing. Each DNA detection method will also briefly describe post-laboratory processing and analysis. Different post-processing of the raw data is required depending on the technology used to identify the methylated sequences. This is analogous to data generated using
ChIP-chip ChIP-on-chip (also known as ChIP-chip) is a technology that combines chromatin immunoprecipitation ('ChIP') with DNA microarray (''"chip"''). Like regular Ch-IP, ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA ''in ...
and
ChIP-seq ChIP-sequencing, also known as ChIP-seq, is a method used to analyze protein interactions with DNA. ChIP-seq combines chromatin immunoprecipitation (ChIP) with Massively parallel signature sequencing, massively parallel DNA sequencing to identify t ...
.


Methylated DNA immunoprecipitation (MeDIP)

Genomic DNA is extracted (
DNA extraction The first isolation of deoxyribonucleic acid (DNA) was done in 1869 by Friedrich Miescher. DNA extraction is the process of isolating DNA from the cells of an organism isolated from a sample, typically a biological sample such as blood, saliva, ...
) from the cells and purified. The purified DNA is then subjected to sonication to shear it into random fragments. This sonication process is quick, simple, and avoids
restriction enzyme A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
biases. The resulting fragments range from 300 to 1000 base pairs (bp) in length, although they are typically between 400 and 600 bp. The short length of these fragments is important in obtaining adequate resolution, improving the efficiency of the downstream step in immunoprecipitation, and reducing fragment-length effects or biases. Also, the size of the fragment affects the binding of 5-methyl-cytidine (5mC) antibody because the antibody needs more than just a single 5mC for efficient binding. To further improve binding affinity of the antibodies, the DNA fragments are denatured to produce single-stranded DNA. Following denaturation, the DNA is incubated with
monoclonal In biology, monoclonality refers to the state of a line of cells that have been derived from a single clonal origin. Thus, "monoclonal cells" can be said to form a single clone. The term ''monoclonal'' comes . The process of replication can occ ...
5mC antibodies. The classical
immunoprecipitation Immunoprecipitation (IP) is the technique of precipitating a protein antigen out of solution using an antibody that specifically binds to that particular protein. This process can be used to isolate and concentrate a particular protein from a sam ...
technique is then applied: magnetic beads conjugated to anti-mouse-
IgG Immunoglobulin G (IgG) is a type of antibody. Representing approximately 75% of serum antibodies in humans, IgG is the most common type of antibody found in blood circulation. IgG molecules are created and released by plasma B cells. Each IgG ant ...
are used to bind the anti-5mC antibodies, and unbound DNA is removed in the supernatant. To purify the DNA,
proteinase K In molecular biology, Proteinase K (, ''protease K'', ''endopeptidase K'', ''Tritirachium alkaline proteinase'', ''Tritirachium album serine proteinase'', ''Tritirachium album proteinase K'') is a broad-spectrum serine protease. The enzyme was ...
is added to digest the antibodies and release the DNA, which can be collected and prepared for DNA detection. For more details regarding the experimental steps see.


MeDIP and array-based hybridization (MeDIP-chip)

A fraction of the input DNA obtained after the sonication step above is labeled with
cyanine Cyanines, also referred to as tetramethylindo(di)-carbocyanines are a synthetic dye family belonging to the polymethine group. Although the name derives etymologically from terms for shades of blue, the cyanine family covers the electromagnetic s ...
-5 (Cy5; red) deoxy-cytosine-triphosphate while the methylated DNA, enriched after the immunoprecipitation step, is labeled with
cyanine Cyanines, also referred to as tetramethylindo(di)-carbocyanines are a synthetic dye family belonging to the polymethine group. Although the name derives etymologically from terms for shades of blue, the cyanine family covers the electromagnetic s ...
-3 (Cy3; green). The labeled DNA samples are cohybridized on a 2-channel, high-density genomic microarray to probe for presence and relative quantities. The purpose of this comparison is to identify sequences that show significant differences in hybridization levels, thereby confirming the sequence of interest is enriched. Array-based identification of MeDIP sequences are limited to the array design. As a result, the resolution is restricted to the probes in the array design. There are additional standard steps required in signal processing to correct for hybridization issues such as noise, as is the case with most array technologies. See for more details.


MeDIP and high-throughput sequencing (MeDIP-seq)

The MeDIP-seq approach, i.e. the coupling of MeDIP with next generation, short-read sequencing technologies such as 454 pyrosequencing or Illumina (Solexa), was first described by Down ''et al.'' in 2008. The high-throughput sequencing of the methylated DNA fragments produces a large number of short reads (36-50bp or 400 bp, depending on the technology). The short reads are aligned to a reference genome using alignment software such as Mapping and Assembly with Quality
Maq
, which uses a
Bayesian Thomas Bayes ( ; c. 1701 – 1761) was an English statistician, philosopher, and Presbyterian minister. Bayesian ( or ) may be either any of a range of concepts and approaches that relate to statistical methods based on Bayes' theorem Bayes ...
approach, along with base and mapping qualities to model error probabilities for the alignments. The reads can then be extended to represent the ~400 to 700 bp fragments from the sonication step. The coverage of these extended reads can be used to estimate the methylation level of the region. A
genome browser The completion of the human genome sequencing in the early 2000s was a turning point in genomics research. Scientists have conducted series of research into the activities of genes and the genome as a whole. The human genome contains around 3 billio ...
such a
Ensembl
can also be used to visualize the data. Validation of the approach to assess quality and accuracy of the data can be done with
quantitative PCR A real-time polymerase chain reaction (real-time PCR, or qPCR when used quantitatively) is a laboratory technique of molecular biology based on the polymerase chain reaction (PCR). It monitors the amplification of a targeted DNA molecule duri ...
. This is done by comparing a sequence from the MeDIP sample against an unmethylated control sequence. The samples are then run on a
gel A gel is a semi-solid that can have properties ranging from soft and weak to hard and tough. Gels are defined as a substantially dilute cross-linked system, which exhibits no flow when in the steady state, although the liquid phase may still d ...
and the band intensities are compared. The relative intensity serves as the guide for finding enrichment. The results can also be compared with MeDIP-chip results to help determine coverage needed.


Cell-free MeDIP-seq (cfMeDIP-seq)

Cell-free methylated DNA immunoprecipitation and high-throughput sequencing (cfMeDIP-seq) is a variation of the MeDIP-seq technique specifically adapted for profiling DNA methylation from low amounts of circulating cell-free DNA (cfDNA) or
cfRNA CFRNA ("The French-Romanian Company for Air Transport"; ; ) was a French Third Republic, French–Kingdom of Romania, Romanian airline, Portal:Aviation/Historical anniversaries/January in aviation/January 1, founded on 1 January 1920. Its name ch ...
found in bodily fluids such as plasma, urine, and cerebrospinal fluid. Standard MeDIP-seq protocols typically require higher DNA input amounts (e.g., 100 ng), which can be a limitation when working with the scarce quantities of cfDNA. cfMeDIP-seq addresses this challenge by employing optimized protocols that can handle significantly lower DNA input (typically 1–10 ng). A key adaptation often involves the addition of " filler DNA" (e.g., bacteriophage lambda DNA or synthetic DNA) to the low-input cfDNA sample before the immunoprecipitation step. This filler DNA acts as a carrier, improving the efficiency and specificity of the antibody-mediated capture of methylated cfDNA fragments. Some protocols also incorporate synthetic spike-in controls to help quantify methylation and correct for technical variations. Like standard MeDIP-seq, cfMeDIP-seq is a bisulfite-free method, which avoids the DNA degradation associated with bisulfite conversion, a process used in other methylation profiling techniques like whole-genome bisulfite sequencing (WGBS). This makes it advantageous for analyzing fragmented cfDNA. cfMeDIP-seq has emerged as a promising technique for
liquid biopsy A liquid biopsy, also known as fluid biopsy or fluid phase biopsy, is the sampling and analysis of non-solid biological tissue, primarily blood. Like traditional biopsy, this type of technique is mainly used as a diagnostic and monitoring tool for ...
applications, particularly in
cancer detection The objective of cancer screening is to detect cancer before symptoms appear, involving various methods such as blood tests, urine tests, DNA tests, and medical imaging. The purpose of screening is early cancer detection, to make the cancer easie ...
and
minimal residual disease Minimal residual disease (MRD), also known as molecular residual disease, is the medical condition in which small number of cancer cells persist in a patient either during or after treatment when the patient is in Remission (medicine), remission ...
monitoring. By analyzing the methylation patterns of cfDNA, it is possible to identify tumor-specific epigenetic signatures circulating in the bloodstream, offering a non-invasive approach for early detection, prognostication, and assessment of treatment response. Beyond
oncology Oncology is a branch of medicine that deals with the study, treatment, diagnosis, and prevention of cancer. A medical professional who practices oncology is an ''oncologist''. The name's Etymology, etymological origin is the Greek word ὄγ ...
, cfMeDIP-seq is also being explored for applications in prenatal diagnostics and cardiology. Bioinformatic pipelines specifically designed for cfMeDIP-seq data analysis have been developed to address the unique characteristics of cfDNA fragments and low input data.


Downstream bioinformatics analysis

The DNA methylation level estimations can be confounded by varying densities of methylated CpG sites across the genome when observing data generated by MeDIP. This can be problematic for analyzing CpG-poor (lower density) regions. One reason for this density issue is its effect on the efficiency of immunoprecipitation. In their study, Down ''et al.'' developed a tool to estimate absolute methylation levels from data generated by MeDIP by modeling the density of methylated CpG sites. This tool is called
Bayesian tool for methylation analysis (Batman) Bayesian tool for methylation analysis, also known as BATMAN, is a statistical tool for analysing methylated DNA immunoprecipitation (MeDIP) profiles. It can be applied to large datasets generated using either oligonucleotide arrays (MeDIP-chip) or ...
. The study reports the coverage of ~90% of all CpG sites in promoters, gene-coding regions, islands, and regulatory elements where methylation levels can be estimated; this is almost 20 times better coverage than any previous methods. Studies using MeDIP-seq or MeDIP-chip are both genome-wide approaches that have the common aim of obtaining the functional mapping of the methylome. Once regions of DNA methylation are identified, a number of bioinformatics analyses can be applied to answer certain biological questions. One obvious step is to investigate genes contained in these regions and investigate the functional significance of their repression. For example, silencing of tumour-suppressor genes in cancer can be attributed to DNA methylation. By identifying mutational events leading to hypermethylation and subsequent repression of known tumour-suppressor genes, one can more specifically characterize the contributing factors to the cause of the disease. Alternatively, one can identify genes that are known to be normally methylated but, as a result of some mutation event, is no longer silenced. Also, one can try and investigate and identify whether some epigenetic regulator has been affected such as DNA methyltransferase (DNMT); in these cases, enrichment may be more limited. Gene-set analysis (for example using tools like DAVID and GoSeq) has been shown to be severely biased when applied to high-throughput methylation data (e.g. MeDIP-seq and MeDIP-ChIP); it has been suggested that this can be corrected using sample label permutations or using a statistical model to control for differences in the numberes of CpG probes / CpG sites that target each gene.


Limitations of MeDIP

Limitations to take note when using MeDIP are typical experimental factors. This includes the quality and cross-reactivity of 5mC antibodies used in the procedure. Furthermore, DNA detection methods (i.e. array hybridization and high-throughput sequencing) typically involve well established limitations. Particularly for array-based procedures, as mentioned above, sequences being analyzed are limited to the specific array design used. Most typical limitations to high-throughput, next generation sequencing apply. The problem of alignment accuracy to repetitive regions in the genome will result in less accurate analysis of methylation in those regions. Also, as was mentioned above, short reads (e.g. 36-50bp from a
Illumina Genome Analyzer
represent a part of a sheared fragment when aligned to the genome; therefore, the exact methylation site can fall anywhere within a window that is a function of the fragment size. In this respect, bisulfite sequencing has much higher resolution (down to a single CpG site; single nucleotide level). However, this level of resolution may not be required for most applications, as the methylation status of CpG sites within < 1000 bp has been shown to be significantly correlated.


Applications of MeDIP

* Weber ''et al.'' 2005 determined that the inactive X-chromosome in females is hypermethylated on a chromosome wide level using MeDIP coupled with microarray. * Keshet ''et al.'' 2006 performed a study on colon and prostate cancer cells using MeDIP-chip. The result is a genome-wide analysis of genes lying in hypermethylated regions as well as conclude that there is an instructive mechanism of de novo methylation in cancer cells. * Zhang ''et al.'' 2006 obtained a high resolution methylome mapping in Arabidopsis using MeDIP-chip. * Novak ''et al.'' 2006 used the MeDIP-chip approach to investigate human breast cancer for methylation associated silencing and observed the inactivation of the HOXA gene cluster


See also

*
Epigenetics In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
*
Immunoprecipitation Immunoprecipitation (IP) is the technique of precipitating a protein antigen out of solution using an antibody that specifically binds to that particular protein. This process can be used to isolate and concentrate a particular protein from a sam ...
*
Methylome DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to ...
*
Restriction landmark genomic scanning Restriction landmark genomic scanning (RLGS) is a genome analysis method for rapid simultaneous visualization of thousands of landmarks, or restriction sites. Using a combination of restriction enzymes some of which are specific to DNA modificati ...
*
Bisulfite sequencing Bisulfite sequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the mos ...


References

{{DEFAULTSORT:Methylated Dna Immunoprecipitation Molecular biology Epigenetics Genomics Protein methods