HOME

TheInfoList



OR:

Computational epigenetics uses statistical methods and mathematical modelling in
epigenetic In biology, epigenetics is the study of stable phenotypic changes (known as ''marks'') that do not involve alterations in the DNA sequence. The Greek prefix '' epi-'' ( "over, outside of, around") in ''epigenetics'' implies features that are " ...
research. Due to the recent explosion of epigenome datasets, computational methods play an increasing role in all areas of epigenetic research.


Definition

Research in computational epigenetics comprises the development and application of bioinformatics methods for solving epigenetic questions, as well as computational data analysis and theoretical modeling in the context of epigenetics. This includes modelling of the effects of histone and DNA CpG island methylation.


Current research areas


Epigenetic data processing and analysis

Various experimental techniques have been developed for genome-wide mapping of epigenetic information, the most widely used being
ChIP-on-chip ChIP-on-chip (also known as ChIP-chip) is a technology that combines chromatin immunoprecipitation ('ChIP') with DNA microarray (''"chip"''). Like regular ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA ''in vivo ...
,
ChIP-seq ChIP-sequencing, also known as ChIP-seq, is a method used to analyze protein interactions with DNA. ChIP-seq combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to identify the binding sites of DNA-associated prote ...
and
bisulfite sequencing Bisulfite sequencing (also known as bisulphite sequencing) is the use of bisulfite treatment of DNA before routine sequencing to determine the pattern of methylation. DNA methylation was the first discovered epigenetic mark, and remains the mo ...
. All of these methods generate large amounts of data and require efficient ways of data processing and quality control by bioinformatic methods.


Epigenome prediction

A substantial amount of bioinformatic research has been devoted to the
prediction A prediction (Latin ''præ-'', "before," and ''dicere'', "to say"), or forecast, is a statement about a future event or data. They are often, but not always, based upon experience or knowledge. There is no universal agreement about the exac ...
of epigenetic information from characteristics of the
genome sequence In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
. Such predictions serve a dual purpose. First, accurate epigenome predictions can substitute for experimental data, to some degree, which is particularly relevant for newly discovered epigenetic mechanisms and for species other than human and mouse. Second, prediction algorithms build statistical models of epigenetic information from training data and can therefore act as a first step toward quantitative modeling of an epigenetic mechanism. Successful computational prediction of DNA and lysine methylation and acetylation has been achieved by combinations of various features.


Applications in cancer epigenetics

The important role of epigenetic defects for
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
opens up new opportunities for improved diagnosis and therapy. These active areas of research give rise to two questions that are particularly amenable to bioinformatic analysis. First, given a list of genomic regions exhibiting epigenetic differences between tumor cells and controls (or between different disease subtypes), can we detect common patterns or find evidence of a functional relationship of these regions to cancer? Second, can we use bioinformatic methods in order to improve diagnosis and therapy by detecting and classifying important disease subtypes?


Emerging topics

The first wave of research in the field of computational epigenetics was driven by rapid progress of experimental methods for data generation, which required adequate computational methods for data processing and quality control, prompted epigenome prediction studies as a means of understanding the genomic distribution of epigenetic information, and provided the foundation for initial projects on
cancer epigenetics Cancer epigenetics is the study of epigenetic modifications to the DNA of cancer cells that do not involve a change in the nucleotide sequence, but instead involve a change in the way the genetic code is expressed. Epigenetic mechanisms are necess ...
. While these topics will continue to be major areas of research and the mere quantity of epigenetic data arising from epigenome projects poses a significant bioinformatic challenge, several additional topics are currently emerging. * Epigenetic regulatory circuitry: Reverse engineering the regulatory networks that read, write and execute epigenetic codes. * Population epigenetics: Distilling regulatory mechanisms from the integration of epigenome data with
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. ...
profiles and
haplotype A haplotype ( haploid genotype) is a group of alleles in an organism that are inherited together from a single parent. Many organisms contain genetic material ( DNA) which is inherited from two parents. Normally these organisms have their DNA or ...
maps for a large sample from a heterogeneous population.'' * Evolutionary epigenetics: Learning about epigenome regulation in human (and its medical consequences) by cross-species comparisons. * Theoretical modeling: Testing our mechanistic and quantitative understanding of epigenetic mechanisms by
in silico In biology and other experimental sciences, an ''in silico'' experiment is one performed on computer or via computer simulation. The phrase is pseudo-Latin for 'in silicon' (correct la, in silicio), referring to silicon in computer chips. It ...
simulation. * Genome browsers: Developing a new blend of web services that enable biologists to perform sophisticated genome and epigenome analysis within an easy-to-use genome browser environment. * Medical epigenetics: Searching for epigenetic mechanisms that play a role in diseases other than cancer, as there is strong circumstantial evidence for epigenetic regulation being involved in
mental disorders A mental disorder, also referred to as a mental illness or psychiatric disorder, is a behavioral or mental pattern that causes significant distress or impairment of personal functioning. Such features may be persistent, relapsing and remitt ...
,
autoimmune diseases An autoimmune disease is a condition arising from an abnormal immune response to a functioning body part. At least 80 types of autoimmune diseases have been identified, with some evidence suggesting that there may be more than 100 types. Nearly a ...
and other complex diseases.


Epigenetics Databases

#
MethDB MethDB is a database for DNA methylation data. See also * DNA methylation DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the ...
Contains information on 19,905 DNA methylation content data and 5,382 methylation patterns for 48 species, 1,511 individuals, 198 tissues and cell lines and 79 phenotypes. # Pubmeth Contains over 5,000 records on methylated genes in various cancer types. # REBASE Contains over 22,000 DNA methyltransferases genes derived from GenBank. # DeepBlue Epigenomic Database contains epigenomic data from more than 60,000 experiments from different IHEC members files divided in many different epigenetic marks. DeepBlue also provides an API for access and process the data on the server. # MeInfoText Contains gene methylation information across 205 human cancer types. # MethPrimerDB Contains 259 primer sets from human, mouse and rat for DNA methylation analysis. # The Histone Database Contains 254 sequences from histone H1, 383 from histone H2, 311 from histone H2B, 1043 from histone H3 and 198 from histone H4, altogether representing at least 857 species. # ChromDB Contains 9,341 chromatin-associated proteins, including RNAi-associated proteins, for a broad range of organisms. # CREMOFAC Contains 1725 redundant and 720 non-redundant chromatin-remodeling factor sequences in eukaryotes. # The Krembil Family Epigenetics Laboratory Contains DNA methylation data of human chromosomes 21, 22, male germ cells and DNA methylation profiles in monozygotic and dizygotic twins. # MethyLogiX DNA methylation databaseMethylation Database
Contains DNA methylation data of human chromosomes 21 and 22, male germ cells and late-onset Alzheimer's disease.


Sources and further reading

* The original version of this article was based on a review paper on computational epigenetics that appeared in the January 2008 issue of the Bioinformatics journal: . This review paper provides >100 references to scientific papers and extensive background information. * Additional data has been updated and added, based on a review paper on computational epigenetics that appeared in the January 2010 issue of th
Bioinformation
journal:
Lim S.J., Tan T.W. and Tong, J.C. (2010) Computational epigenetics: the new scientific paradigm. Bioinformation, 4(7): 331-337
'. This review paper provides >129 references to scientific papers. It is published as
open access Open access (OA) is a set of principles and a range of practices through which research outputs are distributed online, free of access charges or other barriers. With open access strictly defined (according to the 2001 definition), or libre o ...
and can be downloaded freely from the publisher's web page
http://bioinformation.net/004/007000042010.pdf


References

{{Reflist Epigenetics Bioinformatics Biophysics Computational fields of study