Chemical Similarity
   HOME

TheInfoList



OR:

Chemical similarity (or molecular similarity) refers to the similarity of
chemical element A chemical element is a chemical substance whose atoms all have the same number of protons. The number of protons is called the atomic number of that element. For example, oxygen has an atomic number of 8: each oxygen atom has 8 protons in its ...
s,
molecule A molecule is a group of two or more atoms that are held together by Force, attractive forces known as chemical bonds; depending on context, the term may or may not include ions that satisfy this criterion. In quantum physics, organic chemi ...
s or
chemical compound A chemical compound is a chemical substance composed of many identical molecules (or molecular entities) containing atoms from more than one chemical element held together by chemical bonds. A molecule consisting of atoms of only one element ...
s with respect to either
structural A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such as ...
or functional qualities, i.e. the effect that the chemical compound has on
reaction Reaction may refer to a process or to a response to an action, event, or exposure. Physics and chemistry *Chemical reaction *Nuclear reaction *Reaction (physics), as defined by Newton's third law * Chain reaction (disambiguation) Biology and ...
partners in inorganic or biological settings. Biological effects and thus also similarity of effects are usually quantified using the
biological activity In pharmacology, biological activity or pharmacological activity describes the beneficial or adverse effects of a drug on living matter. When a drug is a complex chemical mixture, this activity is exerted by the substance's active ingredient or ...
of a compound. In general terms, function can be related to the
chemical activity In thermodynamics, activity (symbol ) is a measure of the "effective concentration" of a species in a mixture, in the sense that the species' chemical potential depends on the activity of a real solution in the same way that it would depend on conc ...
of compounds (among others). The notion of ''chemical similarity'' (or ''molecular similarity'') is one of the most important concepts in
cheminformatics Cheminformatics (also known as chemoinformatics) refers to the use of physical chemistry theory with computer and information science techniques—so called "'' in silico''" techniques—in application to a range of descriptive and prescriptive ...
. It plays an important role in modern approaches to predicting the properties of chemical compounds, designing chemicals with a predefined set of properties and, especially, in conducting drug design studies by screening large databases containing structures of available (or potentially available) chemicals. These studies are based on the similar property principle of Johnson and Maggiora, which states: ''similar compounds have similar properties''.


Similarity measures

Chemical similarity is often described as an inverse of a measure of distance in descriptor space. Examples for inverse distance measures are molecule kernels, that measure the structural similarity of chemical compounds.


Similarity search and virtual screening

The similarity-based virtual screening (a kind of ligand-based virtual screening) assumes that all compounds in a database that are similar to a query compound have similar biological activity. Although this hypothesis is not always valid, quite often the set of retrieved compounds is considerably enriched with actives. To achieve high efficacy of similarity-based screening of databases containing millions of compounds, molecular structures are usually represented by ''molecular screens'' (structural keys) or by fixed-size or variable-size ''molecular fingerprints''. Molecular screens and fingerprints can contain both 2D- and 3D-information. However, the 2D-fingerprints, which are a kind of binary fragment descriptors, dominate in this area. Fragment-based structural keys, like MDL keys, are sufficiently good for handling small and medium-sized chemical databases, whereas processing of large databases is performed with fingerprints having much higher information density. Fragment-based Daylight, BCI, and UNITY 2D (Tripos) fingerprints are the best known examples. The most popular
similarity measure In statistics and related fields, a similarity measure or similarity function or similarity metric is a real-valued function that quantifies the similarity between two objects. Although no single definition of a similarity exists, usually such mea ...
for comparing chemical structures represented by means of fingerprints is the Tanimoto (or Jaccard) coefficient ''T''. Two structures are usually considered similar if ''T'' > 0.85 (for Daylight fingerprints). However, it is a common misunderstanding that a similarity of ''T'' > 0.85 reflects similar bioactivities in general ("the 0.85 myth").


Chemical similarity network

The concept of chemical similarity can be expanded to consider chemical similarity
network theory In mathematics, computer science, and network science, network theory is a part of graph theory. It defines networks as Graph (discrete mathematics), graphs where the vertices or edges possess attributes. Network theory analyses these networks ...
, where descriptive network properties and
graph theory In mathematics and computer science, graph theory is the study of ''graph (discrete mathematics), graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of ''Vertex (graph ...
can be applied to analyze large chemical space, estimate chemical diversity and predict
drug target A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
. Recently, 3D chemical similarity networks based on 3D ligand conformation have also been developed, which can be used to identify scaffold hopping ligands.


See also

* Me-too compound *
Drug design Drug design, often referred to as rational drug design or simply rational design, is the invention, inventive process of finding new medications based on the knowledge of a biological target. The drug is most commonly an organic compound, organi ...
* Substructure search *
Ternary compound In inorganic chemistry and materials chemistry, a ternary compound or ternary phase is a chemical compound containing three different elements. While some ternary compounds are molecular, ''e.g.'' chloroform (), more typically ternary phases r ...


References


External links

*
Small Molecule Subgraph Detector (SMSD)
mdash; a Java-based software library for calculating Maximum Common Subgraph (MCS) between small molecules. This enables us to find similarity/distance between molecules. MCS is also used for screening drug like compounds by hitting molecules, which share common subgraph (substructure).
Kernel-based Similarity for Clustering, regression and QSAR Modeling

Brutus
mdash; a similarity analysis tool based on molecular interaction fields. {{DEFAULTSORT:Chemical Similarity Cheminformatics Drug discovery Chemistry