A chemical library or compound library is a collection of stored
chemicals
A chemical substance is a unique form of matter with constant chemical composition and characteristic properties. Chemical substances may take the form of a single element or chemical compounds. If two or more chemical substances can be combin ...
usually used ultimately in
high-throughput screening
High-throughput screening (HTS) is a method for scientific discovery especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling device ...
or
industrial manufacture. The chemical library can consist in simple terms of a series of stored chemicals. Each chemical has associated information stored in some kind of database with information such as the
chemical structure
A chemical structure of a molecule is a spatial arrangement of its atoms and their chemical bonds. Its determination includes a chemist's specifying the molecular geometry and, when feasible and necessary, the electronic structure of the target m ...
, purity, quantity, and physiochemical characteristics of the compound.
Purpose
In
drug discovery
In the fields of medicine, biotechnology, and pharmacology, drug discovery is the process by which new candidate medications are discovered.
Historically, drugs were discovered by identifying the active ingredient from traditional remedies or ...
high-throughput screening
High-throughput screening (HTS) is a method for scientific discovery especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling device ...
, it is desirable to screen a
drug target
A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
against a selection of chemicals that try to take advantage of as much of the appropriate
chemical space as possible. The
chemical space of all possible
chemical structure
A chemical structure of a molecule is a spatial arrangement of its atoms and their chemical bonds. Its determination includes a chemist's specifying the molecular geometry and, when feasible and necessary, the electronic structure of the target m ...
s is extraordinarily large. Most stored chemical libraries do not typically have a fully represented or sampled chemical space mostly because of storage and cost concerns. However, since many molecular interactions cannot be predicted, the wider the chemical space that is sampled by the chemical library, the better the chance that
high-throughput screening
High-throughput screening (HTS) is a method for scientific discovery especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling device ...
will find a "hit"—a chemical with an appropriate interaction in a biological model that might be developed into a drug.
An example of a chemical library in drug discovery would be a series of chemicals known to inhibit kinases, or in industrial processes, a series of catalysts known to polymerize resins.
Generation of chemical libraries
Chemical libraries are usually generated for a specific goal and larger chemical libraries could be made of several groups of smaller libraries stored in the same location. In the
drug discovery
In the fields of medicine, biotechnology, and pharmacology, drug discovery is the process by which new candidate medications are discovered.
Historically, drugs were discovered by identifying the active ingredient from traditional remedies or ...
process for instance, a wide range of organic chemicals are needed to test against models of disease in
high-throughput screening
High-throughput screening (HTS) is a method for scientific discovery especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling device ...
. Therefore, most of the chemical synthesis needed to generate chemical libraries in
drug discovery
In the fields of medicine, biotechnology, and pharmacology, drug discovery is the process by which new candidate medications are discovered.
Historically, drugs were discovered by identifying the active ingredient from traditional remedies or ...
is based on
organic chemistry
Organic chemistry is a subdiscipline within chemistry involving the science, scientific study of the structure, properties, and reactions of organic compounds and organic matter, organic materials, i.e., matter in its various forms that contain ...
. A company that is interested in screening for
kinase
In biochemistry, a kinase () is an enzyme that catalyzes the transfer of phosphate groups from high-energy, phosphate-donating molecules to specific substrates. This process is known as phosphorylation, where the high-energy ATP molecule don ...
inhibitors in
cancer
Cancer is a group of diseases involving Cell growth#Disorders, abnormal cell growth with the potential to Invasion (cancer), invade or Metastasis, spread to other parts of the body. These contrast with benign tumors, which do not spread. Po ...
may limit their chemical libraries and synthesis to just those types of chemicals known to have affinity for
ATP binding sites or
allosteric
In the fields of biochemistry and pharmacology an allosteric regulator (or allosteric modulator) is a substance that binds to a site on an enzyme or receptor distinct from the active site, resulting in a conformational change that alters the p ...
sites.
Generally, however, most chemical libraries focus on large groups of varied
organic chemical series where an
organic chemist
Organic chemistry is a subdiscipline within chemistry involving the science, scientific study of the structure, properties, and reactions of organic compounds and organic matter, organic materials, i.e., matter in its various forms that contain ...
can make many variations on the same
molecular scaffold or
molecular backbone. Sometimes chemicals can be purchased from outside vendors as well and included into an internal chemical library.
Depending upon their scope and design, chemical libraries can also be classified as diverse oriented,
Drug-like, Lead-like, peptide-mimetic, Natural Product-like,
Targeted against a specific family of biological targets such Kinases, GPCRs, Proteases, PPI etc. These chemical libraries are often used in
target based drug discovery (reverse pharmacology). Among the compound libraries should be annotated the Fragment Compound Libraries, which are mainly used for
Fragment-based lead discovery
Fragment-based lead discovery (FBLD) also known as fragment-based drug discovery (FBDD) is a method used for finding lead compounds as part of the drug discovery process. Fragments are small organic molecules which are small in size and low in mol ...
.
Design and optimization of chemical libraries
Chemical libraries are usually designed by chemists and
chemoinformatics scientists and synthesized by
organic chemistry
Organic chemistry is a subdiscipline within chemistry involving the science, scientific study of the structure, properties, and reactions of organic compounds and organic matter, organic materials, i.e., matter in its various forms that contain ...
and
medicinal chemistry
Medicinal or pharmaceutical chemistry is a scientific discipline at the intersection of chemistry and pharmacy involved with drug design, designing and developing pharmaceutical medication, drugs. Medicinal chemistry involves the identification, ...
. The method of chemical library generation usually depends on the project and there are many factors to consider when using rational methods to select screening compounds.
Typically, a range of chemicals is screened against a particular drug target or disease model, and the preliminary "hits", or chemicals that show the desired activity, are re-screened to verify their activity. Once they are qualified as a "hit" by their repeatability and activity, these particular chemicals are registered and analysed.
Chemoproteomics is a field of study that incorporates the use of chemical libraries to identify protein targets. Commonalities among the different chemical groups are studied as they are often reflective of a particular chemical subspace. Additional chemistry work may be needed to further optimize the chemical library in the active portion of the subspace. When it is needed, more synthesis is completed to extend out the chemical library in that particular subspace by generating more compounds that are very similar to the original hits. This new selection of compounds within this narrow range are further screened and then taken on to more sophisticated models for further validation in the
Drug Discovery Hit to Lead
Hit to lead (H2L) also known as lead generation is a stage in early drug discovery where small molecule hits from a high-throughput screening, high throughput screen (HTS) are evaluated and undergo limited optimization to identify promising lead co ...
process.
Storage and management
The "chemical space" of all possible organic chemicals is large and increases exponentially with the size of the molecule. Most chemical libraries do not typically have a fully represented
chemical space mostly because of storage and cost concerns.
Because of the expense and effort involved in chemical synthesis, the chemicals must be correctly stored and banked away for later use to prevent early degradation. Each chemical has a particular shelf life and storage requirement and in a good-sized chemical library, there is a timetable by which library chemicals are disposed of and replaced on a regular basis. Some chemicals are fairly unstable, radioactive, volatile or flammable and must be stored under careful conditions in accordance with safety standards such as
OSHA.
Most chemical libraries are managed with information technologies such as
barcoding and
relational database
A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970.
A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
s. Additionally,
robotics
Robotics is the interdisciplinary study and practice of the design, construction, operation, and use of robots.
Within mechanical engineering, robotics is the design and construction of the physical structures of robots, while in computer s ...
are necessary to fetch compounds in larger chemical libraries.
Because a chemical library's individual entries can easily reach up into the millions of compounds, the management of even modest-sized chemical libraries can be a full-time endeavor.
Compound management is one such field that attempts to manage and upkeep these chemical libraries as well as maximizing safety and effectiveness in their management.
See also
*
Compound management
*
Druglikeness
Druglikeness is a qualitative concept used in drug design for how "druglike" a substance is with respect to factors like bioavailability. A druglike molecule has properties such as:
* Solubility in both water and fat, as an orally administered d ...
*
Lead compound
A lead compound (, i.e. a "leading" compound, not to be confused with various compounds of the metallic element lead) in drug discovery is a chemical compound that has pharmacological or biological activity likely to be therapeutically useful, but ...
*
Scientific collection
*
Chemoproteomics
Further reading
Ian Yates. Compound Management comes of age. Drug Discovery World Spring 2003 p35-43Archer JR. History, evolution, and trends in compound management for high-throughput screening. Assay Drug Dev Technol. 2004 Dec;2(6):675-81Casey R. Designing Chemical Compound Libraries for Drug Discovery. Business Intelligence Network December 1, 2005.GLARE - A free open source software for combinatorial library design.
References
{{Reflist
Chemistry
Drug discovery
Bioinformatics
Cheminformatics