HOME

TheInfoList



OR:

Metadynamics (MTD; also abbreviated as METAD or MetaD) is a
computer simulation Computer simulation is the process of mathematical modelling, performed on a computer, which is designed to predict the behaviour of, or the outcome of, a real-world or physical system. The reliability of some mathematical models can be dete ...
method in
computational physics Computational physics is the study and implementation of numerical analysis to solve problems in physics for which a quantitative theory already exists. Historically, computational physics was the first application of modern computers in scienc ...
, chemistry and
biology Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditar ...
. It is used to
estimate Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is de ...
the free energy and other
state function In the thermodynamics of equilibrium, a state function, function of state, or point function for a thermodynamic system is a mathematical function relating several state variables or state quantities (that describe equilibrium states of a syst ...
s of a system, where
ergodicity In mathematics, ergodicity expresses the idea that a point of a moving system, either a dynamical system or a stochastic process, will eventually visit all parts of the space that the system moves in, in a uniform and random sense. This implies t ...
is hindered by the form of the system's energy landscape. It was first suggested by
Alessandro Laio Alessandro is both a given name and a surname, the Italian form of the name Alexander. Notable people with the name include: People with the given name Alessandro * Alessandro Allori (1535–1607), Italian portrait painter * Alessandro Baricco ...
and Michele Parrinello in 2002 and is usually applied within
molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of th ...
simulations. MTD closely resembles a number of recent methods such as adaptively biased molecular dynamics, adaptive reaction coordinate forces and local elevation umbrella sampling. More recently, both the original and well-tempered metadynamics were derived in the context of importance sampling and shown to be a special case of the adaptive biasing potential setting. MTD is related to the Wang–Landau sampling.


Introduction

The technique builds on a large number of related methods including (in a chronological order) the deflation, tunneling,
tabu search Tabu search is a metaheuristic search method employing local search methods used for mathematical optimization. It was created by Fred W. Glover in 1986 and formalized in 1989. Local (neighborhood) searches take a potential solution to a prob ...
, local elevation, conformational flooding, Engkvist-Karlström and Adaptive Biasing Force methods. Metadynamics has been informally described as "filling the free energy wells with computational sand".http://www.grs-sim.de/cms/upload/Carloni/Presentations/Marinelli.ppt The algorithm assumes that the system can be described by a few
collective variable In chemistry, a reaction coordinate is an abstract one-dimensional coordinate which represents progress along a reaction pathway. It is usually a geometric parameter that changes during the conversion of one or more molecular entities. In molec ...
s (CV). During the simulation, the location of the system in the space determined by the collective variables is calculated and a positive Gaussian potential is added to the real energy landscape of the system. In this way the system is discouraged to come back to the previous point. During the evolution of the simulation, more and more Gaussians sum up, thus discouraging more and more the system to go back to its previous steps, until the system explores the full energy landscape—at this point the modified free energy becomes a constant as a function of the collective variables which is the reason for the collective variables to start fluctuating heavily. At this point the energy landscape can be recovered as the opposite of the sum of all Gaussians. The time interval between the addition of two Gaussian functions, as well as the Gaussian height and Gaussian width, are tuned to optimize the ratio between accuracy and computational cost. By simply changing the size of the Gaussian, metadynamics can be fitted to yield very quickly a rough map of the energy landscape by using large Gaussians, or can be used for a finer grained description by using smaller Gaussians. Usually, the well-tempered metadynamics is used to change the Gaussian size adaptively. Also, the Gaussian width can be adapted with the adaptive Gaussian metadynamics. Metadynamics has the advantage, upon methods like adaptive umbrella sampling, of not requiring an initial estimate of the energy landscape to explore. However, it is not trivial to choose proper collective variables for a complex simulation. Typically, it requires several trials to find a good set of collective variables, but there are several automatic procedure proposed:
essential coordinates Essential or essentials may refer to: Biology * Essential amino acid, one that the organism cannot produce by itself Groups and organizations * EQ Media Group, formerly Essential Media Group, a global television production company * Essential ...
, Sketch-Map, and non-linear data-driven collective variables.


Multi-replica approach

Independent metadynamics simulations (replicas) can be coupled together to improve usability and parallel performance. There are several such methods proposed: the multiple walker MTD, the parallel tempering MTD, the bias-exchange MTD, and the collective-variable tempering MTD. The last three are similar to the parallel tempering method and use replica exchanges to improve sampling. Typically, the Metropolis–Hastings algorithm is used for replica exchanges, but the infinite swapping and Suwa-Todo algorithms give better replica exchange rates.


High-dimensional approach

Typical (single-replica) MTD simulations can include up to 3 CVs, even using the multi-replica approach, it is hard to exceed 8 CVs, in practice. This limitation comes from the bias potential, constructed by adding Gaussian functions (kernels). It is a special case of the kernel density estimator (KDE). The number of required kernels, for a constant KDE accuracy, increases exponentially with the number of dimensions. So MTD simulation length has to increase exponentially with the number of CVs to maintain the same accuracy of the bias potential. Also, the bias potential, for fast evaluation, is typically approximated with a regular grid. The required
memory Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
to store the grid increases exponentially with the number of dimensions (CVs) too. A high-dimensional generalization of metadynamics is NN2B. It is based on two
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
algorithms: the
nearest-neighbor density estimator Nearest neighbor may refer to: * Nearest neighbor search in pattern recognition and in computational geometry * Nearest-neighbor interpolation for interpolating data * Nearest neighbor graph in geometry * Nearest neighbor function in probability ...
(NNDE) and the
artificial neural network Artificial neural networks (ANNs), usually simply called neural networks (NNs) or neural nets, are computing systems inspired by the biological neural networks that constitute animal brains. An ANN is based on a collection of connected units ...
(ANN). NNDE replaces KDE to estimate the updates of bias potential from short biased simulations, while ANN is used to approximate the resulting bias potential. ANN is a memory-efficient representation of high-dimensional functions, where derivatives (biasing forces) are effectively computed with the
backpropagation In machine learning, backpropagation (backprop, BP) is a widely used algorithm for training feedforward artificial neural networks. Generalizations of backpropagation exist for other artificial neural networks (ANNs), and for functions gener ...
algorithm. An alternative method, exploiting ANN for the adaptive bias potential, uses
mean potential force There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set. For a data set, the ''ari ...
s for the estimation. This method is also a high-dimensional generalization of the Adaptive Biasing Force (ABF) method. Additionally, the training of ANN is improved using the Bayesian regularization, and the error of approximation can be inferred by training an ensemble of ANNs.


Recent developments

In 2020, an evolution of metadynamics was proposed, the ''on-the-fly probability enhanced sampling'' method (OPES), which is now the method of choice of Michele Parrinello's research group. The OPES method has only a few robust parameters, converges faster than metadynamics and has a straightforward reweighting scheme. OPES has been implemented in the PLUMED library since version 2.7.


Algorithm

Assume, we have a classical N-particle system with positions at \ (i \in 1...N) in the
Cartesian coordinates A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured i ...
(\vec r_i \in \mathbb^3). The particle interaction are described with a
potential Potential generally refers to a currently unrealized ability. The term is used in a wide variety of fields, from physics to the social sciences to indicate things that are in a state where they are able to change in ways ranging from the simple r ...
function V \equiv V(\). The potential function form (e.g. two local minima separated by a high-energy barrier) prevents an
ergodic In mathematics, ergodicity expresses the idea that a point of a moving system, either a dynamical system or a stochastic process, will eventually visit all parts of the space that the system moves in, in a uniform and random sense. This implies t ...
sampling with
molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of th ...
or
Monte Carlo Monte Carlo (; ; french: Monte-Carlo , or colloquially ''Monte-Carl'' ; lij, Munte Carlu ; ) is officially an administrative area of the Principality of Monaco, specifically the ward of Monte Carlo/Spélugues, where the Monte Carlo Casino i ...
methods.


Original metadynamics

A general idea of MTD is to enhance the system sampling by discouraging revisiting of sampled states. It is achieved by augmenting the system Hamiltonian H with a bias potential V_\text: :H = T + V + V_\text. The bias potential is a function of
collective variables A collective is a group of entities that share or are motivated by at least one common issue or interest, or work together to achieve a common objective. Collectives can differ from cooperatives in that they are not necessarily focused upon an ...
(V_\text \equiv V_\text(\vec s\,)). A collective variable is a function of the particle positions (\vec s \equiv \vec s(\)). The bias potential is continuously updated by adding bias at rate \omega, where \vec s_t is an instantaneous collective variable value at time t : :\frac = \omega\, \delta(, \vec s - \vec s_t, ) . At infinitely long simulation time t_\text, the accumulated bias potential converges to free energy with opposite sign (and irrelevant constant C): :V_\text(\vec s\,) = \!\!\int_0^ \!\!\!\omega\, \delta(, \vec s - \vec s_t, )\; dt \quad\Rightarrow\quad F(\vec s\,) = -\!\!\!\!\lim_\!\! V_\text(\vec s\,) + C For a computationally efficient implementation, the update process is discretised into \tau time intervals (\lfloor\;\rfloor denotes the
floor function In mathematics and computer science, the floor function is the function that takes as input a real number , and gives as output the greatest integer less than or equal to , denoted or . Similarly, the ceiling function maps to the least ...
) and \delta-function is replaced with a localized positive
kernel function In operator theory, a branch of mathematics, a positive-definite kernel is a generalization of a positive-definite function or a positive-definite matrix. It was first introduced by James Mercer in the early 20th century, in the context of solving ...
K. The bias potential becomes a sum of the kernel functions centred at the instantaneous collective variable values \vec s_j at time \tau j : :V_\text(\vec s\,) \approx \tau \!\!\!\sum_^\!\! \omega\, K(, \vec s - \vec s_j, ) . Typically, the kernel is a multi-dimensional Gaussian function, whose covariance matrix has diagonal non-zero elements only: :V_\text(\vec s\,) \approx \tau \!\!\!\sum_^\!\! \omega \exp\!\! \left(\! -\frac \left, \frac \^2 \right) . The parameter \tau, \omega, and \vec \sigma are determined ''a priori'' and kept constant during the simulation.


Implementation

Below there is a
pseudocode In computer science, pseudocode is a plain language description of the steps in an algorithm or another system. Pseudocode often uses structural conventions of a normal programming language, but is intended for human reading rather than machine re ...
of MTD base on
molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of th ...
(MD), where \ and \ are the N-particle system positions and velocities, respectively. The bias V_\text is updated every n = \tau/\Delta t MD steps, and its contribution to the system forces \ is \. set initial \ and \ set V_\text(\vec s\,) := 0 every MD step: compute CV values: \vec s_t := \vec s(\) every n MD steps: update bias potential: V_\text(\vec s\,) := V_\text(\vec s\,) + \tau \omega \exp\!\! \left(\! -\frac \left, \frac \^2 \right) compute atomic forces: \vec F_i := -\frac \overbrace^ propagate \ and \ by \Delta t


Free energy estimator

The finite size of the kernel makes the bias potential to fluctuate around a mean value. A converged free energy can be obtained by averaging the bias potential. The averaging is started from t_\text, when the motion along the collective variable becomes diffusive: :\bar F(\vec s) = - \frac \int^_ \!\!\!\!\!V_\text(\vec s, t)\, dt + C


Applications

Metadynamics has been used to study: *
protein folding Protein folding is the physical process by which a protein chain is translated to its native three-dimensional structure, typically a "folded" conformation by which the protein becomes biologically functional. Via an expeditious and reprodu ...
*
chemical reactions A chemical reaction is a process that leads to the IUPAC nomenclature for organic transformations, chemical transformation of one set of chemical substances to another. Classically, chemical reactions encompass changes that only involve the pos ...
*
molecular docking In the field of molecular modeling, docking is a method which predicts the preferred orientation of one molecule to a second when a ligand and a target are bound to each other to form a stable complex. Knowledge of the preferred orientation in tu ...
*
phase transitions In chemistry, thermodynamics, and other related fields, a phase transition (or phase change) is the physical process of transition between one state of a medium and another. Commonly the term is used to refer to changes among the basic states ...
. * encapsulation of DNA onto hydrophobic and hydrophilic single-walled carbon nanotubes.


Implementations


PLUMED

PLUMED is an
open-source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
library A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vi ...
implementing many MTD algorithms and
collective variable In chemistry, a reaction coordinate is an abstract one-dimensional coordinate which represents progress along a reaction pathway. It is usually a geometric parameter that changes during the conversion of one or more molecular entities. In molec ...
s. It has a flexible
object-oriented Object-oriented programming (OOP) is a programming paradigm based on the concept of " objects", which can contain data and code. The data is in the form of fields (often known as attributes or ''properties''), and the code is in the form of ...
design and can be interfaced with several MD programs (
AMBER Amber is fossilized tree resin that has been appreciated for its color and natural beauty since Neolithic times. Much valued from antiquity to the present as a gemstone, amber is made into a variety of decorative objects."Amber" (2004). In M ...
, GROMACS,
LAMMPS Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) is a molecular dynamics program from Sandia National Laboratories. LAMMPS makes use of Message Passing Interface (MPI) for parallel communication and is free and open-source softw ...
,
NAMD Nanoscale Molecular Dynamics (NAMD, formerly Not Another Molecular Dynamics Program) is computer software for molecular dynamics simulation, written using the Charm++ parallel programming model. It is noted for its parallel efficiency and is often ...
, Quantum ESPRESSO, DL_POLY_4, CP2K, and OpenMM).


Other

Other MTD implementations exist in th
Collective Variables Module
(for
LAMMPS Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) is a molecular dynamics program from Sandia National Laboratories. LAMMPS makes use of Message Passing Interface (MPI) for parallel communication and is free and open-source softw ...
,
NAMD Nanoscale Molecular Dynamics (NAMD, formerly Not Another Molecular Dynamics Program) is computer software for molecular dynamics simulation, written using the Charm++ parallel programming model. It is noted for its parallel efficiency and is often ...
, and GROMACS), ORAC, CP2K, and
Desmond Desmond or Desmond's may refer to: Arts and entertainment * ''Desmond'' (novel), 1792 novel by Charlotte Turner Smith * '' Desmond's'', 1990s British television sitcom Ireland * Kingdom of Desmond, medieval Irish kingdom * Earl of Desmond, Iris ...
.


External links


Introduction to metadynamics

PLUMED

Colvars module website (NAMD, LAMMPS, Gromacs)

Visual movie of metadynamics



See also

* Local elevation * Parallel tempering * Umbrella sampling


References

{{reflist, 2 Molecular dynamics Computational chemistry Theoretical chemistry