HOME

TheInfoList



OR:

In statistical mechanics, the Fokker–Planck equation is a partial differential equation that describes the
time evolution Time evolution is the change of state brought about by the passage of time, applicable to systems with internal state (also called ''stateful systems''). In this formulation, ''time'' is not required to be a continuous parameter, but may be disc ...
of the
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) ca ...
of the velocity of a particle under the influence of drag forces and random forces, as in Brownian motion. The equation can be generalized to other observables as well. It is named after Adriaan Fokker and Max Planck, who described it in 1914 and 1917. It is also known as the Kolmogorov forward equation, after
Andrey Kolmogorov Andrey Nikolaevich Kolmogorov ( rus, Андре́й Никола́евич Колмого́ров, p=ɐnˈdrʲej nʲɪkɐˈlajɪvʲɪtɕ kəlmɐˈɡorəf, a=Ru-Andrey Nikolaevich Kolmogorov.ogg, 25 April 1903 – 20 October 1987) was a Sovi ...
, who independently discovered it in 1931. When applied to particle position distributions, it is better known as the Smoluchowski equation (after Marian Smoluchowski), and in this context it is equivalent to the
convection–diffusion equation The convection–diffusion equation is a combination of the diffusion and convection (advection) equations, and describes physical phenomena where particles, energy, or other physical quantities are transferred inside a physical system due to two ...
. The case with zero
diffusion Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical ...
is the continuity equation. The Fokker–Planck equation is obtained from the master equation through Kramers–Moyal expansion. The first consistent microscopic derivation of the Fokker–Planck equation in the single scheme of classical and
quantum mechanics Quantum mechanics is a fundamental theory in physics that provides a description of the physical properties of nature at the scale of atoms and subatomic particles. It is the foundation of all quantum physics including quantum chemistry, ...
was performed by Nikolay Bogoliubov and Nikolay Krylov.


One dimension

In one spatial dimension ''x'', for an Itô process driven by the standard
Wiener process In mathematics, the Wiener process is a real-valued continuous-time stochastic process named in honor of American mathematician Norbert Wiener for his investigations on the mathematical properties of the one-dimensional Brownian motion. It is ...
W_t and described by the stochastic differential equation (SDE) dX_t = \mu(X_t, t) \,dt + \sigma(X_t, t) \,dW_t with
drift Drift or Drifts may refer to: Geography * Drift or ford (crossing) of a river * Drift, Kentucky, unincorporated community in the United States * In Cornwall, England: ** Drift, Cornwall, village ** Drift Reservoir, associated with the village ...
\mu(X_t, t) and
diffusion Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical ...
coefficient D(X_t, t) = \sigma^2(X_t, t)/2, the Fokker–Planck equation for the probability density p(x, t) of the random variable X_t is \frac p(x, t) = -\frac\left mu(x, t) p(x, t)\right+ \frac\left (x, t) p(x, t)\right In the following, use \sigma = \sqrt. Define the infinitesimal generator \mathcal (the following can be found in Ref.): \mathcalp(X_t) = \lim_ \frac1\left(\mathbb\big (X_) \mid X_t = x \big- p(x)\right). The ''transition probability'' \mathbb_(x \mid x'), the probability of going from (t', x') to (t, x), is introduced here; the expectation can be written as \mathbb(p(X_) \mid X_t = x) = \int p(y) \, \mathbb_(y \mid x) \,dy. Now we replace in the definition of \mathcal, multiply by \mathbb_(x \mid x') and integrate over dx. The limit is taken on \int p(y) \int \mathbb_(y \mid x)\,\mathbb_(x \mid x') \,dx \,dy - \int p(x) \, \mathbb_(x \mid x') \,dx. Note now that \int \mathbb_(y \mid x) \, \mathbb_(x \mid x') \,dx = \mathbb_(y \mid x'), which is the Chapman–Kolmogorov theorem. Changing the dummy variable y to x, one gets \begin \int p(x) \lim_ \frac1 \left( \mathbb_(x \mid x') - \mathbb_(x \mid x') \right) \,dx, \end which is a time derivative. Finally we arrive to \int mathcalp(x)\mathbb_(x \mid x') \,dx = \int p(x) \, \partial_t \mathbb_(x \mid x') \,dx. From here, the Kolmogorov backward equation can be deduced. If we instead use the adjoint operator of \mathcal, \mathcal^\dagger, defined such that \int mathcalp(x)\mathbb_(x \mid x') \,dx = \int p(x) mathcal^\dagger \mathbb_(x \mid x')\,dx, then we arrive to the Kolmogorov forward equation, or Fokker–Planck equation, which, simplifying the notation p(x, t) = \mathbb_(x \mid x'), in its differential form reads \mathcal^\dagger p(x, t) = \partial_t p(x, t). Remains the issue of defining explicitly \mathcal. This can be done taking the expectation from the integral form of the Itô's lemma: \mathbb\big(p(X_t)\big) = p(X_0) + \mathbb\left(\int_0^t \left(\partial_t + \mu\partial_x + \frac\partial_x^2 \right) p(X_) \,dt'\right). The part that depends on dW_t vanished because of the martingale property. Then, for a particle subject to an Itô equation, using \mathcal = \mu\partial_x + \frac\partial_x^2, it can be easily calculated, using integration by parts, that \mathcal^\dagger = -\partial_x(\mu \cdot) + \frac12 \partial_x^2(\sigma^2 \cdot), which bring us to the Fokker–Planck equation: \partial_t p(x, t) = -\partial_x \big(\mu(x, t) \cdot p(x, t)\big) + \partial_x^2\left(\frac \, p(x,t)\right). While the Fokker–Planck equation is used with problems where the initial distribution is known, if the problem is to know the distribution at previous times, the Feynman–Kac formula can be used, which is a consequence of the Kolmogorov backward equation. The stochastic process defined above in the Itô sense can be rewritten within the Stratonovich convention as a Stratonovich SDE: dX_t = \left mu(X_t, t) - \frac \fracD(X_t, t)\right\,dt + \sqrt \circ dW_t. It includes an added noise-induced drift term due to diffusion gradient effects if the noise is state-dependent. This convention is more often used in physical applications. Indeed, it is well known that any solution to the Stratonovich SDE is a solution to the Itô SDE. The zero-drift equation with constant diffusion can be considered as a model of classical Brownian motion: \frac p(x, t) = D_0\frac\left (x, t)\right This model has discrete spectrum of solutions if the condition of fixed boundaries is added for \: p(0, t) = p(L, t) = 0, p(x, 0) = p_0(x). It has been shown that in this case an analytical spectrum of solutions allows deriving a local uncertainty relation for the coordinate-velocity phase volume: \Delta x \, \Delta v \geq D_0. Here D_0 is a minimal value of a corresponding diffusion spectrum D_j, while \Delta x and \Delta v represent the uncertainty of coordinate–velocity definition.


Higher dimensions

More generally, if d\mathbf_t = \boldsymbol(\mathbf_t,t)\,dt + \boldsymbol(\mathbf_t,t)\,d\mathbf_t, where \mathbf_t and \boldsymbol(\mathbf_t,t) are -dimensional random vectors, \boldsymbol(\mathbf_t,t) is an N \times M matrix and \mathbf_t is an ''M''-dimensional standard
Wiener process In mathematics, the Wiener process is a real-valued continuous-time stochastic process named in honor of American mathematician Norbert Wiener for his investigations on the mathematical properties of the one-dimensional Brownian motion. It is ...
, the probability density p(\mathbf,t) for \mathbf_t satisfies the Fokker–Planck equation \frac = -\sum_^N \frac \left \mu_i(\mathbf,t) p(\mathbf,t) \right+ \sum_^ \sum_^ \frac \left D_(\mathbf,t) p(\mathbf,t) \right with drift vector \boldsymbol = (\mu_1,\ldots,\mu_N) and diffusion
tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tensor ...
\mathbf = \frac \boldsymbol^\mathsf, i.e. D_(\mathbf,t) = \frac\sum_^M \sigma_(\mathbf,t) \sigma_(\mathbf,t). If instead of an Itô SDE, a Stratonovich SDE is considered, d\mathbf_t = \boldsymbol(\mathbf_t,t)\,dt + \boldsymbol(\mathbf_t,t)\circ d\mathbf_t, the Fokker–Planck equation will read: \frac = -\sum_^N \frac \left \mu_i(\mathbf,t) \, p(\mathbf,t) \right+ \frac \sum_^M \sum_^ \frac \left\


Examples


Wiener process

A standard scalar
Wiener process In mathematics, the Wiener process is a real-valued continuous-time stochastic process named in honor of American mathematician Norbert Wiener for his investigations on the mathematical properties of the one-dimensional Brownian motion. It is ...
is generated by the stochastic differential equation dX_t = dW_t. Here the drift term is zero and the diffusion coefficient is 1/2. Thus the corresponding Fokker–Planck equation is \frac = \frac \frac, which is the simplest form of a diffusion equation. If the initial condition is p(x,0) = \delta(x), the solution is p(x,t) = \frace^.


Ornstein–Uhlenbeck process

The Ornstein–Uhlenbeck process is a process defined as dX_t = -a X_t dt + \sigma dW_t. with a>0. Physically, this equation can be motivated as follows: a particle of mass m with velocity V_t moving in a medium, e.g., a fluid, will experience a friction force which resists motion whose magnitude can be approximated as being proportional to particle's velocity -a V_t with a = \mathrm . Other particles in the medium will randomly kick the particle as they collide with it and this effect can be approximated by a white noise term; \sigma (d W_t/dt) . Newton's second law is written as m \frac=-a V_t +\sigma \frac. Taking m = 1 for simplicity and changing the notation as V_t\rightarrow X_t leads to the familiar form dX_t = -a X_t dt + \sigma dW_t. The corresponding Fokker–Planck equation is \frac = a \frac\left(x \,p(x,t)\right) + \frac \frac, The stationary solution (\partial_t p = 0) is p_(x) = \sqrt e^.


Plasma physics

In plasma physics, the distribution function for a particle species s, p_s (\mathbf,\mathbf,t), takes the place of the
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) ca ...
. The corresponding Boltzmann equation is given by \frac + \mathbf \cdot \boldsymbol p_s + \frac \left( \mathbf + \mathbf \times \mathbf \right) \cdot \boldsymbol_v p_s = -\frac \left(p_s \langle\Delta v_i\rangle\right) + \frac \frac \left(p_s \langle\Delta v_i \, \Delta v_j\rangle\right), where the third term includes the particle acceleration due to the Lorentz force and the Fokker–Planck term at the right-hand side represents the effects of particle collisions. The quantities \langle\Delta v_i\rangle and \langle\Delta v_i \, \Delta v_j\rangle are the average change in velocity a particle of type s experiences due to collisions with all other particle species in unit time. Expressions for these quantities are given elsewhere. If collisions are ignored, the Boltzmann equation reduces to the
Vlasov equation The Vlasov equation is a differential equation describing time evolution of the distribution function of plasma consisting of charged particles with long-range interaction, e.g. Coulomb. The equation was first suggested for description of plasma ...
.


Smoluchowski Diffusion Equation

The Smoluchowski Diffusion equation is the Fokker–Planck equation restricted to Brownian particles affected by an external force F(r). \partial_t P(r,t, r_0, t_0) = \nabla \cdot r_0, t_0) Where D is the diffusion constant and \beta = \frac. The importance of this equation is it allows for both the inclusion of the effect of temperature on the system of particles and a spatially dependent diffusion constant. Starting with the
Langevin Equation In physics, a Langevin equation (named after Paul Langevin) is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Lang ...
of a Brownian particle in external field F(r), where \gamma is the friction term, \xi is a fluctuating force on the particle, and \sigma is the amplitude of the fluctuation. m\ddot = - \gamma \dot + F(r) + \sigma \xi(t) At equilibrium the frictional force is much greater than the inertial force, \left\vert \gamma \dot \right\vert >> \left\vert m \ddot \right\vert. Therefore, the Langevin equation becomes, \gamma \dot = F(r) + \sigma \xi(t) Which generates the following Fokker–Planck equation, \partial_t P(r,t, r_0,t_0) = \left(\nabla^2\frac - \nabla \cdot \frac\right) P(r,t, r_0,t_0) Rearranging the Fokker–Planck equation, \partial_t P(r,t, r_0,t_0)= \nabla \cdot \left( \nabla D- \frac\right) P(r,t, r_0,t_0) Where D = \frac. Note, the diffusion coefficient may not necessarily be spatially independent if \sigma or \gamma are spatially dependent. Next, the total number of particles in any particular volume is given by, N_V (t, r_0, t_0) = \int\limits_V dr P(r,t, r_0,t_0) Therefore, the flux of particles can be determined by taking the time derivative of the number of particles in a given volume, plugging in the Fokker–Planck equation, and then applying
Gauss's Theorem In vector calculus, the divergence theorem, also known as Gauss's theorem or Ostrogradsky's theorem, reprinted in is a theorem which relates the ''flux'' of a vector field through a closed surface to the ''divergence'' of the field in the vol ...
. \partial_t N_V (t, r_0, t_0) = \int_V dV \nabla \cdot\left( \nabla D- \frac\right) P(r,t, r_0, t_0) = \int_ d\mathbf \cdot j(r,t, r_0, t_0) j(r,t, r_0, t_0) = \left( \nabla D- \frac\right)P(r,t, r_0, t_0) In equilibrium, it is assumed that the flux goes to zero. Therefore, Boltzmann statistics can be applied for the probability of a particles location at equilibrium, where F(r) = -\nabla U(r) is a conservative force and the probability of a particle being in a state r is given as P(r,t, r_0, t_0) = \frac. j(r,t, r_0, t_0) = \left( \nabla D- \frac\right)\frac = 0 \Rightarrow \nabla D = F(r)(\frac - D \beta) This relation is a realization of the fluctuation–dissipation theorem. Now applying \nabla \cdot \nabla to D P(r,t, r_0, t_0) and using the Fluctuation-dissipation theorem, \begin \nabla \cdot \nabla D P(r,t, r_0,t_0) &= \nabla \cdot D \nabla P(r,t, r_0,t_0)+ \nabla \cdot P(r,t, r_0,t_0) \nabla D \\ &=\nabla \cdot D \nabla P(r,t, r_0,t_0)+\nabla \cdot P(r,t, r_0,t_0) \frac - \nabla \cdot P(r,t, r_0,t_0) D \beta F(r) \end Rearranging, \Rightarrow \nabla \cdot \left( \nabla D- \frac\right)P(r,t, r_0,t_0)= \nabla \cdot D(\nabla-\beta F(r)) P(r,t, r_0,t_0) Therefore, the Fokker–Planck equation becomes the Smoluchowski equation, \partial_t P(r,t, r_0, t_0) = \nabla \cdot D (\nabla - \beta F(r)) P(r,t, r_0, t_0) for an arbitrary force F(r).


Computational considerations

Brownian motion follows the
Langevin equation In physics, a Langevin equation (named after Paul Langevin) is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Lang ...
, which can be solved for many different stochastic forcings with results being averaged (canonical ensemble in
molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of th ...
). However, instead of this computationally intensive approach, one can use the Fokker–Planck equation and consider the probability p(\mathbf, t)\,d\mathbf of the particle having a velocity in the interval (\mathbf, \mathbf + d\mathbf) when it starts its motion with \mathbf_0 at time 0.


1-D Linear Potential Example


Theory

Starting with a linear potential of the form U(x) = cx the corresponding Smoluchowski equation becomes, \partial_t P(x,t, x_0, t_0) = \partial_x D (\partial_x + \beta c) P(x,t, x_0, t_0) Where the diffusion constant, D, is constant over space and time. The boundary conditions are such that the probability vanishes at x \rightarrow \pm \infin with an initial condition of the ensemble of particles starting in the same place, P(x,t, x_0,t_0)= \delta (x-x_0) . Defining \tau = D t and b = \beta c and applying the coordinate transformation, y = x +\tau b ,\ \ \ y_0= x_0 + \tau_0 b With P(x, t, , x_0, t_0) = q(y, \tau, y_0, \tau_0) the Smoluchowki equation becomes, \partial_\tau q(y, \tau, y_0, \tau_0) =\partial_y^2 q(y, \tau, y_0, \tau_0) Which is the free diffusion equation with solution, q(y, \tau, y_0, \tau_0)= \frac e^ And after transforming back to the original coordinates, P(x, t , x_0, t_0)= \frac \exp


Simulation

The simulation on the right was completed using a
Brownian dynamics Brownian dynamics (BD) can be used to describe the motion of molecules for example in molecular simulations or in reality. It is a simplified version of Langevin dynamics and corresponds to the limit where no average acceleration takes place. Thi ...
simulation. Starting with a Langevin equation for the system, m\ddot = - \gamma \dot -c + \sigma \xi(t) where \gamma is the friction term, \xi is a fluctuating force on the particle, and \sigma is the amplitude of the fluctuation. At equilibrium the frictional force is much greater than the inertial force, \left, \gamma \dot \ \gg \left, m \ddot \. Therefore, the Langevin equation becomes, \gamma \dot = -c + \sigma \xi(t) For the Brownian dynamic simulation the fluctuation force \xi(t) is assumed to be Gaussian with the amplitude being dependent of the temperature of the system \sigma = \sqrt. Rewriting the Langevin equation, \frac=-D \beta c + \sqrt\xi(t) where D = \frac is the Einstein relation. The integration of this equation was done using the Euler–Maruyama method to numerically approximate the path of this Brownian particle.


Solution

Being a partial differential equation, the Fokker–Planck equation can be solved analytically only in special cases. A formal analogy of the Fokker–Planck equation with the
Schrödinger equation The Schrödinger equation is a linear partial differential equation that governs the wave function of a quantum-mechanical system. It is a key result in quantum mechanics, and its discovery was a significant landmark in the development of th ...
allows the use of advanced operator techniques known from quantum mechanics for its solution in a number of cases. Furthermore, in the case of overdamped dynamics when the Fokker–Planck equation contains second partial derivatives with respect to all spatial variables, the equation can be written in the form of a master equation that can easily be solved numerically. In many applications, one is only interested in the steady-state probability distribution p_0(x), which can be found from \frac = 0. The computation of mean first passage times and splitting probabilities can be reduced to the solution of an ordinary differential equation which is intimately related to the Fokker–Planck equation.


Particular cases with known solution and inversion

In mathematical finance for volatility smile modeling of options via local volatility, one has the problem of deriving a diffusion coefficient (\mathbf_t,t) consistent with a probability density obtained from market option quotes. The problem is therefore an inversion of the Fokker–Planck equation: Given the density f(x,t) of the option underlying ''X'' deduced from the option market, one aims at finding the local volatility (\mathbf_t,t) consistent with ''f''. This is an inverse problem that has been solved in general by Dupire (1994, 1997) with a non-parametric solution. Brigo and Mercurio (2002, 2003) propose a solution in parametric form via a particular local volatility (\mathbf_t,t) consistent with a solution of the Fokker–Planck equation given by a
mixture model In statistics, a mixture model is a probabilistic model for representing the presence of subpopulations within an overall population, without requiring that an observed data set should identify the sub-population to which an individual observati ...
. More information is available also in Fengler (2008), Gatheral (2008), and Musiela and Rutkowski (2008).


Fokker–Planck equation and path integral

Every Fokker–Planck equation is equivalent to a path integral. The path integral formulation is an excellent starting point for the application of field theory methods. This is used, for instance, in critical dynamics. A derivation of the path integral is possible in a similar way as in quantum mechanics. The derivation for a Fokker–Planck equation with one variable x is as follows. Start by inserting a delta function and then integrating by parts: \begin \fracp & = - \frac \left D_1(x',t) p(x',t) \right+ \frac \left D_2(x',t) p(x',t) \right\\ pt& = \int_^\infty dx\left( \left D_\left( x,t\right) \frac+D_2 \left( x,t\right) \frac\right\delta\left( x' -x\right) \right) p\!\left( x,t\right). \end The x-derivatives here only act on the \delta-function, not on p(x,t). Integrate over a time interval \varepsilon, p(x', t + \varepsilon) =\int_^\infty \, dx\left(\left( 1+\varepsilon \left D_1(x,t) \frac \partial + D_2(x,t) \frac\rightright) \delta(x' - x) \right) p(x,t)+O(\varepsilon^2). Insert the Fourier integral \delta = \int_^ \frac e^ for the \delta-function, \begin p(x', t+\varepsilon) & = \int_^\infty dx \int_^ \frac \left(1+\varepsilon \left \tilde D_1(x,t) +\tilde^2 D_2(x,t) \right\right) e^p(x,t) +O(\varepsilon^2) \\ pt& =\int_^\infty dx \int_^ \frac\exp \left( \varepsilon \left -\tilde\frac\varepsilon + \tilde D_1(x,t) +\tilde^2 D_2(x,t) \right\right) p(x,t) +O(\varepsilon^2). \end This equation expresses p(x', t+\varepsilon) as functional of p(x,t). Iterating (t'-t)/\varepsilon times and performing the limit \varepsilon \rightarrow 0 gives a path integral with action S=\int dt\left \tilde D_1 (x,t) +\tilde^2 D_2 (x,t) -\tilde\frac \right The variables \tilde conjugate to x are called "response variables". Although formally equivalent, different problems may be solved more easily in the Fokker–Planck equation or the path integral formulation. The equilibrium distribution for instance may be obtained more directly from the Fokker–Planck equation.


See also

*
Kolmogorov backward equation In probability theory, Kolmogorov equations, including Kolmogorov forward equations and Kolmogorov backward equations, characterize continuous-time Markov processes. In particular, they describe how the probability that a continuous-time Markov pr ...
* Boltzmann equation *
Vlasov equation The Vlasov equation is a differential equation describing time evolution of the distribution function of plasma consisting of charged particles with long-range interaction, e.g. Coulomb. The equation was first suggested for description of plasma ...
* Master equation *
Mean-field game theory Mean-field game theory is the study of strategic decision making by small interacting agents in very large populations. It lies at the intersection of game theory with stochastic analysis and control theory. The use of the term "mean field" is insp ...
* Bogoliubov–Born–Green–Kirkwood–Yvon hierarchy of equations * Ornstein–Uhlenbeck process *
Convection–diffusion equation The convection–diffusion equation is a combination of the diffusion and convection (advection) equations, and describes physical phenomena where particles, energy, or other physical quantities are transferred inside a physical system due to two ...
* Klein–Kramers equation


Notes and references


Further reading

* * * * {{DEFAULTSORT:Fokker-Planck Equation Stochastic processes Equations Parabolic partial differential equations Max Planck Stochastic calculus Mathematical finance Transport phenomena