HOME

TheInfoList



OR:

In the
calculus of variations The calculus of variations (or variational calculus) is a field of mathematical analysis that uses variations, which are small changes in Function (mathematics), functions and functional (mathematics), functionals, to find maxima and minima of f ...
, a field of
mathematical analysis Analysis is the branch of mathematics dealing with continuous functions, limit (mathematics), limits, and related theories, such as Derivative, differentiation, Integral, integration, measure (mathematics), measure, infinite sequences, series ( ...
, the functional derivative (or variational derivative) relates a change in a functional (a functional in this sense is a function that acts on functions) to a change in a function on which the functional depends. In the calculus of variations, functionals are usually expressed in terms of an
integral In mathematics, an integral is the continuous analog of a Summation, sum, which is used to calculate area, areas, volume, volumes, and their generalizations. Integration, the process of computing an integral, is one of the two fundamental oper ...
of functions, their
arguments An argument is a series of sentences, statements, or propositions some of which are called premises and one is the conclusion. The purpose of an argument is to give reasons for one's conclusion via justification, explanation, and/or persua ...
, and their
derivative In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...
s. In an integrand of a functional, if a function is varied by adding to it another function that is arbitrarily small, and the resulting integrand is expanded in powers of , the coefficient of in the first order term is called the functional derivative. For example, consider the functional J = \int_a^b L( \, x, f(x), f' \, ) \, dx \, , where . If is varied by adding to it a function , and the resulting integrand is expanded in powers of , then the change in the value of to first order in can be expressed as follows:According to , this notation is customary in physical literature. \begin \delta J &= \int_a^b \left( \frac \delta f(x) + \frac \frac \delta f(x) \right) \, dx \, \\ ex&= \int_a^b \left( \frac - \frac \frac \right) \delta f(x) \, dx \, + \, \frac (b) \delta f(b) \, - \, \frac (a) \delta f(a) \end where the variation in the derivative, was rewritten as the derivative of the variation , and
integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
was used in these derivatives.


Definition

In this section, the functional differential (or variation or first variation) Called ''first variation'' in , ''variation'' or ''first variation'' in , ''variation'' or ''differential'' in and ''differential'' in . is defined. Then the functional derivative is defined in terms of the functional differential.


Functional differential

Suppose B is a
Banach space In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...
and F is a functional defined on B. The differential of F at a point \rho\in B is the
linear functional In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear mapIn some texts the roles are reversed and vectors are defined as linear maps from covectors to scalars from a vector space to its field of ...
\delta F rho,\cdot/math> on B defined. by the condition that, for all \phi\in B, F rho+\phi- F
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \delta F rho; \phi+ \varepsilon \left\, \phi\right\, where \varepsilon is a real number that depends on \, \phi\, in such a way that \varepsilon\to 0 as \, \phi\, \to 0. This means that \delta F rho,\cdot/math> is the
Fréchet derivative In mathematics, the Fréchet derivative is a derivative defined on normed spaces. Named after Maurice Fréchet, it is commonly used to generalize the derivative of a real-valued function of a single real variable to the case of a vector-valued f ...
of F at \rho. However, this notion of functional differential is so strong it may not exist,. and in those cases a weaker notion, like the
Gateaux derivative In mathematics, the Gateaux differential or Gateaux derivative is a generalization of the concept of directional derivative in differential calculus. Named after René Gateaux, it is defined for functions between locally convex topological vect ...
is preferred. In many practical cases, the functional differential is defined. as the directional derivative \begin \delta F rho,\phi&= \lim_\frac \\ ex&= \left \fracF[\rho+\varepsilon \phiright ">rho+\varepsilon_\phi.html" ;"title="\fracF[\rho+\varepsilon \phi">\fracF[\rho+\varepsilon \phiright . \end Note that this notion of the functional differential can even be defined without a norm.


Functional derivative

In many applications, the domain of the functional F is a space of differentiable functions \rho defined on some space \Omega and F is of the form F
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \int_\Omega L(x,\rho(x),D\rho(x))\,dx for some function L(x,\rho(x),D\rho(x)) that may depend on x, the value \rho(x) and the derivative D\rho(x). If this is the case and, moreover, \delta F[\rho,\phi] can be written as the integral of \phi times another function (denoted ) \delta F [\rho, \phi] = \int_\Omega \frac (x) \ \phi(x) \ dx then this function is called the functional derivative of at ... If F is restricted to only certain functions \rho (for example, if there are some boundary conditions imposed) then \phi is restricted to functions such that \rho+\varepsilon\phi continues to satisfy these conditions. Heuristically, \phi is the change in \rho, so we 'formally' have \phi = \delta\rho, and then this is similar in form to the
total differential In calculus, the differential represents the principal part of the change in a function y = f(x) with respect to changes in the independent variable. The differential dy is defined by dy = f'(x)\,dx, where f'(x) is the derivative of with resp ...
of a function F(\rho_1,\rho_2,\dots,\rho_n), dF = \sum_ ^n \frac \ d\rho_i , where \rho_1,\rho_2,\dots,\rho_n are independent variables. Comparing the last two equations, the functional derivative \delta F/\delta\rho(x) has a role similar to that of the partial derivative \partial F/\partial\rho_i, where the variable of integration x is like a continuous version of the summation index i.. One thinks of as the gradient of at the point , so the value measures how much the functional will change if the function is changed at the point . Hence the formula \int \frac(x) \phi(x) \; dx is regarded as the directional derivative at point \rho in the direction of \phi. This is analogous to vector calculus, where the inner product of a vector v with the gradient gives the directional derivative in the direction of v.


Properties

Like the derivative of a function, the functional derivative satisfies the following properties, where and are functionals: Here the notation \frac(x) \equiv \frac is introduced. * Linearity:. \frac = \lambda \frac + \mu \frac, where are constants. * Product rule:. \frac = \frac G
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
+ F
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
\frac \, , * Chain rules: **If is a functional and another functional, then \frac = \int dx \frac_\cdot\frac \ . **If is an ordinary differentiable function (local functional) , then this reduces to \frac = \frac \ \frac \ .


Determining functional derivatives

A formula to determine functional derivatives for a common class of functionals can be written as the integral of a function and its derivatives. This is a generalization of the
Euler–Lagrange equation In the calculus of variations and classical mechanics, the Euler–Lagrange equations are a system of second-order ordinary differential equations whose solutions are stationary points of the given action functional. The equations were discovered ...
: indeed, the functional derivative was introduced in
physics Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
within the derivation of the
Lagrange Joseph-Louis Lagrange (born Giuseppe Luigi Lagrangiaprinciple of least action Action principles lie at the heart of fundamental physics, from classical mechanics through quantum mechanics, particle physics, and general relativity. Action principles start with an energy function called a Lagrangian describing the physical sy ...
in
Lagrangian mechanics In physics, Lagrangian mechanics is a formulation of classical mechanics founded on the d'Alembert principle of virtual work. It was introduced by the Italian-French mathematician and astronomer Joseph-Louis Lagrange in his presentation to the ...
(18th century). The first three examples below are taken from
density functional theory Density functional theory (DFT) is a computational quantum mechanical modelling method used in physics, chemistry and materials science to investigate the electronic structure (or nuclear structure) (principally the ground state) of many-body ...
(20th century), the fourth from
statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. Sometimes called statistical physics or statistical thermodynamics, its applicati ...
(19th century).


Formula

Given a functional F
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \int f( \boldsymbol, \rho(\boldsymbol), \nabla\rho(\boldsymbol) )\, d\boldsymbol, and a function \phi(\boldsymbol) that vanishes on the boundary of the region of integration, from a previous section
Definition A definition is a statement of the meaning of a term (a word, phrase, or other set of symbols). Definitions can be classified into two large categories: intensional definitions (which try to give the sense of a term), and extensional definitio ...
, \begin \int \frac \, \phi(\boldsymbol) \, d\boldsymbol & = \left \frac \int f( \boldsymbol, \rho + \varepsilon \phi, \nabla\rho+\varepsilon\nabla\phi )\, d\boldsymbol \right \\ & = \int \left( \frac \, \phi + \frac \cdot \nabla\phi \right) d\boldsymbol \\ & = \int \left \frac \, \phi + \nabla \cdot \left( \frac \, \phi \right) - \left( \nabla \cdot \frac \right) \phi \rightd\boldsymbol \\ & = \int \left \frac \, \phi - \left( \nabla \cdot \frac \right) \phi \rightd\boldsymbol \\ & = \int \left( \frac - \nabla \cdot \frac \right) \phi(\boldsymbol) \ d\boldsymbol \, . \end The second line is obtained using the
total derivative In mathematics, the total derivative of a function at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with res ...
, where is a derivative of a scalar with respect to a vector.For a three-dimensional Cartesian coordinate system, \frac = \frac \mathbf + \frac \mathbf + \frac \mathbf\, , where \rho_x = \frac\, , \ \rho_y = \frac\, , \ \rho_z = \frac and \mathbf, \mathbf, \mathbf are unit vectors along the x, y, z axes. The third line was obtained by use of a product rule for divergence. The fourth line was obtained using the
divergence theorem In vector calculus, the divergence theorem, also known as Gauss's theorem or Ostrogradsky's theorem, reprinted in is a theorem relating the '' flux'' of a vector field through a closed surface to the ''divergence'' of the field in the volume ...
and the condition that \phi=0 on the boundary of the region of integration. Since \phi is also an arbitrary function, applying the
fundamental lemma of calculus of variations In mathematics, specifically in the calculus of variations, a variation of a function can be concentrated on an arbitrarily small interval, but not a single point. Accordingly, the necessary condition of extremum (functional derivative equal zero ...
to the last line, the functional derivative is \frac = \frac - \nabla \cdot \frac where and . This formula is for the case of the functional form given by at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example
Coulomb potential energy functional The coulomb (symbol: C) is the unit of electric charge in the International System of Units (SI). It is defined to be equal to the electric charge delivered by a 1 ampere current in 1 second, with the elementary charge ''e'' as a defining con ...
.) The above equation for the functional derivative can be generalized to the case that includes higher dimensions and higher order derivatives. The functional would be, F rho(\boldsymbol)= \int f( \boldsymbol, \rho(\boldsymbol), \nabla\rho(\boldsymbol), \nabla^\rho(\boldsymbol), \dots, \nabla^\rho(\boldsymbol))\, d\boldsymbol, where the vector , and is a tensor whose components are partial derivative operators of order , \left \nabla^ \right = \frac \qquad \qquad \text \quad \alpha_1, \alpha_2, \dots, \alpha_i = 1, 2, \dots , n \ . For example, for the case of three dimensions () and second order derivatives (), the tensor has components, \left \nabla^ \right = \frac where \alpha and \beta can be 1,2,3. An analogous application of the definition of the functional derivative yields \begin \frac & = \frac - \nabla \cdot \frac + \nabla^ \cdot \frac + \dots + (-1)^N \nabla^ \cdot \frac \\ & = \frac + \sum_^N (-1)^\nabla^ \cdot \frac \ . \end In the last two equations, the components of the tensor \frac are partial derivatives of with respect to partial derivatives of ''ρ'', \left \frac \right = \frac where \rho_ \equiv \frac , and the tensor scalar product is, \nabla^ \cdot \frac = \sum_^n \ \frac \ \frac \ . For example, for the case and , the tensor scalar product is, \nabla^ \cdot \frac = \sum_^3 \ \frac \, \frac , where \rho_ \equiv \frac .


Examples


Thomas–Fermi kinetic energy functional

The
Thomas–Fermi model The Thomas–Fermi (TF) model, named after Llewellyn Thomas and Enrico Fermi, is a quantum mechanical theory for the electronic structure of many-body systems developed semiclassically shortly after the introduction of the Schrödinger equa ...
of 1927 used a kinetic energy functional for a noninteracting uniform electron gas in a first attempt of
density-functional theory Density functional theory (DFT) is a computational quantum mechanics, quantum mechanical modelling method used in physics, chemistry and materials science to investigate the electronic structure (or nuclear structure) (principally the ground sta ...
of electronic structure: T_\mathrm
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= C_\mathrm \int \rho^(\mathbf) \, d\mathbf \, . Since the integrand of does not involve derivatives of , the functional derivative of is,. \frac = C_\mathrm \frac = \frac C_\mathrm \rho^(\mathbf) \, .


Coulomb potential energy functional

The electron-nucleus potential energy is V
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \int \frac \ d\boldsymbol. Applying the definition of functional derivative, \begin \int \frac \ \phi(\boldsymbol) \ d\boldsymbol & = \left \frac \int \frac \ d\boldsymbol \right \\ ex& = \int \frac \ d\boldsymbol \, . \end So, \frac = \frac \ . The functional derivative of the classical part of the electron-electron interaction (often called Hartree energy) is J
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \frac\iint \frac\, d\mathbf d\mathbf' \, . From the definition of the functional derivative, \begin \int \frac \phi(\boldsymbol)d\boldsymbol & = \left \frac \, J[\rho + \varepsilon\phi\right ">rho_+_\varepsilon\phi.html" ;"title="\frac \, J[\rho + \varepsilon\phi">\frac \, J[\rho + \varepsilon\phi\right \\ & = \left [ \frac \, \left ( \frac\iint \frac \, d\boldsymbol d\boldsymbol' \right ) \right ]_ \\ & = \frac\iint \frac \, d\boldsymbol d\boldsymbol' + \frac\iint \frac \, d\boldsymbol d\boldsymbol' \\ \end The first and second terms on the right hand side of the last equation are equal, since and in the second term can be interchanged without changing the value of the integral. Therefore, \int \frac \phi(\boldsymbol)d\boldsymbol = \int \left ( \int \frac d\boldsymbol' \right ) \phi(\boldsymbol) d\boldsymbol and the functional derivative of the electron-electron Coulomb potential energy functional 'ρ''is,. \frac = \int \frac d\boldsymbol' \, . The second functional derivative is \frac = \frac \left ( \frac \right ) = \frac.


von Weizsäcker kinetic energy functional

In 1935 von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it better suit a molecular electron cloud: T_\mathrm
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \frac \int \frac d\mathbf = \int t_\mathrm(\mathbf) \ d\mathbf \, , where t_\mathrm \equiv \frac \frac \qquad \text \ \ \rho = \rho(\boldsymbol) \ . Using a previously derived
formula In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a ''chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betwe ...
for the functional derivative, \begin \frac & = \frac - \nabla\cdot\frac \\ & = -\frac\frac - \left ( \frac \frac - \frac \frac \right ) \qquad \text \ \ \nabla^2 = \nabla \cdot \nabla \ , \end and the result is,. \frac = \ \ \, \frac\frac - \frac\frac \ .


Entropy

The
entropy Entropy is a scientific concept, most commonly associated with states of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynamics, where it was first recognized, to the micros ...
of a discrete
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
is a functional of the
probability mass function In probability and statistics, a probability mass function (sometimes called ''probability function'' or ''frequency function'') is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes i ...
. H (x)= -\sum_x p(x) \log p(x) Thus, \begin \sum_x \frac \, \phi(x) & = \left \frac H[p(x) + \varepsilon\phi(x)\right">(x)_+_\varepsilon\phi(x).html" ;"title="\frac H[p(x) + \varepsilon\phi(x)">\frac H[p(x) + \varepsilon\phi(x)\right\\ & = \left [- \, \frac \sum_x \, [p(x) + \varepsilon\phi(x)] \ \log [p(x) + \varepsilon\phi(x)] \right]_ \\ & = -\sum_x \, [1+\log p(x)] \ \phi(x) \, . \end Thus, \frac = -1-\log p(x).


Exponential

Let F varphi(x) e^. Using the delta function as a test function, \begin \frac & = \lim_\frac\\ & = \lim_\frac\\ & = e^\lim_\frac\\ & = e^\lim_\frac\\ & = e^g(y). \end Thus, \frac = g(y) F varphi(x) This is particularly useful in calculating the correlation functions from the partition function in
quantum field theory In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
.


Functional derivative of a function

A function can be written in the form of an integral like a functional. For example, \rho(\boldsymbol) = F
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
= \int \rho(\boldsymbol') \delta(\boldsymbol-\boldsymbol')\, d\boldsymbol'. Since the integrand does not depend on derivatives of ''ρ'', the functional derivative of ''ρ'' is, \frac \equiv \frac = \frac \, rho(\boldsymbol') \delta(\boldsymbol-\boldsymbol')= \delta(\boldsymbol-\boldsymbol').


Functional derivative of iterated function

The functional derivative of the iterated function f(f(x)) is given by: \frac = f'(f(x))\delta(x-y) + \delta(f(x)-y) and \frac = f'(f(f(x))(f'(f(x))\delta(x-y) + \delta(f(x)-y)) + \delta(f(f(x))-y) In general: \frac = f'( f^(x) ) \frac + \delta( f^(x) - y ) Putting in gives: \frac = - \frac


Using the delta function as a test function

In physics, it is common to use the
Dirac delta function In mathematical analysis, the Dirac delta function (or distribution), also known as the unit impulse, is a generalized function on the real numbers, whose value is zero everywhere except at zero, and whose integral over the entire real line ...
\delta(x-y) in place of a generic test function \phi(x), for yielding the functional derivative at the point y (this is a point of the whole functional derivative as a
partial derivative In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). P ...
is a component of the gradient): \frac=\lim_\frac. This works in cases when F rho(x)+\varepsilon f(x)/math> formally can be expanded as a series (or at least up to first order) in \varepsilon. The formula is however not mathematically rigorous, since F rho(x)+\varepsilon\delta(x-y)/math> is usually not even defined. The definition given in a previous section is based on a relationship that holds for all test functions \phi(x), so one might think that it should hold also when \phi(x) is chosen to be a specific function such as the
delta function In mathematical analysis, the Dirac delta function (or distribution), also known as the unit impulse, is a generalized function on the real numbers, whose value is zero everywhere except at zero, and whose integral over the entire real lin ...
. However, the latter is not a valid test function (it is not even a proper function). In the definition, the functional derivative describes how the functional F rho(x)/math> changes as a result of a small change in the entire function \rho(x). The particular form of the change in \rho(x) is not specified, but it should stretch over the whole interval on which x is defined. Employing the particular form of the perturbation given by the delta function has the meaning that \rho(x) is varied only in the point y. Except for this point, there is no variation in \rho(x).


Notes


Footnotes


References

*. *. *. *. *. *


External links

* {{Analysis in topological vector spaces Calculus of variations Differential calculus Differential operators Topological vector spaces Variational analysis