Hellmann–Feynman theorem
   HOME

TheInfoList



OR:

In
quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
, the Hellmann–Feynman theorem relates the
derivative In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...
of the total energy with respect to a parameter to the
expectation value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first moment) is a generalization of the weighted average. Informally, the expected va ...
of the derivative of the
Hamiltonian Hamiltonian may refer to: * Hamiltonian mechanics, a function that represents the total energy of a system * Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system ** Dyall Hamiltonian, a modified Hamiltonian ...
with respect to that same parameter. According to the theorem, once the
spatial distribution A spatial distribution in statistics is the arrangement of a phenomenon across the Earth's surface and a graphical display of such an arrangement is an important tool in geographical and environmental statistics. A graphical display of a spatial ...
of the
electrons The electron (, or in nuclear reactions) is a subatomic particle with a negative one elementary charge, elementary electric charge. It is a fundamental particle that comprises the ordinary matter that makes up the universe, along with up qua ...
has been determined by solving the
Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...
, all the forces in the system can be calculated using classical electrostatics. The theorem has been proven independently by many authors, including Paul Güttinger (1932),
Wolfgang Pauli Wolfgang Ernst Pauli ( ; ; 25 April 1900 – 15 December 1958) was an Austrian theoretical physicist and a pioneer of quantum mechanics. In 1945, after having been nominated by Albert Einstein, Pauli received the Nobel Prize in Physics "for the ...
(1933), Hans Hellmann (1937) and
Richard Feynman Richard Phillips Feynman (; May 11, 1918 – February 15, 1988) was an American theoretical physicist. He is best known for his work in the path integral formulation of quantum mechanics, the theory of quantum electrodynamics, the physics of t ...
(1939). The theorem states where *\hat_ is a Hermitian operator depending upon a continuous parameter \lambda\,, *, \psi_\lambda\rangle, is an
eigenstate In quantum physics, a quantum state is a mathematical entity that embodies the knowledge of a quantum system. Quantum mechanics specifies the construction, evolution, and measurement of a quantum state. The result is a prediction for the system re ...
(
eigenfunction In mathematics, an eigenfunction of a linear operator ''D'' defined on some function space is any non-zero function f in that space that, when acted upon by ''D'', is only multiplied by some scaling factor called an eigenvalue. As an equation, th ...
) of the
Hamiltonian Hamiltonian may refer to: * Hamiltonian mechanics, a function that represents the total energy of a system * Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system ** Dyall Hamiltonian, a modified Hamiltonian ...
, depending implicitly upon \lambda, *E_\, is the energy (eigenvalue) of the state , \psi_\lambda\rangle, i.e. \hat_, \psi_\lambda\rangle = E_, \psi_\lambda\rangle. Note that there is a breakdown of the Hellmann-Feynman theorem close to quantum critical points in the thermodynamic limit.


Proof

This proof of the Hellmann–Feynman theorem requires that the
wave function In quantum physics, a wave function (or wavefunction) is a mathematical description of the quantum state of an isolated quantum system. The most common symbols for a wave function are the Greek letters and (lower-case and capital psi (letter) ...
be an eigenfunction of the Hamiltonian under consideration; however, it is also possible to prove more generally that the theorem holds for non-eigenfunction wave functions which are stationary (partial derivative is zero) for all relevant variables (such as orbital rotations). The Hartree–Fock wavefunction is an important example of an approximate eigenfunction that still satisfies the Hellmann–Feynman theorem. Notable example of where the Hellmann–Feynman is not applicable is for example finite-order Møller–Plesset perturbation theory, which is not variational. The proof also employs an identity of normalized wavefunctions – that derivatives of the overlap of a wave function with itself must be zero. Using Dirac's
bra–ket notation Bra–ket notation, also called Dirac notation, is a notation for linear algebra and linear operators on complex vector spaces together with their dual space both in the finite-dimensional and infinite-dimensional case. It is specifically de ...
these two conditions are written as :\hat_, \psi_\lambda\rangle = E_, \psi_\lambda\rangle, :\langle\psi_\lambda, \psi_\lambda\rangle = 1 \Rightarrow \frac\langle\psi_\lambda, \psi_\lambda\rangle =0. The proof then follows through an application of the derivative
product rule In calculus, the product rule (or Leibniz rule or Leibniz product rule) is a formula used to find the derivatives of products of two or more functions. For two functions, it may be stated in Lagrange's notation as (u \cdot v)' = u ' \cdot v ...
to the
expectation value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first moment) is a generalization of the weighted average. Informally, the expected va ...
of the Hamiltonian viewed as a function of \lambda: : \begin \frac &= \frac\langle\psi_\lambda, \hat_, \psi_\lambda\rangle \\ &=\bigg\langle\frac\bigg, \hat_\bigg, \psi_\lambda\bigg\rangle + \bigg\langle\psi_\lambda\bigg, \hat_\bigg, \frac\bigg\rangle + \bigg\langle\psi_\lambda\bigg, \frac\bigg, \psi_\lambda\bigg\rangle \\ &=E_\bigg\langle\frac\bigg, \psi_\lambda\bigg\rangle + E_\bigg\langle\psi_\lambda\bigg, \frac\bigg\rangle + \bigg\langle\psi_\lambda\bigg, \frac\bigg, \psi_\lambda\bigg\rangle \\ &=E_\frac\langle\psi_\lambda, \psi_\lambda\rangle + \bigg\langle\psi_\lambda\bigg, \frac\bigg, \psi_\lambda\bigg\rangle \\ &=\bigg\langle\psi_\lambda\bigg, \frac\bigg, \psi_\lambda\bigg\rangle. \end


Alternate proof

The Hellmann–Feynman theorem is actually a direct, and to some extent trivial, consequence of the variational principle (the Rayleigh–Ritz variational principle) from which the Schrödinger equation may be derived. This is why the Hellmann–Feynman theorem holds for wave-functions (such as the Hartree–Fock wave-function) that, though not eigenfunctions of the Hamiltonian, do derive from a variational principle. This is also why it holds, e.g., in
density functional theory Density functional theory (DFT) is a computational quantum mechanical modelling method used in physics, chemistry and materials science to investigate the electronic structure (or nuclear structure) (principally the ground state) of many-body ...
, e.g. in the adiabatic connection fluctuation dissipation theorem, which is not wave-function based and for which the standard derivation does not apply. According to the Rayleigh–Ritz variational principle, the eigenfunctions of the Schrödinger equation are stationary points of the functional (which is nicknamed ''Schrödinger functional'' for brevity): The eigenvalues are the values that the Schrödinger functional takes at the stationary points: where \psi_ satisfies the variational condition: By differentiating Eq. (3) using the
chain rule In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h ...
, the following equation is obtained: Due to the variational condition, Eq. (4), the second term in Eq. (5) vanishes. In one sentence, the Hellmann–Feynman theorem states that ''the derivative of the stationary values of a function(al) with respect to a parameter on which it may depend, can be computed from the explicit dependence only, disregarding the implicit one''. Because the Schrödinger functional can only depend explicitly on an external parameter through the Hamiltonian, Eq. (1) trivially follows.


Example applications


Molecular forces

The most common application of the Hellmann–Feynman theorem is the calculation of
intramolecular force An intramolecular force (from Latin ''intra-'' 'within') is any force that binds together the atoms making up a molecule. Intramolecular forces are stronger than the Intermolecular force, intermolecular forces that govern the interactions between ...
s in molecules. This allows for the calculation of equilibrium geometries – the nuclear coordinates where the forces acting upon the nuclei, due to the electrons and other nuclei, vanish. The parameter \lambda corresponds to the coordinates of the nuclei. For a molecule with 1\leq i\leq N electrons with coordinates \, and 1\leq\alpha\leq M nuclei, each located at a specified point \ and with nuclear charge Z_\alpha, the clamped nucleus Hamiltonian is :\hat=\hat + \hat - \sum_^\sum_^\frac + \sum_^\sum_^\frac. The x-component of the force acting on a given nucleus is equal to the negative of the derivative of the total energy with respect to that coordinate. Employing the Hellmann–Feynman theorem this is equal to :F_ = -\frac = -\bigg\langle\psi\bigg, \frac\bigg, \psi\bigg\rangle. Only two components of the Hamiltonian contribute to the required derivative – the electron-nucleus and nucleus-nucleus terms. Differentiating the Hamiltonian yields : \begin \frac &= \frac \left(- \sum_^\sum_^\frac + \sum_^\sum_^\frac\right), \\ &=-Z_\sum_^\frac +Z_\sum_^Z_\frac. \end Insertion of this in to the Hellmann–Feynman theorem returns the x-component of the force on the given nucleus in terms of the electronic density \rho(\mathbf) and the atomic coordinates and nuclear charges: :F_ = Z_\left(\int\mathrm\mathbf\ \rho(\mathbf)\frac - \sum_^Z_\frac\right). A comprehensive survey of similar applications of the Hellmann-Feynman theorem in quantum chemistry is given in B. M. Deb (ed.) ''The Force Concept in Chemistry'', Van Nostrand Rheinhold, 1981.


Expectation values

An alternative approach for applying the Hellmann–Feynman theorem is to promote a fixed or discrete parameter which appears in a Hamiltonian to be a continuous variable solely for the mathematical purpose of taking a derivative. Possible parameters are physical constants or discrete quantum numbers. As an example, the radial Schrödinger equation for a hydrogen-like atom is :\hat_=-\frac\left(\frac\left(r^\frac\right)-l(l+1)\right) -\frac, which depends upon the discrete
azimuthal quantum number In quantum mechanics, the azimuthal quantum number is a quantum number for an atomic orbital that determines its angular momentum operator, orbital angular momentum and describes aspects of the angular shape of the orbital. The azimuthal quantum ...
l. Promoting l to be a continuous parameter allows for the derivative of the Hamiltonian to be taken: :\frac = \frac(2l+1). The Hellmann–Feynman theorem then allows for the determination of the expectation value of \frac for hydrogen-like atoms: : \begin \bigg\langle\psi_\bigg, \frac\bigg, \psi_\bigg\rangle &= \frac\frac\bigg\langle\psi_\bigg, \frac\bigg, \psi_\bigg\rangle \\ &=\frac\frac\frac \\ &=\frac\frac\frac\frac \\ &=\frac\frac\frac \\ &=\frac. \end In order to compute the energy derivative, the way n depends on l has to be known. These quantum numbers are usually independent, but here the solutions must be varied so as to keep the number of nodes in the wavefunction fixed. The number of nodes is n-l-1, so \partial n/\partial l=1.


Van der Waals forces

In the end of Feynman's paper, he states that, " Van der Waals' forces can also be interpreted as arising from charge distributions with higher concentration between the nuclei. The Schrödinger perturbation theory for two interacting atoms at a separation R, large compared to the radii of the atoms, leads to the result that the charge distribution of each is distorted from central symmetry, a dipole moment of order 1/R^7 being induced in each atom. The negative charge distribution of each atom has its center of gravity moved slightly toward the other. It is not the interaction of these dipoles which leads to van der Waals's force, but rather the attraction of each nucleus for the distorted charge distribution of its ''own'' electrons that gives the attractive 1/R^7 force."


Hellmann–Feynman theorem for time-dependent wavefunctions

For a general time-dependent wavefunction satisfying the time-dependent
Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...
, the Hellmann–Feynman theorem is not valid. However, the following identity holds: : \bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg, \Psi_\lambda(t)\bigg\rangle = i \hbar \frac\bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg\rangle For : i\hbar\frac=H_\lambda\Psi_\lambda(t)


Proof

The proof only relies on the Schrödinger equation and the assumption that partial derivatives with respect to λ and t can be interchanged. : \begin \bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg, \Psi_\lambda(t)\bigg\rangle &= \frac\langle\Psi_\lambda(t), H_\lambda, \Psi_\lambda(t)\rangle - \bigg\langle\frac\bigg, H_\lambda\bigg, \Psi_\lambda(t)\bigg\rangle - \bigg\langle\Psi_\lambda(t)\bigg, H_\lambda\bigg, \frac\bigg\rangle \\ &= i\hbar \frac\bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg\rangle - i\hbar\bigg\langle\frac\bigg, \frac\bigg\rangle + i\hbar\bigg\langle\frac\bigg, \frac\bigg\rangle \\ &= i\hbar \bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg\rangle + i\hbar\bigg\langle\frac\bigg, \frac\bigg\rangle \\ &= i \hbar \frac\bigg\langle\Psi_\lambda(t)\bigg, \frac\bigg\rangle \end


References

{{DEFAULTSORT:Hellmann-Feynman theorem Eponymous theorems of physics Intermolecular forces Theorems in quantum mechanics Richard Feynman