In
particle physics
Particle physics or high-energy physics is the study of Elementary particle, fundamental particles and fundamental interaction, forces that constitute matter and radiation. The field also studies combinations of elementary particles up to the s ...
, the Dirac equation is a
relativistic wave equation
In physics, specifically relativistic quantum mechanics (RQM) and its applications to particle physics, relativistic wave equations predict the behavior of particles at high energies and velocities comparable to the speed of light. In the c ...
derived by British physicist
Paul Dirac
Paul Adrien Maurice Dirac ( ; 8 August 1902 – 20 October 1984) was an English mathematician and Theoretical physics, theoretical physicist who is considered to be one of the founders of quantum mechanics. Dirac laid the foundations for bot ...
in 1928. In its
free form, or including electromagnetic interactions, it describes all
spin-1/2
In quantum mechanics, spin is an intrinsic property of all elementary particles. All known fermions, the particles that constitute ordinary matter, have a spin of . The spin number describes how many symmetrical facets a particle has in one f ...
massive particles, called "Dirac particles", such as
electron
The electron (, or in nuclear reactions) is a subatomic particle with a negative one elementary charge, elementary electric charge. It is a fundamental particle that comprises the ordinary matter that makes up the universe, along with up qua ...
s and
quark
A quark () is a type of elementary particle and a fundamental constituent of matter. Quarks combine to form composite particles called hadrons, the most stable of which are protons and neutrons, the components of atomic nucleus, atomic nuclei ...
s for which
parity is a
symmetry
Symmetry () in everyday life refers to a sense of harmonious and beautiful proportion and balance. In mathematics, the term has a more precise definition and is usually used to refer to an object that is Invariant (mathematics), invariant und ...
. It is consistent with both the principles of
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
and the theory of
special relativity
In physics, the special theory of relativity, or special relativity for short, is a scientific theory of the relationship between Spacetime, space and time. In Albert Einstein's 1905 paper, Annus Mirabilis papers#Special relativity,
"On the Ele ...
, and was the first theory to account fully for special relativity in the context of quantum mechanics. The equation is validated by its rigorous accounting of the observed
fine structure
In atomic physics, the fine structure describes the splitting of the spectral lines of atoms due to electron spin and relativistic corrections to the non-relativistic Schrödinger equation. It was first measured precisely for the hydrogen atom ...
of the
hydrogen spectrum
The emission spectrum of atomic hydrogen has been divided into a number of ''spectral series'', with wavelengths given by the Rydberg formula. These observed spectral lines are due to the electron making atomic electron transition, transitions b ...
and has become vital in the building of the
Standard Model
The Standard Model of particle physics is the Scientific theory, theory describing three of the four known fundamental forces (electromagnetism, electromagnetic, weak interaction, weak and strong interactions – excluding gravity) in the unive ...
.
The equation also implied the existence of a new form of matter, ''
antimatter
In modern physics, antimatter is defined as matter composed of the antiparticles (or "partners") of the corresponding subatomic particle, particles in "ordinary" matter, and can be thought of as matter with reversed charge and parity, or go ...
'', previously unsuspected and unobserved and which was experimentally confirmed several years later. It also provided a ''theoretical'' justification for the introduction of several component wave functions in
Pauli's
phenomenological theory of
spin. The wave functions in the Dirac theory are vectors of four
complex number
In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s (known as
bispinor
In physics, and specifically in quantum field theory, a bispinor is a mathematical construction that is used to describe some of the fundamental particles of nature, including quarks and electrons. It is a specific embodiment of a spinor, specifi ...
s), two of which resemble
the Pauli wavefunction in the non-relativistic limit, in contrast to the
Schrödinger equation
The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...
, which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to the
Weyl equation
In physics, particularly in quantum field theory, the Weyl equation is a relativistic wave equation for describing massless spin-1/2 particles called Weyl fermions. The equation is named after Hermann Weyl. The Weyl fermions are one of the three ...
.
In the context of
quantum field theory
In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin-1/2 particles.
Dirac did not fully appreciate the importance of his results; however, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of the
positron
The positron or antielectron is the particle with an electric charge of +1''elementary charge, e'', a Spin (physics), spin of 1/2 (the same as the electron), and the same Electron rest mass, mass as an electron. It is the antiparticle (antimatt ...
—represents one of the great triumphs of
theoretical physics
Theoretical physics is a branch of physics that employs mathematical models and abstractions of physical objects and systems to rationalize, explain, and predict List of natural phenomena, natural phenomena. This is in contrast to experimental p ...
. This accomplishment has been described as fully on par with the works of
Newton,
Maxwell
Maxwell may refer to:
People
* Maxwell (surname), including a list of people and fictional characters with the name
** James Clerk Maxwell, mathematician and physicist
* Justice Maxwell (disambiguation)
* Maxwell baronets, in the Baronetage of N ...
, and
Einstein
Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...
before him. The equation has been deemed by some physicists to be the "real seed of modern physics".
The equation has also been described as the "centerpiece of relativistic quantum mechanics", with it also stated that "the equation is perhaps the most important one in all of quantum mechanics".
The Dirac equation is inscribed upon a plaque on the floor of
Westminster Abbey
Westminster Abbey, formally titled the Collegiate Church of Saint Peter at Westminster, is an Anglican church in the City of Westminster, London, England. Since 1066, it has been the location of the coronations of 40 English and British m ...
. Unveiled on 13 November 1995, the plaque commemorates Dirac's life.
The equation, in its
natural units
In physics, natural unit systems are measurement systems for which selected physical constants have been set to 1 through nondimensionalization of physical units. For example, the speed of light may be set to 1, and it may then be omitted, equa ...
formulation, is also prominently displayed in the auditorium at the ‘Paul A.M. Dirac’ Lecture Hall at the Patrick M.S. Blackett Institute (formerly The San Domenico Monastery) of the
Ettore Majorana Foundation and Centre for Scientific Culture in
Erice
Erice (; ) is a (municipality) contiguous with the provincial capital Trapani, in western Sicily. Its historic core occupies the site of the ancient city of Eryx, one of the most significant archaeological and religious centres in pre-Roman w ...
,
Sicily
Sicily (Italian language, Italian and ), officially the Sicilian Region (), is an island in the central Mediterranean Sea, south of the Italian Peninsula in continental Europe and is one of the 20 regions of Italy, regions of Italy. With 4. ...
.
History
The Dirac equation in the form originally proposed by
Dirac is:
where is the
wave function
In quantum physics, a wave function (or wavefunction) is a mathematical description of the quantum state of an isolated quantum system. The most common symbols for a wave function are the Greek letters and (lower-case and capital psi (letter) ...
for an
electron
The electron (, or in nuclear reactions) is a subatomic particle with a negative one elementary charge, elementary electric charge. It is a fundamental particle that comprises the ordinary matter that makes up the universe, along with up qua ...
of
rest mass
The invariant mass, rest mass, intrinsic mass, proper mass, or in the case of bound systems simply mass, is the portion of the total mass of an object or system of objects that is independent of the overall motion of the system. More precisely, ...
with
spacetime
In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualiz ...
coordinates . are the components of the
momentum
In Newtonian mechanics, momentum (: momenta or momentums; more specifically linear momentum or translational momentum) is the product of the mass and velocity of an object. It is a vector quantity, possessing a magnitude and a direction. ...
, understood to be the
momentum operator in the
Schrödinger equation
The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...
. is the
speed of light
The speed of light in vacuum, commonly denoted , is a universal physical constant exactly equal to ). It is exact because, by international agreement, a metre is defined as the length of the path travelled by light in vacuum during a time i ...
, and is the
reduced Planck constant
The Planck constant, or Planck's constant, denoted by h, is a fundamental physical constant of foundational importance in quantum mechanics: a photon's energy is equal to its frequency multiplied by the Planck constant, and the wavelength of a ...
; these fundamental
physical constant
A physical constant, sometimes fundamental physical constant or universal constant, is a physical quantity that cannot be explained by a theory and therefore must be measured experimentally. It is distinct from a mathematical constant, which has a ...
s reflect special relativity and quantum mechanics, respectively. and are
gamma matrices
In mathematical physics, the gamma matrices, \ \left\\ , also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra \ \mathr ...
.
Dirac's purpose in casting this equation was to explain the behavior of the relativistically moving electron, thus allowing the atom to be treated in a manner consistent with relativity. He hoped that the corrections introduced this way might have a bearing on the problem of
atomic spectra.
Up until that time, attempts to make the old quantum theory of the atom compatible with the theory of relativity—which were based on discretizing the
angular momentum
Angular momentum (sometimes called moment of momentum or rotational momentum) is the rotational analog of Momentum, linear momentum. It is an important physical quantity because it is a Conservation law, conserved quantity – the total ang ...
stored in the electron's possibly non-circular orbit of the
atomic nucleus
The atomic nucleus is the small, dense region consisting of protons and neutrons at the center of an atom, discovered in 1911 by Ernest Rutherford at the Department_of_Physics_and_Astronomy,_University_of_Manchester , University of Manchester ...
—had failed, and the new quantum mechanics of
Heisenberg,
Pauli,
Jordan
Jordan, officially the Hashemite Kingdom of Jordan, is a country in the Southern Levant region of West Asia. Jordan is bordered by Syria to the north, Iraq to the east, Saudi Arabia to the south, and Israel and the occupied Palestinian ter ...
,
Schrödinger, and Dirac himself had not developed sufficiently to treat this problem. Although Dirac's original intentions were satisfied, his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics.
The new elements in this equation are the four
matrices
Matrix (: matrices or matrixes) or MATRIX may refer to:
Science and mathematics
* Matrix (mathematics), a rectangular array of numbers, symbols or expressions
* Matrix (logic), part of a formula in prenex normal form
* Matrix (biology), the ...
, , and , and the four-component
wave function
In quantum physics, a wave function (or wavefunction) is a mathematical description of the quantum state of an isolated quantum system. The most common symbols for a wave function are the Greek letters and (lower-case and capital psi (letter) ...
. There are four components in because the evaluation of it at any given point in configuration space is a
bispinor
In physics, and specifically in quantum field theory, a bispinor is a mathematical construction that is used to describe some of the fundamental particles of nature, including quarks and electrons. It is a specific embodiment of a spinor, specifi ...
. It is interpreted as a superposition of a
spin-up electron, a spin-down electron, a spin-up positron, and a spin-down positron.
The matrices and are all
Hermitian {{Short description, none
Numerous things are named after the French mathematician Charles Hermite (1822–1901):
Hermite
* Cubic Hermite spline, a type of third-degree spline
* Gauss–Hermite quadrature, an extension of Gaussian quadrature me ...
and are
involutory:
and they all mutually
anti-commute:
These matrices and the form of the wave function have a deep mathematical significance. The algebraic structure represented by the
gamma matrices
In mathematical physics, the gamma matrices, \ \left\\ , also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra \ \mathr ...
had been created some 50 years earlier by the English mathematician
W. K. Clifford. In turn, Clifford's ideas had emerged from the mid-19th-century work of German mathematician
Hermann Grassmann
Hermann Günther Grassmann (, ; 15 April 1809 – 26 September 1877) was a German polymath known in his day as a linguist and now also as a mathematician. He was also a physicist, general scholar, and publisher. His mathematical work was littl ...
in his ''Lineare Ausdehnungslehre'' (''Theory of Linear Expansion'').
Making the Schrödinger equation relativistic
The Dirac equation is superficially similar to the Schrödinger equation for a massive
free particle
In physics, a free particle is a particle that, in some sense, is not bound by an external force, or equivalently not in a region where its potential energy varies. In classical physics, this means the particle is present in a "field-free" space. I ...
:
The left side represents the square of the momentum operator divided by twice the mass, which is the non-relativistic kinetic energy. Because relativity treats space and time as a whole, a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the
Maxwell equations that govern the behavior of light – the equations must be differentially of the ''same order'' in space and time. In relativity, the momentum and the energies are the space and time parts of a spacetime vector, the
four-momentum
In special relativity, four-momentum (also called momentum–energy or momenergy) is the generalization of the classical three-dimensional momentum to four-dimensional spacetime. Momentum is a vector in three dimensions; similarly four-momentum i ...
, and they are related by the relativistically invariant relation
which says that the
length of this four-vector is proportional to the rest mass . Substituting the operator equivalents of the energy and momentum from the Schrödinger theory produces the
Klein–Gordon equation
The Klein–Gordon equation (Klein–Fock–Gordon equation or sometimes Klein–Gordon–Fock equation) is a relativistic wave equation, related to the Schrödinger equation. It is named after Oskar Klein and Walter Gordon. It is second-order i ...
describing the propagation of waves, constructed from relativistically invariant objects,
with the wave function
being a relativistic scalar: a complex number that has the same numerical value in all frames of reference. Space and time derivatives both enter to second order. This has a telling consequence for the interpretation of the equation. Because the equation is second order in the time derivative, one must specify initial values both of the wave function itself and of its first time-derivative in order to solve definite problems. Since both may be specified more or less arbitrarily, the wave function cannot maintain its former role of determining the
probability density of finding the electron in a given state of motion. In the Schrödinger theory, the probability density is given by the positive definite expression
and this density is convected according to the probability current vector
with the conservation of probability current and density following from the continuity equation:
The fact that the density is
positive definite and convected according to this continuity equation implies that one may integrate the density over a certain domain and set the total to 1, and this condition will be maintained by the
conservation law
In physics, a conservation law states that a particular measurable property of an isolated physical system does not change as the system evolves over time. Exact conservation laws include conservation of mass-energy, conservation of linear momen ...
. A proper relativistic theory with a probability density current must also share this feature. To maintain the notion of a convected density, one must generalize the Schrödinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function. The Schrödinger expression can be kept for the current, but the probability density must be replaced by the symmetrically formed expression
which now becomes the 4th component of a spacetime vector, and the entire
probability 4-current density has the relativistically covariant expression
The continuity equation is as before. Everything is compatible with relativity now, but the expression for the density is no longer positive definite; the initial values of both and may be freely chosen, and the density may thus become negative, something that is impossible for a legitimate probability density. Thus, one cannot get a simple generalization of the Schrödinger equation under the naive assumption that the wave function is a relativistic scalar, and the equation it satisfies, second order in time.
Although it is not a successful relativistic generalization of the Schrödinger equation, this equation is resurrected in the context of
quantum field theory
In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
, where it is known as the
Klein–Gordon equation
The Klein–Gordon equation (Klein–Fock–Gordon equation or sometimes Klein–Gordon–Fock equation) is a relativistic wave equation, related to the Schrödinger equation. It is named after Oskar Klein and Walter Gordon. It is second-order i ...
, and describes a spinless particle field (e.g.
pi meson
In particle physics, a pion (, ) or pi meson, denoted with the Greek letter pi (), is any of three subatomic particles: , , and . Each pion consists of a quark and an antiquark and is therefore a meson. Pions are the lightest mesons and, more ...
or
Higgs boson
The Higgs boson, sometimes called the Higgs particle, is an elementary particle in the Standard Model of particle physics produced by the excited state, quantum excitation of the Higgs field,
one of the field (physics), fields in particl ...
). Historically, Schrödinger himself arrived at this equation before the one that bears his name but soon discarded it. In the context of quantum field theory, the indefinite density is understood to correspond to the ''charge'' density, which can be positive or negative, and not the probability density.
Dirac's coup
Dirac thus thought to try an equation that was ''first order'' in both space and time. He postulated an equation of the form
where the operators
must be independent of
for linearity and independent of
for space-time homogeneity. These constraints implied additional dynamical variables that the
operators will depend upon; from this requirement Dirac concluded that the operators would depend upon matrices, related to the Pauli matrices.
One could, for example, formally (i.e. by
abuse of notation
In mathematics, abuse of notation occurs when an author uses a mathematical notation in a way that is not entirely formally correct, but which might help simplify the exposition or suggest the correct intuition (while possibly minimizing errors an ...
, since it is not straightforward to take a
functional square root
In mathematics, a functional square root (sometimes called a half iterate) is a square root of a function with respect to the operation of function composition. In other words, a functional square root of a function is a function satisfying fo ...
of the sum of two differential operators) take the
relativistic expression for the energy
replace by its operator equivalent, expand the square root in an
infinite series
In mathematics, a series is, roughly speaking, an addition of infinitely many terms, one after the other. The study of series is a major part of calculus and its generalization, mathematical analysis. Series are used in most areas of mathemati ...
of derivative operators, set up an eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, even if it were technically possible.
As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator (see also
half derivative) thus:
On multiplying out the right side it is apparent that, in order to get all the cross-terms such as to vanish, one must assume
with
Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's
matrix mechanics
Matrix mechanics is a formulation of quantum mechanics created by Werner Heisenberg, Max Born, and Pascual Jordan in 1925. It was the first conceptually autonomous and logically consistent formulation of quantum mechanics. Its account of quantum ...
, immediately understood that these conditions could be met if , , and are ''matrices'', with the implication that the wave function has ''multiple components''. This immediately explained the appearance of two-component wave functions in Pauli's phenomenological theory of
spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least matrices to set up a system with the properties required – so the wave function had ''four'' components, not two, as in the Pauli theory, or one, as in the bare Schrödinger theory. The four-component wave function represents a new class of mathematical object in physical theories that makes its first appearance here.
Given the factorization in terms of these matrices, one can now write down immediately an equation
with
to be determined. Applying again the matrix operator on both sides yields
Taking
shows that all the components of the wave function ''individually'' satisfy the relativistic energy–momentum relation. Thus the sought-for equation that is first-order in both space and time is
Setting
and because
, the Dirac equation is produced as written above.
Covariant form and relativistic invariance
To demonstrate the
relativistic invariance of the equation, it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing. New matrices are introduced as follows:
and the equation takes the form (remembering the definition of the covariant components of the
4-gradient and especially that )
where there is an
implied summation over the values of the twice-repeated index , and is the
4-gradient. In practice one often writes the
gamma matrices
In mathematical physics, the gamma matrices, \ \left\\ , also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra \ \mathr ...
in terms of 2 × 2 sub-matrices taken from the
Pauli matrices
In mathematical physics and mathematics, the Pauli matrices are a set of three complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma (), they are occasionally denoted by tau () ...
and the 2 × 2
identity matrix
In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. It has unique properties, for example when the identity matrix represents a geometric transformation, the obje ...
. Explicitly the
standard representation is
The complete system is summarized using the
Minkowski metric
In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of general_relativity, gravitation. It combines inertial space and time manifolds into a four-dimensional model.
The model ...
on spacetime in the form
where the bracket expression
denotes the
anticommutator
In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory.
Group theory
The commutator of two elements, ...
. These are the defining relations of a
Clifford algebra
In mathematics, a Clifford algebra is an algebra generated by a vector space with a quadratic form, and is a unital associative algebra with the additional structure of a distinguished subspace. As -algebras, they generalize the real number ...
over a pseudo-orthogonal 4-dimensional space with
metric signature
In mathematics, the signature of a metric tensor ''g'' (or equivalently, a real quadratic form thought of as a real symmetric bilinear form on a finite-dimensional vector space) is the number (counted with multiplicity) of positive, negative and z ...
. The specific Clifford algebra employed in the Dirac equation is known today as the
Dirac algebra
In mathematical physics, the Dirac algebra is the Clifford algebra \text_(\mathbb). This was introduced by the mathematical physicist P. A. M. Dirac in 1928 in developing the Dirac equation for spin- particles with a matrix representation of the ...
. Although not recognized as such by Dirac at the time the equation was formulated, in hindsight the introduction of this ''
geometric algebra
In mathematics, a geometric algebra (also known as a Clifford algebra) is an algebra that can represent and manipulate geometrical objects such as vectors. Geometric algebra is built out of two fundamental operations, addition and the geometric pr ...
'' represents an enormous stride forward in the development of quantum theory.
The Dirac equation may now be interpreted as an
eigenvalue
In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...
equation, where the rest mass is proportional to an eigenvalue of the
4-momentum operator, the
proportionality constant being the speed of light:
Using
(
is pronounced "d-slash"), according to Feynman slash notation, the Dirac equation becomes:
In practice, physicists often use units of measure such that , known as
natural units
In physics, natural unit systems are measurement systems for which selected physical constants have been set to 1 through nondimensionalization of physical units. For example, the speed of light may be set to 1, and it may then be omitted, equa ...
. The equation then takes the simple form
A foundational theorem states that if two distinct sets of matrices are given that both satisfy the
Clifford relations, then they are connected to each other by a
similarity transform:
If in addition the matrices are all
unitary
Unitary may refer to:
Mathematics
* Unitary divisor
* Unitary element
* Unitary group
* Unitary matrix
* Unitary morphism
* Unitary operator
* Unitary transformation
* Unitary representation
* Unitarity (physics)
* ''E''-unitary inverse semigr ...
, as are the Dirac set, then itself is
unitary
Unitary may refer to:
Mathematics
* Unitary divisor
* Unitary element
* Unitary group
* Unitary matrix
* Unitary morphism
* Unitary operator
* Unitary transformation
* Unitary representation
* Unitarity (physics)
* ''E''-unitary inverse semigr ...
;
The transformation is unique up to a multiplicative factor of absolute value 1. Let us now imagine a
Lorentz transformation
In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
to have been performed on the space and time coordinates, and on the derivative operators, which form a covariant vector. For the operator to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the previously mentioned foundational theorem, one may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form
If the transformed spinor is defined as
then the transformed Dirac equation is produced in a way that demonstrates
manifest relativistic invariance:
Thus, settling on any unitary representation of the gammas is final, provided the spinor is transformed according to the unitary transformation that corresponds to the given Lorentz transformation.
The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function. The representation shown here is known as the ''standard'' representation – in it, the wave function's upper two components go over into Pauli's 2 spinor wave function in the limit of low energies and small velocities in comparison to light.
The considerations above reveal the origin of the gammas in ''geometry'', hearkening back to Grassmann's original motivation; they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as represent ''
oriented surface
In mathematics, orientability is a property of some topological spaces such as real vector spaces, Euclidean spaces, Surface (topology), surfaces, and more generally manifolds that allows a consistent definition of "clockwise" and "anticlockw ...
elements'', and so on. With this in mind, one can find the form of the unit volume element on spacetime in terms of the gammas as follows. By definition, it is
For this to be an invariant, the
epsilon symbol must be a
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
, and so must contain a factor of , where is the
determinant
In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...
of the
metric tensor
In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
. Since this is negative, that factor is ''imaginary''. Thus
This matrix is given the special symbol , owing to its importance when one is considering improper transformations of space-time, that is, those that change the orientation of the basis vectors. In the standard representation, it is
This matrix will also be found to anticommute with the other four Dirac matrices:
It takes a leading role when questions of ''
parity'' arise because the volume element as a directed magnitude changes sign under a space-time reflection. Taking the positive square root above thus amounts to choosing a handedness convention on spacetime.
Comparison with related theories
Pauli theory
The necessity of introducing half-integer
spin goes back experimentally to the results of the
Stern–Gerlach experiment
In quantum physics, the Stern–Gerlach experiment demonstrated that the spatial orientation of angular momentum is quantization (physics), quantized. Thus an Atomic spacing, atomic-scale system was shown to have intrinsically quantum propertie ...
. A beam of atoms is run through a strong
inhomogeneous
Homogeneity and heterogeneity are concepts relating to the uniformity of a substance, process or image. A homogeneous feature is uniform in composition or character (i.e., color, shape, size, weight, height, distribution, texture, language, i ...
magnetic field
A magnetic field (sometimes called B-field) is a physical field that describes the magnetic influence on moving electric charges, electric currents, and magnetic materials. A moving charge in a magnetic field experiences a force perpendicular ...
, which then splits into parts depending on the
intrinsic angular momentum
Spin is an intrinsic form of angular momentum carried by elementary particles, and thus by composite particles such as hadrons, atomic nuclei, and atoms. Spin is quantized, and accurate models for the interaction with spin require relativistic ...
of the atoms. It was found that for
silver
Silver is a chemical element; it has Symbol (chemistry), symbol Ag () and atomic number 47. A soft, whitish-gray, lustrous transition metal, it exhibits the highest electrical conductivity, thermal conductivity, and reflectivity of any metal. ...
atoms, the beam was split in two; the
ground state
The ground state of a quantum-mechanical system is its stationary state of lowest energy; the energy of the ground state is known as the zero-point energy of the system. An excited state is any state with energy greater than the ground state ...
therefore could not be
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into three parts, corresponding to atoms with . The conclusion is that silver atoms have net intrinsic angular momentum of .
Pauli set up a theory that explained this splitting by introducing a two-component wave function and a corresponding correction term in the
Hamiltonian
Hamiltonian may refer to:
* Hamiltonian mechanics, a function that represents the total energy of a system
* Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system
** Dyall Hamiltonian, a modified Hamiltonian ...
, representing a semi-classical coupling of this wave function to an applied magnetic field, as so in
SI units
The International System of Units, internationally known by the abbreviation SI (from French ), is the modern form of the metric system and the world's most widely used system of measurement. It is the only system of measurement with official st ...
: (Note that bold faced characters imply
Euclidean vectors in 3
dimensions, whereas the
Minkowski four-vector
In special relativity, a four-vector (or 4-vector, sometimes Lorentz vector) is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vect ...
can be defined as .)
Here and
represent the components of the
electromagnetic four-potential
An electromagnetic four-potential is a relativistic vector function from which the electromagnetic field can be derived. It combines both an electric scalar potential and a magnetic vector potential into a single four-vector.Gravitation, J.A. W ...
in their standard SI units, and the three sigmas are the
Pauli matrices
In mathematical physics and mathematics, the Pauli matrices are a set of three complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma (), they are occasionally denoted by tau () ...
. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual
classical Hamiltonian of a charged particle interacting with an applied field in
SI units
The International System of Units, internationally known by the abbreviation SI (from French ), is the modern form of the metric system and the world's most widely used system of measurement. It is the only system of measurement with official st ...
:
This Hamiltonian is now a matrix, so the Schrödinger equation based on it must use a two-component wave function. On introducing the external electromagnetic 4-vector potential into the Dirac equation in a similar way, known as
minimal coupling, it takes the form:
A second application of the Dirac operator will now reproduce the Pauli term exactly as before, because the spatial Dirac matrices multiplied by , have the same squaring and commutation properties as the Pauli matrices. What is more, the value of the
gyromagnetic ratio
In physics, the gyromagnetic ratio (also sometimes known as the magnetogyric ratio in other disciplines) of a particle or system is the ratio of its magnetic moment to its angular momentum, and it is often denoted by the symbol , gamma. Its SI u ...
of the electron, standing in front of Pauli's new term, is explained from first principles. This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness. There is more however. The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner. First the equation is written in the form of coupled equations for 2-spinors with the SI units restored:
so
Assuming the field is weak and the motion of the electron non-relativistic, the total energy of the electron is approximately equal to its
rest energy
The invariant mass, rest mass, intrinsic mass, proper mass, or in the case of bound systems simply mass, is the portion of the total mass of an object or system of objects that is independent of the overall motion of the system. More precisely, ...
, and the momentum going over to the classical value,
and so the second equation may be written
which is of order
Thus, at typical energies and velocities, the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components. Substituting this expression into the first equation gives after some rearrangement
The operator on the left represents the particle's total energy reduced by its rest energy, which is just its classical
kinetic energy
In physics, the kinetic energy of an object is the form of energy that it possesses due to its motion.
In classical mechanics, the kinetic energy of a non-rotating object of mass ''m'' traveling at a speed ''v'' is \fracmv^2.Resnick, Rober ...
, so one can recover Pauli's theory upon identifying his 2-spinor with the top components of the Dirac spinor in the non-relativistic approximation. A further approximation gives the Schrödinger equation as the limit of the Pauli theory. Thus, the Schrödinger equation may be seen as the far non-relativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities. This also was a great triumph for the new equation, as it traced the mysterious that appears in it, and the necessity of a complex wave function, back to the geometry of spacetime through the Dirac algebra. It also highlights why the Schrödinger equation, although ostensibly in the form of a
diffusion equation
The diffusion equation is a parabolic partial differential equation. In physics, it describes the macroscopic behavior of many micro-particles in Brownian motion, resulting from the random movements and collisions of the particles (see Fick's l ...
, actually represents wave propagation.
It should be strongly emphasized that the entire Dirac spinor represents an ''irreducible'' whole. The separation, done here, of the Dirac spinor into large and small components depends on the low-energy approximation being valid. The components that were neglected above, to show that the Pauli theory can be recovered by a low-velocity approximation of Dirac's equation, are necessary to produce new phenomena observed in the relativistic regime – among them
antimatter
In modern physics, antimatter is defined as matter composed of the antiparticles (or "partners") of the corresponding subatomic particle, particles in "ordinary" matter, and can be thought of as matter with reversed charge and parity, or go ...
, and the
creation and
annihilation
In particle physics, annihilation is the process that occurs when a subatomic particle collides with its respective antiparticle to produce other particles, such as an electron colliding with a positron to produce two photons. The total energy a ...
of particles.
Weyl theory
In the massless case
, the Dirac equation reduces to the
Weyl equation
In physics, particularly in quantum field theory, the Weyl equation is a relativistic wave equation for describing massless spin-1/2 particles called Weyl fermions. The equation is named after Hermann Weyl. The Weyl fermions are one of the three ...
, which describes relativistic massless spin-1/2 particles.
The theory acquires a second
symmetry: see below.
Physical interpretation
Identification of observables
The critical physical question in a quantum theory is this: what are the physically
observable
In physics, an observable is a physical property or physical quantity that can be measured. In classical mechanics, an observable is a real-valued "function" on the set of all possible system states, e.g., position and momentum. In quantum ...
quantities defined by the theory? According to the postulates of quantum mechanics, such quantities are defined by
self-adjoint operators
In mathematics, a self-adjoint operator on a complex vector space ''V'' with inner product \langle\cdot,\cdot\rangle is a linear map ''A'' (from ''V'' to itself) that is its own adjoint. That is, \langle Ax,y \rangle = \langle x,Ay \rangle for al ...
that act on the
Hilbert space
In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
of possible states of a system. The eigenvalues of these operators are then the possible results of
measuring
Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events.
In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...
the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. To maintain this interpretation on passing to the Dirac theory, the Hamiltonian must be taken to be
where, as always, there is an
implied summation over the twice-repeated index . This looks promising, because one can see by inspection the rest energy of the particle and, in the case of , the energy of a charge placed in an electric potential . What about the term involving the vector potential? In classical electrodynamics, the energy of a charge moving in an applied potential is
Thus, the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and one must take great care to correctly identify what is observable in this theory. Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables.
Hole theory
The negative solutions to the equation are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy solutions. Since they exist, they cannot simply be ignored, for once the interaction between the electron and the electromagnetic field is included, any electron placed in a positive-energy eigenstate would decay into negative-energy eigenstates of successively lower energy. Real electrons obviously do not behave in this way, or they would disappear by emitting energy in the form of
photon
A photon () is an elementary particle that is a quantum of the electromagnetic field, including electromagnetic radiation such as light and radio waves, and the force carrier for the electromagnetic force. Photons are massless particles that can ...
s.
To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the
vacuum
A vacuum (: vacuums or vacua) is space devoid of matter. The word is derived from the Latin adjective (neuter ) meaning "vacant" or "void". An approximation to such vacuum is a region with a gaseous pressure much less than atmospheric pressur ...
is the many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the
Dirac sea. Since the
Pauli exclusion principle
In quantum mechanics, the Pauli exclusion principle (German: Pauli-Ausschlussprinzip) states that two or more identical particles with half-integer spins (i.e. fermions) cannot simultaneously occupy the same quantum state within a system that o ...
forbids electrons from occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and positive-energy electrons would be forbidden from decaying into negative-energy eigenstates.
Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a ''positive'' energy because energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but
Hermann Weyl
Hermann Klaus Hugo Weyl (; ; 9 November 1885 – 8 December 1955) was a German mathematician, theoretical physicist, logician and philosopher. Although much of his working life was spent in Zürich, Switzerland, and then Princeton, New Jersey, ...
pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the
positron
The positron or antielectron is the particle with an electric charge of +1''elementary charge, e'', a Spin (physics), spin of 1/2 (the same as the electron), and the same Electron rest mass, mass as an electron. It is the antiparticle (antimatt ...
, experimentally discovered by
Carl Anderson in 1932.
It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. The infinitely negative contributions from the sea of negative-energy electrons have to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negative-energy electrons is exactly canceled by an infinite positive "
jellium" background so that the net electric charge density of the vacuum is zero. In
quantum field theory
In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
, a
Bogoliubov transformation on the
creation and annihilation operators
Creation operators and annihilation operators are Operator (mathematics), mathematical operators that have widespread applications in quantum mechanics, notably in the study of quantum harmonic oscillators and many-particle systems. An annihilatio ...
(turning an occupied negative-energy electron state into an unoccupied positive energy positron state and an unoccupied negative-energy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.
In certain applications of
condensed matter physics
Condensed matter physics is the field of physics that deals with the macroscopic and microscopic physical properties of matter, especially the solid and liquid State of matter, phases, that arise from electromagnetic forces between atoms and elec ...
, however, the underlying concepts of "hole theory" are valid. The sea of
conduction electron
In solid-state physics, the valence band and conduction band are the bands closest to the Fermi level, and thus determine the electrical conductivity of the solid. In nonmetals, the valence band is the highest range of electron energies in ...
s in an
electrical conductor
In physics and electrical engineering, a conductor is an object or type of material that allows the flow of charge (electric current) in one or more directions. Materials made of metal are common electrical conductors. The flow of negatively c ...
, called a
Fermi sea
The Fermi energy is a concept in quantum mechanics usually referring to the energy difference between the highest and lowest occupied single-particle states in a quantum system of non-interacting fermions at absolute zero temperature.
In a Fermi ga ...
, contains electrons with energies up to the
chemical potential
In thermodynamics, the chemical potential of a Chemical specie, species is the energy that can be absorbed or released due to a change of the particle number of the given species, e.g. in a chemical reaction or phase transition. The chemical potent ...
of the system. An unfilled state in the Fermi sea behaves like a positively charged electron, and although it too is referred to as an "electron hole", it is distinct from a positron. The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material.
In quantum field theory
In
quantum field theories such as
quantum electrodynamics
In particle physics, quantum electrodynamics (QED) is the Theory of relativity, relativistic quantum field theory of electrodynamics. In essence, it describes how light and matter interact and is the first theory where full agreement between quant ...
, the Dirac field is subject to a process of
second quantization
Second quantization, also referred to as occupation number representation, is a formalism used to describe and analyze quantum many-body systems. In quantum field theory, it is known as canonical quantization, in which the fields (typically as ...
, which resolves some of the paradoxical features of the equation.
Mathematical formulation
In its modern formulation for field theory, the Dirac equation is written in terms of a
Dirac spinor
In quantum field theory, the Dirac spinor is the spinor that describes all known fundamental particles that are fermions, with the possible exception of neutrinos. It appears in the plane-wave solution to the Dirac equation, and is a certain comb ...
field
taking values in a complex vector space described concretely as
, defined on flat spacetime (
Minkowski space
In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of gravitation. It combines inertial space and time manifolds into a four-dimensional model.
The model helps show how a ...
)
. Its expression also contains
gamma matrices
In mathematical physics, the gamma matrices, \ \left\\ , also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra \ \mathr ...
and a parameter
interpreted as the mass, as well as other physical constants. Dirac first obtained his equation through a factorization of Einstein's energy-momentum-mass equivalence relation assuming a scalar product of momentum vectors determined by the metric tensor and quantized the resulting relation by associating momenta to their respective operators.
In terms of a field
, the Dirac equation is then
and in
natural units
In physics, natural unit systems are measurement systems for which selected physical constants have been set to 1 through nondimensionalization of physical units. For example, the speed of light may be set to 1, and it may then be omitted, equa ...
, with
Feynman slash notation
In the study of Dirac fields in quantum field theory, Richard Feynman introduced the convenient Feynman slash notation (less commonly known as the Dirac slash notation). If ''A'' is a covariant vector (i.e., a 1-form),
: \ \stackrel\ \gamma^ ...
,
The gamma matrices are a set of four
complex matrices (elements of
) that satisfy the defining ''anti''-commutation relations:
where
is the Minkowski metric element, and the indices
run over 0,1,2 and 3. These matrices can be realized explicitly under a choice of representation. Two common choices are the Dirac representation and the chiral representation. The Dirac representation is
where
are the
Pauli matrices
In mathematical physics and mathematics, the Pauli matrices are a set of three complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma (), they are occasionally denoted by tau () ...
.
For the chiral representation the
are the same, but
The slash notation is a compact notation for
where
is a four-vector (often it is the four-vector differential operator
). The summation over the index
is implied.
Alternatively the four coupled linear first-order
partial differential equation
In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives.
The function is often thought of as an "unknown" that solves the equation, similar to ho ...
s for the four quantities that make up the wave function can be written as a vector. In
Planck units
In particle physics and physical cosmology, Planck units are a system of units of measurement defined exclusively in terms of four universal physical constants: ''Speed of light, c'', ''Gravitational constant, G'', ''Reduced Planck constant, ħ ...
this becomes:
which makes it clearer that it is a set of four partial differential equations with four unknown functions.
(Note that the term is not preceded by because is imaginary.)
Dirac adjoint and the adjoint equation
The Dirac adjoint of the spinor field
is defined as
Using the property of gamma matrices (which follows straightforwardly from Hermicity properties of the
) that
one can derive the adjoint Dirac equation by taking the Hermitian conjugate of the Dirac equation and multiplying on the right by
:
where the partial derivative
acts from the right on
: written in the usual way in terms of a left action of the derivative, we have
Klein–Gordon equation
Applying
to the Dirac equation gives
That is, each component of the Dirac spinor field satisfies the
Klein–Gordon equation
The Klein–Gordon equation (Klein–Fock–Gordon equation or sometimes Klein–Gordon–Fock equation) is a relativistic wave equation, related to the Schrödinger equation. It is named after Oskar Klein and Walter Gordon. It is second-order i ...
.
Conserved current
A
conserved current
In physics a conserved current is a current, j^\mu, that satisfies the continuity equation \partial_\mu j^\mu=0. The continuity equation represents a conservation law, hence the name.
Indeed, integrating the continuity equation over a volume V, ...
of the theory is
Another approach to derive this expression is by variational methods, applying
Noether's theorem
Noether's theorem states that every continuous symmetry of the action of a physical system with conservative forces has a corresponding conservation law. This is the first of two theorems (see Noether's second theorem) published by the mat ...
for the global
symmetry to derive the conserved current
Solutions
Since the Dirac operator acts on 4-tuples of
square-integrable functions, its solutions should be members of the same
Hilbert space
In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
. The fact that the energies of the solutions do not have a lower bound is unexpected.
Plane-wave solutions
Plane-wave solutions are those arising from an ansatz
which models a particle with definite 4-momentum
where
For this ansatz, the Dirac equation becomes an equation for
:
After picking a representation for the gamma matrices
, solving this is a matter of solving a system of linear equations. It is a representation-free property of gamma matrices that the solution space is two-dimensional (see
here
Here may refer to:
Music
* ''Here'' (Adrian Belew album), 1994
* ''Here'' (Alicia Keys album), 2016
* ''Here'' (Cal Tjader album), 1979
* ''Here'' (Edward Sharpe album), 2012
* ''Here'' (Idina Menzel album), 2004
* ''Here'' (Merzbow album), ...
).
For example, in the chiral representation for
, the solution space is parametrised by a
vector
, with
where
and
is the Hermitian matrix square-root.
These plane-wave solutions provide a starting point for canonical quantization.
Lagrangian formulation
Both the Dirac equation and the Adjoint Dirac equation can be obtained from (varying) the action with a specific Lagrangian density that is given by:
If one varies this with respect to
one gets the adjoint Dirac equation. Meanwhile, if one varies this with respect to
one gets the Dirac equation.
In natural units and with the slash notation, the action is then
For this action, the conserved current
above arises as the conserved current corresponding to the global
symmetry through
Noether's theorem
Noether's theorem states that every continuous symmetry of the action of a physical system with conservative forces has a corresponding conservation law. This is the first of two theorems (see Noether's second theorem) published by the mat ...
for field theory. Gauging this field theory by changing the symmetry to a local, spacetime point dependent one gives gauge symmetry (really, gauge redundancy). The resultant theory is
quantum electrodynamics
In particle physics, quantum electrodynamics (QED) is the Theory of relativity, relativistic quantum field theory of electrodynamics. In essence, it describes how light and matter interact and is the first theory where full agreement between quant ...
or QED. See below for a more detailed discussion.
Lorentz invariance
The Dirac equation is invariant under Lorentz transformations, that is, under the action of the Lorentz group
or strictly
, the component connected to the identity.
For a Dirac spinor viewed concretely as taking values in
, the transformation under a Lorentz transformation
is given by a
complex matrix