Density matrix
   HOME

TheInfoList



OR:

In
quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
, a density matrix (or density operator) is a matrix used in calculating the
probabilities Probability is a branch of mathematics and statistics concerning Event (probability theory), events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probab ...
of the outcomes of measurements performed on
physical system A physical system is a collection of physical objects under study. The collection differs from a set: all the objects must coexist and have some physical relationship. In other words, it is a portion of the physical universe chosen for analys ...
s. It is a generalization of the state vectors or wavefunctions: while those can only represent pure states, density matrices can also represent mixed states. These arise in quantum mechanics in two different situations: # when the preparation of a system can randomly produce different pure states, and thus one must deal with the statistics of possible preparations, and # when one wants to describe a physical system that is entangled with another, without describing their combined state. This case is typical for a system interacting with some environment (e.g. decoherence). In this case, the density matrix of an entangled system differs from that of an ensemble of pure states that, combined, would give the same statistical results upon measurement. Density matrices are thus crucial tools in areas of quantum mechanics that deal with mixed states, such as quantum statistical mechanics, open quantum systems and quantum information.


Definition and motivation

The density matrix is a representation of a
linear operator In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pr ...
called the density operator. The density matrix is obtained from the density operator by a choice of an orthonormal basis in the underlying space. In practice, the terms ''density matrix'' and ''density operator'' are often used interchangeably. Pick a basis with states , 0\rangle, , 1\rangle in a two-dimensional
Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
, then the density operator is represented by the matrix (\rho_) = \left( \begin \rho_ & \rho_ \\ \rho_ & \rho_ \end \right) = \left( \begin p_ & \rho_ \\ \rho^*_ & p_ \end \right) where the diagonal elements are
real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
s that sum to one (also called populations of the two states , 0\rangle, , 1\rangle). The off-diagonal elements are complex conjugates of each other (also called coherences); they are restricted in magnitude by the requirement that (\rho_) be a positive semi-definite operator, see below. A density operator is a positive semi-definite, self-adjoint operator of trace one acting on the
Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
of the system. This definition can be motivated by considering a situation where some pure states , \psi_j\rangle (which are not necessarily orthogonal) are prepared with probability p_j each. This is known as an ''ensemble'' of pure states. The probability of obtaining projective measurement result m when using projectors \Pi_m is given by p(m) = \sum_j p_j \left\langle \psi_j\ \Pi_m \left, \psi_j\right\rangle = \operatorname \left \psi_j\right\rangle \left\langle \psi_j\\right) \right which makes the density operator, defined as \rho = \sum_j p_j \left, \psi_j \right\rangle \left\langle \psi_j\, a convenient representation for the state of this ensemble. It is easy to check that this operator is positive semi-definite, self-adjoint, and has trace one. Conversely, it follows from the
spectral theorem In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized (that is, represented as a diagonal matrix in some basis). This is extremely useful because computations involvin ...
that every operator with these properties can be written as \sum_j p_j \left, \psi_j\right\rangle \left\langle \psi_j\ for some states \left, \psi_j\right\rangle and coefficients p_j that are non-negative and add up to one. However, this representation will not be unique, as shown by the Schrödinger–HJW theorem. Another motivation for the definition of density operators comes from considering local measurements on entangled states. Let , \Psi\rangle be a pure entangled state in the composite Hilbert space \mathcal_1\otimes\mathcal_2. The probability of obtaining measurement result m when measuring projectors \Pi_m on the Hilbert space \mathcal_1 alone is given by p(m) = \left\langle \Psi\ \left(\Pi_m \otimes I\right) \left, \Psi\right\rangle = \operatorname \left \Psi\right\rangle \left\langle \Psi\ \right) \right where \operatorname_2 denotes the partial trace over the Hilbert space \mathcal_2. This makes the operator \rho = \operatorname_2 \left, \Psi\right\rangle\left\langle \Psi\ a convenient tool to calculate the probabilities of these local measurements. It is known as the reduced density matrix of , \Psi\rangle on subsystem 1. It is easy to check that this operator has all the properties of a density operator. Conversely, the Schrödinger–HJW theorem implies that all density operators can be written as \operatorname_2 \left, \Psi\right\rangle \left\langle \Psi\ for some state \left, \Psi\right\rangle .


Pure and mixed states

A pure quantum state is a state that can not be written as a probabilistic mixture, or convex combination, of other quantum states. There are several equivalent characterizations of pure states in the language of density operators. A density operator represents a pure state if and only if: * it can be written as an outer product of a state vector , \psi\rangle with itself, that is, \rho = , \psi \rangle \langle \psi, . * it is a projection, in particular of rank one. * it is idempotent, that is \rho = \rho^2. * it has purity one, that is, \operatorname(\rho^2) = 1. It is important to emphasize the difference between a probabilistic mixture (i.e. an ensemble) of quantum states and the superposition of two states. If an ensemble is prepared to have half of its systems in state , \psi_1 \rangle and the other half in , \psi_2 \rangle, it can be described by the density matrix: : \rho = \frac12\begin 1 & 0 \\ 0 & 1\end, where , \psi_1 \rangle and , \psi_2 \rangle are assumed orthogonal and of dimension 2, for simplicity. On the other hand, a quantum superposition of these two states with equal probability amplitudes results in the pure state , \psi \rangle = (, \psi_1 \rangle + , \psi_2 \rangle)/\sqrt, with density matrix : , \psi\rangle\langle\psi, = \frac12\begin 1 & 1 \\ 1 & 1\end. Unlike the probabilistic mixture, this superposition can display quantum interference. Geometrically, the set of density operators is a convex set, and the pure states are the extremal points of that set. The simplest case is that of a two-dimensional Hilbert space, known as a qubit. An arbitrary mixed state for a qubit can be written as a
linear combination In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...
of the Pauli matrices, which together with the identity matrix provide a basis for 2 \times 2 self-adjoint matrices: : \rho = \frac\left(I + r_x \sigma_x + r_y \sigma_y + r_z \sigma_z\right), where the real numbers (r_x, r_y, r_z) are the coordinates of a point within the unit ball and : \sigma_x = \begin 0&1\\ 1&0 \end, \quad \sigma_y = \begin 0&-i\\ i&0 \end, \quad \sigma_z = \begin 1&0\\ 0&-1 \end . Points with r_x^2 + r_y^2 + r_z^2 = 1 represent pure states, while mixed states are represented by points in the interior. This is known as the Bloch sphere picture of qubit state space.


Example: light polarization

An example of pure and mixed states is light polarization. An individual
photon A photon () is an elementary particle that is a quantum of the electromagnetic field, including electromagnetic radiation such as light and radio waves, and the force carrier for the electromagnetic force. Photons are massless particles that can ...
can be described as having right or left circular polarization, described by the orthogonal quantum states , \mathrm\rangle and , \mathrm\rangle or a superposition of the two: it can be in any state \alpha, \mathrm\rangle+\beta, \mathrm\rangle (with , \alpha, ^2+, \beta, ^2=1), corresponding to
linear In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...
, circular, or elliptical polarization. Consider now a vertically polarized photon, described by the state , \mathrm\rangle = (, \mathrm\rangle+, \mathrm\rangle)/\sqrt. If we pass it through a circular polarizer that allows either only , \mathrm\rangle polarized light, or only , \mathrm\rangle polarized light, half of the photons are absorbed in both cases. This may make it ''seem'' like half of the photons are in state , \mathrm\rangle and the other half in state , \mathrm\rangle, but this is not correct: if we pass (, \mathrm\rangle+, \mathrm\rangle)/\sqrt through a
linear polarizer In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...
there is no absorption whatsoever, but if we pass either state , \mathrm\rangle or , \mathrm\rangle half of the photons are absorbed. Unpolarized light (such as the light from an incandescent light bulb) cannot be described as ''any'' state of the form \alpha, \mathrm\rangle+\beta, \mathrm\rangle (linear, circular, or elliptical polarization). Unlike polarized light, it passes through a polarizer with 50% intensity loss whatever the orientation of the polarizer; and it cannot be made polarized by passing it through any wave plate. However, unpolarized light ''can'' be described as a statistical ensemble, e. g. as each photon having either , \mathrm\rangle polarization or , \mathrm\rangle polarization with probability 1/2. The same behavior would occur if each photon had either vertical polarization , \mathrm\rangle or horizontal polarization , \mathrm \rangle with probability 1/2. These two ensembles are completely indistinguishable experimentally, and therefore they are considered the same mixed state. For this example of unpolarized light, the density operator equals : \rho = \frac , \mathrm\rangle \langle \mathrm, + \frac, \mathrm\rangle \langle \mathrm, = \frac , \mathrm\rangle \langle \mathrm, + \frac, \mathrm\rangle \langle \mathrm, = \frac12\begin 1 & 0 \\ 0 & 1\end. There are also other ways to generate unpolarized light: one possibility is to introduce uncertainty in the preparation of the photon, for example, passing it through a birefringent crystal with a rough surface, so that slightly different parts of the light beam acquire different polarizations. Another possibility is using entangled states: a radioactive decay can emit two photons traveling in opposite directions, in the quantum state (, \mathrm,\mathrm\rangle+, \mathrm,\mathrm\rangle)/\sqrt. The joint state of the two photons ''together'' is pure, but the density matrix for each photon individually, found by taking the partial trace of the joint density matrix, is completely mixed.


Equivalent ensembles and purifications

A given density operator does not uniquely determine which ensemble of pure states gives rise to it; in general there are infinitely many different ensembles generating the same density matrix. Those cannot be distinguished by any measurement. The equivalent ensembles can be completely characterized: let \ be an ensemble. Then for any complex matrix U such that U^\dagger U = I (a partial isometry), the ensemble \ defined by : \sqrt \left, \varphi_i \right\rangle = \sum_j U_ \sqrt \left, \psi_j \right\rangle will give rise to the same density operator, and all equivalent ensembles are of this form. A closely related fact is that a given density operator has infinitely many different purifications, which are pure states that generate the density operator when a partial trace is taken. Let : \rho = \sum_j p_j , \psi_j \rangle \langle \psi_j, be the density operator generated by the ensemble \, with states , \psi_j\rangle not necessarily orthogonal. Then for all partial isometries U we have that : , \Psi\rangle = \sum_j \sqrt , \psi_j \rangle U , a_j\rangle is a purification of \rho, where , a_j\rangle is an orthogonal basis, and furthermore all purifications of \rho are of this form.


Measurement

Let A be an observable of the system, and suppose the ensemble is in a mixed state such that each of the pure states \textstyle , \psi_j\rangle occurs with probability p_j. Then the corresponding density operator equals : \rho = \sum_j p_j , \psi_j \rangle \langle \psi_j, . The expectation value of the
measurement Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events. In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...
can be calculated by extending from the case of pure states: : \langle A \rangle = \sum_j p_j \langle \psi_j, A, \psi_j \rangle = \sum_j p_j \operatorname\left(, \psi_j \rangle \langle \psi_j, A \right) = \operatorname\left(\sum_j p_j , \psi_j \rangle \langle \psi_j, A\right) = \operatorname(\rho A), where \operatorname denotes trace. Thus, the familiar expression \langle A\rangle=\langle\psi, A, \psi\rangle for pure states is replaced by : \langle A \rangle = \operatorname( \rho A) for mixed states. Moreover, if A has spectral resolution : A = \sum _i a_i P_i, where P_i is the projection operator into the eigenspace corresponding to eigenvalue a_i, the post-measurement density operator is given by : \rho_i' = \frac when outcome ''i'' is obtained. In the case where the measurement result is not known the ensemble is instead described by : \; \rho ' = \sum_i P_i \rho P_i. If one assumes that the probabilities of measurement outcomes are linear functions of the projectors P_i, then they must be given by the trace of the projector with a density operator. Gleason's theorem shows that in Hilbert spaces of dimension 3 or larger the assumption of linearity can be replaced with an assumption of non-contextuality. This restriction on the dimension can be removed by assuming non-contextuality for POVMs as well, but this has been criticized as physically unmotivated.


Entropy

The von Neumann entropy S of a mixture can be expressed in terms of the eigenvalues of \rho or in terms of the trace and
logarithm In mathematics, the logarithm of a number is the exponent by which another fixed value, the base, must be raised to produce that number. For example, the logarithm of to base is , because is to the rd power: . More generally, if , the ...
of the density operator \rho. Since \rho is a positive semi-definite operator, it has a spectral decomposition such that \rho = \textstyle\sum_i \lambda_i , \varphi_i\rangle \langle\varphi_i, , where , \varphi_i\rangle are orthonormal vectors, \lambda_i \ge 0, and \textstyle \sum \lambda_i = 1. Then the entropy of a quantum system with density matrix \rho is : S = -\sum_i \lambda_i \ln\lambda_i = -\operatorname(\rho \ln\rho). This definition implies that the von Neumann entropy of any pure state is zero. If \rho_i are states that have support on orthogonal subspaces, then the von Neumann entropy of a convex combination of these states, : \rho = \sum_i p_i \rho_i, is given by the von Neumann entropies of the states \rho_i and the Shannon entropy of the probability distribution p_i: : S(\rho) = H(p_i) + \sum_i p_i S(\rho_i). When the states \rho_i do not have orthogonal supports, the sum on the right-hand side is strictly greater than the von Neumann entropy of the convex combination \rho. Given a density operator \rho and a projective measurement as in the previous section, the state \rho' defined by the convex combination : \rho' = \sum_i P_i \rho P_i, which can be interpreted as the state produced by performing the measurement but not recording which outcome occurred, has a von Neumann entropy larger than that of \rho, except if \rho = \rho'. It is however possible for the \rho' produced by a ''generalized'' measurement, or POVM, to have a lower von Neumann entropy than \rho..


Von Neumann equation for time evolution

Just as the Schrödinger equation describes how pure states evolve in time, the von Neumann equation (also known as the Liouville–von Neumann equation) describes how a density operator evolves in time. The von Neumann equation dictates that : i \hbar \frac \rho = , \rho, where the brackets denote a
commutator In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory. Group theory The commutator of two elements, ...
. This equation only holds when the density operator is taken to be in the Schrödinger picture, even though this equation seems at first look to emulate the Heisenberg equation of motion in the
Heisenberg picture In physics, the Heisenberg picture or Heisenberg representation is a Dynamical pictures, formulation (largely due to Werner Heisenberg in 1925) of quantum mechanics in which observables incorporate a dependency on time, but the quantum state, st ...
, with a crucial sign difference: : i \hbar \frac A_\text = - , A_\text, where A_\text(t) is some ''Heisenberg picture'' operator; but in this picture the density matrix is ''not time-dependent'', and the relative sign ensures that the time derivative of the expected value \langle A \rangle comes out ''the same as in the Schrödinger picture''. If the Hamiltonian is time-independent, the von Neumann equation can be easily solved to yield : \rho(t) = e^ \rho(0) e^. For a more general Hamiltonian, if G(t) is the wavefunction propagator over some interval, then the time evolution of the density matrix over that same interval is given by : \rho(t) = G(t) \rho(0) G(t)^\dagger. If one enters the interaction picture, choosing to focus on some component H_1 of the Hamiltonian H = H_0 + H_1, the equation for the evolution of the interaction-picture density operator \rho_(t) possesses identical structure to the von Neumann equation, except the Hamiltonian must also be transformed into the new picture: : where .


Wigner functions and classical analogies

The density matrix operator may also be realized in phase space. Under the Wigner map, the density matrix transforms into the equivalent Wigner function, : W(x,p) \,\ \stackrel\ \, \frac \int_^\infty \psi^*(x + y) \psi(x - y) e^ \,dy. The equation for the time evolution of the Wigner function, known as Moyal equation, is then the Wigner-transform of the above von Neumann equation, : \frac = -\, where H(x,p) is the Hamiltonian, and \ is the Moyal bracket, the transform of the quantum
commutator In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory. Group theory The commutator of two elements, ...
. The evolution equation for the Wigner function is then analogous to that of its classical limit, the Liouville equation of
classical physics Classical physics refers to physics theories that are non-quantum or both non-quantum and non-relativistic, depending on the context. In historical discussions, ''classical physics'' refers to pre-1900 physics, while '' modern physics'' refers to ...
. In the limit of a vanishing Planck constant \hbar, W(x,p,t) reduces to the classical Liouville probability density function in phase space.


Example applications

Density matrices are a basic tool of quantum mechanics, and appear at least occasionally in almost any type of quantum-mechanical calculation. Some specific examples where density matrices are especially helpful and common are as follows: *
Statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. Sometimes called statistical physics or statistical thermodynamics, its applicati ...
uses density matrices, most prominently to express the idea that a system is prepared at a nonzero temperature. Constructing a density matrix using a canonical ensemble gives a result of the form \rho = \exp(-\beta H)/Z(\beta), where \beta is the inverse temperature (k_ T)^ and H is the system's Hamiltonian. The normalization condition that the trace of \rho be equal to 1 defines the partition function to be Z(\beta) = \mathrm \exp(-\beta H). If the number of particles involved in the system is itself not certain, then a grand canonical ensemble can be applied, where the states summed over to make the density matrix are drawn from a Fock space. * Quantum decoherence theory typically involves non-isolated quantum systems developing entanglement with other systems, including measurement apparatuses. Density matrices make it much easier to describe the process and calculate its consequences. Quantum decoherence explains why a system interacting with an environment transitions from being a pure state, exhibiting superpositions, to a mixed state, an incoherent combination of classical alternatives. This transition is fundamentally reversible, as the combined state of system and environment is still pure, but for all practical purposes irreversible, as the environment is a very large and complex quantum system, and it is not feasible to reverse their interaction. Decoherence is thus very important for explaining the classical limit of quantum mechanics, but cannot explain wave function collapse, as all classical alternatives are still present in the mixed state, and wave function collapse selects only one of them. * Similarly, in quantum computation, quantum information theory, open quantum systems, and other fields where state preparation is noisy and decoherence can occur, density matrices are frequently used. Noise is often modelled via a depolarizing channel or an amplitude damping channel. Quantum tomography is a process by which, given a set of data representing the results of quantum measurements, a density matrix consistent with those measurement results is computed. * When analyzing a system with many electrons, such as an
atom Atoms are the basic particles of the chemical elements. An atom consists of a atomic nucleus, nucleus of protons and generally neutrons, surrounded by an electromagnetically bound swarm of electrons. The chemical elements are distinguished fr ...
or
molecule A molecule is a group of two or more atoms that are held together by Force, attractive forces known as chemical bonds; depending on context, the term may or may not include ions that satisfy this criterion. In quantum physics, organic chemi ...
, an imperfect but useful first approximation is to treat the electrons as
uncorrelated In probability theory and statistics, two real-valued random variables, X, Y, are said to be uncorrelated if their covariance, \operatorname ,Y= \operatorname Y- \operatorname \operatorname /math>, is zero. If two variables are uncorrelated, ther ...
or each having an independent single-particle wavefunction. This is the usual starting point when building the Slater determinant in the Hartree–Fock method. If there are N electrons filling the N single-particle wavefunctions , \psi_i\rangle and if only single-particle observables are considered, then their expectation values for the N-electron system can be computed using the density matrix \sum_^N , \psi_i\rangle \langle \psi_i, (the ''one-particle density matrix'' of the N-electron system).


C*-algebraic formulation of states

It is now generally accepted that the description of quantum mechanics in which all self-adjoint operators represent observables is untenable. For this reason, observables are identified with elements of an abstract C*-algebra ''A'' (that is one without a distinguished representation as an algebra of operators) and states are positive linear functionals on ''A''. However, by using the GNS construction, we can recover Hilbert spaces that realize ''A'' as a subalgebra of operators. Geometrically, a pure state on a C*-algebra ''A'' is a state that is an extreme point of the set of all states on ''A''. By properties of the GNS construction these states correspond to irreducible representations of ''A''. The states of the C*-algebra of compact operators ''K''(''H'') correspond exactly to the density operators, and therefore the pure states of ''K''(''H'') are exactly the pure states in the sense of quantum mechanics. The C*-algebraic formulation can be seen to include both classical and quantum systems. When the system is classical, the algebra of observables become an abelian C*-algebra. In that case the states become probability measures.


History

The formalism of density operators and matrices was introduced in 1927 by
John von Neumann John von Neumann ( ; ; December 28, 1903 – February 8, 1957) was a Hungarian and American mathematician, physicist, computer scientist and engineer. Von Neumann had perhaps the widest coverage of any mathematician of his time, in ...
and independently, but less systematically, by Lev Landau and later in 1946 by
Felix Bloch Felix Bloch (; ; 23 October 1905 – 10 September 1983) was a Swiss-American physicist who shared the 1952 Nobel Prize in Physics with Edward Mills Purcell "for their development of new methods for nuclear magnetic precision measurements and di ...
. Von Neumann introduced the density matrix in order to develop both quantum statistical mechanics and a theory of quantum measurements. The name density matrix itself relates to its classical correspondence to a phase-space
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
(probability distribution of position and momentum) in classical
statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. Sometimes called statistical physics or statistical thermodynamics, its applicati ...
, which was introduced by Eugene Wigner in 1932. In contrast, the motivation that inspired Landau was the impossibility of describing a subsystem of a composite quantum system by a state vector.


See also

* Atomic electron transition * Density functional theory * Green–Kubo relations * Green's function (many-body theory) * Lindblad equation * Wigner quasi-probability distribution


Notes and references

{{Quantum mechanics topics Functional analysis Quantum information science Statistical mechanics Lev Landau