linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...

, a

square matrix In mathematics, a square matrix is a Matrix (mathematics), matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied. Squ ...

A

is called diagonalizable or non-defective if it is similar to a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagon ...

. That is, if there exists an

invertible matrix In linear algebra, an invertible matrix (''non-singular'', ''non-degenarate'' or ''regular'') is a square matrix that has an inverse. In other words, if some other matrix is multiplied by the invertible matrix, the result can be multiplied by a ...

P

and a diagonal matrix

D

such that . This is equivalent to (Such

D

are not unique.) This property exists for any linear map: for a

finite-dimensional In mathematics, the dimension of a vector space ''V'' is the cardinality (i.e., the number of vectors) of a basis of ''V'' over its base field. p. 44, §2.36 It is sometimes called Hamel dimension (after Georg Hamel) or algebraic dimension to d ...

vector space In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...

linear map In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that p ...

T:V\to V

is called diagonalizable if there exists an

ordered basis Order, ORDER or Orders may refer to: * A socio-political or established or existing order, e.g. World order, Ancien Regime, Pax Britannica * Categorization, the process in which ideas and objects are recognized, differentiated, and understood * ...

V

consisting of

eigenvector In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by ...

s of

T

. These definitions are equivalent: if

T

has a

matrix Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the m ...

representation

A = PDP^

as above, then the column vectors of

P

form a basis consisting of eigenvectors of and the diagonal entries of

D

are the corresponding

eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

s of with respect to this eigenvector basis,

T

is represented by Diagonalization is the process of finding the above

P

and and makes many subsequent computations easier. One can raise a diagonal matrix

D

to a power by simply raising the diagonal entries to that power. The

determinant In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...

of a diagonal matrix is simply the product of all diagonal entries. Such computations generalize easily to The geometric transformation represented by a diagonalizable matrix is an ''

inhomogeneous dilation In affine geometry, uniform scaling (or isotropic scaling) is a linear transformation that enlarges (increases) or shrinks (diminishes) objects by a '' scale factor'' that is the same in all directions ( isotropically). The result of uniform sc ...

'' (or ''anisotropic scaling''). That is, it can scale the space by a different amount in different directions. The direction of each eigenvector is scaled by a factor given by the corresponding eigenvalue. A square matrix that is not diagonalizable is called '' defective''. It can happen that a matrix

A

with

real Real may refer to: Currencies * Argentine real * Brazilian real (R$) * Central American Republic real * Mexican real * Portuguese real * Spanish real * Spanish colonial real Nature and science * Reality, the state of things as they exist, rathe ...

entries is defective over the real numbers, meaning that

A = PDP^

is impossible for any invertible

P

and diagonal

D

with real entries, but it is possible with

complex Complex commonly refers to: * Complexity, the behaviour of a system whose components interact in multiple ways so possible interactions are difficult to describe ** Complex system, a system composed of many components which may interact with each ...

entries, so that

A

is diagonalizable over the complex numbers. For example, this is the case for a generic

rotation matrix In linear algebra, a rotation matrix is a transformation matrix that is used to perform a rotation (mathematics), rotation in Euclidean space. For example, using the convention below, the matrix :R = \begin \cos \theta & -\sin \theta \\ \sin \t ...

. Many results for diagonalizable matrices hold only over an

algebraically closed field In mathematics, a field is algebraically closed if every non-constant polynomial in (the univariate polynomial ring with coefficients in ) has a root in . In other words, a field is algebraically closed if the fundamental theorem of algebra ...

(such as the complex numbers). In this case, diagonalizable matrices are

dense Density (volumetric mass density or specific mass) is the ratio of a substance's mass to its volume. The symbol most often used for density is ''ρ'' (the lower case Greek letter rho), although the Latin letter ''D'' (or ''d'') can also be use ...

in the space of all matrices, which means any defective matrix can be deformed into a diagonalizable matrix by a small

perturbation Perturbation or perturb may refer to: * Perturbation theory, mathematical methods that give approximate solutions to problems that cannot be solved exactly * Perturbation (geology), changes in the nature of alluvial deposits over time * Perturbati ...

; and the

Jordan–Chevalley decomposition In mathematics, specifically linear algebra, the Jordan–Chevalley decomposition, named after Camille Jordan and Claude Chevalley, expresses a linear operator in a unique way as the sum of two other linear operators which are simpler to understand ...

states that any matrix is uniquely the sum of a diagonalizable matrix and a

nilpotent matrix In linear algebra, a nilpotent matrix is a square matrix ''N'' such that :N^k = 0\, for some positive integer k. The smallest such k is called the index of N, sometimes the degree of N. More generally, a nilpotent transformation is a linear trans ...

. Over an algebraically closed field, diagonalizable matrices are equivalent to semi-simple matrices.

Definition

A square

n \times n

matrix

A

with entries in a

field Field may refer to: Expanses of open ground * Field (agriculture), an area of land used for agricultural purposes * Airfield, an aerodrome that lacks the infrastructure of an airport * Battlefield * Lawn, an area of mowed grass * Meadow, a grass ...

F

is called diagonalizable or nondefective if there exists an

n \times n

invertible matrix (i.e. an element of the

general linear group In mathematics, the general linear group of degree n is the set of n\times n invertible matrices, together with the operation of ordinary matrix multiplication. This forms a group, because the product of two invertible matrices is again inve ...

GL_''n''(''F'')),

P

, such that

P^AP

is a diagonal matrix.

Characterization

The fundamental fact about diagonalizable maps and matrices is expressed by the following: * An

n \times n

matrix

A

over a field

F

is diagonalizable

if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...

the sum of the

dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coo ...

s of its eigenspaces is equal to

n

, which is the case if and only if there exists a

basis Basis is a term used in mathematics, finance, science, and other contexts to refer to foundational concepts, valuation measures, or organizational names; here, it may refer to: Finance and accounting * Adjusted basis, the net cost of an asse ...

F^n

consisting of eigenvectors of

A

. If such a basis has been found, one can form the matrix

P

having these

basis vectors In mathematics, a set of elements of a vector space is called a basis (: bases) if every element of can be written in a unique way as a finite linear combination of elements of . The coefficients of this linear combination are referred to as ...

as columns, and

P^AP

will be a diagonal matrix whose diagonal entries are the eigenvalues of

A

. The matrix

P

is known as a

modal matrix In linear algebra, the modal matrix is used in the diagonalization process involving eigenvalues and eigenvectors. Specifically the modal matrix M for the matrix A is the ''n'' × ''n'' matrix formed with the eigenvectors of A as columns in M. It ...

for

A

. * A linear map

T : V \to V

is diagonalizable if and only if the sum of the

s of its eigenspaces is equal to which is the case if and only if there exists a basis of

V

consisting of eigenvectors of

T

. With respect to such a basis,

T

will be represented by a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of The following sufficient (but not necessary) condition is often useful. * An

n \times n

matrix

A

is diagonalizable over the field

F

if it has

n

distinct eigenvalues in i.e. if its

characteristic polynomial In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The ...

has

n

distinct roots in however, the converse may be false. Consider

\begin 
-1 & 3 & -1 \\
-3 & 5 & -1 \\
-3 & 3 & 1 
\end,

which has eigenvalues 1, 2, 2 (not all distinct) and is diagonalizable with diagonal form ( similar to

\begin
1 & 0 & 0 \\
0 & 2 & 0 \\
0 & 0 & 2
\end

and change of basis matrix

P

\begin
1 & 1 & -1 \\
1 & 1 & 0 \\
1 & 0 & 3
\end.

The converse fails when

A

has an eigenspace of dimension higher than 1. In this example, the eigenspace of

A

associated with the eigenvalue 2 has dimension 2. * A linear map

T : V \to V

with

n = \dim(V)

is diagonalizable if it has

n

distinct eigenvalues, i.e. if its characteristic polynomial has

n

distinct roots in

F

. Let

A

be a matrix over If

A

is diagonalizable, then so is any power of it. Conversely, if

A

is invertible,

F

is algebraically closed, and

A^n

is diagonalizable for some

n

that is not an integer multiple of the characteristic of then

A

is diagonalizable. Proof: If

A^n

is diagonalizable, then

A

is annihilated by some polynomial which has no multiple root (since and is divided by the minimal polynomial of Over the complex numbers

\Complex

, almost every matrix is diagonalizable. More precisely: the set of complex

n \times n

matrices that are ''not'' diagonalizable over considered as a

subset In mathematics, a Set (mathematics), set ''A'' is a subset of a set ''B'' if all Element (mathematics), elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they a ...

of has

Lebesgue measure In measure theory, a branch of mathematics, the Lebesgue measure, named after French mathematician Henri Lebesgue, is the standard way of assigning a measure to subsets of higher dimensional Euclidean '-spaces. For lower dimensions or , it c ...

zero. One can also say that the diagonalizable matrices form a dense subset with respect to the

Zariski topology In algebraic geometry and commutative algebra, the Zariski topology is a topology defined on geometric objects called varieties. It is very different from topologies that are commonly used in real or complex analysis; in particular, it is not ...

: the non-diagonalizable matrices lie inside the vanishing set of the

discriminant In mathematics, the discriminant of a polynomial is a quantity that depends on the coefficients and allows deducing some properties of the zero of a function, roots without computing them. More precisely, it is a polynomial function of the coef ...

of the characteristic polynomial, which is a

hypersurface In geometry, a hypersurface is a generalization of the concepts of hyperplane, plane curve, and surface. A hypersurface is a manifold or an algebraic variety of dimension , which is embedded in an ambient space of dimension , generally a Euclidea ...

. From that follows also density in the usual (''strong'') topology given by a

norm Norm, the Norm or NORM may refer to: In academic disciplines * Normativity, phenomenon of designating things as good or bad * Norm (geology), an estimate of the idealised mineral content of a rock * Norm (philosophy), a standard in normative e ...

. The same is not true over The

expresses an operator as the sum of its semisimple (i.e., diagonalizable) part and its

nilpotent In mathematics, an element x of a ring (mathematics), ring R is called nilpotent if there exists some positive integer n, called the index (or sometimes the degree), such that x^n=0. The term, along with its sister Idempotent (ring theory), idem ...

part. Hence, a matrix is diagonalizable if and only if its nilpotent part is zero. Put in another way, a matrix is diagonalizable if each block in its

Jordan form \begin \lambda_1 1\hphantom\hphantom\\ \hphantom\lambda_1 1\hphantom\\ \hphantom\lambda_1\hphantom\\ \hphantom\lambda_2 1\hphantom\hphantom\\ \hphantom\hphantom\lambda_2\hphantom\\ \hphantom\lambda_3\hphantom\\ \hphantom\ddots\hphantom\\ ...

has no nilpotent part; i.e., each "block" is a one-by-one matrix.

Diagonalization

Consider the two following arbitrary bases

E = \

and

F = \

. Suppose that there exists a linear transformation represented by a matrix

A_E

which is written with respect to basis E. Suppose also that there exists the following eigen-equation:

A_E \boldsymbol_ = \lambda_i \boldsymbol_

The alpha eigenvectors are written also with respect to the E basis. Since the set F is both a set of eigenvectors for matrix A and it spans some arbitrary vector space, then we say that there exists a matrix

D_F

which is a diagonal matrix that is similar to

A_E

. In other words,

A_E

is a diagonalizable matrix if the matrix is written in the basis F. We perform the change of basis calculation using the transition matrix

S

, which changes basis from E to F as follows:

D_F = S_^F \ A_E  \ S_^

, where

S_^F

is the transition matrix from E-basis to F-basis. The inverse can then be equated to a new transition matrix

P

which changes basis from F to E instead and so we have the following relationship :

S_^ = P_^

Both

S

and

P

transition matrices are invertible. Thus we can manipulate the matrices in the following fashion:

\begin
    D = S \ A_ \ S^ \\
    D = P^ \ A_ \ P 
\end

The matrix

A_

will be denoted as

A

, which is still in the E-basis. Similarly, the diagonal matrix is in the F-basis. Diagonalization as rotation

If a matrix

A

can be diagonalized, that is, :

P^AP = \begin
  \lambda_1 &         0 &  \cdots &         0 \\
          0 & \lambda_2 &  \cdots &         0 \\
     \vdots &    \vdots & \ddots &    \vdots \\
          0 &         0 &  \cdots & \lambda_n
\end = D,

then: :

AP = P\begin
  \lambda_1 &         0 &  \cdots &         0 \\
          0 & \lambda_2 &  \cdots &         0 \\
     \vdots &    \vdots & \ddots &    \vdots \\
          0 &         0 &  \cdots & \lambda_n
\end.

The transition matrix S has the E-basis vectors as columns written in the basis F. Inversely, the inverse transition matrix P has F-basis vectors

\boldsymbol_i

written in the basis of E so that we can represent P in block matrix form in the following manner: :

P = \begin \boldsymbol_ & \boldsymbol_ & \cdots & \boldsymbol_ \end,

as a result we can write:

\begin
    A \begin \boldsymbol_ & \boldsymbol_ & \cdots & \boldsymbol_ \end =  \begin \boldsymbol_ & \boldsymbol_ & \cdots & \boldsymbol_ \endD.
\end

In block matrix form, we can consider the A-matrix to be a matrix of 1x1 dimensions whilst P is a 1xn dimensional matrix. The D-matrix can be written in full form with all the diagonal elements as an nxn dimensional matrix:

A \begin \boldsymbol_ & \boldsymbol_ & \cdots & \boldsymbol_ \end=  \begin \boldsymbol_ & \boldsymbol_ & \cdots & \boldsymbol_ \end
\begin
  \lambda_1 &         0 &  \cdots &         0 \\
          0 & \lambda_2 &  \cdots &         0 \\
     \vdots &    \vdots & \ddots &    \vdots \\
          0 &         0 &  \cdots & \lambda_n
\end.

Performing the above matrix multiplication we end up with the following result:

\begin
        A \begin \boldsymbol_1 & \boldsymbol_2 & \cdots & \boldsymbol_n \end =  \begin \lambda_1 \boldsymbol_1 & \lambda_2\boldsymbol_2 & \cdots & \lambda_n \boldsymbol_n  \end


\end

Taking each component of the block matrix individually on both sides, we end up with the following: :

A\boldsymbol_i = \lambda_i \boldsymbol_i \qquad (i=1,2,\dots,n).

So the column vectors of

P

are

right eigenvector In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a c ...

s of and the corresponding diagonal entry is the corresponding

. The invertibility of

P

also suggests that the eigenvectors are

linearly independent In the theory of vector spaces, a set of vectors is said to be if there exists no nontrivial linear combination of the vectors that equals the zero vector. If such a linear combination exists, then the vectors are said to be . These concep ...

and form a basis of This is the necessary and sufficient condition for diagonalizability and the canonical approach of diagonalization. The

row vector In linear algebra, a column vector with elements is an m \times 1 matrix consisting of a single column of entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some , co ...

s of

P^

are the

left eigenvector In linear algebra, an eigenvector ( ) or characteristic vector is a Vector (mathematics and physics), vector that has its direction (geometry), direction unchanged (or reversed) by a given linear map, linear transformation. More precisely, an e ...

s of When a complex matrix

A\in\mathbb^

is a

Hermitian matrix In mathematics, a Hermitian matrix (or self-adjoint matrix) is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the -th row and -th column is equal to the complex conjugate of the element in the ...

(or more generally a

normal matrix In mathematics, a complex square matrix is normal if it commutes with its conjugate transpose : :A \text \iff A^*A = AA^* . The concept of normal matrices can be extended to normal operators on infinite-dimensional normed spaces and to nor ...

), eigenvectors of

A

can be chosen to form an

orthonormal basis In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...

of and

P

can be chosen to be a

unitary matrix In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if U^* U = UU^* = I, where is the identity matrix. In physics, especially in quantum mechanics, the conjugate ...

. If in addition,

A\in\mathbb^

is a real

symmetric matrix In linear algebra, a symmetric matrix is a square matrix that is equal to its transpose. Formally, Because equal matrices have equal dimensions, only square matrices can be symmetric. The entries of a symmetric matrix are symmetric with ...

, then its eigenvectors can be chosen to be an orthonormal basis of

\mathbb^n

and

P

can be chosen to be an

orthogonal matrix In linear algebra, an orthogonal matrix, or orthonormal matrix, is a real square matrix whose columns and rows are orthonormal vectors. One way to express this is Q^\mathrm Q = Q Q^\mathrm = I, where is the transpose of and is the identi ...

. For most practical work matrices are diagonalized numerically using computer software. Many algorithms exist to accomplish this.

Simultaneous diagonalization

A set of matrices is said to be ''simultaneously diagonalizable'' if there exists a single invertible matrix

P

such that

P^AP

is a diagonal matrix for every

A

in the set. The following theorem characterizes simultaneously diagonalizable matrices: A set of diagonalizable matrices commutes if and only if the set is simultaneously diagonalizable. The set of all

n \times n

diagonalizable matrices (over with

n > 1

is not simultaneously diagonalizable. For instance, the matrices :

\begin 1 & 0 \\ 0 & 0 \end \quad\text\quad \begin 1 & 1 \\ 0 & 0 \end

are diagonalizable but not simultaneously diagonalizable because they do not commute. A set consists of commuting

normal matrices Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...

if and only if it is simultaneously diagonalizable by a

; that is, there exists a unitary matrix

U

such that

U^ AU

is diagonal for every

A

in the set. In the language of

Lie theory In mathematics, the mathematician Sophus Lie ( ) initiated lines of study involving integration of differential equations, transformation groups, and contact (mathematics), contact of spheres that have come to be called Lie theory. For instance, ...

, a set of simultaneously diagonalizable matrices generates a

toral Lie algebra In mathematics, a toral subalgebra is a Lie subalgebra of a general linear Lie algebra all of whose elements are semisimple (or diagonalizable over an algebraically closed field). Equivalently, a Lie algebra is toral if it contains no nonzero nilp ...

Examples

Diagonalizable matrices

Involution Involution may refer to: Mathematics * Involution (mathematics), a function that is its own inverse * Involution algebra, a *-algebra: a type of algebraic structure * Involute, a construction in the differential geometry of curves * Exponentiati ...

s are diagonalizable over the reals (and indeed any field of characteristic not 2), with ±1 on the diagonal. * Finite order

endomorphism In mathematics, an endomorphism is a morphism from a mathematical object to itself. An endomorphism that is also an isomorphism is an automorphism. For example, an endomorphism of a vector space is a linear map , and an endomorphism of a g ...

s are diagonalizable over

\mathbb

(or any algebraically closed field where the characteristic of the field does not divide the order of the endomorphism) with

roots of unity In mathematics, a root of unity is any complex number that yields 1 when raised to some positive integer power . Roots of unity are used in many branches of mathematics, and are especially important in number theory, the theory of group char ...

on the diagonal. This follows since the minimal polynomial is separable, because the roots of unity are distinct. * Projections are diagonalizable, with 0s and 1s on the diagonal. * Real

symmetric matrices In linear algebra, a symmetric matrix is a square matrix that is equal to its transpose. Formally, Because equal matrices have equal dimensions, only square matrices can be symmetric. The entries of a symmetric matrix are symmetric with re ...

are diagonalizable by orthogonal matrices; i.e., given a real symmetric matrix

Q^AQ

is diagonal for some orthogonal matrix More generally, matrices are diagonalizable by

unitary matrices In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if U^* U = UU^* = I, where is the identity matrix. In physics, especially in quantum mechanics, the conjugate ...

if and only if they are

normal Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...

. In the case of the real symmetric matrix, we see that so clearly

AA^ = A^A

holds. Examples of normal matrices are real symmetric (or skew-symmetric) matrices (e.g. covariance matrices) and

Hermitian matrices In mathematics, a Hermitian matrix (or self-adjoint matrix) is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the -th row and -th column is equal to the complex conjugate of the element in the -th ...

(or skew-Hermitian matrices). See

spectral theorem In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized (that is, represented as a diagonal matrix in some basis). This is extremely useful because computations involvin ...

s for generalizations to infinite-dimensional vector spaces.

Matrices that are not diagonalizable

In general, a

is not diagonalizable over the reals, but all rotation matrices are diagonalizable over the complex field. Even if a matrix is not diagonalizable, it is always possible to "do the best one can", and find a matrix with the same properties consisting of eigenvalues on the leading diagonal, and either ones or zeroes on the superdiagonal – known as

Jordan normal form \begin \lambda_1 1\hphantom\hphantom\\ \hphantom\lambda_1 1\hphantom\\ \hphantom\lambda_1\hphantom\\ \hphantom\lambda_2 1\hphantom\hphantom\\ \hphantom\hphantom\lambda_2\hphantom\\ \hphantom\lambda_3\hphantom\\ \hphantom\ddots\hphantom\\ ...

. Some matrices are not diagonalizable over any field, most notably nonzero nilpotent matrices. This happens more generally if the algebraic and geometric multiplicities of an eigenvalue do not coincide. For instance, consider :

C = \begin 0 & 1 \\ 0 & 0 \end.

This matrix is not diagonalizable: there is no matrix

U

such that

U^CU

is a diagonal matrix. Indeed,

C

has one eigenvalue (namely zero) and this eigenvalue has algebraic multiplicity 2 and geometric multiplicity 1. Some real matrices are not diagonalizable over the reals. Consider for instance the matrix :

B = \left begin 0 & 1 \\ \!-1 & 0 \end\right

The matrix

B

does not have any real eigenvalues, so there is no real matrix

Q

such that

Q^BQ

is a diagonal matrix. However, we can diagonalize

B

if we allow complex numbers. Indeed, if we take :

Q = \begin 1 & i \\ i & 1 \end,

then

Q^BQ

is diagonal. It is easy to find that

B

is the rotation matrix which rotates counterclockwise by angle

\theta = -\frac

Note that the above examples show that the sum of diagonalizable matrices need not be diagonalizable.

How to diagonalize a matrix

Diagonalizing a matrix is the same process as finding its

eigenvalues and eigenvectors In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

, in the case that the eigenvectors form a basis. For example, consider the matrix :

A=\left begin
0 & 1 & \!\!\!-2\\
0 & 1 & 0\\
1 & \!\!\!-1 & 3
\end\right

The roots of the

p(\lambda)=\det(\lambda I-A)

are the eigenvalues Solving the linear system

\left(1I-A\right) \mathbf = \mathbf

gives the eigenvectors

\mathbf_1 = (1,1,0)

and while

\left(2I-A\right)\mathbf = \mathbf

gives that is,

A \mathbf_i = \lambda_i \mathbf_i

for These vectors form a basis of so we can assemble them as the column vectors of a change-of-basis matrix

P

to get:

= \begin 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end = D .

We may see this equation in terms of transformations:

P

takes the standard basis to the eigenbasis, so we have:

P^ AP \mathbf_i =
P^ A \mathbf_i =
P^ (\lambda_i\mathbf_i) =
\lambda_i\mathbf_i,

so that

P^ AP

has the standard basis as its eigenvectors, which is the defining property of Note that there is no preferred order of the eigenvectors in changing the order of the

eigenvectors In linear algebra, an eigenvector ( ) or characteristic vector is a Vector (mathematics and physics), vector that has its direction (geometry), direction unchanged (or reversed) by a given linear map, linear transformation. More precisely, an e ...

P

just changes the order of the

eigenvalues In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

in the diagonalized form of

Application to matrix functions

Diagonalization can be used to efficiently compute the powers of a matrix :

\begin 
  A^k &= \left(PDP^\right)^k = \left(PDP^\right) \left(PDP^\right) \cdots \left(PDP^\right) \\
      &= PD\left(P^P\right) D \left(P^P\right) \cdots \left(P^P\right) D P^ = PD^kP^,
\end

and the latter is easy to calculate since it only involves the powers of a diagonal matrix. For example, for the matrix

A

with eigenvalues

\lambda = 1,1,2

in the example above we compute: :

&= \begin 2 - 2^k & -1 + 2^k & 2 - 2^ \\ 0 & 1 & 0 \\ -1 + 2^k & 1 - 2^k & -1 + 2^ \end. \end

This approach can be generalized to

matrix exponential In mathematics, the matrix exponential is a matrix function on square matrix, square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exp ...

and other

matrix function In mathematics, every analytic function can be used for defining a matrix function that maps square matrices with complex entries to square matrices of the same size. This is used for defining the exponential of a matrix, which is involved in th ...

s that can be defined as power series. For example, defining we have: :

&= \begin 2 e - e^2 & -e + e^2 & 2 e - 2 e^2 \\ 0 & e & 0 \\ -e + e^2 & e - e^2 & -e + 2 e^2 \end. \end

This is particularly useful in finding closed form expressions for terms of

linear recursive sequences In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a ''function'' (or '' mapping''); * linearity of a ''polynomial''. An example of a linear function is the function defined by f(x)=( ...

, such as the

Fibonacci numbers In mathematics, the Fibonacci sequence is a sequence in which each element is the sum of the two elements that precede it. Numbers that are part of the Fibonacci sequence are known as Fibonacci numbers, commonly denoted . Many writers begin the s ...

Particular application

For example, consider the following matrix: :

M = \begina & b - a\\ 0 & b\end.

Calculating the various powers of

M

reveals a surprising pattern: :

M^2 = \begina^2 & b^2-a^2 \\ 0 &b^2 \end,\quad
  M^3 = \begina^3 & b^3-a^3 \\ 0 &b^3 \end,\quad
  M^4 = \begina^4 & b^4-a^4 \\ 0 &b^4 \end,\quad
  \ldots

The above phenomenon can be explained by diagonalizing To accomplish this, we need a basis of

\R^2

consisting of eigenvectors of One such eigenvector basis is given by :

\mathbf = \begin 1 \\ 0 \end = \mathbf_1,\quad
  \mathbf = \begin 1 \\ 1 \end = \mathbf_1 + \mathbf_2,

where e_''i'' denotes the standard basis of R^''n''. The reverse change of basis is given by :

\mathbf_1 = \mathbf,\qquad \mathbf_2 = \mathbf - \mathbf.

Straightforward calculations show that :

M\mathbf = a\mathbf,\qquad M\mathbf = b\mathbf.

Thus, ''a'' and ''b'' are the eigenvalues corresponding to u and v, respectively. By linearity of matrix multiplication, we have that :

M^n \mathbf = a^n \mathbf,\qquad M^n \mathbf = b^n \mathbf.

Switching back to the standard basis, we have :

\begin
  M^n \mathbf_1 &= M^n \mathbf = a^n \mathbf_1, \\
  M^n \mathbf_2 &= M^n \left(\mathbf - \mathbf\right) = b^n \mathbf - a^n\mathbf = \left(b^n - a^n\right) \mathbf_1 + b^n\mathbf_2.
\end

The preceding relations, expressed in matrix form, are :

M^n = \begin a^n & b^n - a^n \\ 0 & b^n \end,

thereby explaining the above phenomenon.

Quantum mechanical application

quantum mechanical Quantum mechanics is the fundamental physical theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is the foundation of a ...

and

quantum chemical Quantum chemistry, also called molecular quantum mechanics, is a branch of physical chemistry focused on the application of quantum mechanics to chemical systems, particularly towards the quantum-mechanical calculation of electronic contributions ...

computations matrix diagonalization is one of the most frequently applied numerical processes. The basic reason is that the time-independent

Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...

is an eigenvalue equation, albeit in most of the physical situations on an infinite dimensional

Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...

. A very common approximation is to truncate (or project) the Hilbert space to finite dimension, after which the Schrödinger equation can be formulated as an eigenvalue problem of a real symmetric, or complex Hermitian matrix. Formally this approximation is founded on the

variational principle A variational principle is a mathematical procedure that renders a physical problem solvable by the calculus of variations, which concerns finding functions that optimize the values of quantities that depend on those functions. For example, the pr ...

, valid for Hamiltonians that are bounded from below. First-order perturbation theory also leads to matrix eigenvalue problem for degenerate states.

Notes

References

{{Matrix classes Matrices (mathematics)

Definition

Characterization

Diagonalization

Simultaneous diagonalization

Examples

Diagonalizable matrices

Matrices that are not diagonalizable

How to diagonalize a matrix

Application to matrix functions

Particular application

Quantum mechanical application

See also

Notes

References