In mathematics, the Bauer–Fike theorem is a standard result in the

perturbation theory In mathematics and applied mathematics, perturbation theory comprises methods for finding an approximate solution to a problem, by starting from the exact solution of a related, simpler problem. A critical feature of the technique is a middl ...

of the

eigenvalue In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted ...

of a complex-valued

diagonalizable matrix In linear algebra, a square matrix A is called diagonalizable or non-defective if it is similar to a diagonal matrix, i.e., if there exists an invertible matrix P and a diagonal matrix D such that or equivalently (Such D are not unique.) ...

. In its substance, it states an absolute upper bound for the deviation of one perturbed matrix eigenvalue from a properly chosen eigenvalue of the exact matrix. Informally speaking, what it says is that ''the sensitivity of the eigenvalues is estimated by the condition number of the matrix of eigenvectors''. The theorem was proved by

Friedrich L. Bauer Friedrich Ludwig "Fritz" Bauer (10 June 1924 – 26 March 2015) was a German pioneer of computer science and professor at the Technical University of Munich. Life Bauer earned his Abitur in 1942 and served in the Wehrmacht during World War ...

and C. T. Fike in 1960.

The setup

In what follows we assume that: * is a

; * is the non-singular

eigenvector matrix In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted ...

such that , where is a diagonal matrix. * If is invertible, its condition number in -norm is denoted by and defined by: ::

\kappa_p(X)=\, X\, _p \left \, X^ \right \, _p.

The Bauer–Fike Theorem

:Bauer–Fike Theorem. Let be an eigenvalue of . Then there exists such that: ::

, \lambda-\mu,  \leq \kappa_p (V) \,  \delta A \, _p

Proof. We can suppose , otherwise take and the result is trivially true since . Since is an eigenvalue of , we have and so :

\begin
0 &= \det(A+\delta A-\mu I) \\
  &= \det(V^)\det(A+\delta A-\mu I)\det(V) \\
  &= \det \left ( V^ (A+\delta A-\mu I) V \right ) \\
  &= \det \left ( V^AV + V^\delta AV- V^ \mu I V \right ) \\
  &= \det \left (\Lambda+V^\delta AV-\mu I \right ) \\
  &= \det(\Lambda-\mu I)\det \left ((\Lambda-\mu I)^V^\delta AV +I \right ) \\
\end

However our assumption, , implies that: and therefore we can write: :

\det \left ((\Lambda-\mu I)^V^\delta AV +I \right ) = 0.

This reveals to be an eigenvalue of :

(\Lambda-\mu I)^V^\delta AV.

Since all -norms are consistent matrix norms we have where is an eigenvalue of . In this instance this gives us: :

1 = , -1,  \leq \left \, (\Lambda-\mu I)^V^\delta AV \right \, _p \leq \left \, (\Lambda-\mu I)^ \right \, _p \left \, V^\right \, _p \, V\, _p\, \delta A\, _p = \left \, (\Lambda-\mu I)^ \right \, _p\ \kappa_p(V)\, \delta A\, _p

But is a diagonal matrix, the -norm of which is easily computed: :

\left \,  \left (\Lambda-\mu I \right )^ \right \, _p\ =\max_ \frac =\max_\frac\ = \frac

whence: :

\min_, \lambda-\mu, \leq\ \kappa_p(V)\, \delta A\, _p.

An Alternate Formulation

The theorem can also be reformulated to better suit numerical methods. In fact, dealing with real eigensystem problems, one often has an exact matrix , but knows only an approximate eigenvalue-eigenvector couple, and needs to bound the error. The following version comes in help. :Bauer–Fike Theorem (Alternate Formulation). Let be an approximate eigenvalue-eigenvector couple, and . Then there exists such that: ::

\left , \lambda-\lambda^a \right , \leq \kappa_p (V)\frac

Proof. We can suppose , otherwise take and the result is trivially true since . So exists, so we can write: :

\boldsymbol^a = \left (A-\lambda^a I \right )^ \boldsymbol= V \left (D-\lambda^a I \right )^V^\boldsymbol

since is diagonalizable; taking the -norm of both sides, we obtain: :

\left \,  \boldsymbol^a \right \, _p= \left \, V \left (D-\lambda^a I \right )^V^\boldsymbol \right \, _p \leq \, V\, _p \left \,  \left (D-\lambda^a I \right )^ \right \, _p \left \, V^ \right \, _p \, \boldsymbol\, _p =\kappa_p(V) \left \,  \left (D-\lambda^a I \right )^ \right \, _p \, \boldsymbol\, _p.

However :

\left (D- \lambda^a I \right )^

is a diagonal matrix and its -norm is easily computed: :

\left \, \left (D-\lambda^a I \right )^ \right \, _p = \max_\frac =\max_ \frac=\frac

whence: :

\min_ \left , \lambda-\lambda^a \right ,  \leq\kappa_p(V) \frac.

A Relative Bound

Both formulations of Bauer–Fike theorem yield an absolute bound. The following corollary is useful whenever a relative bound is needed: :Corollary. Suppose is invertible and that is an eigenvalue of . Then there exists such that: ::

\frac\leq\kappa_p (V) \left \, A^\delta A \right \, _p

Note. can be formally viewed as the ''relative variation of'' , just as is the relative variation of . Proof. Since is an eigenvalue of and , by multiplying by from left we have: :

-A^(A+\delta A)\boldsymbol=-\mu A^\boldsymbol.

If we set: :

A^a =\mu A^, \qquad (\delta A)^a = -A^\delta A

then we have: :

\left (A^a + (\delta A)^a - I \right )\boldsymbol = \boldsymbol

which means that is an eigenvalue of , with as an eigenvector. Now, the eigenvalues of are , while it has the same

as . Applying the Bauer–Fike theorem to with eigenvalue , gives us: :

\min_ \left, \frac-1\ = \min_\frac \leq \kappa_p (V) \left \, A^\delta A \right \, _p

The Case of Normal Matrices

If is

normal Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...

, is a

unitary matrix In linear algebra, a complex square matrix is unitary if its conjugate transpose is also its inverse, that is, if U^* U = UU^* = UU^ = I, where is the identity matrix. In physics, especially in quantum mechanics, the conjugate transpose is ...

, therefore: :

\, V\, _2= \left \, V^ \right \, _2 = 1,

so that . The Bauer–Fike theorem then becomes: :

\exists\lambda\in\Lambda(A): \quad  , \lambda-\mu, \leq\, \delta A\, _2

Or in alternate formulation: :

\exists\lambda\in\Lambda(A): \quad \left , \lambda-\lambda^a \right , \leq\frac

which obviously remains true if is a

Hermitian matrix In mathematics, a Hermitian matrix (or self-adjoint matrix) is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the -th row and -th column is equal to the complex conjugate of the element in the -t ...

. In this case, however, a much stronger result holds, known as the Weyl's theorem on eigenvalues. In the hermitian case one can also restate the Bauer–Fike theorem in the form that the map that maps a matrix to its

spectrum A spectrum (plural ''spectra'' or ''spectrums'') is a condition that is not limited to a specific set of values but can vary, without gaps, across a continuum. The word was first used scientifically in optics to describe the rainbow of colors ...

is a non-expansive function with respect to the

Hausdorff distance In mathematics, the Hausdorff distance, or Hausdorff metric, also called Pompeiu–Hausdorff distance, measures how far two subsets of a metric space are from each other. It turns the set of non-empty compact subsets of a metric space into a me ...

on the set of compact subsets of .

References

* * {{DEFAULTSORT:Bauer-Fike theorem Spectral theory Theorems in analysis Articles containing proofs