The Schur complement is a key tool in the fields of
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matrix (mathemat ...
, the theory of
matrices,
numerical analysis
Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic computation, symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of ...
, and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
.
It is defined for a
block matrix. Suppose ''p'', ''q'' are
nonnegative integers such that ''p + q > 0'', and suppose ''A'', ''B'', ''C'', ''D'' are respectively ''p'' × ''p'', ''p'' × ''q'', ''q'' × ''p'', and ''q'' × ''q'' matrices of complex numbers. Let
so that ''M'' is a (''p'' + ''q'') × (''p'' + ''q'') matrix.
If ''D'' is invertible, then the Schur complement of the block ''D'' of the matrix ''M'' is the ''p'' × ''p'' matrix defined by
If ''A'' is invertible, the Schur complement of the block ''A'' of the matrix ''M'' is the ''q'' × ''q'' matrix defined by
In the case that ''A'' or ''D'' is
singular
Singular may refer to:
* Singular, the grammatical number that denotes a unit quantity, as opposed to the plural and other forms
* Singular or sounder, a group of boar, see List of animal names
* Singular (band), a Thai jazz pop duo
*'' Singula ...
, substituting a
generalized inverse
In mathematics, and in particular, algebra, a generalized inverse (or, g-inverse) of an element ''x'' is an element ''y'' that has some properties of an inverse element but not necessarily all of them. The purpose of constructing a generalized inv ...
for the inverses on ''M/A'' and ''M/D'' yields the generalized Schur complement.
The Schur complement is named after
Issai Schur
Issai Schur (10 January 1875 – 10 January 1941) was a Russian mathematician who worked in Germany for most of his life. He studied at the Humboldt University of Berlin, University of Berlin. He obtained his doctorate in 1901, became lecturer i ...
who used it to prove
Schur's lemma
In mathematics, Schur's lemma is an elementary but extremely useful statement in representation theory of groups and algebras. In the group case it says that if ''M'' and ''N'' are two finite-dimensional irreducible representations
of a gro ...
, although it had been used previously.
Emilie Virginia Haynsworth was the first to call it the ''Schur complement''. The Schur complement is sometimes referred to as the ''Feshbach map'' after a physicist
Herman Feshbach.
Background
The Schur complement arises when performing a block
Gaussian elimination
In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of row-wise operations performed on the corresponding matrix of coefficients. This method can a ...
on the matrix ''M''. In order to eliminate the elements below the block diagonal, one multiplies the matrix ''M'' by a ''block lower triangular'' matrix on the right as follows:
where ''I
p'' denotes a ''p''×''p''
identity matrix
In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. It has unique properties, for example when the identity matrix represents a geometric transformation, the obje ...
. As a result, the Schur complement
appears in the upper-left ''p''×''p'' block.
Continuing the elimination process beyond this point (i.e., performing a block
Gauss–Jordan elimination),
leads to an
LDU decomposition of ''M'', which reads
Thus, the inverse of ''M'' may be expressed involving ''D''
−1 and the inverse of Schur's complement, assuming it exists, as
The above relationship comes from the elimination operations that involve ''D''
−1 and ''M/D''. An equivalent derivation can be done with the roles of ''A'' and ''D'' interchanged. By equating the expressions for ''M''
−1 obtained in these two different ways, one can establish the
matrix inversion lemma, which relates the two Schur complements of ''M'': ''M/D'' and ''M/A'' (see ''"Derivation from LDU decomposition"'' in ).
Properties
* If ''p'' and ''q'' are both 1 (i.e., ''A'', ''B'', ''C'' and ''D'' are all scalars), we get the familiar formula for the inverse of a 2-by-2 matrix:
:
: provided that ''AD'' − ''BC'' is non-zero.
* In general, if ''A'' is invertible, then
:
: whenever this inverse exists.
* (Schur's formula) When ''A'', respectively ''D'', is invertible, the determinant of ''M'' is also clearly seen to be given by
:
, respectively
:
,
: which generalizes the determinant formula for 2 × 2 matrices.
* (Guttman rank additivity formula) If ''D'' is invertible, then the
rank of ''M'' is given by
:
* (
Haynsworth inertia additivity formula) If ''A'' is invertible, then the ''inertia'' of the block matrix ''M'' is equal to the inertia of ''A'' plus the inertia of ''M''/''A''.
* (Quotient identity)
.
* The Schur complement of a
Laplacian matrix is also a Laplacian matrix.
Application to solving linear equations
The Schur complement arises naturally in solving a system of linear equations such as
.
Assuming that the submatrix
is invertible, we can eliminate
from the equations, as follows.
Substituting this expression into the second equation yields
:
We refer to this as the ''reduced equation'' obtained by eliminating
from the original equation. The matrix appearing in the reduced equation is called the Schur complement of the first block
in
:
:
.
Solving the reduced equation, we obtain
:
Substituting this into the first equation yields
:
We can express the above two equation as:
:
Therefore, a formulation for the inverse of a block matrix is:
:
In particular, we see that the Schur complement is the inverse of the
block entry of the inverse of
.
In practice, one needs
to be
well-conditioned in order for this algorithm to be numerically accurate.
This method is useful in electrical engineering to reduce the dimension of a network's equations. It is especially useful when element(s) of the output vector are zero. For example, when
or
is zero, we can eliminate the associated rows of the coefficient matrix without any changes to the rest of the output vector. If
is null then the above equation for
reduces to
, thus reducing the dimension of the coefficient matrix while leaving
unmodified. This is used to advantage in electrical engineering where it is referred to as node elimination or
Kron reduction.
Applications to probability theory and statistics
Suppose the random column vectors ''X'', ''Y'' live in R
''n'' and R
''m'' respectively, and the vector (''X'', ''Y'') in R
''n'' + ''m'' has a
multivariate normal distribution whose covariance is the symmetric positive-definite matrix
:
where
is the covariance matrix of ''X'',
is the covariance matrix of ''Y'' and
is the covariance matrix between ''X'' and ''Y''.
Then the
conditional covariance of ''X'' given ''Y'' is the Schur complement of ''C'' in
:
:
If we take the matrix
above to be, not a covariance of a random vector, but a ''sample'' covariance, then it may have a
Wishart distribution. In that case, the Schur complement of ''C'' in
also has a Wishart distribution.
Conditions for positive definiteness and semi-definiteness
Let ''X'' be a
symmetric matrix
In linear algebra, a symmetric matrix is a square matrix that is equal to its transpose. Formally,
Because equal matrices have equal dimensions, only square matrices can be symmetric.
The entries of a symmetric matrix are symmetric with ...
of real numbers given by
Then
* If ''A'' is invertible, then ''X'' is positive definite if and only if ''A'' and its complement ''X/A'' are both positive definite:
:
* If ''C'' is invertible, then ''X'' is positive definite if and only if ''C'' and its complement ''X/C'' are both positive definite:
:
* If ''A'' is positive definite, then ''X'' is positive semi-definite if and only if the complement ''X/A'' is positive semi-definite:
:
* If ''C'' is positive definite, then ''X'' is positive semi-definite if and only if the complement ''X/C'' is positive semi-definite:
:
The first and third statements can be derived
[Boyd, S. and Vandenberghe, L. (2004), "Convex Optimization", Cambridge University Press (Appendix A.5.5)] by considering the minimizer of the quantity
as a function of ''v'' (for fixed ''u'').
Furthermore, since
and similarly for positive semi-definite matrices, the second (respectively fourth) statement is immediate from the first (resp. third) statement.
There is also a sufficient and necessary condition for the positive semi-definiteness of ''X'' in terms of a generalized Schur complement.
Precisely,
*
and
*
where
denotes a
generalized inverse
In mathematics, and in particular, algebra, a generalized inverse (or, g-inverse) of an element ''x'' is an element ''y'' that has some properties of an inverse element but not necessarily all of them. The purpose of constructing a generalized inv ...
of
.
See also
*
Woodbury matrix identity
*
Quasi-Newton method
*
Haynsworth inertia additivity formula
*
Gaussian process
In probability theory and statistics, a Gaussian process is a stochastic process (a collection of random variables indexed by time or space), such that every finite collection of those random variables has a multivariate normal distribution. The di ...
*
Total least squares
*
Guyan reduction in computational mechanics
References
{{reflist
Linear algebra
Issai Schur