In
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matrix (mathemat ...
, a QR decomposition, also known as a QR factorization or QU factorization, is a
decomposition
Decomposition is the process by which dead organic substances are broken down into simpler organic or inorganic matter such as carbon dioxide, water, simple sugars and mineral salts. The process is a part of the nutrient cycle and is ess ...
of a
matrix
Matrix (: matrices or matrixes) or MATRIX may refer to:
Science and mathematics
* Matrix (mathematics), a rectangular array of numbers, symbols or expressions
* Matrix (logic), part of a formula in prenex normal form
* Matrix (biology), the m ...
''A'' into a product ''A'' = ''QR'' of an
orthonormal matrix ''Q'' and an
upper triangular matrix
In mathematics, a triangular matrix is a special kind of square matrix. A square matrix is called if all the entries ''above'' the main diagonal are zero. Similarly, a square matrix is called if all the entries ''below'' the main diagonal are ...
''R''. QR decomposition is often used to solve the
linear least squares
Linear least squares (LLS) is the least squares approximation of linear functions to data.
It is a set of formulations for solving statistical problems involved in linear regression, including variants for ordinary (unweighted), weighted, and ...
(LLS) problem and is the basis for a particular
eigenvalue algorithm
In numerical analysis, one of the most important problems is designing efficient and Numerical stability, stable algorithms for finding the eigenvalues of a Matrix (mathematics), matrix. These eigenvalue algorithms may also find eigenvectors.
Eig ...
, the
QR algorithm.
Cases and definitions
Square matrix
Any real
square matrix
In mathematics, a square matrix is a Matrix (mathematics), matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied.
Squ ...
''A'' may be decomposed as
:
where ''Q'' is an
orthogonal matrix
In linear algebra, an orthogonal matrix, or orthonormal matrix, is a real square matrix whose columns and rows are orthonormal vectors.
One way to express this is
Q^\mathrm Q = Q Q^\mathrm = I,
where is the transpose of and is the identi ...
(its columns are
orthogonal
In mathematics, orthogonality (mathematics), orthogonality is the generalization of the geometric notion of ''perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendic ...
unit vector
In mathematics, a unit vector in a normed vector space is a Vector (mathematics and physics), vector (often a vector (geometry), spatial vector) of Norm (mathematics), length 1. A unit vector is often denoted by a lowercase letter with a circumfle ...
s meaning and ''R'' is an upper
triangular matrix
In mathematics, a triangular matrix is a special kind of square matrix. A square matrix is called if all the entries ''above'' the main diagonal are zero. Similarly, a square matrix is called if all the entries ''below'' the main diagonal are z ...
(also called right triangular matrix). If ''A'' is
invertible
In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers.
Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that ...
, then the factorization is unique if we require the diagonal elements of ''R'' to be positive.
If instead ''A'' is a complex square matrix, then there is a decomposition ''A'' = ''QR'' where ''Q'' is a
unitary matrix
In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if
U^* U = UU^* = I,
where is the identity matrix.
In physics, especially in quantum mechanics, the conjugate ...
(so the
conjugate transpose
In mathematics, the conjugate transpose, also known as the Hermitian transpose, of an m \times n complex matrix \mathbf is an n \times m matrix obtained by transposing \mathbf and applying complex conjugation to each entry (the complex conjugate ...
If ''A'' has ''n''
linearly independent
In the theory of vector spaces, a set of vectors is said to be if there exists no nontrivial linear combination of the vectors that equals the zero vector. If such a linear combination exists, then the vectors are said to be . These concep ...
columns, then the first ''n'' columns of ''Q'' form an
orthonormal basis
In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...
for the
column space
In linear algebra, the column space (also called the range or image) of a matrix ''A'' is the span (set of all possible linear combinations) of its column vectors. The column space of a matrix is the image or range of the corresponding matr ...
of ''A''. More generally, the first ''k'' columns of ''Q'' form an orthonormal basis for the
span of the first ''k'' columns of ''A'' for any .
The fact that any column ''k'' of ''A'' only depends on the first ''k'' columns of ''Q'' corresponds to the triangular form of ''R''.
[
]
Rectangular matrix
More generally, we can factor a complex ''m''×''n'' matrix ''A'', with , as the product of an ''m''×''m'' unitary matrix
In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if
U^* U = UU^* = I,
where is the identity matrix.
In physics, especially in quantum mechanics, the conjugate ...
''Q'' and an ''m''×''n'' upper triangular matrix ''R''. As the bottom (''m''−''n'') rows of an ''m''×''n'' upper triangular matrix consist entirely of zeroes, it is often useful to partition ''R'', or both ''R'' and ''Q'':
:
where ''R''1 is an ''n''×''n'' upper triangular matrix, 0 is an zero matrix
In mathematics, particularly linear algebra, a zero matrix or null matrix is a matrix all of whose entries are zero. It also serves as the additive identity of the additive group of m \times n matrices, and is denoted by the symbol O or 0 followe ...
, ''Q''1 is ''m''×''n'', ''Q''2 is , and ''Q''1 and ''Q''2 both have orthogonal columns.
call ''Q''1''R''1 the ''thin QR factorization'' of ''A''; Trefethen and Bau call this the ''reduced QR factorization''.[ If ''A'' is of full rank ''n'' and we require that the diagonal elements of ''R''1 are positive then ''R''1 and ''Q''1 are unique, but in general ''Q''2 is not. ''R''1 is then equal to the upper triangular factor of the ]Cholesky decomposition
In linear algebra, the Cholesky decomposition or Cholesky factorization (pronounced ) is a decomposition of a Hermitian, positive-definite matrix into the product of a lower triangular matrix and its conjugate transpose, which is useful for eff ...
of ''A'' ''A'' (= ''A''T''A'' if ''A'' is real).
QL, RQ and LQ decompositions
Analogously, we can define QL, RQ, and LQ decompositions, with ''L'' being a ''lower'' triangular matrix.
Computing the QR decomposition
There are several methods for actually computing the QR decomposition, such as the Gram–Schmidt process
In mathematics, particularly linear algebra and numerical analysis, the Gram–Schmidt process or Gram-Schmidt algorithm is a way of finding a set of two or more vectors that are perpendicular to each other.
By technical definition, it is a metho ...
, Householder transformation
In linear algebra, a Householder transformation (also known as a Householder reflection or elementary reflector) is a linear transformation that describes a reflection (mathematics), reflection about a plane (mathematics), plane or hyperplane conta ...
s, or Givens rotations. Each has a number of advantages and disadvantages.
Using the Gram–Schmidt process
Consider the Gram–Schmidt process
In mathematics, particularly linear algebra and numerical analysis, the Gram–Schmidt process or Gram-Schmidt algorithm is a way of finding a set of two or more vectors that are perpendicular to each other.
By technical definition, it is a metho ...
applied to the columns of the full column rank matrix with inner product
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
(or for the complex case).
Define the projection
Projection or projections may refer to:
Physics
* Projection (physics), the action/process of light, heat, or sound reflecting from a surface to another in a different direction
* The display of images by a projector
Optics, graphics, and carto ...
:
:
then:
:
We can now express the s over our newly computed orthonormal basis:
:
where This can be written in matrix form:
:
where:
:
and
:
Example
Consider the decomposition of
:
Recall that an orthonormal matrix has the property
Then, we can calculate by means of Gram–Schmidt as follows:
:
Thus, we have
:
Relation to RQ decomposition
The RQ decomposition transforms a matrix ''A'' into the product of an upper triangular matrix ''R'' (also known as right-triangular) and an orthogonal matrix ''Q''. The only difference from QR decomposition is the order of these matrices.
QR decomposition is Gram–Schmidt orthogonalization of columns of ''A'', started from the first column.
RQ decomposition is Gram–Schmidt orthogonalization of rows of ''A'', started from the last row.
Advantages and disadvantages
The Gram-Schmidt process is inherently numerically unstable. While the application of the projections has an appealing geometric analogy to orthogonalization, the orthogonalization itself is prone to numerical error. A significant advantage is the ease of implementation.
Using Householder reflections
A Householder reflection (or ''Householder transformation'') is a transformation that takes a vector and reflects it about some plane or hyperplane
In geometry, a hyperplane is a generalization of a two-dimensional plane in three-dimensional space to mathematical spaces of arbitrary dimension. Like a plane in space, a hyperplane is a flat hypersurface, a subspace whose dimension is ...
. We can use this operation to calculate the ''QR'' factorization of an ''m''-by-''n'' matrix with .
''Q'' can be used to reflect a vector in such a way that all coordinates but one disappear.
Let be an arbitrary real ''m''-dimensional column vector of such that for a scalar ''α''. If the algorithm is implemented using floating-point arithmetic
In computing, floating-point arithmetic (FP) is arithmetic on subsets of real numbers formed by a ''significand'' (a Sign (mathematics), signed sequence of a fixed number of digits in some Radix, base) multiplied by an integer power of that ba ...
, then ''α'' should get the opposite sign as the ''k''-th coordinate of where is to be the pivot coordinate after which all entries are 0 in matrix ''A''s final upper triangular form, to avoid loss of significance. In the complex case, set
:
and substitute transposition by conjugate transposition in the construction of ''Q'' below.
Then, where is the vector , is the Euclidean norm
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces'' ...
and is an identity matrix, set
:
Or, if is complex
:
is an ''m''-by-''m'' Householder matrix, which is both symmetric and orthogonal (Hermitian and unitary in the complex case), and
:
This can be used to gradually transform an ''m''-by-''n'' matrix ''A'' to upper triangular form. First, we multiply ''A'' with the Householder matrix ''Q''1 we obtain when we choose the first matrix column for x. This results in a matrix ''Q''1''A'' with zeros in the left column (except for the first row).
:
This can be repeated for ''A''′ (obtained from ''Q''1''A'' by deleting the first row and first column), resulting in a Householder matrix ''Q''′2. Note that ''Q''′2 is smaller than ''Q''1. Since we want it really to operate on ''Q''1''A'' instead of ''A''′ we need to expand it to the upper left, filling in a 1, or in general:
:
After iterations of this process,
:
is an upper triangular matrix. So, with
:
is a QR decomposition of .
This method has greater numerical stability
In the mathematical subfield of numerical analysis, numerical stability is a generally desirable property of numerical algorithms. The precise definition of stability depends on the context: one important context is numerical linear algebra, and ...
than the Gram–Schmidt method above.
In numerical tests the computed factors and satisfy
at machine precision. Also, orthogonality is preserved: . However, the accuracy of and decrease with condition number:
For a well-conditioned example (, ):
In an ill-conditioned test (, ):
The following table gives the number of operations in the ''k''-th step of the QR-decomposition by the Householder transformation, assuming a square matrix with size ''n''.
Summing these numbers over the steps (for a square matrix of size ''n''), the complexity of the algorithm (in terms of floating point multiplications) is given by
:
Example
Let us calculate the decomposition of
:
First, we need to find a reflection that transforms the first column of matrix ''A'', vector into
Now,
:
and
:
Here,
: and
Therefore
: and and then
:
Now observe:
:
so we already have almost a triangular matrix. We only need to zero the (3, 2) entry.
Take the (1, 1) minor, and then apply the process again to
:
By the same method as above, we obtain the matrix of the Householder transformation
:
after performing a direct sum with 1 to make sure the next step in the process works properly.
Now, we find
:
Or, to four decimal digits,
:
The matrix ''Q'' is orthogonal and ''R'' is upper triangular, so is the required QR decomposition.
Advantages and disadvantages
The use of Householder transformations is inherently the most simple of the numerically stable QR decomposition algorithms due to the use of reflections as the mechanism for producing zeroes in the ''R'' matrix. However, the Householder reflection algorithm is bandwidth heavy and difficult to parallelize, as every reflection that produces a new zero element changes the entirety of both ''Q'' and ''R'' matrices.
Parallel implementation of Householder QR
The Householder QR method can be implemented in parallel with algorithms such as the TSQR algorithm (which stands for ''Tall Skinny QR''). This algorithm can be applied in the case when the matrix ''A'' has ''m >> n''. This algorithm uses a binary reduction tree to compute local householder QR decomposition at each node in the forward pass, and re-constitute the Q matrix in the backward pass. The binary tree
In computer science, a binary tree is a tree data structure in which each node has at most two children, referred to as the ''left child'' and the ''right child''. That is, it is a ''k''-ary tree with . A recursive definition using set theor ...
structure aims at decreasing the amount of communication between processor to increase performance.
Using Givens rotations
QR decompositions can also be computed with a series of Givens rotations. Each rotation zeroes an element in the subdiagonal of the matrix, forming the ''R'' matrix. The concatenation of all the Givens rotations forms the orthogonal ''Q'' matrix.
In practice, Givens rotations are not actually performed by building a whole matrix and doing a matrix multiplication. A Givens rotation procedure is used instead which does the equivalent of the sparse Givens matrix multiplication, without the extra work of handling the sparse elements. The Givens rotation procedure is useful in situations where only relatively few off-diagonal elements need to be zeroed, and is more easily parallelized than Householder transformation
In linear algebra, a Householder transformation (also known as a Householder reflection or elementary reflector) is a linear transformation that describes a reflection (mathematics), reflection about a plane (mathematics), plane or hyperplane conta ...
s.
Example
Let us calculate the decomposition of
:
First, we need to form a rotation matrix
In linear algebra, a rotation matrix is a transformation matrix that is used to perform a rotation (mathematics), rotation in Euclidean space. For example, using the convention below, the matrix
:R = \begin
\cos \theta & -\sin \theta \\
\sin \t ...
that will zero the lowermost left element, We form this matrix using the Givens rotation method, and call the matrix . We will first rotate the vector to point along the ''X'' axis. This vector has an angle We create the orthogonal Givens rotation matrix, :
:
And the result of now has a zero in the element.
:
We can similarly form Givens matrices and which will zero the sub-diagonal elements and forming a triangular matrix The orthogonal matrix is formed from the product of all the Givens matrices Thus, we have and the ''QR'' decomposition is
Advantages and disadvantages
The QR decomposition via Givens rotations is the most involved to implement, as the ordering of the rows required to fully exploit the algorithm is not trivial to determine. However, it has a significant advantage in that each new zero element affects only the row with the element to be zeroed (''i'') and a row above (''j''). This makes the Givens rotation algorithm more bandwidth efficient and parallelizable than the Householder reflection technique.
Connection to a determinant or a product of eigenvalues
We can use QR decomposition to find the determinant
In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...
of a square matrix. Suppose a matrix is decomposed as . Then we have
can be chosen such that . Thus,
where the are the entries on the diagonal of . Furthermore, because the determinant equals the product of the eigenvalues, we have
where the are eigenvalues of .
We can extend the above properties to a non-square complex matrix by introducing the definition of QR decomposition for non-square complex matrices and replacing eigenvalues with singular values.
Start with a QR decomposition for a non-square matrix ''A'':
:
where denotes the zero matrix and is a unitary matrix.
From the properties of the singular value decomposition
In linear algebra, the singular value decomposition (SVD) is a Matrix decomposition, factorization of a real number, real or complex number, complex matrix (mathematics), matrix into a rotation, followed by a rescaling followed by another rota ...
(SVD) and the determinant of a matrix, we have
:
where the are the singular values of
Note that the singular values of and are identical, although their complex eigenvalues may be different. However, if ''A'' is square, then
:
It follows that the QR decomposition can be used to efficiently calculate the product of the eigenvalues or singular values of a matrix.
Column pivoting
Pivoted QR differs from ordinary Gram-Schmidt in that it takes the largest remaining column at the beginning of each new step—column pivoting— and thus introduces a permutation matrix
In mathematics, particularly in matrix theory, a permutation matrix is a square binary matrix that has exactly one entry of 1 in each row and each column with all other entries 0. An permutation matrix can represent a permutation of elements. ...
''P'':
:
Column pivoting is useful when ''A'' is (nearly) rank deficient, or is suspected of being so. It can also improve numerical accuracy. ''P'' is usually chosen so that the diagonal elements of ''R'' are non-increasing: . This can be used to find the (numerical) rank of ''A'' at lower computational cost than a singular value decomposition
In linear algebra, the singular value decomposition (SVD) is a Matrix decomposition, factorization of a real number, real or complex number, complex matrix (mathematics), matrix into a rotation, followed by a rescaling followed by another rota ...
, forming the basis of so-called rank-revealing QR algorithms.
Using for solution to linear inverse problems
Compared to the direct matrix inverse, inverse solutions using QR decomposition are more numerically stable as evidenced by their reduced condition numbers.
To solve the underdetermined linear problem where the matrix has dimensions and rank first find the QR factorization of the transpose of where ''Q'' is an orthogonal matrix (i.e. and ''R'' has a special form: