HOME

TheInfoList



OR:

In
linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...
and
functional analysis Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (for example, Inner product space#Definition, inner product, Norm (mathematics ...
, a projection is a
linear transformation In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pr ...
P from a
vector space In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...
to itself (an endomorphism) such that P\circ P=P. That is, whenever P is applied twice to any vector, it gives the same result as if it were applied once (i.e. P is
idempotent Idempotence (, ) is the property of certain operations in mathematics and computer science whereby they can be applied multiple times without changing the result beyond the initial application. The concept of idempotence arises in a number of pl ...
). It leaves its
image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
unchanged. This definition of "projection" formalizes and generalizes the idea of graphical projection. One can also consider the effect of a projection on a geometrical object by examining the effect of the projection on points in the object.


Definitions

A projection on a vector space V is a linear operator P\colon V \to V such that P^2 = P. When V has an
inner product In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
and is complete, i.e. when V is a
Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
, the concept of
orthogonality In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendicular'' is more specifically ...
can be used. A projection P on a Hilbert space V is called an orthogonal projection if it satisfies \langle P \mathbf x, \mathbf y \rangle = \langle \mathbf x, P \mathbf y \rangle for all \mathbf x, \mathbf y \in V. A projection on a Hilbert space that is not orthogonal is called an oblique projection.


Projection matrix

* A square matrix P is called a projection matrix if it is equal to its square, i.e. if P^2 = P. * A square matrix P is called an orthogonal projection matrix if P^2 = P = P^ for a real
matrix Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the m ...
, and respectively P^2 = P = P^ for a
complex Complex commonly refers to: * Complexity, the behaviour of a system whose components interact in multiple ways so possible interactions are difficult to describe ** Complex system, a system composed of many components which may interact with each ...
matrix, where P^ denotes the
transpose In linear algebra, the transpose of a Matrix (mathematics), matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other ...
of P and P^ denotes the adjoint or Hermitian transpose of P. * A projection matrix that is not an orthogonal projection matrix is called an oblique projection matrix. The
eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...
s of a projection matrix must be 0 or 1.


Examples


Orthogonal projection

For example, the function which maps the point (x,y,z) in three-dimensional space \mathbb^3 to the point (x,y,0) is an orthogonal projection onto the ''xy''-plane. This function is represented by the matrix P = \begin 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 0 \end. The action of this matrix on an arbitrary
vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...
is P \begin x \\ y \\ z \end = \begin x \\ y \\ 0 \end. To see that P is indeed a projection, i.e., P = P^2, we compute P^2 \begin x \\ y \\ z \end = P \begin x \\ y \\ 0 \end = \begin x \\ y \\ 0 \end = P\begin x \\ y \\ z \end. Observing that P^ = P shows that the projection is an orthogonal projection.


Oblique projection

A simple example of a non-orthogonal (oblique) projection is P = \begin 0 & 0 \\ \alpha & 1 \end. Via
matrix multiplication In mathematics, specifically in linear algebra, matrix multiplication is a binary operation that produces a matrix (mathematics), matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the n ...
, one sees that P^2 = \begin 0 & 0 \\ \alpha & 1 \end \begin 0 & 0 \\ \alpha & 1 \end = \begin 0 & 0 \\ \alpha & 1 \end = P. showing that P is indeed a projection. The projection P is orthogonal
if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
\alpha = 0 because only then P^ = P.


Properties and classification


Idempotence

By definition, a projection P is
idempotent Idempotence (, ) is the property of certain operations in mathematics and computer science whereby they can be applied multiple times without changing the result beyond the initial application. The concept of idempotence arises in a number of pl ...
(i.e. P^2 = P).


Open map

Every projection is an
open map In mathematics, more specifically in topology, an open map is a function between two topological spaces that maps open sets to open sets. That is, a function f : X \to Y is open if for any open set U in X, the image f(U) is open in Y. Likewise, ...
onto its image, meaning that it maps each
open set In mathematics, an open set is a generalization of an Interval (mathematics)#Definitions_and_terminology, open interval in the real line. In a metric space (a Set (mathematics), set with a metric (mathematics), distance defined between every two ...
in the domain to an open set in the
subspace topology In topology and related areas of mathematics, a subspace of a topological space (''X'', ''𝜏'') is a subset ''S'' of ''X'' which is equipped with a topology induced from that of ''𝜏'' called the subspace topology (or the relative topology ...
of the
image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
. That is, for any vector \mathbf and any ball B_\mathbf (with positive radius) centered on \mathbf, there exists a ball B_ (with positive radius) centered on P\mathbf that is wholly contained in the image P(B_\mathbf).


Complementarity of image and kernel

Let W be a finite-dimensional vector space and P be a projection on W. Suppose the subspaces U and V are the
image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
and kernel of P respectively. Then P has the following properties: # P is the identity operator I on U: \forall \mathbf x \in U: P \mathbf x = \mathbf x. # We have a
direct sum The direct sum is an operation between structures in abstract algebra, a branch of mathematics. It is defined differently but analogously for different kinds of structures. As an example, the direct sum of two abelian groups A and B is anothe ...
W = U \oplus V. Every vector \mathbf x \in W may be decomposed uniquely as \mathbf x = \mathbf u + \mathbf v with \mathbf u = P \mathbf x and \mathbf v = \mathbf x - P \mathbf x = \left(I-P\right) \mathbf x, and where \mathbf u \in U, \mathbf v \in V. The image and kernel of a projection are ''complementary'', as are P and Q = I - P. The operator Q is also a projection as the image and kernel of P become the kernel and image of Q and vice versa. We say P is a projection along V onto U (kernel/image) and Q is a projection along U onto V.


Spectrum

In infinite-dimensional vector spaces, the
spectrum A spectrum (: spectra or spectrums) is a set of related ideas, objects, or properties whose features overlap such that they blend to form a continuum. The word ''spectrum'' was first used scientifically in optics to describe the rainbow of co ...
of a projection is contained in \ as (\lambda I - P)^ = \frac 1 \lambda I + \frac 1 P. Only 0 or 1 can be an
eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...
of a projection. This implies that an orthogonal projection P is always a positive semi-definite matrix. In general, the corresponding
eigenspace In linear algebra, an eigenvector ( ) or characteristic vector is a Vector (mathematics and physics), vector that has its direction (geometry), direction unchanged (or reversed) by a given linear map, linear transformation. More precisely, an e ...
s are (respectively) the kernel and range of the projection. Decomposition of a vector space into direct sums is not unique. Therefore, given a subspace V, there may be many projections whose range (or kernel) is V. If a projection is nontrivial it has minimal polynomial x^2 - x = x (x-1), which factors into distinct linear factors, and thus P is diagonalizable.


Product of projections

The product of projections is not in general a projection, even if they are orthogonal. If two projections commute then their product is a projection, but the converse is false: the product of two non-commuting projections may be a projection. If two orthogonal projections commute then their product is an orthogonal projection. If the product of two orthogonal projections is an orthogonal projection, then the two orthogonal projections commute (more generally: two self-adjoint endomorphisms commute if and only if their product is self-adjoint).


Orthogonal projections

When the vector space W has an
inner product In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
and is complete (is a
Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
) the concept of
orthogonality In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendicular'' is more specifically ...
can be used. An orthogonal projection is a projection for which the range U and the kernel V are orthogonal subspaces. Thus, for every \mathbf x and \mathbf y in W, \langle P \mathbf x, (\mathbf y - P \mathbf y) \rangle = \langle (\mathbf x - P \mathbf x) , P \mathbf y \rangle = 0. Equivalently: \langle \mathbf x, P \mathbf y \rangle = \langle P \mathbf x, P \mathbf y \rangle = \langle P \mathbf x, \mathbf y \rangle. A projection is orthogonal if and only if it is
self-adjoint In mathematics, an element of a *-algebra is called self-adjoint if it is the same as its adjoint (i.e. a = a^*). Definition Let \mathcal be a *-algebra. An element a \in \mathcal is called self-adjoint if The set of self-adjoint elements ...
. Using the self-adjoint and idempotent properties of P, for any \mathbf x and \mathbf y in W we have P\mathbf \in U, \mathbf - P\mathbf \in V, and \langle P \mathbf x, \mathbf y - P \mathbf y \rangle = \langle \mathbf x, \left(P-P^2\right) \mathbf y \rangle = 0 where \langle \cdot, \cdot \rangle is the inner product associated with W. Therefore, P and I - P are orthogonal projections. The other direction, namely that if P is orthogonal then it is self-adjoint, follows from the implication from \langle (\mathbf x - P \mathbf x) , P \mathbf y \rangle = \langle P \mathbf x, (\mathbf y - P \mathbf y) \rangle = 0 to \langle \mathbf x, P \mathbf y \rangle = \langle P \mathbf x, P\mathbf y \rangle = \langle P \mathbf x, \mathbf y \rangle = \langle \mathbf x, P^* \mathbf y \rangle for every x and y in W; thus P=P^*. The existence of an orthogonal projection onto a closed subspace follows from the Hilbert projection theorem.


Properties and special cases

An orthogonal projection is a bounded operator. This is because for every \mathbf v in the vector space we have, by the
Cauchy–Schwarz inequality The Cauchy–Schwarz inequality (also called Cauchy–Bunyakovsky–Schwarz inequality) is an upper bound on the absolute value of the inner product between two vectors in an inner product space in terms of the product of the vector norms. It is ...
: \left \, P \mathbf v\right\, ^2 = \langle P \mathbf v, P \mathbf v \rangle = \langle P \mathbf v, \mathbf v \rangle \leq \left\, P \mathbf v\right\, \cdot \left\, \mathbf v\right\, Thus \left\, P \mathbf v\right\, \leq \left\, \mathbf v\right\, . For finite-dimensional complex or real vector spaces, the standard inner product can be substituted for \langle \cdot, \cdot \rangle.


=Formulas

= A simple case occurs when the orthogonal projection is onto a line. If \mathbf u is a
unit vector In mathematics, a unit vector in a normed vector space is a Vector (mathematics and physics), vector (often a vector (geometry), spatial vector) of Norm (mathematics), length 1. A unit vector is often denoted by a lowercase letter with a circumfle ...
on the line, then the projection is given by the
outer product In linear algebra, the outer product of two coordinate vectors is the matrix whose entries are all products of an element in the first vector with an element in the second vector. If the two coordinate vectors have dimensions ''n'' and ''m'', the ...
P_\mathbf = \mathbf u \mathbf u^\mathsf. (If \mathbf u is complex-valued, the transpose in the above equation is replaced by a Hermitian transpose). This operator leaves u invariant, and it annihilates all vectors orthogonal to \mathbf u, proving that it is indeed the orthogonal projection onto the line containing u. A simple way to see this is to consider an arbitrary vector \mathbf x as the sum of a component on the line (i.e. the projected vector we seek) and another perpendicular to it, \mathbf x = \mathbf x_\parallel + \mathbf x_\perp. Applying projection, we get P_ \mathbf x = \mathbf u \mathbf u^\mathsf \mathbf x_\parallel + \mathbf u \mathbf u^\mathsf \mathbf x_\perp = \mathbf u \left( \sgn\left(\mathbf u^\mathsf \mathbf x_\parallel\right) \left \, \mathbf x_\parallel \right \, \right) + \mathbf u \cdot \mathbf 0 = \mathbf x_\parallel by the properties of the
dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a Scalar (mathematics), scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. N ...
of parallel and perpendicular vectors. This formula can be generalized to orthogonal projections on a subspace of arbitrary
dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coo ...
. Let \mathbf u_1, \ldots, \mathbf u_k be an
orthonormal basis In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...
of the subspace U, with the assumption that the integer k \geq 1, and let A denote the n \times k matrix whose columns are \mathbf u_1, \ldots, \mathbf u_k, i.e., A = \begin \mathbf u_1 & \cdots & \mathbf u_k \end. Then the projection is given by: P_A = A A^\mathsf which can be rewritten as P_A = \sum_i \langle \mathbf u_i, \cdot \rangle \mathbf u_i. The matrix A^\mathsf is the partial isometry that vanishes on the orthogonal complement of U, and A is the isometry that embeds U into the underlying vector space. The range of P_A is therefore the ''final space'' of A. It is also clear that A A^ is the identity operator on U. The orthonormality condition can also be dropped. If \mathbf u_1, \ldots, \mathbf u_k is a (not necessarily orthonormal) basis with k \geq 1, and A is the matrix with these vectors as columns, then the projection is: P_A = A \left(A^\mathsf A\right)^ A^\mathsf. The matrix A still embeds U into the underlying vector space but is no longer an isometry in general. The matrix \left(A^\mathsfA\right)^ is a "normalizing factor" that recovers the norm. For example, the rank-1 operator \mathbf u \mathbf u^\mathsf is not a projection if \left\, \mathbf u \right\, \neq 1. After dividing by \mathbf u^\mathsf \mathbf u = \left\, \mathbf u \right\, ^2, we obtain the projection \mathbf u \left(\mathbf u^\mathsf \mathbf u \right)^ \mathbf u^\mathsf onto the subspace spanned by u. In the general case, we can have an arbitrary positive definite matrix D defining an inner product \langle x, y \rangle_D = y^\dagger Dx, and the projection P_A is given by P_A x = \operatorname_ \left\, x - y\right\, ^2_D. Then P_A = A \left(A^\mathsf D A\right)^ A^\mathsf D. When the range space of the projection is generated by a frame (i.e. the number of generators is greater than its dimension), the formula for the projection takes the form: P_A = A A^+. Here A^+ stands for the Moore–Penrose pseudoinverse. This is just one of many ways to construct the projection operator. If \begin A & B \end is a non-singular matrix and A^\mathsfB = 0 (i.e., B is the null space matrix of A), the following holds: \begin I &= \begin A & B \end \begin A & B \end^\begin A^\mathsf \\ B^\mathsf \end^ \begin A^\mathsf \\ B^\mathsf \end \\ &= \begin A & B \end \left( \begin A^\mathsf \\ B^\mathsf \end \begin A & B \end \right )^ \begin A^\mathsf \\B^\mathsf \end \\ &= \begin A & B \end \beginA^\mathsfA&O\\O&B^\mathsfB\end^ \begin A^\mathsf \\ B^\mathsf \end\\ pt &= A \left(A^\mathsfA\right)^ A^\mathsf + B \left(B^\mathsfB\right)^ B^\mathsf \end If the orthogonal condition is enhanced to A^\mathsfW B = A^\mathsfW^\mathsfB = 0 with W non-singular, the following holds: I = \beginA & B\end \begin\left(A^\mathsf W A\right)^ A^\mathsf \\ \left(B^\mathsf W B\right)^ B^\mathsf \end W. All these formulas also hold for complex inner product spaces, provided that the conjugate transpose is used instead of the transpose. Further details on sums of projectors can be found in Banerjee and Roy (2014). Also see Banerjee (2004) for application of sums of projectors in basic
spherical trigonometry Spherical trigonometry is the branch of spherical geometry that deals with the metrical relationships between the edge (geometry), sides and angles of spherical triangles, traditionally expressed using trigonometric functions. On the sphere, ge ...
.


Oblique projections

The term ''oblique projections'' is sometimes used to refer to non-orthogonal projections. These projections are also used to represent spatial figures in two-dimensional drawings (see oblique projection), though not as frequently as orthogonal projections. Whereas calculating the fitted value of an ordinary least squares regression requires an orthogonal projection, calculating the fitted value of an instrumental variables regression requires an oblique projection. A projection is defined by its kernel and the basis vectors used to characterize its range (which is a complement of the kernel). When these basis vectors are orthogonal to the kernel, then the projection is an orthogonal projection. When these basis vectors are not orthogonal to the kernel, the projection is an oblique projection, or just a projection.


A matrix representation formula for a nonzero projection operator

Let P \colon V \to V be a linear operator such that P^2 = P and assume that P is not the zero operator. Let the vectors \mathbf u_1, \ldots, \mathbf u_k form a basis for the range of P, and assemble these vectors in the n \times k matrix A. Then k \geq 1, otherwise k = 0 and P is the zero operator. The range and the kernel are complementary spaces, so the kernel has dimension n - k. It follows that the orthogonal complement of the kernel has dimension k. Let \mathbf v_1, \ldots, \mathbf v_k form a basis for the orthogonal complement of the kernel of the projection, and assemble these vectors in the matrix B. Then the projection P (with the condition k \geq 1) is given by P = A \left(B^\mathsf A\right)^ B^\mathsf. This expression generalizes the formula for orthogonal projections given above. A standard proof of this expression is the following. For any vector \mathbf x in the vector space V, we can decompose \mathbf = \mathbf_1 + \mathbf_2, where vector \mathbf_1 = P(\mathbf) is in the image of P, and vector \mathbf_2 = \mathbf - P(\mathbf). So P(\mathbf_2) = P(\mathbf) - P^2(\mathbf)= \mathbf, and then \mathbf_2 is in the kernel of P, which is the null space of A. In other words, the vector \mathbf_1 is in the column space of A, so \mathbf_1 = A \mathbf for some k dimension vector \mathbf and the vector \mathbf_2 satisfies B^\mathsf \mathbf_2=\mathbf by the construction of B. Put these conditions together, and we find a vector \mathbf so that B^\mathsf (\mathbf-A\mathbf)=\mathbf. Since matrices A and B are of full rank k by their construction, the k\times k-matrix B^\mathsf A is invertible. So the equation B^\mathsf (\mathbf-A\mathbf)=\mathbf gives the vector \mathbf= (B^A)^ B^ \mathbf. In this way, P\mathbf = \mathbf_1 = A\mathbf= A(B^A)^ B^ \mathbf for any vector \mathbf \in V and hence P = A(B^A)^ B^. In the case that P is an orthogonal projection, we can take A = B, and it follows that P=A \left(A^\mathsf A\right)^ A^\mathsf. By using this formula, one can easily check that P=P^\mathsf. In general, if the vector space is over complex number field, one then uses the Hermitian transpose A^* and has the formula P=A \left(A^* A\right)^ A^*. Recall that one can express the Moore–Penrose inverse of the matrix A by A^= (A^*A)^A^* since A has full column rank, so P=A A^.


Singular values

I-P is also an oblique projection. The singular values of P and I-P can be computed by an
orthonormal basis In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...
of A. Let Q_A be an orthonormal basis of A and let Q_A^ be the orthogonal complement of Q_A. Denote the singular values of the matrix Q_A^T A (B^T A)^ B^T Q_A^ by the positive values \gamma_1 \ge \gamma_2 \ge \ldots \ge \gamma_k . With this, the singular values for P are: \sigma_i = \begin \sqrt & 1 \le i \le k \\ 0 & \text \end and the singular values for I-P are \sigma_i = \begin \sqrt & 1 \le i \le k \\ 1 & k+1 \le i \le n-k \\ 0 & \text \end This implies that the largest singular values of P and I-P are equal, and thus that the
matrix norm In the field of mathematics, norms are defined for elements within a vector space. Specifically, when the vector space comprises matrices, such norms are referred to as matrix norms. Matrix norms differ from vector norms in that they must also ...
of the oblique projections are the same. However, the condition number satisfies the relation \kappa(I-P) = \frac \ge \frac = \kappa(P), and is therefore not necessarily equal.


Finding projection with an inner product

Let V be a vector space (in this case a plane) spanned by orthogonal vectors \mathbf u_1, \mathbf u_2, \dots, \mathbf u_p. Let y be a vector. One can define a projection of \mathbf y onto V as \operatorname_V \mathbf y = \frac \mathbf u^i where repeated indices are summed over ( Einstein sum notation). The vector \mathbf y can be written as an orthogonal sum such that \mathbf y = \operatorname_V \mathbf y + \mathbf z. \operatorname_V \mathbf y is sometimes denoted as \hat. There is a theorem in linear algebra that states that this \mathbf z is the smallest distance (the '' orthogonal distance'') from \mathbf y to V and is commonly used in areas such as
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
.


Canonical forms

Any projection P=P^2 on a vector space of dimension d over a field is a
diagonalizable matrix In linear algebra, a square matrix A is called diagonalizable or non-defective if it is matrix similarity, similar to a diagonal matrix. That is, if there exists an invertible matrix P and a diagonal matrix D such that . This is equivalent to ...
, since its minimal polynomial divides x^2-x, which splits into distinct linear factors. Thus there exists a basis in which P has the form :P = I_r\oplus 0_ where r is the rank of P. Here I_r is the
identity matrix In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. It has unique properties, for example when the identity matrix represents a geometric transformation, the obje ...
of size r, 0_ is the
zero matrix In mathematics, particularly linear algebra, a zero matrix or null matrix is a matrix all of whose entries are zero. It also serves as the additive identity of the additive group of m \times n matrices, and is denoted by the symbol O or 0 followe ...
of size d-r, and \oplus is the
direct sum The direct sum is an operation between structures in abstract algebra, a branch of mathematics. It is defined differently but analogously for different kinds of structures. As an example, the direct sum of two abelian groups A and B is anothe ...
operator. If the vector space is complex and equipped with an
inner product In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
, then there is an ''orthonormal'' basis in which the matrix of ''P'' is :P = \begin1&\sigma_1 \\ 0&0\end \oplus \cdots \oplus \begin1&\sigma_k \\ 0&0\end \oplus I_m \oplus 0_s. where \sigma_1 \geq \sigma_2\geq \dots \geq \sigma_k > 0. The
integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
s k,s,m and the real numbers \sigma_i are uniquely determined. 2k+s+m=d. The factor I_m \oplus 0_s corresponds to the maximal invariant subspace on which P acts as an ''orthogonal'' projection (so that ''P'' itself is orthogonal if and only if k=0) and the \sigma_i-blocks correspond to the ''oblique'' components.


Projections on normed vector spaces

When the underlying vector space X is a (not necessarily finite-dimensional) normed vector space, analytic questions, irrelevant in the finite-dimensional case, need to be considered. Assume now X is a
Banach space In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...
. Many of the algebraic results discussed above survive the passage to this context. A given direct sum decomposition of X into complementary subspaces still specifies a projection, and vice versa. If X is the direct sum X = U \oplus V, then the operator defined by P(u+v) = u is still a projection with range U and kernel V. It is also clear that P^2 = P. Conversely, if P is projection on X, i.e. P^2 = P, then it is easily verified that (1-P)^2 = (1-P). In other words, 1 - P is also a projection. The relation P^2 = P implies 1 = P + (1-P) and X is the direct sum \operatorname(P) \oplus \operatorname(1 - P). However, in contrast to the finite-dimensional case, projections need not be continuous in general. If a subspace U of X is not closed in the norm topology, then the projection onto U is not continuous. In other words, the range of a continuous projection P must be a closed subspace. Furthermore, the kernel of a continuous projection (in fact, a continuous linear operator in general) is closed. Thus a ''continuous'' projection P gives a decomposition of X into two complementary ''closed'' subspaces: X = \operatorname(P) \oplus \ker(P) = \ker(1-P) \oplus \ker(P). The converse holds also, with an additional assumption. Suppose U is a closed subspace of X. If there exists a closed subspace V such that , then the projection P with range U and kernel V is continuous. This follows from the closed graph theorem. Suppose and . One needs to show that Px=y. Since U is closed and , ''y'' lies in U, i.e. . Also, . Because V is closed and , we have x-y \in V, i.e. P(x-y)=Px-Py=Px-y=0, which proves the claim. The above argument makes use of the assumption that both U and V are closed. In general, given a closed subspace U, there need not exist a complementary closed subspace V, although for
Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
s this can always be done by taking the orthogonal complement. For Banach spaces, a one-dimensional subspace always has a closed complementary subspace. This is an immediate consequence of
Hahn–Banach theorem In functional analysis, the Hahn–Banach theorem is a central result that allows the extension of bounded linear functionals defined on a vector subspace of some vector space to the whole space. The theorem also shows that there are sufficient ...
. Let U be the linear span of u. By Hahn–Banach, there exists a bounded
linear functional In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear mapIn some texts the roles are reversed and vectors are defined as linear maps from covectors to scalars from a vector space to its field of ...
\varphi such that . The operator P(x)=\varphi(x)u satisfies P^2=P, i.e. it is a projection. Boundedness of \varphi implies continuity of P and therefore \ker(P) = \operatorname(I-P) is a closed complementary subspace of U.


Applications and further considerations

Projections (orthogonal and otherwise) play a major role in
algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s for certain linear algebra problems: * QR decomposition (see Householder transformation and Gram–Schmidt decomposition); *
Singular value decomposition In linear algebra, the singular value decomposition (SVD) is a Matrix decomposition, factorization of a real number, real or complex number, complex matrix (mathematics), matrix into a rotation, followed by a rescaling followed by another rota ...
* Reduction to Hessenberg form (the first step in many eigenvalue algorithms) *
Linear regression In statistics, linear regression is a statistical model, model that estimates the relationship between a Scalar (mathematics), scalar response (dependent variable) and one or more explanatory variables (regressor or independent variable). A mode ...
* Projective elements of matrix algebras are used in the construction of certain K-groups in Operator K-theory As stated above, projections are a special case of idempotents. Analytically, orthogonal projections are non-commutative generalizations of characteristic functions. Idempotents are used in classifying, for instance, semisimple algebras, while
measure theory In mathematics, the concept of a measure is a generalization and formalization of geometrical measures (length, area, volume) and other common notions, such as magnitude (mathematics), magnitude, mass, and probability of events. These seemingl ...
begins with considering characteristic functions of measurable sets. Therefore, as one can imagine, projections are very often encountered in the context of
operator algebra In functional analysis, a branch of mathematics, an operator algebra is an algebra of continuous linear operators on a topological vector space, with the multiplication given by the composition of mappings. The results obtained in the study o ...
s. In particular, a
von Neumann algebra In mathematics, a von Neumann algebra or W*-algebra is a *-algebra of bounded operators on a Hilbert space that is closed in the weak operator topology and contains the identity operator. It is a special type of C*-algebra. Von Neumann al ...
is generated by its complete lattice of projections.


Generalizations

More generally, given a map between normed vector spaces T\colon V \to W, one can analogously ask for this map to be an isometry on the orthogonal complement of the kernel: that (\ker T)^\perp \to W be an isometry (compare Partial isometry); in particular it must be
onto In mathematics, a surjective function (also known as surjection, or onto function ) is a function such that, for every element of the function's codomain, there exists one element in the function's domain such that . In other words, for a f ...
. The case of an orthogonal projection is when ''W'' is a subspace of ''V.'' In
Riemannian geometry Riemannian geometry is the branch of differential geometry that studies Riemannian manifolds, defined as manifold, smooth manifolds with a ''Riemannian metric'' (an inner product on the tangent space at each point that varies smooth function, smo ...
, this is used in the definition of a Riemannian submersion.


See also

* Centering matrix, which is an example of a projection matrix. * Dykstra's projection algorithm to compute the projection onto an intersection of sets * Invariant subspace *
Least-squares spectral analysis Least-squares spectral analysis (LSSA) is a method of estimating a Spectral density estimation#Overview, frequency spectrum based on a least-squares fit of Sine wave, sinusoids to data samples, similar to Fourier analysis. Fourier analysis, the ...
* Orthogonalization * Properties of trace


Notes


References

* * * * Brezinski, Claude: ''Projection Methods for Systems of Equations'', North-Holland, ISBN 0-444-82777-3 (1997).


External links

* , from MIT OpenCourseWare * , by Pavel Grinfeld.
Planar Geometric Projections Tutorial
– a simple-to-follow tutorial explaining the different types of planar geometric projections. {{linear algebra Functional analysis Linear algebra Linear operators