In
geometry
Geometry (; ) is a branch of mathematics concerned with properties of space such as the distance, shape, size, and relative position of figures. Geometry is, along with arithmetic, one of the oldest branches of mathematics. A mathematician w ...
and
linear algebra, a principal axis is a certain line in a
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...
associated with a
ellipsoid or
hyperboloid, generalizing the major and minor
axes of an
ellipse or
hyperbola. The principal axis theorem states that the principal axes are
perpendicular, and gives a constructive procedure for finding them.
Mathematically, the principal axis theorem is a generalization of the method of
completing the square from
elementary algebra. In
linear algebra and
functional analysis
Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (for example, Inner product space#Definition, inner product, Norm (mathematics ...
, the principal axis theorem is a geometrical counterpart of the
spectral theorem. It has applications to the
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
of
principal components analysis
Principal component analysis (PCA) is a Linear map, linear dimensionality reduction technique with applications in exploratory data analysis, visualization and Data Preprocessing, data preprocessing.
The data is linear map, linearly transformed ...
and the
singular value decomposition. In
physics
Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
, the theorem is fundamental to the studies of
angular momentum and
birefringence.
Motivation
The equations in the
Cartesian plane
define, respectively, an ellipse and a hyperbola. In each case, the and axes are the principal axes. This is easily seen, given that there are no ''cross-terms'' involving products in either expression. However, the situation is more complicated for equations like
Here some method is required to determine whether this is an
ellipse or a
hyperbola. The basic observation is that if, by
completing the square, the quadratic expression can be reduced to a sum of two squares then the equation defines an ellipse, whereas if it reduces to a difference of two squares then the equation represents a hyperbola:
Thus, in our example expression, the problem is how to absorb the coefficient of the cross-term into the functions and . Formally, this problem is similar to the problem of
matrix diagonalization, where one tries to find a suitable coordinate system in which the matrix of a
linear transformation is
diagonal. The first step is to find a matrix in which the technique of diagonalization can be applied.
The trick is to write the
quadratic form
In mathematics, a quadratic form is a polynomial with terms all of degree two (" form" is another name for a homogeneous polynomial). For example,
4x^2 + 2xy - 3y^2
is a quadratic form in the variables and . The coefficients usually belong t ...
as
where the cross-term has been split into two equal parts. The matrix in the above decomposition is a
symmetric matrix. In particular, by the
spectral theorem, it has
real eigenvalues and is
diagonalizable by an
orthogonal matrix (''orthogonally diagonalizable'').
To orthogonally diagonalize , one must first find its eigenvalues, and then find an
orthonormal eigenbasis. Calculation reveals that the eigenvalues of are
with corresponding eigenvectors
Dividing these by their respective lengths yields an orthonormal eigenbasis:
Now the matrix is an orthogonal matrix, since it has orthonormal columns, and is diagonalized by:
This applies to the present problem of "diagonalizing" the quadratic form through the observation that
Thus, the equation
is that of an ellipse, since the left side can be written as the sum of two squares.
It is tempting to simplify this expression by pulling out factors of 2. However, it is important ''not'' to do this. The quantities
have a geometrical meaning. They determine an ''orthonormal coordinate system'' on In other words, they are obtained from the original coordinates by the application of a rotation (and possibly a reflection). Consequently, one may use the and coordinates to make statements about ''length and angles'' (particularly length), which would otherwise be more difficult in a different choice of coordinates (by rescaling them, for instance). For example, the maximum distance from the origin on the ellipse
occurs when , so at the points . Similarly, the minimum distance is where .
It is possible now to read off the major and minor axes of this ellipse. These are precisely the individual
eigenspaces of the matrix , since these are where or . Symbolically, the principal axes are
To summarize:
* The equation is for an ellipse, since both eigenvalues are positive. (Otherwise, if one were positive and the other negative, it would be a hyperbola.)
* The principal axes are the lines spanned by the eigenvectors.
* The minimum and maximum distances to the origin can be read off the equation in diagonal form.
Using this information, it is possible to attain a clear geometrical picture of the ellipse: to graph it, for instance.
Formal statement
The principal axis theorem concerns
quadratic forms in which are
homogeneous polynomials of degree 2. Any quadratic form may be represented as
where is a symmetric matrix.
The first part of the theorem is contained in the following statements guaranteed by the spectral theorem:
* The eigenvalues of are real.
* is diagonalizable, and the eigenspaces of are mutually orthogonal.
In particular, is ''orthogonally diagonalizable'', since one may take a basis of each eigenspace and apply the
Gram-Schmidt process separately within the eigenspace to obtain an orthonormal eigenbasis.
For the second part, suppose that the eigenvalues of are (possibly repeated according to their
algebraic multiplicities) and the corresponding orthonormal eigenbasis is . Then,
and
where is the -th entry of . Furthermore,
: The -th principal axis is the line determined by equating for all . The -th principal axis is the span of the vector .
See also
*
Sylvester's law of inertia
References
* {{cite book, authorlink=Gilbert Strang, first=Gilbert, last=Strang, title=Introduction to Linear Algebra, publisher=Wellesley-Cambridge Press, year=1994, isbn=0-9614088-5-5
Theorems in geometry
Theorems in linear algebra