In
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, an elliptical distribution is any member of a broad family of
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
s that generalize the
multivariate normal distribution. In the simplified two and three dimensional case, the joint distribution forms an
ellipse
In mathematics, an ellipse is a plane curve surrounding two focus (geometry), focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special ty ...
and an
ellipsoid
An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional Scaling (geometry), scalings, or more generally, of an affine transformation.
An ellipsoid is a quadric surface; that is, a Surface (mathemat ...
, respectively, in iso-density plots.
In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the normal distribution is used in ''classical''
multivariate analysis
Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable, i.e., '' multivariate random variables''.
Multivariate statistics concerns understanding the differ ...
, while elliptical distributions are used in ''generalized'' multivariate analysis, for the study of symmetric distributions with tails that are
heavy, like the
multivariate t-distribution, or light (in comparison with the normal distribution). Some statistical methods that were originally motivated by the study of the normal distribution have good performance for general elliptical distributions (with finite variance), particularly for spherical distributions (which are defined below). Elliptical distributions are also used in
robust statistics
Robust statistics are statistics that maintain their properties even if the underlying distributional assumptions are incorrect. Robust Statistics, statistical methods have been developed for many common problems, such as estimating location parame ...
to evaluate proposed multivariate-statistical procedures.
Definition
Elliptical distributions are defined in terms of the
characteristic function of probability theory. A random vector
on a
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...
has an ''elliptical distribution'' if its characteristic function
satisfies the following
functional equation (for every column-vector
)
:
for some
location parameter
In statistics, a location parameter of a probability distribution is a scalar- or vector-valued parameter x_0, which determines the "location" or shift of the distribution. In the literature of location parameter estimation, the probability distr ...
, some
nonnegative-definite matrix and some scalar function
.
The definition of elliptical distributions for ''real'' random-vectors has been extended to accommodate random vectors in Euclidean spaces over the
field of
complex number
In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s, so facilitating applications in
time-series analysis. Computational methods are available for generating
pseudo-random vectors from elliptical distributions, for use in
Monte Carlo
Monte Carlo ( ; ; or colloquially ; , ; ) is an official administrative area of Monaco, specifically the Ward (country subdivision), ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is located. Informally, the name also refers to ...
simulation
A simulation is an imitative representation of a process or system that could exist in the real world. In this broad sense, simulation can often be used interchangeably with model. Sometimes a clear distinction between the two terms is made, in ...
s for example.
Some elliptical distributions are alternatively defined in terms of their
density functions. An elliptical distribution with a density function ''f'' has the form:
:
where
is the
normalizing constant,
is an
-dimensional
random vector
In probability, and statistics, a multivariate random variable or random vector is a list or vector of mathematical variables each of whose value is unknown, either because the value has not yet occurred or because there is imperfect knowledge ...
with
median vector (which is also the mean vector if the latter exists), and
is a
positive definite matrix which is proportional to the
covariance matrix if the latter exists.
Examples
Examples include the following multivariate probability distributions:
*
Multivariate normal distribution
*
Multivariate ''t''-distribution
*
Symmetric multivariate stable distribution
*
Symmetric multivariate Laplace distribution
*
Multivariate logistic distribution
* Multivariate symmetric general
hyperbolic distribution[
]
Properties
In the 2-dimensional case, if the density exists, each iso-density locus (the set of ''x''1,''x''2 pairs all giving a particular value of ) is an ellipse
In mathematics, an ellipse is a plane curve surrounding two focus (geometry), focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special ty ...
or a union of ellipses (hence the name elliptical distribution). More generally, for arbitrary ''n'', the iso-density loci are unions of ellipsoid
An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional Scaling (geometry), scalings, or more generally, of an affine transformation.
An ellipsoid is a quadric surface; that is, a Surface (mathemat ...
s. All these ellipsoids or ellipses have the common center μ and are scaled copies (homothets) of each other.
The multivariate normal distribution is the special case in which . While the multivariate normal is unbounded (each element of can take on arbitrarily large positive or negative values with non-zero probability, because for all non-negative ), in general elliptical distributions can be bounded or unbounded—such a distribution is bounded if for all greater than some value.
There exist elliptical distributions that have undefined mean
A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
, such as the Cauchy distribution (even in the univariate case). Because the variable ''x'' enters the density function quadratically, all elliptical distributions are symmetric about
If two subsets of a jointly elliptical random vector are uncorrelated
In probability theory and statistics, two real-valued random variables, X, Y, are said to be uncorrelated if their covariance, \operatorname ,Y= \operatorname Y- \operatorname \operatorname /math>, is zero. If two variables are uncorrelated, ther ...
, then if their means exist they are mean independent of each other (the mean of each subvector conditional on the value of the other subvector equals the unconditional mean).
If random vector ''X'' is elliptically distributed, then so is ''DX'' for any matrix ''D'' with full row rank. Thus any linear combination of the components of ''X'' is elliptical (though not necessarily with the same elliptical distribution), and any subset of ''X'' is elliptical.
Applications
Elliptical distributions are used in statistics and in economics. They are also used to calculate the landing footprints of spacecraft.
In mathematical economics, elliptical distributions have been used to describe portfolios in mathematical finance
Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling in the financial field.
In general, there exist two separate branches of finance that req ...
.
Statistics: Generalized multivariate analysis
In statistics, the multivariate ''normal'' distribution (of Gauss) is used in ''classical'' multivariate analysis
Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable, i.e., '' multivariate random variables''.
Multivariate statistics concerns understanding the differ ...
, in which most methods for estimation and hypothesis-testing are motivated for the normal distribution. In contrast to classical multivariate analysis, ''generalized'' multivariate analysis refers to research on elliptical distributions without the restriction of normality.
For suitable elliptical distributions, some classical methods continue to have good properties. Under finite-variance assumptions, an extension of Cochran's theorem (on the distribution of quadratic forms) holds.
Spherical distribution
An elliptical distribution with a zero mean and variance in the form where is the identity-matrix is called a ''spherical distribution''. For spherical distributions, classical results on parameter-estimation and hypothesis-testing hold have been extended. Similar results hold for linear models, and indeed also for complicated models (especially for the growth curve model). The analysis of multivariate models uses multilinear algebra
Multilinear algebra is the study of Function (mathematics), functions with multiple vector space, vector-valued Argument of a function, arguments, with the functions being Linear map, linear maps with respect to each argument. It involves concept ...
(particularly Kronecker product
In mathematics, the Kronecker product, sometimes denoted by ⊗, is an operation on two matrices of arbitrary size resulting in a block matrix. It is a specialization of the tensor product (which is denoted by the same symbol) from vector ...
s and vectorization) and matrix calculus
In mathematics, matrix calculus is a specialized notation for doing multivariable calculus, especially over spaces of matrix (mathematics), matrices. It collects the various partial derivatives of a single Function (mathematics), function with ...
.
Robust statistics: Asymptotics
Another use of elliptical distributions is in robust statistics
Robust statistics are statistics that maintain their properties even if the underlying distributional assumptions are incorrect. Robust Statistics, statistical methods have been developed for many common problems, such as estimating location parame ...
, in which researchers examine how statistical procedures perform on the class of elliptical distributions, to gain insight into the procedures' performance on even more general problems,[
] for example by using the limiting theory of statistics ("asymptotics").
Economics and finance
Elliptical distributions are important in portfolio theory because, if the returns on all assets available for portfolio formation are jointly elliptically distributed, then all portfolios can be characterized completely by their location and scale – that is, any two portfolios with identical location and scale of portfolio return have identical distributions of portfolio return. Various features of portfolio analysis, including mutual fund separation theorem In Modern portfolio theory, portfolio theory, a mutual fund separation theorem, mutual fund theorem, or separation theorem is a theorem stating that, under certain conditions, any investor's optimal portfolio can be constructed by holding each of ce ...
s and the Capital Asset Pricing Model
In finance, the capital asset pricing model (CAPM) is a model used to determine a theoretically appropriate required rate of return of an asset, to make decisions about adding assets to a Diversification (finance), well-diversified Portfolio (f ...
, hold for all elliptical distributions.
Notes
References
*
*
*
*
*
*
*:Originally
*
*
*
Further reading
* A collection of papers.
{{DEFAULTSORT:Elliptical Distribution
Types of probability distributions
Location-scale family probability distributions
Multivariate statistics
Normal distribution
Ellipsoids
Ellipses