In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the multivariate ''t''-distribution (or multivariate Student distribution) is a
multivariate probability distribution. It is a generalization to
random vector
In probability, and statistics, a multivariate random variable or random vector is a list or vector of mathematical variables each of whose value is unknown, either because the value has not yet occurred or because there is imperfect knowledge ...
s of the
Student's ''t''-distribution, which is a distribution applicable to univariate
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s. While the case of a
random matrix
In probability theory and mathematical physics, a random matrix is a matrix-valued random variable—that is, a matrix in which some or all of its entries are sampled randomly from a probability distribution. Random matrix theory (RMT) is the ...
could be treated within this structure, the
matrix ''t''-distribution is distinct and makes particular use of the matrix structure.
Definition
One common method of construction of a multivariate ''t''-distribution, for the case of
dimensions, is based on the observation that if
and
are independent and distributed as
and
(i.e.
multivariate normal and
chi-squared distribution
In probability theory and statistics, the \chi^2-distribution with k Degrees of freedom (statistics), degrees of freedom is the distribution of a sum of the squares of k Independence (probability theory), independent standard normal random vari ...
s) respectively, the matrix
is a ''p'' × ''p'' matrix, and
is a constant vector then the random variable
has the density
:
and is said to be distributed as a multivariate ''t''-distribution with parameters
. Note that
is not the covariance matrix since the covariance is given by
(for
).
The constructive definition of a multivariate ''t''-distribution simultaneously serves as a sampling algorithm:
# Generate
and
, independently.
# Compute
.
This formulation gives rise to the hierarchical representation of a multivariate ''t''-distribution as a scale-mixture of normals:
where
indicates a gamma distribution with density proportional to
, and
conditionally follows
.
In the special case
, the distribution is a
multivariate Cauchy distribution.
Derivation
There are in fact many candidates for the multivariate generalization of
Student's ''t''-distribution. An extensive survey of the field has been given by Kotz and Nadarajah (2004). The essential issue is to define a probability density function of several variables that is the appropriate generalization of the formula for the univariate case. In one dimension (
), with
and
, we have the
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
:
and one approach is to use a corresponding function of several variables. This is the basic idea of
elliptical distribution
In probability and statistics, an elliptical distribution is any member of a broad family of probability distributions that generalize the multivariate normal distribution. In the simplified two and three dimensional case, the joint distribution f ...
theory, where one writes down a corresponding function of
variables
that replaces
by a quadratic function of all the
. It is clear that this only makes sense when all the marginal distributions have the same
degrees of freedom
In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...
. With
, one has a simple choice of multivariate density function
:
which is the standard but not the only choice.
An important special case is the standard bivariate ''t''-distribution, ''p'' = 2:
:
Note that
.
Now, if
is the identity matrix, the density is
:
The difficulty with the standard representation is revealed by this formula, which does not factorize into the product of the marginal one-dimensional distributions. When
is diagonal the standard representation can be shown to have zero
correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
but the
marginal distribution
In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset. It gives the probabilities of various values of the variable ...
s are not
statistically independent
Independence is a fundamental notion in probability theory, as in statistics and the theory of stochastic processes. Two event (probability theory), events are independent, statistically independent, or stochastically independent if, informally s ...
.
A notable spontaneous occurrence of the elliptical multivariate distribution is its formal mathematical appearance when least squares methods are applied to multivariate normal data such as the classical Markowitz minimum variance econometric solution for asset portfolios.
Cumulative distribution function
The definition of the
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
(cdf) in one dimension can be extended to multiple dimensions by defining the following probability (here
is a real vector):
:
There is no simple formula for
, but it can b
approximated numericallyvia
Monte Carlo integration.
[
]
Conditional Distribution
This was developed by Muirhead
and Cornish. but later derived using the simpler chi-squared ratio representation above, by Roth
and Ding. Let vector
follow a multivariate ''t'' distribution and partition into two subvectors of
elements:
:
where
, the known mean vectors are
and the scale matrix is
.
Roth and Ding find the conditional distribution
to be a new ''t''-distribution with modified parameters.
:
An equivalent expression in Kotz et. al. is somewhat less concise.
Thus the conditional distribution is most easily represented as a two-step procedure. Form first the intermediate distribution
above then, using the parameters below, the explicit conditional distribution becomes
:
where
:
Effective degrees of freedom,
is augmented by the number of disused variables
.
:
is the conditional mean of
:
is the
Schur complement of
.
:
is the squared
Mahalanobis distance
The Mahalanobis distance is a distance measure, measure of the distance between a point P and a probability distribution D, introduced by Prasanta Chandra Mahalanobis, P. C. Mahalanobis in 1936. The mathematical details of Mahalanobis distance ...
of
from
with scale matrix
:
is the conditional scale matrix for
.
Copulas based on the multivariate ''t''
The use of such distributions is enjoying renewed interest due to applications in
mathematical finance
Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling in the financial field.
In general, there exist two separate branches of finance that req ...
, especially through the use of the Student's ''t''
copula.
Elliptical representation
Constructed as an
elliptical distribution
In probability and statistics, an elliptical distribution is any member of a broad family of probability distributions that generalize the multivariate normal distribution. In the simplified two and three dimensional case, the joint distribution f ...
, take the simplest centralised case with spherical symmetry and no scaling,
, then the multivariate ''
t''-PDF takes the form
:
where
and
= degrees of freedom as defined in Muirhead
section 1.5. The covariance of
is
:
The aim is to convert the Cartesian PDF to a radial one. Kibria and Joarder, define radial measure
and, noting that the density is dependent only on r
2, we get
which is equivalent to the variance of
-element vector
treated as a univariate heavy-tail zero-mean random sequence with uncorrelated, yet statistically dependent, elements.
Radial Distribution
follows the
Fisher-Snedecor or
distribution:
:
having mean value
.
-distributions arise naturally in tests of sums of squares of sampled data after normalization by the sample standard deviation.
By a change of random variable to
in the equation above, retaining
-vector
, we have
and probability distribution
:
which is a regular
Beta-prime distribution having mean value
.
Cumulative Radial Distribution
Given the Beta-prime distribution, the radial cumulative distribution function of
is known:
:
where
is the incomplete
Beta function
In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral
: \Beta(z_1,z_2) = \int_0^1 t^ ...
and applies with a spherical
assumption.
In the scalar case,
, the distribution is equivalent to Student-''t'' with the equivalence
, the variable ''t'' having double-sided tails for CDF purposes, i.e. the "two-tail-t-test".
The radial distribution can also be derived via a straightforward coordinate transformation from Cartesian to spherical. A constant radius surface at
with PDF
is an iso-density surface. Given this density value, the quantum of probability on a shell of surface area
and thickness
at
is
.
The enclosed
-sphere of radius
has surface area
. Substitution into
shows that the shell has element of probability
which is equivalent to radial density function
:
which further simplifies to
where
is the
Beta function
In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral
: \Beta(z_1,z_2) = \int_0^1 t^ ...
.
Changing the radial variable to
returns the previous Beta Prime distribution
:
To scale the radial variables without changing the radial shape function, define scale matrix
, yielding a 3-parameter Cartesian density function, ie. the probability
in volume element
is
:
or, in terms of scalar radial variable
,
:
Radial Moments
The moments of all the radial variables , with the spherical distribution assumption, can be derived from the Beta Prime distribution. If
then
, a known result. Thus, for variable
we have
:
The moments of
are
:
while introducing the scale matrix
yields
:
Moments relating to radial variable
are found by setting
and
whereupon
:
Linear Combinations and Affine Transformation
Full Rank Transform
This closely relates to the multivariate normal method and is described in Kotz and Nadarajah, Kibria and Joarder, Roth, and Cornish. Starting from a somewhat simplified version of the central MV-t pdf:
, where
is a constant and
is arbitrary but fixed, let
be a full-rank matrix and form vector
. Then, by straightforward change of variables
:
The matrix of partial derivatives is
and the Jacobian becomes
. Thus
:
The denominator reduces to
:
In full:
:
which is a regular MV-''t'' distribution.
In general if
and
has full rank
then
:
Marginal Distributions
This is a special case of the rank-reducing linear transform below. Kotz defines marginal distributions as follows. Partition
into two subvectors of
elements:
:
with
, means
, scale matrix
then
,
such that
:
:
If a transformation is constructed in the form
:
then vector
, as discussed below, has the same distribution as the marginal distribution of
.
Rank-Reducing Linear Transform
In the linear transform case, if
is a rectangular matrix
, of rank
the result is dimensionality reduction. Here, Jacobian
is seemingly rectangular but the value
in the denominator pdf is nevertheless correct. There is a discussion of rectangular matrix product determinants in Aitken. In general if
and
has full rank
then
:
:
''In extremis'', if ''m'' = 1 and
becomes a row vector, then scalar ''Y'' follows a univariate double-sided Student-t distribution defined by
with the same
degrees of freedom. Kibria et. al. use the affine transformation to find the marginal distributions which are also MV-''t''.
* During affine transformations of variables with elliptical distributions all vectors must ultimately derive from one initial isotropic spherical vector
whose elements remain 'entangled' and are not statistically independent.
* A vector of independent student-''t'' samples is not consistent with the multivariate ''t'' distribution.
* Adding two sample multivariate ''t'' vectors generated with independent Chi-squared samples and different
values:
will not produce internally consistent distributions, though they will yield a
Behrens-Fisher problem.
* Taleb compares many examples of fat-tail elliptical ''vs'' non-elliptical multivariate distributions
Related concepts
* In univariate statistics, the
Student's ''t''-test makes use of
Student's ''t''-distribution
* The elliptical multivariate-''t'' distribution arises spontaneously in linearly constrained least squares solutions involving multivariate normal source data, for example the Markowitz global minimum variance solution in financial portfolio analysis.
which addresses an ensemble of normal random vectors or a random matrix. It does not arise in ordinary least squares (OLS) or multiple regression with fixed dependent and independent variables which problem tends to produce well-behaved normal error probabilities.
*
Hotelling's ''T''-squared distribution is a distribution that arises in multivariate statistics.
* The
matrix ''t''-distribution is a distribution for random variables arranged in a matrix structure.
See also
*
Multivariate normal distribution
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional ( univariate) normal distribution to higher dimensions. One d ...
, which is the limiting case of the multivariate Student's t-distribution when
.
*
Chi distribution
In probability theory and statistics, the chi distribution is a continuous probability distribution over the non-negative real line. It is the distribution of the positive square root of a sum of squared independent Gaussian random variables. E ...
, the
pdf
Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
of the scaling factor in the construction the Student's t-distribution and also the
2-norm (or
Euclidean norm
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces'' ...
) of a multivariate normally distributed vector (centered at zero).
**
Rayleigh distribution#Student's t, random vector length of multivariate ''t''-distribution
*
Mahalanobis distance
The Mahalanobis distance is a distance measure, measure of the distance between a point P and a probability distribution D, introduced by Prasanta Chandra Mahalanobis, P. C. Mahalanobis in 1936. The mathematical details of Mahalanobis distance ...
References
Literature
*
*
*
External links
Copula Methods vs Canonical Multivariate Distributions: the multivariate Student T distribution with general degrees of freedom
{{DEFAULTSORT:Multivariate Normal Distribution
Continuous distributions
Multivariate continuous distributions