In
probability theory
Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
and
statistics, the chi distribution is a continuous
probability distribution. It is the distribution of the positive square root of the sum of squares of a set of independent random variables each following a standard
normal distribution
In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
:
f(x) = \frac e^
The parameter \mu i ...
, or equivalently, the distribution of the
Euclidean distance of the random variables from the origin. It is thus related to the
chi-squared distribution
In probability theory and statistics, the chi-squared distribution (also chi-square or \chi^2-distribution) with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables. The chi-squar ...
by describing the distribution of the positive square roots of a variable obeying a chi-squared distribution.
If
are
independent,
normally distributed random variables with mean 0 and
standard deviation 1, then the statistic
:
is distributed according to the chi distribution. The chi distribution has one parameter,
, which specifies the number of
degrees of freedom
Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...
(i.e. the number of random variables
).
The most familiar examples are the
Rayleigh distribution (chi distribution with two
degrees of freedom
Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...
) and the
Maxwell–Boltzmann distribution of the molecular speeds in an
ideal gas (chi distribution with three degrees of freedom).
Definitions
Probability density function
The
probability density function
In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) c ...
(pdf) of the chi-distribution is
:
where
is the
gamma function
In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except th ...
.
Cumulative distribution function
The cumulative distribution function is given by:
:
where
is the
regularized gamma function.
Generating functions
The
moment-generating function is given by:
:
where
is Kummer's
confluent hypergeometric function. The
characteristic function is given by:
:
Properties
Moments
The raw
moments are then given by:
:
where
is the
gamma function
In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except th ...
. Thus the first few raw moments are:
:
:
:
:
:
:
where the rightmost expressions are derived using the recurrence relationship for the gamma function:
:
From these expressions we may derive the following relationships:
Mean:
, which is close to
for large ''k''
Variance:
, which approaches
as ''k'' increases
Skewness:
Kurtosis excess:
Entropy
The entropy is given by:
:
where
is the
polygamma function.
Large n approximation
We find the large n=k+1 approximation of the mean and variance of chi distribution. This has application e.g. in finding the distribution of standard deviation of a sample of normally distributed population, where n is the sample size.
The mean is then:
:
We use the
Legendre duplication formula to write:
:
,
so that:
:
Using
Stirling's approximation
In mathematics, Stirling's approximation (or Stirling's formula) is an approximation for factorials. It is a good approximation, leading to accurate results even for small values of n. It is named after James Stirling, though a related but less p ...
for Gamma function, we get the following expression for the mean:
:
::