Skew Normal Distribution
   HOME

TheInfoList



OR:

In
probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the skew normal distribution is a
continuous probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
that generalises the
normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...
to allow for non-zero
skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...
.


Definition

Let \phi(x) denote the
standard normal In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac e^ ...
probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
:\phi(x)=\frace^ with the
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...
given by :\Phi(x) = \int_^ \phi(t)\ \mathrm dt = \frac \left 1 + \operatorname \left(\frac\right)\right where "erf" is the
error function In mathematics, the error function (also called the Gauss error function), often denoted by , is a function \mathrm: \mathbb \to \mathbb defined as: \operatorname z = \frac\int_0^z e^\,\mathrm dt. The integral here is a complex Contour integrat ...
. Then the probability density function (pdf) of the skew-normal distribution with parameter \alpha is given by :f(x) = 2\phi(x)\Phi(\alpha x). \, This distribution was first introduced by O'Hagan and Leonard (1976). Alternative forms to this distribution, with the corresponding quantile function, have been given by Ashour and Abdel-Hamid and by Mudholkar and Hutson. A stochastic process that underpins the distribution was described by Andel, Netuka and Zvara (1984). Both the distribution and its stochastic process underpinnings were consequences of the symmetry argument developed in Chan and Tong (1986), which applies to multivariate cases beyond normality, e.g. skew multivariate t distribution and others. The distribution is a particular case of a general class of distributions with probability density functions of the form f(x) = 2 \phi(x) \Phi(x) where \phi(\cdot) is any
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
symmetric about zero and \Phi(\cdot) is any CDF whose PDF is symmetric about zero. To add
location In geography, location or place is used to denote a region (point, line, or area) on Earth's surface. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ambiguous bou ...
and scale parameters to this, one makes the usual transform x\rightarrow\frac. One can verify that the normal distribution is recovered when \alpha = 0, and that the absolute value of the
skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...
increases as the absolute value of \alpha increases. The distribution is right skewed if \alpha>0 and is left skewed if \alpha<0. The probability density function with location \xi, scale \omega, and parameter \alpha becomes :f(x) = \frac\phi\left(\frac\right)\Phi\left(\alpha \left(\frac\right)\right). \, The skewness ( \gamma_1 ) of the distribution is limited to slightly less than the interval (-1,1) . As has been shown, the mode (maximum) m_o of the distribution is unique. For general \alpha there is no analytic expression for m_o , but a quite accurate (numerical) approximation is: \begin \delta &= \frac \\ m_o (\alpha) &\approx \sqrt\delta - \left(1-\frac\right) \frac - \frac e^ \\ \end


Estimation

Maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stati ...
estimates for \xi, \omega, and \alpha can be computed numerically, but no closed-form expression for the estimates is available unless \alpha=0. In contrast, the method of moments has a closed-form expression since the skewness equation can be inverted with :, \delta, = \sqrt where \delta = \frac and the sign of \delta is the same as the sign of \gamma_1. Consequently, \alpha = \frac, \omega = \frac, and \xi=\mu-\omega\delta\sqrt where \mu and \sigma are the mean and standard deviation. As long as the sample skewness \hat_1 is not too large, these formulas provide method of moments estimates \hat\alpha, \hat\omega, and \hat\xi based on a sample's \hat\mu, \hat\sigma, and \hat\gamma_1. The maximum (theoretical) skewness is obtained by setting in the skewness equation, giving \gamma_1 \approx 0.9952717. However it is possible that the sample skewness is larger, and then \alpha cannot be determined from these equations. When using the method of moments in an automatic fashion, for example to give starting values for maximum likelihood iteration, one should therefore let (for example) , \hat_1, = \min(0.99, , (1/n)\sum, ). Concern has been expressed about the impact of skew normal methods on the reliability of inferences based upon them.Pewsey, Arthur. "Problems of inference for Azzalini's skewnormal distribution." Journal of Applied Statistics 27.7 (2000): 859-870
/ref>


Related distributions

The exponentially modified normal distribution is another 3-parameter distribution that is a generalization of the normal distribution to skewed cases. The skew normal still has a normal-like tail in the direction of the skew, with a shorter tail in the other direction; that is, its density is asymptotically proportional to e^ for some positive k. Thus, in terms of the
seven states of randomness The seven states of randomness in probability theory, fractals and risk analysis are extensions of the concept of randomness as modeled by the normal distribution. These seven states were first introduced by BenoƮt Mandelbrot in his 1997 book ...
, it shows "proper mild randomness". In contrast, the exponentially modified normal has an exponential tail in the direction of the skew; its density is asymptotically proportional to e^. In the same terms, it shows "borderline mild randomness". Thus, the skew normal is useful for modeling skewed distributions which nevertheless have no more outliers than the normal, while the exponentially modified normal is useful for cases with an increased incidence of outliers in (just) one direction.


See also

*
Generalized normal distribution The generalized normal distribution (GND) or generalized Gaussian distribution (GGD) is either of two families of parametric continuous probability distributions on the real line. Both families add a shape parameter to the normal distribution. ...
*
Log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...


References


External links


The multi-variate skew-normal distribution with an application to body mass, height and Body Mass Index



The Skew-Normal Probability Distribution (and related distributions, such as the skew-t)




{{DEFAULTSORT:Skew Normal Distribution Continuous distributions Normal distribution