In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
of a
real-valued
In mathematics, value may refer to several, strongly related notions.
In general, a mathematical value may be any definite mathematical object. In elementary mathematics, this is most often a number – for example, a real number such as or an ...
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
. Similar to
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimodal ...
, kurtosis provides insight into specific characteristics of a distribution. Various methods exist for quantifying kurtosis in theoretical distributions, and corresponding techniques allow estimation based on sample data from a population. It’s important to note that different measures of kurtosis can yield varying
interpretations.
The standard measure of a distribution's kurtosis, originating with
Karl Pearson
Karl Pearson (; born Carl Pearson; 27 March 1857 – 27 April 1936) was an English biostatistician and mathematician. He has been credited with establishing the discipline of mathematical statistics. He founded the world's first university ...
, is a scaled version of the fourth
moment of the distribution. This number is related to the tails of the distribution, not its peak; hence, the sometimes-seen characterization of kurtosis as "
peakedness" is incorrect. For this measure, higher kurtosis corresponds to greater extremity of
deviations (or
outlier
In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...
s), and not the configuration of data near
the mean.
Excess kurtosis, typically compared to a value of 0, characterizes the “tailedness” of a distribution. A univariate
normal distribution
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac ...
has an excess kurtosis of 0. Negative excess kurtosis indicates a platykurtic distribution, which doesn’t necessarily have a flat top but produces fewer or less extreme outliers than the normal distribution. For instance, the
uniform distribution (i.e. one that is uniformly finite over some bound and zero elsewhere) is platykurtic. On the other hand, positive excess kurtosis signifies a leptokurtic distribution. The
Laplace distribution
In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponen ...
, for example, has tails that decay more slowly than a Gaussian, resulting in more outliers. To simplify comparison with the normal distribution, excess kurtosis is calculated as Pearson’s kurtosis minus 3. Some authors and software packages use “kurtosis” to refer specifically to excess kurtosis, but this article distinguishes between the two for clarity.
Alternative measures of kurtosis are: the
L-kurtosis
In statistics, L-moments are a sequence of statistics used to summarize the shape of a probability distribution. They are linear combinations of order statistics ( L-statistics) analogous to conventional moments, and can be used to calculate qua ...
, which is a scaled version of the fourth
L-moment
In statistics, L-moments are a sequence of statistics used to summarize the shape of a probability distribution. They are linear combinations of order statistics ( L-statistics) analogous to conventional moments, and can be used to calculate qua ...
; measures based on four population or sample
quantiles
In statistics and probability, quantiles are cut points dividing the range of a probability distribution into continuous intervals with equal probabilities or dividing the observations in a sample in the same way. There is one fewer quantile than ...
. These are analogous to the alternative measures of
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimodal ...
that are not based on ordinary moments.
Pearson moments
The kurtosis is the fourth
standardized moment
In probability theory and statistics, a standardized moment of a probability distribution is a moment (often a higher degree central moment) that is normalized, typically by a power of the standard deviation, rendering the moment scale invariant ...
, defined as
where is the fourth
central moment
In probability theory and statistics, a central moment is a moment of a probability distribution of a random variable about the random variable's mean; that is, it is the expected value of a specified integer power of the deviation of the random ...
and is the
standard deviation
In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
. Several letters are used in the literature to denote the kurtosis. A very common choice is , which is fine as long as it is clear that it does not refer to a
cumulant
In probability theory and statistics, the cumulants of a probability distribution are a set of quantities that provide an alternative to the '' moments'' of the distribution. Any two probability distributions whose moments are identical will have ...
. Other choices include , to be similar to the notation for skewness, although sometimes this is instead reserved for the excess kurtosis.
The kurtosis is bounded below by the squared
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimodal ...
plus 1:
where is the third
central moment
In probability theory and statistics, a central moment is a moment of a probability distribution of a random variable about the random variable's mean; that is, it is the expected value of a specified integer power of the deviation of the random ...
. The lower bound is realized by the
Bernoulli distribution
In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli, is the discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with pro ...
. There is no upper limit to the kurtosis of a general probability distribution, and it may be infinite.
A reason why some authors favor the excess kurtosis is that cumulants are
extensive. Formulas related to the extensive property are more naturally expressed in terms of the excess kurtosis. For example, let be independent random variables for which the fourth moment exists, and let be the random variable defined by the sum of the . The excess kurtosis of is
where
is the standard deviation of . In particular if all of the have the same variance, then this simplifies to
The reason not to subtract 3 is that the bare
moment better generalizes to
multivariate distribution
Multivariate is the quality of having multiple variables.
It may also refer to:
In mathematics
* Multivariable calculus
* Multivariate function
* Multivariate polynomial
* Multivariate interpolation
* Multivariate optimization
In computing
* ...
s, especially when independence is not assumed. The
cokurtosis
In probability theory and statistics, cokurtosis is a measure of how much two random variables change together. Cokurtosis is the fourth standardized cross central moment. If two random variables exhibit a high level of cokurtosis they will tend t ...
between pairs of variables is an order four
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
. For a bivariate normal distribution, the cokurtosis tensor has off-diagonal terms that are neither 0 nor 3 in general, so attempting to "correct" for an excess becomes confusing. It is true, however, that the joint cumulants of degree greater than two for any
multivariate normal distribution
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional ( univariate) normal distribution to higher dimensions. One d ...
are zero.
For two random variables, and , not necessarily independent, the kurtosis of the sum, , is
Note that the fourth-power
binomial coefficient
In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...
s (1, 4, 6, 4, 1) appear in the above equation.
Interpretation
The interpretation of the Pearson measure of kurtosis (or excess kurtosis) was once debated, but it is now well-established. As noted by Westfall in 2014, "...''its unambiguous interpretation relates to tail extremity.'' Specifically, it reflects either the presence of existing outliers (for sample kurtosis) or the tendency to produce outliers (for the kurtosis of a probability distribution). The underlying logic is straightforward: Kurtosis represents the average (or
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
) of standardized data raised to the fourth power. Standardized values less than 1—corresponding to data within one standard deviation of the mean (where the “peak” occurs)—contribute minimally to kurtosis. This is because raising a number less than 1 to the fourth power brings it closer to zero. The meaningful contributors to kurtosis are data values outside the peak region, i.e., the outliers. Therefore, kurtosis primarily measures outliers and provides no information about the central "peak".
Numerous misconceptions about kurtosis relate to notions of peakedness. One such misconception is that kurtosis measures both the “peakedness” of a distribution and the
heaviness of its tail . Other incorrect interpretations include notions like “lack of shoulders” (where the “shoulder” refers vaguely to the area between the peak and the tail, or more specifically, the region about one
standard deviation
In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
from the mean) or “bimodality.” Balanda and
MacGillivray argue that the standard definition of kurtosis “poorly captures the kurtosis, peakedness, or tail weight of a distribution.”Instead, they propose a vague definition of kurtosis as the location- and scale-free movement of
probability mass from the distribution’s shoulders into its center and tails.
Moors' interpretation
In 1986, Moors gave an interpretation of kurtosis. Let
where is a random variable, is the mean and is the standard deviation.
Now by definition of the kurtosis
, and by the well-known identity
The kurtosis can now be seen as a measure of the dispersion of around its expectation. Alternatively it can be seen to be a measure of the dispersion of around and . attains its minimal value in a symmetric two-point distribution. In terms of the original variable , the kurtosis is a measure of the dispersion of around the two values .
High values of arise in two circumstances:
* where the probability mass is concentrated around the mean and the data-generating process produces occasional values far from the mean
* where the probability mass is concentrated in the tails of the distribution.
Maximal entropy
The
entropy
Entropy is a scientific concept, most commonly associated with states of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynamics, where it was first recognized, to the micros ...
of a distribution is
.
For any
with
positive definite, among all probability distributions on
with mean
and covariance
, the normal distribution
has the largest entropy.
Since mean
and covariance
are the first two moments, it is natural to consider extension to higher moments. In fact, by
Lagrange multiplier
In mathematical optimization, the method of Lagrange multipliers is a strategy for finding the local maxima and minima of a function (mathematics), function subject to constraint (mathematics), equation constraints (i.e., subject to the conditio ...
method, for any prescribed first n moments, if there exists some probability distribution of form
that has the prescribed moments (if it is feasible), then it is the maximal entropy distribution under the given constraints.
By serial expansion,
so if a random variable has probability distribution
, where
is a normalization constant, then its kurtosis is
Excess kurtosis
The ''excess kurtosis'' is defined as kurtosis minus 3. There are 3 distinct regimes as described below.
Mesokurtic
Distributions with zero excess kurtosis are called mesokurtic, or mesokurtotic. The most prominent example of a mesokurtic distribution is the normal distribution family, regardless of the values of its
parameter
A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
s. A few other well-known distributions can be mesokurtic, depending on parameter values: for example, the
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
is mesokurtic for
.
Leptokurtic
A distribution with
positive
Positive is a property of positivity and may refer to:
Mathematics and science
* Positive formula, a logical formula not containing negation
* Positive number, a number that is greater than 0
* Plus sign, the sign "+" used to indicate a positi ...
excess kurtosis is called leptokurtic, or leptokurtotic. "Lepto-" means "slender". A leptokurtic distribution has ''
fatter tails''. Examples of leptokurtic distributions include the
Student's t-distribution
In probability theory and statistics, Student's distribution (or simply the distribution) t_\nu is a continuous probability distribution that generalizes the Normal distribution#Standard normal distribution, standard normal distribu ...
,
Rayleigh distribution
In probability theory and statistics, the Rayleigh distribution is a continuous probability distribution for nonnegative-valued random variables. Up to rescaling, it coincides with the chi distribution with two degrees of freedom.
The distributi ...
,
Laplace distribution
In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponen ...
,
exponential distribution
In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...
,
Poisson distribution
In probability theory and statistics, the Poisson distribution () is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time if these events occur with a known const ...
and the
logistic distribution
In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
. Such distributions are sometimes termed ''super-Gaussian''.
Platykurtic

A distribution with
negative excess kurtosis is called platykurtic, or platykurtotic. "Platy-" means "broad". A platykurtic distribution has ''thinner tails''. Examples of platykurtic distributions include the
continuous
Continuity or continuous may refer to:
Mathematics
* Continuity (mathematics), the opposing concept to discreteness; common examples include
** Continuous probability distribution or random variable in probability and statistics
** Continuous ...
and
discrete uniform distribution
In probability theory and statistics, the discrete uniform distribution is a symmetric probability distribution wherein each of some finite whole number ''n'' of outcome values are equally likely to be observed. Thus every one of the ''n'' out ...
s, and the
raised cosine distribution
In probability theory and statistics, the raised cosine distribution is a continuous probability distribution supported on the interval mu-s,\mu+s/math>. The probability density function (PDF) is
:f(x;\mu,s)=\frac
\left +\cos\left(\frac\,\pi\ri ...
. The most platykurtic distribution of all is the
Bernoulli distribution
In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli, is the discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with pro ...
with ''p'' = 1/2 (for example the number of times one obtains "heads" when flipping a coin once, a
coin toss
A coin is a small object, usually round and flat, used primarily as a medium of exchange or legal tender. They are standardized in weight, and produced in large quantities at a mint in order to facilitate trade. They are most often issued by a ...
), for which the excess kurtosis is −2.
Graphical examples
The Pearson type VII family
The effects of kurtosis are illustrated using a
parametric family
In mathematics and its applications, a parametric family or a parameterized family is a indexed family, family of objects (a set of related objects) whose differences depend only on the chosen values for a set of parameters.
Common examples are p ...
of distributions whose kurtosis can be adjusted while their lower-order moments and cumulants remain constant. Consider the
Pearson type VII family, which is a special case of the
Pearson type IV family restricted to symmetric densities. The
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
is given by
where is a
scale parameter
In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution.
Definition
If a family ...
and is a
shape parameter
In probability theory and statistics, a shape parameter (also known as form parameter) is a kind of numerical parameter of a parametric family of probability distributionsEveritt B.S. (2002) Cambridge Dictionary of Statistics. 2nd Edition. CUP.
th ...
.
All densities in this family are symmetric. The -th moment exists provided . For the kurtosis to exist, we require . Then the mean and
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimodal ...
exist and are both identically zero. Setting makes the variance equal to unity. Then the only free parameter is , which controls the fourth moment (and cumulant) and hence the kurtosis. One can reparameterize with
, where
is the excess kurtosis as defined above. This yields a one-parameter leptokurtic family with zero mean, unit variance, zero skewness, and arbitrary non-negative excess kurtosis. The reparameterized density is
In the limit as
one obtains the density
which is shown as the red curve in the images on the right.
In the other direction as
one obtains the
standard normal
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac e^ ...
density as the limiting distribution, shown as the black curve.
In the images on the right, the blue curve represents the density
with excess kurtosis of 2. The top image shows that leptokurtic densities in this family have a higher peak than the mesokurtic normal density, although this conclusion is only valid for this select family of distributions. The comparatively fatter tails of the leptokurtic densities are illustrated in the second image, which plots the natural logarithm of the Pearson type VII densities: the black curve is the logarithm of the standard normal density, which is a
parabola
In mathematics, a parabola is a plane curve which is Reflection symmetry, mirror-symmetrical and is approximately U-shaped. It fits several superficially different Mathematics, mathematical descriptions, which can all be proved to define exactl ...
. One can see that the normal density allocates little probability mass to the regions far from the mean ("has thin tails"), compared with the blue curve of the leptokurtic Pearson type VII density with excess kurtosis of 2. Between the blue curve and the black are other Pearson type VII densities with = 1, 1/2, 1/4, 1/8, and 1/16. The red curve again shows the upper limit of the Pearson type VII family, with
(which, strictly speaking, means that the fourth moment does not exist). The red curve decreases the slowest as one moves outward from the origin ("has fat tails").
Other well-known distributions
Several well-known, unimodal, and symmetric distributions from different parametric families are compared here. Each has a mean and skewness of zero. The parameters have been chosen to result in a variance equal to 1 in each case. The images on the right show curves for the following seven densities, on a
linear scale
A linear scale, also called a bar scale, scale bar, graphic scale, or graphical scale, is a means of visually showing the scale of a map, nautical chart, engineering drawing, or architectural drawing. A scale bar is common element of map layo ...
and
logarithmic scale
A logarithmic scale (or log scale) is a method used to display numerical data that spans a broad range of values, especially when there are significant differences among the magnitudes of the numbers involved.
Unlike a linear Scale (measurement) ...
:
* D:
Laplace distribution
In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponen ...
, also known as the double exponential distribution, red curve (two straight lines in the log-scale plot), excess kurtosis = 3
* S:
hyperbolic secant distribution
In probability theory and statistics, the hyperbolic secant distribution is a continuous probability distribution whose probability density function and characteristic function are proportional to the hyperbolic secant function. The hyperbolic se ...
, orange curve, excess kurtosis = 2
* L:
logistic distribution
In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
, green curve, excess kurtosis = 1.2
* N:
normal distribution
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac ...
, black curve (inverted parabola in the log-scale plot), excess kurtosis = 0
* C:
raised cosine distribution
In probability theory and statistics, the raised cosine distribution is a continuous probability distribution supported on the interval mu-s,\mu+s/math>. The probability density function (PDF) is
:f(x;\mu,s)=\frac
\left +\cos\left(\frac\,\pi\ri ...
, cyan curve, excess kurtosis = −0.593762...
* W:
Wigner semicircle distribution
The Wigner semicircle distribution, named after the physicist Eugene Wigner, is the probability distribution defined on the domain minus;''R'', ''R''whose probability density function ''f'' is a scaled semicircle, i.e. a semi-ellipse, centered at ...
, blue curve, excess kurtosis = −1
* U:
uniform distribution, magenta curve (shown for clarity as a rectangle in both images), excess kurtosis = −1.2.
Note that in these cases the platykurtic densities have bounded
support
Support may refer to:
Arts, entertainment, and media
* Supporting character
* Support (art), a solid surface upon which a painting is executed
Business and finance
* Support (technical analysis)
* Child support
* Customer support
* Income Su ...
, whereas the densities with positive or zero excess kurtosis are supported on the whole
real line
A number line is a graphical representation of a straight line that serves as spatial representation of numbers, usually graduated like a ruler with a particular origin (geometry), origin point representing the number zero and evenly spaced mark ...
.
One cannot infer that high or low kurtosis distributions have the characteristics indicated by these examples. There exist platykurtic densities with infinite support,
*e.g.,
exponential power distribution
The generalized normal distribution (GND) or generalized Gaussian distribution (GGD) is either of two families of parametric statistics, parametric continuous probability distributions on the real number, real line. Both families add a shape para ...
s with sufficiently large shape parameter ''b''
and there exist leptokurtic densities with finite support.
*e.g., a distribution that is uniform between −3 and −0.3, between −0.3 and 0.3, and between 0.3 and 3, with the same density in the (−3, −0.3) and (0.3, 3) intervals, but with 20 times more density in the (−0.3, 0.3) interval
Also, there exist platykurtic densities with infinite peakedness,
*e.g., an equal mixture of the
beta distribution
In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1
The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
or (0, 1) in terms of two positive Statistical parameter, parameters, denoted by ''alpha'' (''α'') an ...
with parameters 0.5 and 1 with its reflection about 0.0
and there exist leptokurtic densities that appear flat-topped,
*e.g., a mixture of distribution that is uniform between −1 and 1 with a T(4.0000001)
Student's t-distribution
In probability theory and statistics, Student's distribution (or simply the distribution) t_\nu is a continuous probability distribution that generalizes the Normal distribution#Standard normal distribution, standard normal distribu ...
, with mixing probabilities 0.999 and 0.001.
Sample kurtosis
Definitions
A natural but biased estimator
For a
sample
Sample or samples may refer to:
* Sample (graphics), an intersection of a color channel and a pixel
* Sample (material), a specimen or small quantity of something
* Sample (signal), a digital discrete sample of a continuous analog signal
* Sample ...
of ''n'' values, a
method of moments estimator of the population excess kurtosis can be defined as
where is the fourth sample
moment about the mean
In probability theory and statistics, a central moment is a moment of a probability distribution of a random variable about the random variable's mean; that is, it is the expected value of a specified integer power of the deviation of the random ...
, ''m''
2 is the second sample moment about the mean (that is, the
sample variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion, ...
), is the -th value, and
is the
sample mean
The sample mean (sample average) or empirical mean (empirical average), and the sample covariance or empirical covariance are statistics computed from a sample of data on one or more random variables.
The sample mean is the average value (or me ...
.
This formula has the simpler representation,
where the
values are the standardized data values using the standard deviation defined using rather than in the denominator.
For example, suppose the data values are 0, 3, 4, 1, 2, 3, 0, 2, 1, 3, 2, 0, 2, 2, 3, 2, 5, 2, 3, 999.
Then the values are −0.239, −0.225, −0.221, −0.234, −0.230, −0.225, −0.239, −0.230, −0.234, −0.225, −0.230, −0.239, −0.230, −0.230, −0.225, −0.230, −0.216, −0.230, −0.225, 4.359
and the values are 0.003, 0.003, 0.002, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.003, 0.002, 0.003, 0.003, 360.976.
The average of these values is 18.05 and the excess kurtosis is thus . This example makes it clear that data near the "middle" or "peak" of the distribution do not contribute to the kurtosis statistic, hence kurtosis does not measure "peakedness". It is simply a measure of the outlier, 999 in this example.
Standard unbiased estimator
Given a sub-set of samples from a population, the sample excess kurtosis
above is a
biased estimator
In statistics, the bias of an estimator (or bias function) is the difference between this estimator's expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called ''unbiased''. In stat ...
of the population excess kurtosis. An alternative estimator of the population excess kurtosis, which is unbiased in random samples of a normal distribution, is defined as follows:
where is the unique symmetric
unbiased
Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
estimator of the fourth
cumulant
In probability theory and statistics, the cumulants of a probability distribution are a set of quantities that provide an alternative to the '' moments'' of the distribution. Any two probability distributions whose moments are identical will have ...
, is the unbiased estimate of the second cumulant (identical to the unbiased estimate of the sample variance), is the fourth sample moment about the mean, is the second sample moment about the mean, is the -th value, and
is the sample mean. This adjusted Fisher–Pearson standardized moment coefficient
is the version found in
Excel and several statistical packages including
Minitab,
SAS, and
SPSS
SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. Versi ...
.
[Doane DP, Seward LE (2011) J Stat Educ 19 (2)]
Unfortunately, in nonnormal samples
is itself generally biased.
Upper bound
An upper bound for the sample kurtosis of () real numbers is
where
is the corresponding sample skewness.
Variance under normality
The variance of the sample kurtosis of a sample of size from the
normal distribution
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac ...
is
Stated differently, under the assumption that the underlying random variable
is normally distributed, it can be shown that
.
Applications
The sample kurtosis is a useful measure of whether there is a problem with outliers in a data set. Larger kurtosis indicates a more serious outlier problem, and may lead the researcher to choose alternative statistical methods.
D'Agostino's K-squared test
In statistics, D'Agostino's ''K''2 test, named for Ralph D'Agostino, is a goodness-of-fit measure of departure from normality, that is the test aims to gauge the compatibility of given data with the null hypothesis that the data is a realizatio ...
is a
goodness-of-fit
The goodness of fit of a statistical model describes how well it fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measure ...
normality test
In statistics, normality tests are used to determine if a data set is well-modeled by a normal distribution
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distributio ...
based on a combination of the sample skewness and sample kurtosis, as is the
Jarque–Bera test
In statistics, the Jarque–Bera test is a goodness-of-fit test of whether sample data have the skewness and kurtosis matching a normal distribution. The test is named after Carlos Jarque and Anil K. Bera.
The test statistic is always nonnegativ ...
for normality.
For non-normal samples, the variance of the sample variance depends on the kurtosis; for details, please see
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
.
Pearson's definition of kurtosis is used as an indicator of intermittency in
turbulence
In fluid dynamics, turbulence or turbulent flow is fluid motion characterized by chaotic changes in pressure and flow velocity. It is in contrast to laminar flow, which occurs when a fluid flows in parallel layers with no disruption between ...
. It is also used in magnetic resonance imaging to quantify non-Gaussian diffusion.
A concrete example is the following lemma by He, Zhang, and Zhang:
Assume a random variable has expectation
, variance
and kurtosis
Assume we sample
many independent copies. Then
This shows that with
many samples, we will see one that is above the expectation with probability at least
.
In other words: If the kurtosis is large, we might see a lot values either all below or above the mean.
Kurtosis convergence
Applying
band-pass filter
A band-pass filter or bandpass filter (BPF) is a device that passes frequencies within a certain range and rejects ( attenuates) frequencies outside that range.
It is the inverse of a '' band-stop filter''.
Description
In electronics and s ...
s to
digital image
A digital image is an image composed of picture elements, also known as pixels, each with '' finite'', '' discrete quantities'' of numeric representation for its intensity or gray level that is an output from its two-dimensional functions f ...
s, kurtosis values tend to be uniform, independent of the range of the filter. This behavior, termed ''kurtosis convergence'', can be used to detect image splicing in
forensic analysis
Forensic science combines principles of law and science to investigate criminal activity. Through crime scene investigations and laboratory analysis, forensic scientists are able to link suspects to evidence. An example is determining the time and ...
.
Seismic signal analysis
Kurtosis can be used in
geophysics
Geophysics () is a subject of natural science concerned with the physical processes and Physical property, properties of Earth and its surrounding space environment, and the use of quantitative methods for their analysis. Geophysicists conduct i ...
to distinguish different types of
seismic signals. It is particularly effective in differentiating seismic signals generated by human footsteps from other signals. This is useful in security and surveillance systems that rely on seismic detection.
Weather prediction
In
meteorology
Meteorology is the scientific study of the Earth's atmosphere and short-term atmospheric phenomena (i.e. weather), with a focus on weather forecasting. It has applications in the military, aviation, energy production, transport, agricultur ...
, kurtosis is used to analyze weather data distributions. It helps predict extreme weather events by assessing the probability of outlier values in historical data,
which is valuable for long-term climate studies and short-term weather forecasting.
Other measures
A different measure of "kurtosis" is provided by using
L-moment
In statistics, L-moments are a sequence of statistics used to summarize the shape of a probability distribution. They are linear combinations of order statistics ( L-statistics) analogous to conventional moments, and can be used to calculate qua ...
s instead of the ordinary moments.
See also
*
Kurtosis risk
In statistics and decision theory, kurtosis risk is the risk that results when a statistical model assumes the normal distribution, but is applied to observations that have a tendency to occasionally be much farther (in terms of number of standar ...
*
Maximum entropy probability distribution
In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, ...
References
Further reading
Alternative source(Comparison of kurtosis estimators)
*
External links
*
Kurtosis calculatorFree Online Software (Calculator)computes various types of skewness and kurtosis statistics for any dataset (includes small and large sample tests)..
on th
Celebrating 100 years of Kurtosisa history of the topic, with different measures of kurtosis.
{{Statistics, descriptive
Moments (mathematics)
Statistical deviation and dispersion