In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, L-moments are a sequence of statistics used to summarize the shape of a
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
.
They are
linear combination
In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...
s of
order statistic
In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with Ranking (statistics), rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and ...
s (
L-statistics) analogous to conventional
moments, and can be used to calculate quantities analogous to
standard deviation
In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
,
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimodal ...
and
kurtosis
In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtos ...
, termed the L-scale, L-skewness and L-kurtosis respectively (the L-mean is identical to the conventional
mean
A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
). Standardised L-moments are called L-moment ratios and are analogous to
standardized moment
In probability theory and statistics, a standardized moment of a probability distribution is a moment (often a higher degree central moment) that is normalized, typically by a power of the standard deviation, rendering the moment scale invariant ...
s. Just as for conventional moments, a theoretical distribution has a set of population L-moments. Sample L-moments can be defined for a sample from the population, and can be used as estimators of the population L-moments.
Population L-moments
For a random variable , the th population L-moment is
[
where denotes the th ]order statistic
In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with Ranking (statistics), rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and ...
(th smallest value) in an independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
sample of size from the distribution of and denotes expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
operator. In particular, the first four population L-moments are
Note that the coefficients of the th L-moment are the same as in the th term of the binomial transform, as used in the -order finite difference
A finite difference is a mathematical expression of the form . Finite differences (or the associated difference quotients) are often used as approximations of derivatives, such as in numerical differentiation.
The difference operator, commonly d ...
(finite analog to the derivative).
The first two of these L-moments have conventional names:
* is the "mean", "L-mean", or "L-location",
* is the "L-scale".
The L-scale is equal to half the Mean absolute difference.[
]
Analytic calculation
Expectations are often defined in terms of probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
s, but the connection in terms of these between the order statistics and their underlying random variable is rather remote. A closer connection can be found in terms of cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
s (CDFs), since these (see this section) satisfy
In particular one may define polynomials and express .
Having a CDF , the expectation may be expressed using a Stieltjes integral
Thomas Joannes Stieltjes ( , ; 29 December 1856 – 31 December 1894) was a Dutch mathematician. He was a pioneer in the field of moment problems and contributed to the study of continued fractions. The Thomas Stieltjes Institute for Mathematics ...
as
thus
where is straight off the derivative of . This integral can often be made more tractable by introducing the quantile function via the change of variables :
Since the L-moments are linear combinations of such expectations, the corresponding integrals can be combined into one for each moment, where the integrand is times a polynomial. We have
where
are the shifted Legendre polynomials, orthogonal on .
In particular
Sillitto's Theorem
The above integral formula for has the form of a generalised Fourier coefficient, and they appeared as such in the literature years before being named moments. In the notation of this article, Sillitto proved
However Hosking cautions that partial sum
In mathematics, a series is, roughly speaking, an addition of infinitely many terms, one after the other. The study of series is a major part of calculus and its generalization, mathematical analysis. Series are used in most areas of mathemati ...
s of this series tend to give poor approximations for the tails of the distribution, and need not be monotonic. Similar problems arise with the Cornish–Fisher expansion of in terms of the cumulant
In probability theory and statistics, the cumulants of a probability distribution are a set of quantities that provide an alternative to the '' moments'' of the distribution. Any two probability distributions whose moments are identical will have ...
s of .
Sample L-moments
The sample L-moments can be computed as the population L-moments of the sample, summing over ''r''-element subsets of the sample hence averaging by dividing by the binomial coefficient
In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...
:
Grouping these by order statistic counts the number of ways an element of an element sample can be the th element of an element subset, and yields formulas of the form below. Direct estimators for the first four L-moments in a finite sample of observations are:[
]
where is the th order statistic
In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with Ranking (statistics), rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and ...
and is a binomial coefficient
In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...
. Sample L-moments can also be defined indirectly in terms of probability weighted moments,[
][
]
which leads to a more efficient algorithm
In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
for their computation.[
]
L-moment ratios
A set of ''L-moment ratios'', or scaled L-moments, is defined by
The most useful of these are called the ''L-skewness'', and the ''L-kurtosis''.
L-moment ratios lie within the interval Tighter bounds can be found for some specific L-moment ratios; in particular, the L-kurtosis lies in and[
A quantity analogous to the ]coefficient of variation
In probability theory and statistics, the coefficient of variation (CV), also known as normalized root-mean-square deviation (NRMSD), percent RMS, and relative standard deviation (RSD), is a standardized measure of dispersion of a probability ...
, but based on L-moments, can also be defined:
which is called the "coefficient of L-variation", or "L-CV". For a non-negative random variable, this lies in the interval [ and is identical to the ]Gini coefficient
In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income distribution, income inequality, the wealth distribution, wealth inequality, or the ...
.[
]
Related quantities
L-moments are statistical quantities that are derived from probability weighted moments (PWM) which were defined earlier (1979).[ PWM are used to efficiently estimate the parameters of distributions expressable in inverse form such as the Gumbel,][ the Tukey lambda, and the Wakeby distributions.
]
Usage
There are two common ways that L-moments are used, in both cases analogously to the conventional moments:
# As summary statistics
In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in
* a measure of ...
for data.
# To derive estimators for the parameters of probability distributions
In probability theory and statistics, a probability distribution is a function that gives the probabilities of occurrence of possible events for an experiment. It is a mathematical description of a random phenomenon in terms of its sample spac ...
, applying the method of moments to the L-moments rather than conventional moments.
In addition to doing these with standard moments, the latter (estimation) is more commonly done using maximum likelihood
In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stati ...
methods; however using L-moments provides a number of advantages. Specifically, L-moments are more robust than conventional moments, and existence of higher L-moments only requires that the random variable have finite mean. One disadvantage of L-moment ratios for estimation is their typically smaller sensitivity. For instance, the Laplace distribution has a kurtosis of 6 and weak exponential tails, but a larger 4th L-moment ratio than e.g. the student-t distribution with d.f.=3, which has an infinite kurtosis and much heavier tails.
As an example consider a dataset with a few data points and one outlying data value. If the ordinary standard deviation
In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
of this data set is taken it will be highly influenced by this one point: however, if the L-scale is taken it will be far less sensitive to this data value. Consequently, L-moments are far more meaningful when dealing with outliers in data than conventional moments. However, there are also other better suited methods to achieve an even higher robustness than just replacing moments by L-moments. One example of this is using L-moments as summary statistics in extreme value theory
Extreme value theory or extreme value analysis (EVA) is the study of extremes in statistical distributions.
It is widely used in many disciplines, such as structural engineering, finance, economics, earth sciences, traffic prediction, and Engin ...
(EVT). This application shows the limited robustness of L-moments, i.e. L-statistics are not resistant statistics, as a single extreme value can throw them off, but because they are only linear (not higher-order statistics), they are less affected by extreme values than conventional moments.
Another advantage L-moments have over conventional moments is that their existence only requires the random variable to have finite mean, so the L-moments exist even if the higher conventional moments do not exist (for example, for Student's t distribution with low degrees of freedom
In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...
). A finite variance is required in addition in order for the standard errors of estimates of the L-moments to be finite.[
Some appearances of L-moments in the statistical literature include the book by David & Nagaraja (2003, Section 9.9) and a number of papers.][ A number of favourable comparisons of L-moments with ordinary moments have been reported.
]
Values for some common distributions
The table below gives expressions for the first two L moments and numerical values of the first two L-moment ratios of some common continuous probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
s with constant L-moment ratios.[
]
More complex expressions have been derived for some further distributions for which the L-moment ratios vary with one or more of the distributional parameters, including the log-normal, Gamma
Gamma (; uppercase , lowercase ; ) is the third letter of the Greek alphabet. In the system of Greek numerals it has a value of 3. In Ancient Greek, the letter gamma represented a voiced velar stop . In Modern Greek, this letter normally repr ...
, generalized Pareto, generalized extreme value, and generalized logistic distributions.[
The notation for the parameters of each distribution is the same as that used in the linked article. In the expression for the mean of the Gumbel distribution, is the
]
Extensions
''Trimmed L-moments'' are generalizations of L-moments that give zero weight to extreme observations. They are therefore more robust to the presence of outliers, and unlike L-moments they may be well-defined for distributions for which the mean does not exist, such as the Cauchy distribution
The Cauchy distribution, named after Augustin-Louis Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) ...
.
See also
*L-estimator
In statistics, an L-estimator (or L-statistic) is an estimator which is a linear combination of order statistics of the measurements. This can be as little as a single point, as in the median (of an odd number of values), or as many as all points ...
References
External links
The L-moments page
Jonathan R.M. Hosking, IBM Research
IBM Research is the research and development division for IBM, an American Multinational corporation, multinational information technology company. IBM Research is headquartered at the Thomas J. Watson Research Center in Yorktown Heights, New York ...
L Moments.
Dataplot reference manual, vol. 1, auxiliary chapter. National Institute of Standards and Technology
The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into Outline of p ...
, 2006. Accessed 2010-05-25.
Lmo
lightweight Python includes functions for fast calculation of L-moments, trimmed L-moments, and multivariate L-comoments.
{{Statistics, descriptive
Moments (mathematics)
Summary statistics