statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, L-moments are a sequence of statistics used to summarize the shape of a

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

. They are

linear combination In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...

s of

order statistic In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with Ranking (statistics), rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and ...

s ( L-statistics) analogous to conventional moments, and can be used to calculate quantities analogous to

standard deviation In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...

skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...

and

kurtosis In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtos ...

, termed the L-scale, L-skewness and L-kurtosis respectively (the L-mean is identical to the conventional

mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...

). Standardised L-moments are called L-moment ratios and are analogous to

standardized moment In probability theory and statistics, a standardized moment of a probability distribution is a moment (often a higher degree central moment) that is normalized, typically by a power of the standard deviation, rendering the moment scale invariant ...

s. Just as for conventional moments, a theoretical distribution has a set of population L-moments. Sample L-moments can be defined for a sample from the population, and can be used as estimators of the population L-moments.

Population L-moments

For a random variable , the th population L-moment is

\lambda_r = \frac \sum_^ (-1)^k \binom \operatorname X_\, ,

where denotes the th

(th smallest value) in an

independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in Pennsylvania, United States * Independentes (English: Independents), a Portuguese artist ...

sample of size from the distribution of and

\mathbb

denotes

expected value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...

operator. In particular, the first four population L-moments are

\right) . \end

Note that the coefficients of the th L-moment are the same as in the th term of the

binomial transform In combinatorics, the binomial transform is a sequence transformation (i.e., a transform of a sequence) that computes its forward differences. It is closely related to the Euler transform, which is the result of applying the binomial transform to ...

, as used in the -order

finite difference A finite difference is a mathematical expression of the form . Finite differences (or the associated difference quotients) are often used as approximations of derivatives, such as in numerical differentiation. The difference operator, commonly d ...

(finite analog to the derivative). The first two of these L-moments have conventional names: *

\lambda_1

is the "mean", "L-mean", or "L-location", *

\lambda_2

is the "L-scale". The L-scale is equal to half the Mean absolute difference.

Analytic calculation

Expectations are often defined in terms of

probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...

s, but the connection in terms of these between the order statistics

X_

and their underlying random variable

X

is rather remote. A closer connection can be found in terms of

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...

s (CDFs), since these (see this section) satisfy

F_(x) = \sum_^n \binom F_X(x)^j \bigl( 1 - F_X(x) \bigr)^ .

In particular one may define polynomials

b_(y) = \sum_^n \binom y^j (1-y)^

and express

F_ = b_ \circ F_X

. Having a CDF

F_X

, the expectation

\mathbb\

may be expressed using a

Stieltjes integral Thomas Joannes Stieltjes ( , ; 29 December 1856 – 31 December 1894) was a Dutch mathematician. He was a pioneer in the field of moment problems and contributed to the study of continued fractions. The Thomas Stieltjes Institute for Mathematics ...

\mathbb\ = \int_ x \, dF_X(x) ,

thus

\mathbb\ = 
  \int_ x \, d(b_ \circ F_X)(x) =
  \int_ x b_'\bigl( F_X(x) \bigr) \, dF_X(x)

where

b_'

is straight off the derivative of

b_

. This integral can often be made more tractable by introducing the

quantile function In probability and statistics, the quantile function is a function Q: ,1\mapsto \mathbb which maps some probability x \in ,1/math> of a random variable v to the value of the variable y such that P(v\leq y) = x according to its probability distr ...

Q_X

via the

change of variables In mathematics, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with functions of other variables. The intent is that when expressed in new variables, the problem may become si ...

y = F_X(x), x = Q_X(y)

\mathbb\ = 
  \int_ x b_'\bigl( F_X(x) \bigr) \, dF_X(x) =
  \int_0^1 Q_X(y) b_'(y) \, dy.

Since the L-moments are linear combinations of such expectations, the corresponding integrals can be combined into one for each moment, where the integrand is

Q_X(y)

times a polynomial. We have

\lambda_n = \int_0^1 Q_X(y) \widetilde_(y) \, dy

where

\widetilde_m(y) = \sum_^m (-1)^ \binom \binom y^k

are the shifted Legendre polynomials, orthogonal on . In particular

\lambda_4 &= \int_0^1 Q_X(y) \left(20 y^3 - 30 y^2 + 12 y - 1\right) dy. \end

Sillitto's Theorem

The above integral formula for

\lambda_n

has the form of a generalised Fourier coefficient, and they appeared as such in the literature years before being named moments. In the notation of this article, Sillitto proved However Hosking cautions that

partial sum In mathematics, a series is, roughly speaking, an addition of infinitely many terms, one after the other. The study of series is a major part of calculus and its generalization, mathematical analysis. Series are used in most areas of mathemati ...

s of this series tend to give poor approximations for the tails of the distribution, and need not be monotonic. Similar problems arise with the Cornish–Fisher expansion of

Q_X

in terms of the

cumulant In probability theory and statistics, the cumulants of a probability distribution are a set of quantities that provide an alternative to the '' moments'' of the distribution. Any two probability distributions whose moments are identical will have ...

s of

X

Sample L-moments

The sample L-moments can be computed as the population L-moments of the sample, summing over ''r''-element subsets of the sample

\left\,

hence averaging by dividing by the

binomial coefficient In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...

\lambda_r = \frac\, \sum_ (-1)^ \binom\, x_j \,.

Grouping these by order statistic counts the number of ways an element of an element sample can be the th element of an element subset, and yields formulas of the form below. Direct estimators for the first four L-moments in a finite sample of observations are:

x_ \end

where is the th

and

\tbinom

is a

. Sample L-moments can also be defined indirectly in terms of probability weighted moments, which leads to a more efficient

algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...

for their computation.

L-moment ratios

A set of ''L-moment ratios'', or scaled L-moments, is defined by

\tau_r = \lambda_r / \lambda_2, \qquad r = 3, 4, \dots ~.

The most useful of these are

\tau_3 ,

called the ''L-skewness'', and

\tau_4 ,

the ''L-kurtosis''. L-moment ratios lie within the interval Tighter bounds can be found for some specific L-moment ratios; in particular, the L-kurtosis

\tau_4

lies in and

\tfrac \left( 5 \tau_3^2 - 1 \right) \leq \tau_4 < 1 \, .

A quantity analogous to the

coefficient of variation In probability theory and statistics, the coefficient of variation (CV), also known as normalized root-mean-square deviation (NRMSD), percent RMS, and relative standard deviation (RSD), is a standardized measure of dispersion of a probability ...

, but based on L-moments, can also be defined:

\tau = \lambda_2 / \lambda_1 \, ,

which is called the "coefficient of L-variation", or "L-CV". For a non-negative random variable, this lies in the interval and is identical to the

Gini coefficient In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income distribution, income inequality, the wealth distribution, wealth inequality, or the ...

Related quantities

L-moments are statistical quantities that are derived from probability weighted moments (PWM) which were defined earlier (1979). PWM are used to efficiently estimate the parameters of distributions expressable in inverse form such as the Gumbel, the Tukey lambda, and the Wakeby distributions.

Usage

There are two common ways that L-moments are used, in both cases analogously to the conventional moments: # As

summary statistics In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in * a measure of ...

for data. # To derive estimators for the parameters of

probability distributions In probability theory and statistics, a probability distribution is a function that gives the probabilities of occurrence of possible events for an experiment. It is a mathematical description of a random phenomenon in terms of its sample spac ...

, applying the method of moments to the L-moments rather than conventional moments. In addition to doing these with standard moments, the latter (estimation) is more commonly done using

maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stati ...

methods; however using L-moments provides a number of advantages. Specifically, L-moments are more robust than conventional moments, and existence of higher L-moments only requires that the random variable have finite mean. One disadvantage of L-moment ratios for estimation is their typically smaller sensitivity. For instance, the Laplace distribution has a kurtosis of 6 and weak exponential tails, but a larger 4th L-moment ratio than e.g. the student-t distribution with d.f.=3, which has an infinite kurtosis and much heavier tails. As an example consider a dataset with a few data points and one outlying data value. If the ordinary

of this data set is taken it will be highly influenced by this one point: however, if the L-scale is taken it will be far less sensitive to this data value. Consequently, L-moments are far more meaningful when dealing with outliers in data than conventional moments. However, there are also other better suited methods to achieve an even higher robustness than just replacing moments by L-moments. One example of this is using L-moments as summary statistics in

extreme value theory Extreme value theory or extreme value analysis (EVA) is the study of extremes in statistical distributions. It is widely used in many disciplines, such as structural engineering, finance, economics, earth sciences, traffic prediction, and Engin ...

(EVT). This application shows the limited robustness of L-moments, i.e. L-statistics are not resistant statistics, as a single extreme value can throw them off, but because they are only linear (not higher-order statistics), they are less affected by extreme values than conventional moments. Another advantage L-moments have over conventional moments is that their existence only requires the random variable to have finite mean, so the L-moments exist even if the higher conventional moments do not exist (for example, for

Student's t distribution In probability theory and statistics, Student's distribution (or simply the distribution) t_\nu is a continuous probability distribution that generalizes the standard normal distribution. Like the latter, it is symmetric around zero ...

with low

degrees of freedom In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...

). A finite variance is required in addition in order for the standard errors of estimates of the L-moments to be finite. Some appearances of L-moments in the statistical literature include the book by David & Nagaraja (2003, Section 9.9) and a number of papers. A number of favourable comparisons of L-moments with ordinary moments have been reported.

Values for some common distributions

The table below gives expressions for the first two L moments and numerical values of the first two L-moment ratios of some common

continuous probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

s with constant L-moment ratios. More complex expressions have been derived for some further distributions for which the L-moment ratios vary with one or more of the distributional parameters, including the

log-normal In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...

Gamma Gamma (; uppercase , lowercase ; ) is the third letter of the Greek alphabet. In the system of Greek numerals it has a value of 3. In Ancient Greek, the letter gamma represented a voiced velar stop . In Modern Greek, this letter normally repr ...

, generalized Pareto, generalized extreme value, and generalized logistic distributions. The notation for the parameters of each distribution is the same as that used in the linked article. In the expression for the mean of the

Gumbel distribution In probability theory and statistics, the Gumbel distribution (also known as the type-I generalized extreme value distribution) is used to model the distribution of the maximum (or the minimum) of a number of samples of various distributions. Thi ...

, is the

Extensions

''Trimmed L-moments'' are generalizations of L-moments that give zero weight to extreme observations. They are therefore more robust to the presence of outliers, and unlike L-moments they may be well-defined for distributions for which the mean does not exist, such as the

Cauchy distribution The Cauchy distribution, named after Augustin-Louis Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) ...

References

External links

The L-moments page
Jonathan R.M. Hosking,

IBM Research IBM Research is the research and development division for IBM, an American Multinational corporation, multinational information technology company. IBM Research is headquartered at the Thomas J. Watson Research Center in Yorktown Heights, New York ...

L Moments.
Dataplot reference manual, vol. 1, auxiliary chapter.

National Institute of Standards and Technology The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into Outline of p ...

, 2006. Accessed 2010-05-25.
Lmo
lightweight Python includes functions for fast calculation of L-moments, trimmed L-moments, and multivariate L-comoments. {{Statistics, descriptive Moments (mathematics) Summary statistics