
In
statistics, an L-estimator is an
estimator
In statistics, an estimator is a rule for calculating an estimate of a given quantity based on observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguished. For example, the ...
which is a linear combination of
order statistic
In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and inference.
Importa ...
s of the measurements (which is also called an L-statistic). This can be as little as a single point, as in the median (of an odd number of values), or as many as all points, as in the mean.
The main benefits of L-estimators are that they are often extremely simple, and often
robust statistics
Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, suc ...
: assuming sorted data, they are very easy to calculate and interpret, and are often resistant to outliers. They thus are useful in robust statistics, as
descriptive statistics
A descriptive statistic (in the count noun sense) is a summary statistic that quantitatively describes or summarizes features from a collection of information, while descriptive statistics (in the mass noun sense) is the process of using and a ...
, in
statistics education, and when computation is difficult. However, they are
inefficient
Efficiency is the often measurable ability to avoid wasting materials, energy, efforts, money, and time in doing something or in producing a desired result. In a more general sense, it is the ability to do things well, successfully, and without ...
, and in modern times robust statistics
M-estimator
In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-est ...
s are preferred, though these are much more difficult computationally. In many circumstances L-estimators are reasonably efficient, and thus adequate for initial estimation.
Examples
A basic example is the
median. Given ''n'' values
, if
is odd, the median equals
, the
-th order statistic; if
is even, it is the average of two order statistics:
. These are both linear combinations of order statistics, and the median is therefore a simple example of an L-estimator.
A more detailed list of examples includes: with a single point, the maximum, the minimum, or any single order statistic or
quantile; with one or two points, the median; with two points, the
mid-range
In statistics, the mid-range or mid-extreme is a measure of central tendency of a sample defined as the arithmetic mean of the maximum and minimum values of the data set:
:M=\frac.
The mid-range is closely related to the range, a measure of ...
, the
range, the
midsummary (
trimmed
''Trimmed'' is a 1922 American silent Western film directed by Harry A. Pollard and featuring Hoot Gibson. It is not known whether the film currently survives, and it may be a lost film.
Cast
* Hoot Gibson as Dale Garland
* Patsy Ruth Miller ...
mid-range, including the
midhinge), and the trimmed range (including the
interquartile range
In descriptive statistics, the interquartile range (IQR) is a measure of statistical dispersion, which is the spread of the data. The IQR may also be called the midspread, middle 50%, fourth spread, or H‑spread. It is defined as the difference ...
and
interdecile range); with three points, the
trimean In statistics the trimean (TM), or Tukey's trimean, is a measure of a probability distribution's location defined as a weighted average of the distribution's median and its two quartiles:
: TM= \frac
This is equivalent to the average of the m ...
; with a fixed fraction of the points, the
trimmed mean (including
interquartile mean) and the
Winsorized mean; with all points, the mean.
Note that some of these (such as median, or mid-range) are measures of
central tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications ...
, and are used as estimators for a
location parameter
In geography, location or place are used to denote a region (point, line, or area) on Earth's surface or elsewhere. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ...
, such as the mean of a normal distribution, while others (such as range or trimmed range) are measures of
statistical dispersion
In statistics, dispersion (also called variability, scatter, or spread) is the extent to which a distribution is stretched or squeezed. Common examples of measures of statistical dispersion are the variance, standard deviation, and interquartil ...
, and are used as estimators of a
scale parameter
In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution.
Definition
If a family o ...
, such as the
standard deviation of a normal distribution.
L-estimators can also measure the
shape
A shape or figure is a graphical representation of an object or its external boundary, outline, or external surface, as opposed to other properties such as color, texture, or material type.
A plane shape or plane figure is constrained to lie on ...
of a distribution, beyond location and scale. For example, the midhinge minus the median is a 3-term L-estimator that measures the
skewness
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For a unimo ...
, and other differences of midsummaries give measures of asymmetry at different points in the tail.
Sample
L-moments are L-estimators for the population L-moment, and have rather complex expressions. L-moments are generally treated separately; see that article for details.
Robustness
L-estimators are often
statistically resistant, having a high
breakdown point. This is defined as the fraction of the measurements which can be arbitrarily changed without causing the resulting estimate to tend to infinity (i.e., to "break down"). The breakdown point of an L-estimator is given by the closest order statistic to the minimum or maximum: for instance, the median has a breakdown point of 50% (the highest possible), and a ''n''% trimmed or
Winsorized mean has a breakdown point of ''n''%.
Not all L-estimators are robust; if it includes the minimum or maximum, then it has a breakdown point of 0. These non-robust L-estimators include the minimum, maximum, mean, and mid-range. The trimmed equivalents are robust, however.
Robust L-estimators used to measure dispersion, such as the IQR, provide
robust measures of scale In statistics, robust measures of scale are methods that quantify the statistical dispersion in a sample of numerical data while resisting outliers. The most common such robust statistics are the ''interquartile range'' (IQR) and the ''median absol ...
.
Applications
In practical use in
robust statistics
Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, suc ...
, L-estimators have been replaced by
M-estimator
In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-est ...
s, which provide robust statistics that also have high relative
efficiency, at the cost of being much more computationally complex and opaque.
However, the simplicity of L-estimators means that they are easily interpreted and visualized, and makes them suited for
descriptive statistics
A descriptive statistic (in the count noun sense) is a summary statistic that quantitatively describes or summarizes features from a collection of information, while descriptive statistics (in the mass noun sense) is the process of using and a ...
and
statistics education; many can even be computed mentally from a
five-number summary or
seven-number summary, or visualized from a
box plot
In descriptive statistics, a box plot or boxplot is a method for graphically demonstrating the locality, spread and skewness groups of numerical data through their quartiles. In addition to the box on a box plot, there can be lines (which are cal ...
. L-estimators play a fundamental role in many approaches to
non-parametric statistics
Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based on either being distri ...
.
Though non-parametric, L-estimators are frequently used for
parameter estimation
Estimation theory is a branch of statistics that deals with estimating the values of parameters based on measured empirical data that has a random component. The parameters describe an underlying physical setting in such a way that their val ...
, as indicated by the name, though they must often be adjusted to yield an
unbiased consistent estimator
In statistics, a consistent estimator or asymptotically consistent estimator is an estimator—a rule for computing estimates of a parameter ''θ''0—having the property that as the number of data points used increases indefinitely, the resul ...
. The choice of L-estimator and adjustment depend on the distribution whose parameter is being estimated.
For example, when estimating a
location parameter
In geography, location or place are used to denote a region (point, line, or area) on Earth's surface or elsewhere. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ...
, for a symmetric distribution a symmetric L-estimator (such as the median or midhinge) will be unbiased. However, if the distribution has
skew, symmetric L-estimators will generally be biased and require adjustment. For example, in a skewed distribution, the
nonparametric skew
In statistics and probability theory, the nonparametric skew is a statistic occasionally used with random variables that take real values.Arnold BC, Groeneveld RA (1995) Measuring skewness with respect to the mode. The American Statistician 49 ( ...
(and
Pearson's skewness coefficients) measure the bias of the median as an estimator of the mean.
When estimating a
scale parameter
In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution.
Definition
If a family o ...
, such as when using an L-estimator as a
robust measures of scale In statistics, robust measures of scale are methods that quantify the statistical dispersion in a sample of numerical data while resisting outliers. The most common such robust statistics are the ''interquartile range'' (IQR) and the ''median absol ...
, such as to estimate the
population variance or population
standard deviation, one generally must multiply by a
scale factor to make it an unbiased consistent estimator; see
scale parameter: estimation.
For example, dividing the IQR by
(using the
error function
In mathematics, the error function (also called the Gauss error function), often denoted by , is a complex function of a complex variable defined as:
:\operatorname z = \frac\int_0^z e^\,\mathrm dt.
This integral is a special (non- elementa ...
) makes it an unbiased, consistent estimator for the population standard deviation if the data follow a
normal distribution
In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
:
f(x) = \frac e^
The parameter \mu i ...
.
L-estimators can also be used as statistics in their own right – for example, the median is a measure of location, and the IQR is a measure of dispersion. In these cases, the sample statistics can act as estimators of their own
expected value
In probability theory, the expected value (also called expectation, expectancy, mathematical expectation, mean, average, or first moment) is a generalization of the weighted average. Informally, the expected value is the arithmetic mean of a ...
; for example, the sample median is an estimator of the population median.
Advantages
Beyond simplicity, L-estimators are also frequently easy to calculate and robust.
Assuming sorted data, L-estimators involving only a few points can be calculated with far fewer mathematical operations than efficient estimates. Before the advent of
electronic calculators and
computers, these provided a useful way to extract much of the information from a sample with minimal labour. These remained in practical use through the early and mid 20th century, when automated sorting of
punch card data was possible, but computation remained difficult, and is still of use today, for estimates given a list of numerical values in non-
machine-readable form, where data input is more costly than manual sorting. They also allow rapid estimation.
L-estimators are often much more robust than maximally efficient conventional methods – the median is maximally
statistically resistant, having a 50%
breakdown point, and the X% trimmed mid-range has an X% breakdown point, while the sample mean (which is maximally efficient) is minimally robust, breaking down for a single outlier.
Efficiency
While L-estimators are not as efficient as other statistics, they often have reasonably high relative efficiency, and show that a large fraction of the information used in estimation can be obtained using only a few points – as few as one, two, or three. Alternatively, they show that order statistics contain a significant amount of information.
For example, in terms of efficiency, given a
sample of a
normally-distributed numerical parameter, the
arithmetic mean
In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the ''mean'' or the '' average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The coll ...
(average) for the
population
Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using ...
can be estimated with maximum efficiency by computing the
sample mean
The sample mean (or "empirical mean") and the sample covariance are statistics computed from a sample of data on one or more random variables.
The sample mean is the average value (or mean value) of a sample of numbers taken from a larger po ...
– adding all the members of the sample and dividing by the number of members.
However, for a large data set (over 100 points) from a symmetric population, the mean can be estimated reasonably efficiently relative to the best estimate by L-estimators. Using a single point, this is done by taking the
median of the sample, with no calculations required (other than sorting); this yields an efficiency of 64% or better (for all ''n''). Using two points, a simple estimate is the
midhinge (the 25%
trimmed
''Trimmed'' is a 1922 American silent Western film directed by Harry A. Pollard and featuring Hoot Gibson. It is not known whether the film currently survives, and it may be a lost film.
Cast
* Hoot Gibson as Dale Garland
* Patsy Ruth Miller ...
mid-range
In statistics, the mid-range or mid-extreme is a measure of central tendency of a sample defined as the arithmetic mean of the maximum and minimum values of the data set:
:M=\frac.
The mid-range is closely related to the range, a measure of ...
), but a more efficient estimate is the 29% trimmed mid-range, that is, averaging the two values 29% of the way in from the smallest and the largest values: the 29th and 71st percentiles; this has an efficiency of about 81%. For three points, the
trimean In statistics the trimean (TM), or Tukey's trimean, is a measure of a probability distribution's location defined as a weighted average of the distribution's median and its two quartiles:
: TM= \frac
This is equivalent to the average of the m ...
(average of median and midhinge) can be used, though the average of the 20th, 50th, and 80th percentile yields 88% efficiency. Using further points yield higher efficiency, though it is notable that only 3 points are needed for very high efficiency.
For estimating the standard deviation of a normal distribution, the scaled
interdecile range gives a reasonably efficient estimator, though instead taking the 7% trimmed range (the difference between the 7th and 93rd percentiles) and dividing by 3 (corresponding to 86% of the data of a normal distribution falling within 1.5 standard deviations of the mean) yields an estimate of about 65% efficiency.
For small samples, L-estimators are also relatively efficient: the midsummary of the 3rd point from each end has an efficiency around 84% for samples of size about 10, and the range divided by
has reasonably good efficiency for sizes up to 20, though this drops with increasing ''n'' and the scale factor can be improved (efficiency 85% for 10 points). Other heuristic estimators for small samples include the range over ''n'' (for standard error), and the range squared over the median (for the chi-squared of a Poisson distribution).
See also
*
L-moment
*
M-estimator
In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-est ...
References
*
*
*
*
* – sec. 5.2.2
*
{{refend
Nonparametric statistics
Robust statistics
Estimator