statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the mid-range or mid-extreme is a measure of

central tendency In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications in ...

of a sample defined as the

arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...

of the maximum and minimum values of the

data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more table (database), database tables, where every column (database), column of a table represents a particular Variable (computer sci ...

: :

M=\frac.

The mid-range is closely related to the range, a measure of

statistical dispersion In statistics, dispersion (also called variability, scatter, or spread) is the extent to which a distribution is stretched or squeezed. Common examples of measures of statistical dispersion are the variance, standard deviation, and interquartil ...

defined as the difference between maximum and minimum values. The two measures are complementary in sense that if one knows the mid-range and the range, one can find the sample maximum and minimum values. The mid-range is rarely used in practical statistical analysis, as it lacks

efficiency Efficiency is the often measurable ability to avoid making mistakes or wasting materials, energy, efforts, money, and time while performing a task. In a more general sense, it is the ability to do things well, successfully, and without waste. ...

as an estimator for most distributions of interest, because it ignores all intermediate points, and lacks

robustness Robustness is the property of being strong and healthy in constitution. When it is transposed into a system A system is a group of interacting or interrelated elements that act according to a set of rules to form a unified whole. A system, ...

, as outliers change it significantly. Indeed, for many distributions it is one of the least efficient and least robust statistics. However, it finds some use in special cases: it is the maximally efficient estimator for the center of a uniform distribution, trimmed mid-ranges address robustness, and as an

L-estimator In statistics, an L-estimator (or L-statistic) is an estimator which is a linear combination of order statistics of the measurements. This can be as little as a single point, as in the median (of an odd number of values), or as many as all points ...

, it is simple to understand and compute.

Robustness

The midrange is highly sensitive to outliers and ignores all but two data points. It is therefore a very non- robust statistic, having a breakdown point of 0, meaning that a single observation can change it arbitrarily. Further, it is highly influenced by outliers: increasing the sample maximum or decreasing the sample minimum by ''x'' changes the mid-range by

x/2,

while it changes the sample mean, which also has breakdown point of 0, by only

x/n.

It is thus of little use in practical statistics, unless outliers are already handled. A trimmed midrange is known as a – the ''n''% trimmed midrange is the average of the ''n''% and (100−''n'')% percentiles, and is more robust, having a breakdown point of ''n''%. In the middle of these is the midhinge, which is the 25% midsummary. The

median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...

can be interpreted as the fully trimmed (50%) mid-range; this accords with the convention that the median of an even number of points is the mean of the two middle points. These trimmed midranges are also of interest as

descriptive statistics A descriptive statistic (in the count noun sense) is a summary statistic that quantitatively describes or summarizes features from a collection of information, while descriptive statistics (in the mass noun sense) is the process of using and an ...

or as

s of central location or

skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...

: differences of midsummaries, such as midhinge minus the median, give measures of skewness at different points in the tail.

Efficiency

Despite its drawbacks, in some cases it is useful: the midrange is a highly efficient

estimator In statistics, an estimator is a rule for calculating an estimate of a given quantity based on Sample (statistics), observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguish ...

of μ, given a small sample of a sufficiently platykurtic distribution, but it is inefficient for

mesokurtic In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtosis ...

distributions, such as the normal. For example, for a

continuous uniform distribution In probability theory and statistics, the continuous uniform distributions or rectangular distributions are a family of symmetric probability distributions. Such a distribution describes an experiment where there is an arbitrary outcome that li ...

with unknown maximum and minimum, the mid-range is the uniformly minimum-variance unbiased estimator (UMVU) estimator for the mean. The sample maximum and sample minimum, together with sample size, are a sufficient statistic for the population maximum and minimum – the distribution of other samples, conditional on a given maximum and minimum, is just the uniform distribution between the maximum and minimum and thus add no information. See German tank problem for further discussion. Thus the mid-range, which is an unbiased and sufficient estimator of the population mean, is in fact the UMVU: using the sample mean just adds noise based on the uninformative distribution of points within this range. Conversely, for the normal distribution, the sample mean is the UMVU estimator of the mean. Thus for platykurtic distributions, which can often be thought of as between a uniform distribution and a normal distribution, the informativeness of the middle sample points versus the extrema values varies from "equal" for normal to "uninformative" for uniform, and for different distributions, one or the other (or some combination thereof) may be most efficient. A robust analog is the trimean, which averages the midhinge (25% trimmed mid-range) and median.

Small samples

For small sample sizes (''n'' from 4 to 20) drawn from a sufficiently platykurtic distribution (negative excess kurtosis, defined as γ₂ = (μ₄/(μ₂)²) − 3), the mid-range is an efficient estimator of the mean ''μ''. The following table summarizes empirical data comparing three estimators of the mean for distributions of varied kurtosis; the modified mean is the truncated mean, where the maximum and minimum are eliminated. For ''n'' = 1 or 2, the midrange and the mean are equal (and coincide with the median), and are most efficient for all distributions. For ''n'' = 3, the modified mean is the median, and instead the mean is the most efficient measure of central tendency for values of ''γ''₂ from 2.0 to 6.0 as well as from −0.8 to 2.0.

Sampling properties

For a sample of size ''n'' from the standard normal distribution, the mid-range ''M'' is unbiased, and has a variance given by: :

\operatorname(M)=\frac.

For a sample of size ''n'' from the standard

Laplace distribution In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponen ...

, the mid-range ''M'' is unbiased, and has a variance given by: :

\operatorname(M)=\frac

and, in particular, the variance does not decrease to zero as the sample size grows. For a sample of size ''n'' from a zero-centred uniform distribution, the mid-range ''M'' is unbiased, ''nM'' has an asymptotic distribution which is a

Deviation

While the mean of a set of values minimizes the sum of squares of deviations and the

minimizes the average absolute deviation, the midrange minimizes the maximum deviation (defined as

\max\left, x_i-m\

): it is a solution to a variational problem.

References