In statistics, Dixon's ''Q'' test, or simply the ''Q'' test, is used for identification and rejection of

outlier In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...

s. This assumes normal distribution and per Robert Dean and Wilfrid Dixon, and others, this test should be used sparingly and never more than once in a data set. To apply a ''Q'' test for bad data, arrange the data in order of increasing values and calculate ''Q'' as defined: :

Q = \frac

Where ''gap'' is the

absolute difference The absolute difference of two real numbers x and y is given by , x-y, , the absolute value of their difference. It describes the distance on the real line between the points corresponding to x and y. It is a special case of the Lp distance for ...

between the outlier in question and the closest number to it. If ''Q'' > ''Q''_table, where ''Q''_table is a reference value corresponding to the sample size and confidence level, then reject the questionable point. Note that only one point may be rejected from a data set using a ''Q'' test.

Example

Consider the data set: :

0.189,\ 0.167,\ 0.187,\ 0.183,\ 0.186,\ 0.182,\ 0.181,\ 0.184,\ 0.181,\ 0.177 \,

Now rearrange in increasing order: :

0.167,\ 0.177,\ 0.181,\ 0.181,\ 0.182,\ 0.183,\ 0.184,\ 0.186,\ 0.187,\ 0.189 \,

We hypothesize that 0.167 is an outlier. Calculate ''Q'': :

Q=\frac = \frac=0.455.

With 10 observations and at 90%

confidence Confidence is a state of being clear-headed either that a hypothesis or prediction is correct or that a chosen course of action is the best or most effective. Confidence comes from a Latin word 'fidere' which means "to trust"; therefore, having ...

, ''Q'' = 0.455 > 0.412 = ''Q''_table, so we conclude 0.167 is indeed an outlier. However, at 95% confidence, ''Q'' = 0.455 < 0.466 = ''Q''_table 0.167 is not considered an outlier. McBaneHalpern, Arthur M. "Experimental physical chemistry : a laboratory textbook." 3rd ed. / Arthur M. Halpern , George C. McBane. New York : W. H. Freeman, c200
Library of Congress
/ref> notes: Dixon provided related tests intended to search for more than one outlier, but they are much less frequently used than the ''r₁₀'' or ''Q'' version that is intended to eliminate a single outlier.

Table

This table summarizes the limit values of the two-tailed Dixon's ''Q'' test.

References

* Robert B. Dean and Wilfrid J. Dixon (1951) "Simplified Statistics for Small Numbers of Observations". Anal. Chem., 1951, 23 (4), 636–638
AbstractFull text PDF
* Rorabacher, D. B. (1991) "Statistical Treatment for Rejection of Deviant Values: Critical Values of Dixon Q Parameter and Related Subrange Ratios at the 95 percent Confidence Level". Anal. Chem., 63 (2), 139–146
PDF
(including larger tables of limit values) * McBane, George C. (2006) "Programs to Compute Distribution Functions and Critical Values for Extreme Value Ratios for Outlier Detection". J. Statistical Software 16(3):1–9, 200
Article (PDF) and Software (Fortan-90, Zipfile)
* Shivanshu Shrivastava, A. Rajesh, P. K. Bora (2014) "Sliding window Dixon's tests for malicious users' suppression in a cooperative spectrum sensing system" IET Communications, 2014, 8 (7) *W. J. Dixon. The Annals of Mathematical Statistics. Vol. 21, No. 4 (Dec., 1950), pp. 488-506

External links

includes 'dixon.test' function.
Dixon's test in Communications
– use of Dixon's test in cognitive radio communications (by Shivanshu Shrivastava) {{DEFAULTSORT:Dixon's Q Test Statistical tests Robust statistics Statistical outliers

Example

Table

See also

References

Further reading

External links