Kuiper's test
   HOME

TheInfoList



OR:

Kuiper's test is used in statistics to
test Test(s), testing, or TEST may refer to: * Test (assessment), an educational assessment intended to measure the respondents' knowledge or other abilities Arts and entertainment * ''Test'' (2013 film), an American film * ''Test'' (2014 film), ...
that whether a given distribution, or family of distributions, is contradicted by evidence from a sample of data. It is named after Dutch mathematician
Nicolaas Kuiper Nicolaas Hendrik Kuiper (; 28 June 1920 – 12 December 1994) was a Dutch mathematician, known for Kuiper's test and proving Kuiper's theorem. He also contributed to the Nash embedding theorem. Kuiper studied at University of Leiden in 1937-4 ...
. Kuiper's test is closely related to the better-known
Kolmogorov–Smirnov test In statistics, the Kolmogorov–Smirnov test (K–S test or KS test) is a nonparametric test of the equality of continuous (or discontinuous, see Section 2.2), one-dimensional probability distributions that can be used to compare a sample wit ...
(or K-S test as it is often called). As with the K-S test, the discrepancy statistics ''D''+ and ''D'' represent the absolute sizes of the most positive and most negative differences between the two cumulative distribution functions that are being compared. The trick with Kuiper's test is to use the quantity ''D''+ + ''D'' as the test statistic. This small change makes Kuiper's test as sensitive in the tails as at the median and also makes it invariant under cyclic transformations of the independent variable. The
Anderson–Darling test The Anderson–Darling test is a statistical test of whether a given sample of data is drawn from a given probability distribution. In its basic form, the test assumes that there are no parameters to be estimated in the distribution being tested, ...
is another test that provides equal sensitivity at the tails as the median, but it does not provide the cyclic invariance. This invariance under cyclic transformations makes Kuiper's test invaluable when testing for cyclic variations by time of year or day of the week or time of day, and more generally for testing the fit of, and differences between, circular probability distributions.


Definition

The test statistic, ''V'', for Kuiper's test is defined as follows. Let ''F'' be the continuous cumulative distribution function which is to be the
null hypothesis In scientific research, the null hypothesis (often denoted ''H''0) is the claim that no difference or relationship exists between two sets of data or variables being analyzed. The null hypothesis is that any experimentally observed difference is d ...
. Denote the sample of data which are independent realisations of random variables, having ''F'' as their distribution function, by ''xi'' (''i''=1,...,''n''). Then define Pearson, E.S., Hartley, H.O. (1972) ''Biometrika Tables for Statisticians, Volume 2'', CUP. (page 118) :z_i=F(x_i), :D^+ = \mathrm \left /n- z_i \right :D^- = \mathrm \left _i-(i-1)/n \right and finally, :V=D^+ + D^- . Tables for the critical points of the test statistic are available, Pearson, E.S., Hartley, H.O. (1972) ''Biometrika Tables for Statisticians, Volume 2'', CUP. (Table 54) and these include certain cases where the distribution being tested is not fully known, so that parameters of the family of distributions are
estimated Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is der ...
.


Example

We could test the hypothesis that computers fail more during some times of the year than others. To test this, we would collect the dates on which the test set of computers had failed and build an empirical distribution function. The
null hypothesis In scientific research, the null hypothesis (often denoted ''H''0) is the claim that no difference or relationship exists between two sets of data or variables being analyzed. The null hypothesis is that any experimentally observed difference is d ...
is that the failures are uniformly distributed. Kuiper's statistic does not change if we change the beginning of the year and does not require that we bin failures into months or the like.Watson, G.S. (1961) "Goodness-Of-Fit Tests on a Circle", '' Biometrika'', 48 (1/2), 109–114 Another test statistic having this property is the Watson statistic, which is related to the Cramér–von Mises test. However, if failures occur mostly on weekends, many uniform-distribution tests such as K-S and Kuiper would miss this, since weekends are spread throughout the year. This inability to distinguish distributions with a comb-like shape from
continuous uniform distribution In probability theory and statistics, the continuous uniform distribution or rectangular distribution is a family of symmetric probability distributions. The distribution describes an experiment where there is an arbitrary outcome that lies betw ...
s is a key problem with all statistics based on a variant of the K-S test. Kuiper's test, applied to the event times modulo one week, is able to detect such a pattern. Using event times that have been modulated with the K-S test can result in different results depending on how the data is phased. In this example, the K-S test may detect the non-uniformity if the data is set to start the week on Saturday, but fail to detect the non-uniformity if the week starts on Wednesday.


See also

*
Kolmogorov–Smirnov test In statistics, the Kolmogorov–Smirnov test (K–S test or KS test) is a nonparametric test of the equality of continuous (or discontinuous, see Section 2.2), one-dimensional probability distributions that can be used to compare a sample wit ...


References

{{Reflist Statistical tests Nonparametric statistics Directional statistics 1960 introductions