In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, Scheffé's method, named after
American statistician
A statistician is a person who works with Theory, theoretical or applied statistics. The profession exists in both the private sector, private and public sectors.
It is common to combine statistical knowledge with expertise in other subjects, a ...
Henry Scheffé, is a method for adjusting
significance levels in a
linear regression
In statistics, linear regression is a statistical model, model that estimates the relationship between a Scalar (mathematics), scalar response (dependent variable) and one or more explanatory variables (regressor or independent variable). A mode ...
analysis to account for
multiple comparisons. It is particularly useful in
analysis of variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
(a special case of regression analysis), and in constructing simultaneous
confidence bands for regressions involving
basis functions.
Scheffé's method is a single-step multiple comparison procedure which applies to the set of estimates of all possible
contrasts among the factor level means, not just the pairwise differences considered by the
Tukey–Kramer method. It works on similar principles as the
Working–Hotelling procedure for estimating mean responses in regression, which applies to the set of all possible factor levels.
The method
Let
be the
mean
A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
s of some variable in
disjoint populations.
An arbitrary contrast is defined by
:
where
:
If
are all equal to each other, then all contrasts among them are . Otherwise, some contrasts differ from .
Technically there are infinitely many contrasts. The simultaneous confidence coefficient is exactly
, whether the factor level sample sizes are equal or unequal. (Usually only a finite number of comparisons are of interest. In this case, Scheffé's method is typically quite conservative, and the
family-wise error rate (experimental error rate) will generally be much smaller than
.)
We estimate
by
:
for which the estimated variance is
:
where
*
is the size of the sample taken from the
th population (the one whose mean is
), and
*
is the estimated variance of the
errors.
It can be shown that the probability is
that all confidence limits of the type
:
are simultaneously correct, where as usual
is the size of the whole population. Norman R. Draper and Harry Smith, in their 'Applied Regression Analysis' (see references), indicate that
should be in the equation in place of
. The slip with
is a result of failing to allow for the additional effect of the constant term in many regressions. That the result based on
is wrong is readily seen by considering
, as in a standard simple linear regression. That formula would then reduce to one with the usual
-distribution, which is appropriate for predicting/estimating for a single value of the independent variable, not for constructing a confidence band for a range of values of the independent value. Also note that the formula is for dealing with the mean values for a range of independent values, not for comparing with individual values such as individual observed data values.
Denoting Scheffé significance in a table
Frequently, subscript letters are used to indicate which values are significantly different using the Scheffé method. For example, when mean values of variables that have been analyzed using an
ANOVA are presented in a table, they are assigned a different letter subscript based on a Scheffé contrast. Values that are not significantly different based on the post-hoc Scheffé contrast will have the same subscript and values that are significantly different will have different subscripts (i.e. 15
a, 17
a, 34
b would mean that the first and second variables both differ from the third variable but not each other because they are both assigned the subscript "a").
Comparison with the Tukey–Kramer method
If only a fixed number of pairwise comparisons are to be made, the
Tukey–Kramer method will result in a more precise confidence interval. In the general case when many or all contrasts might be of interest, the Scheffé method is more appropriate and will give narrower confidence intervals in the case of a large number of comparisons.
References
*
*
External links
Scheffé's method
{{DEFAULTSORT:Scheffe's method
Multiple comparisons