In
statistics, the multivariate Behrens–Fisher problem is the problem of testing for the equality of means from two
multivariate normal
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One ...
distributions when the covariance matrices are unknown and possibly not equal. Since this is a generalization of the univariate
Behrens-Fisher problem, it inherits all of the difficulties that arise in the univariate problem.
Notation and problem formulation
Let
be independent random samples from two
-variate normal distributions with unknown mean vectors
and unknown
dispersion matrices . The index
refers to the first or second population, and the
th observation from the
th population is
.
The multivariate Behrens–Fisher problem is to test the null hypothesis
that the means are equal versus the alternative
of non-equality:
:
Define some statistics, which are used in the various attempts to solve the multivariate Behrens–Fisher problem, by
:
The sample means
and sum-of-squares matrices
are
sufficient
In logic and mathematics, necessity and sufficiency are terms used to describe a conditional or implicational relationship between two statements. For example, in the conditional statement: "If then ", is necessary for , because the truth of ...
for the multivariate normal parameters
, so it suffices to perform inference be based on just these statistics. The distributions of
and
are independent and are, respectively,
multivariate normal
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions. One ...
and
Wishart:
:
Background
In the case where the dispersion matrices are equal, the distribution of the
statistic is known to be an
F distribution
In probability theory and statistics, the ''F''-distribution or F-ratio, also known as Snedecor's ''F'' distribution or the Fisher–Snedecor distribution (after Ronald Fisher and George W. Snedecor) is a continuous probability distribution ...
under the null and a
noncentral F-distribution
In probability theory and statistics, the noncentral ''F''-distribution is a continuous probability distribution that is a noncentral generalization of the (ordinary) ''F''-distribution. It describes the distribution of the quotient (''X''/''n' ...
under the alternative.
The main problem is that when the true values of the dispersion matrix are unknown, then under the null hypothesis the probability of rejecting
via a
test depends on the unknown dispersion matrices.
In practice, this dependency harms inference when the dispersion matrices are far from each other or when the sample size is not large enough to estimate them accurately.
Now, the mean vectors are independently and normally distributed,
:
but the sum
does not follow the Wishart distribution,
which makes inference more difficult.
Proposed solutions
Proposed solutions are based on a few main strategies:
* Compute statistics which mimick the
statistic and which have an approximate
distribution with estimated degrees of freedom (df).
* Use
generalized p-values
In statistics, a generalized ''p''-value is an extended version of the classical ''p''-value, which except in a limited number of applications, provides only approximate solutions.
Conventional statistical methods do not provide exact solutions ...
based on
generalized test variables.
* Use Roy's union-intersection principle
Approaches using the ''T''2 with approximate degrees of freedom
Below,
indicates the
trace operator
In mathematics, the trace operator extends the notion of the restriction of a function to the boundary of its domain to "generalized" functions in a Sobolev space. This is particularly important for the study of partial differential equations wit ...
.
Yao (1965)
(as cited by
)
:
where
:
Johansen (1980)
(as cited by
)
:
where
:
and
:
Nel and Van der Merwe's (1986)
(as cited by
)
:
where
:
Comments on performance
Kim (1992) proposed a solution that is based on a variant of
. Although its power is high, the fact that it is not invariant makes it less attractive. Simulation studies by Subramaniam and Subramaniam (1973) show that the size of Yao's test is closer to the nominal level than that of James's. Christensen and Rencher (1997) performed numerical studies comparing several of these testing procedures and concluded that Kim and Nel and Van der Merwe's tests had the highest power. However, these two procedures are not invariant.
Krishnamoorthy and Yu (2004)
Krishnamoorthy and Yu (2004) proposed a procedure which adjusts in Nel and Var der Merwe (1986)'s approximate df for the denominator of
under the null distribution to make it invariant. They show that the approximate degrees of freedom lies in the interval