In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the likelihood-ratio test is a
hypothesis test that involves comparing the
goodness of fit
The goodness of fit of a statistical model describes how well it fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measur ...
of two competing
statistical model
A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
s, typically one found by
maximization over the entire
parameter space and another found after imposing some
constraint, based on the ratio of their
likelihoods. If the more constrained model (i.e., the
null hypothesis
The null hypothesis (often denoted ''H''0) is the claim in scientific research that the effect being studied does not exist. The null hypothesis can also be described as the hypothesis in which no relationship exists between two sets of data o ...
) is supported by the
observed data, the two likelihoods should not differ by more than
sampling error
In statistics, sampling errors are incurred when the statistical characteristics of a population are estimated from a subset, or sample, of that population. Since the sample does not include all members of the population, statistics of the sample ...
. Thus the likelihood-ratio test tests whether this ratio is
significantly different from one, or equivalently whether its
natural logarithm
The natural logarithm of a number is its logarithm to the base of a logarithm, base of the e (mathematical constant), mathematical constant , which is an Irrational number, irrational and Transcendental number, transcendental number approxima ...
is significantly different from zero.
The likelihood-ratio test, also known as Wilks test, is the oldest of the three classical approaches to hypothesis testing, together with the
Lagrange multiplier test and the
Wald test. In fact, the latter two can be conceptualized as approximations to the likelihood-ratio test, and are asymptotically equivalent. In the case of comparing two models each of which has no unknown
parameters, use of the likelihood-ratio test can be justified by the
Neyman–Pearson lemma. The lemma demonstrates that the test has the highest
power among all competitors.
Definition
General
Suppose that we have a
statistical model
A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
with
parameter space . A
null hypothesis
The null hypothesis (often denoted ''H''0) is the claim in scientific research that the effect being studied does not exist. The null hypothesis can also be described as the hypothesis in which no relationship exists between two sets of data o ...
is often stated by saying that the parameter
lies in a specified subset
of
. The
alternative hypothesis is thus that
lies in the
complement of
, i.e. in
, which is denoted by
. The likelihood ratio test statistic for the null hypothesis
is given by:
: