statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the likelihood-ratio test is a hypothesis test that involves comparing the

goodness of fit The goodness of fit of a statistical model describes how well it fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measur ...

of two competing

statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...

s, typically one found by maximization over the entire parameter space and another found after imposing some constraint, based on the ratio of their likelihoods. If the more constrained model (i.e., the

null hypothesis The null hypothesis (often denoted ''H''0) is the claim in scientific research that the effect being studied does not exist. The null hypothesis can also be described as the hypothesis in which no relationship exists between two sets of data o ...

) is supported by the observed data, the two likelihoods should not differ by more than

sampling error In statistics, sampling errors are incurred when the statistical characteristics of a population are estimated from a subset, or sample, of that population. Since the sample does not include all members of the population, statistics of the sample ...

. Thus the likelihood-ratio test tests whether this ratio is significantly different from one, or equivalently whether its

natural logarithm The natural logarithm of a number is its logarithm to the base of a logarithm, base of the e (mathematical constant), mathematical constant , which is an Irrational number, irrational and Transcendental number, transcendental number approxima ...

is significantly different from zero. The likelihood-ratio test, also known as Wilks test, is the oldest of the three classical approaches to hypothesis testing, together with the Lagrange multiplier test and the Wald test. In fact, the latter two can be conceptualized as approximations to the likelihood-ratio test, and are asymptotically equivalent. In the case of comparing two models each of which has no unknown parameters, use of the likelihood-ratio test can be justified by the Neyman–Pearson lemma. The lemma demonstrates that the test has the highest power among all competitors.

Definition

General

Suppose that we have a

with parameter space

\Theta

. A

is often stated by saying that the parameter

\theta

lies in a specified subset

\Theta_0

\Theta

. The alternative hypothesis is thus that

\theta

lies in the complement of

\Theta_0

, i.e. in

\Theta ~ \backslash ~ \Theta_0

, which is denoted by

\Theta_0^\text

. The likelihood ratio test statistic for the null hypothesis

H_0 \, : \, \theta \in \Theta_0

is given by: :

\lambda_\text = -2 \ln \left \frac \right /math>

where the quantity inside the brackets is called the likelihood ratio. Here, the \sup notation refers to the

supremum In mathematics, the infimum (abbreviated inf; : infima) of a subset S of a partially ordered set P is the greatest element in P that is less than or equal to each element of S, if such an element exists. If the infimum of S exists, it is unique, ...

. As all likelihoods are positive, and as the constrained maximum cannot exceed the unconstrained maximum, the likelihood ratio is bounded between zero and one. Often the likelihood-ratio test statistic is expressed as a difference between the log-likelihoods :

\lambda_\text = -2 \left \ell( \theta_0 ) - \ell( \hat ) ~\right /math>
where 
: \ell( \hat ) \equiv \ln \left \sup_ \mathcal(\theta) ~\right is the logarithm of the maximized likelihood function \mathcal, and \ell(\theta_0) is the maximal value in the special case that the null hypothesis is true (but not necessarily a value that maximizes \mathcal for the sampled data) and
: \theta_0 \in \Theta_0 \qquad \text \qquad \hat \in \Theta~ denote the respective arguments of the maxima and the allowed ranges they're embedded in. Multiplying by −2 ensures mathematically that (by Wilks' theorem) \lambda_\text converges asymptotically to being ²-distributed if the null hypothesis happens to be true. The finite-sample distribution s of likelihood-ratio statistics are generally unknown.

The likelihood-ratio test requires that the models be nested – i.e. the more complex model can be transformed into the simpler model by imposing constraints on the former's parameters. Many common test statistics are tests for nested models and can be phrased as log-likelihood ratios or approximations thereof: e.g. the''Z''-test, the''F''-test, the''G''-test, and Pearson's chi-squared test; for an illustration with the one-sample ''t''-test, see below.

If the models are not nested, then instead of the likelihood-ratio test, there is a generalization of the test that can usually be used: for details, see '' relative likelihood''.

Case of simple hypotheses

A simple-vs.-simple hypothesis test has completely specified models under both the null hypothesis and the alternative hypothesis, which for convenience are written in terms of fixed values of a notional parameter

\theta

: :

\begin
H_0 &:& \theta=\theta_0 ,\\
H_1 &:& \theta=\theta_1 .
\end

In this case, under either hypothesis, the distribution of the data is fully specified: there are no unknown parameters to estimate. For this case, a variant of the likelihood-ratio test is available: :

\Lambda(x) = \frac.

Some older references may use the reciprocal of the function above as the definition. Thus, the likelihood ratio is small if the alternative model is better than the null model. The likelihood-ratio test provides the decision rule as follows: :If

~\Lambda > c ~

, do not reject

H_0

; :If

~\Lambda < c ~

, reject

H_0

; :If

~\Lambda = c ~

, reject

H_0

with probability

~q~

. : The values

c

and

q

are usually chosen to obtain a specified significance level

\alpha

, via the relation :

~q~

\operatorname(\Lambda=c \mid H_0)~+~\operatorname(\Lambda < c \mid H_0)~=~\alpha~.

The Neyman–Pearson lemma states that this likelihood-ratio test is the most powerful among all level

\alpha

tests for this case.

Interpretation

The likelihood ratio is a function of the data

x

; therefore, it is a

statistic A statistic (singular) or sample statistic is any quantity computed from values in a sample which is considered for a statistical purpose. Statistical purposes include estimating a population parameter, describing a sample, or evaluating a hypot ...

, although unusual in that the statistic's value depends on a parameter,

\theta

. The likelihood-ratio test rejects the null hypothesis if the value of this statistic is too small. How small is too small depends on the significance level of the test, i.e. on what probability of Type I error is considered tolerable (Type I errors consist of the rejection of a null hypothesis that is true). The numerator corresponds to the likelihood of an observed outcome under the

. The denominator corresponds to the maximum likelihood of an observed outcome, varying parameters over the whole parameter space. The numerator of this ratio is less than the denominator; so, the likelihood ratio is between 0 and 1. Low values of the likelihood ratio mean that the observed result was much less likely to occur under the null hypothesis as compared to the alternative. High values of the statistic mean that the observed outcome was nearly as likely to occur under the null hypothesis as the alternative, and so the null hypothesis cannot be rejected.

An example

The following example is adapted and abridged from . Suppose that we have a random sample, of size , from a population that is normally-distributed. Both the mean, , and the standard deviation, , of the population are unknown. We want to test whether the mean is equal to a given value, . Thus, our null hypothesis is and our alternative hypothesis is . The likelihood function is :

\mathcal(\mu,\sigma \mid x) = \left(2\pi\sigma^2\right)^ \exp\left( -\sum_^n \frac\right)\,.

With some calculation (omitted here), it can then be shown that :

\lambda_ = n \ln\left 1 + \frac\right

where is the -statistic with degrees of freedom. Hence we may use the known exact distribution of to draw inferences.

Asymptotic distribution: Wilks’ theorem

If the distribution of the likelihood ratio corresponding to a particular null and alternative hypothesis can be explicitly determined then it can directly be used to form decision regions (to sustain or reject the null hypothesis). In most cases, however, the exact distribution of the likelihood ratio corresponding to specific hypotheses is very difficult to determine. Assuming is true, there is a fundamental result by Samuel S. Wilks: As the sample size

n

approaches

\infty

, and if the null hypothesis lies strictly within the interior of the parameter space, the test statistic

\lambda_\text

defined above will be asymptotically chi-squared distributed (

\chi^2

) with

degrees of freedom In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...

equal to the difference in dimensionality of

\Theta

and

\Theta_0

. This implies that for a great variety of hypotheses, we can calculate the likelihood ratio

\lambda

for the data and then compare the observed

\lambda_\text

to the

\chi^2

value corresponding to a desired

statistical significance In statistical hypothesis testing, a result has statistical significance when a result at least as "extreme" would be very infrequent if the null hypothesis were true. More precisely, a study's defined significance level, denoted by \alpha, is the ...

as an ''approximate'' statistical test. Other extensions exist.

References

External links

Practical application of likelihood ratio test described

R Package: Wald's Sequential Probability Ratio Test

Online Clinical Calculator {{DEFAULTSORT:Likelihood-Ratio Test Statistical ratios Statistical tests