HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, Whittle likelihood is an approximation to the
likelihood function A likelihood function (often simply called the likelihood) measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the ...
of a stationary Gaussian
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...
. It is named after the mathematician and statistician Peter Whittle, who introduced it in his PhD thesis in 1951. It is commonly used in
time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...
and
signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...
for parameter estimation and signal detection.


Context

In a stationary Gaussian time series model, the
likelihood function A likelihood function (often simply called the likelihood) measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the ...
is (as usual in Gaussian models) a function of the associated mean and covariance parameters. With a large number (N) of observations, the (N \times N) covariance matrix may become very large, making computations very costly in practice. However, due to stationarity, the covariance matrix has a rather simple structure, and by using an approximation, computations may be simplified considerably (from O(N^2) to O(N\log(N))). The idea effectively boils down to assuming a heteroscedastic zero-mean Gaussian model in Fourier domain; the model formulation is based on the time series'
discrete Fourier transform In mathematics, the discrete Fourier transform (DFT) converts a finite sequence of equally-spaced Sampling (signal processing), samples of a function (mathematics), function into a same-length sequence of equally-spaced samples of the discre ...
and its
power spectral density In signal processing, the power spectrum S_(f) of a continuous time signal x(t) describes the distribution of power into frequency components f composing that signal. According to Fourier analysis, any physical signal can be decomposed into ...
.
See also:


Definition

Let X_1,\ldots,X_N be a stationary Gaussian time series with (''one-sided'') power spectral density S_1(f), where N is even and samples are taken at constant sampling intervals \Delta_t. Let \tilde_1,\ldots,\tilde_ be the (complex-valued)
discrete Fourier transform In mathematics, the discrete Fourier transform (DFT) converts a finite sequence of equally-spaced Sampling (signal processing), samples of a function (mathematics), function into a same-length sequence of equally-spaced samples of the discre ...
(DFT) of the time series. Then for the Whittle likelihood one effectively assumes independent zero-mean
Gaussian distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real number, real-valued random variable. The general form of its probability density function is f(x ...
s for all \tilde_j with variances for the real and imaginary parts given by : \operatorname\left(\operatorname(\tilde_j)\right) = \operatorname \left( \operatorname (\tilde_j)\right) = S_1(f_j) where f_j=\frac j is the jth Fourier frequency. This approximate model immediately leads to the (logarithmic) likelihood function :\log\left(P(x_1,\ldots,x_N)\right) \propto -\sum_j \left( \log\left( S_1(f_j) \right) + \frac \right) where , \cdot, denotes the absolute value with , \tilde_j, ^2=\left(\operatorname(\tilde_j)\right)^2 + \left( \operatorname (\tilde_j)\right)^2.


Special case of a known noise spectrum

In case the noise spectrum is assumed a-priori ''known'', and noise properties are not to be inferred from the data, the likelihood function may be simplified further by ignoring constant terms, leading to the sum-of-squares expression :\log\left(P(x_1,\ldots,x_N)\right) \;\propto\; -\sum_j\frac This expression also is the basis for the common
matched filter In signal processing, the output of the matched filter is given by correlating a known delayed signal, or ''template'', with an unknown signal to detect the presence of the template in the unknown signal. This is equivalent to convolving the unkn ...
.


Accuracy of approximation

The Whittle likelihood in general is only an approximation, it is only exact if the spectrum is constant, i.e., in the trivial case of
white noise In signal processing, white noise is a random signal having equal intensity at different frequencies, giving it a constant power spectral density. The term is used with this or similar meanings in many scientific and technical disciplines, i ...
. The
efficiency Efficiency is the often measurable ability to avoid making mistakes or wasting materials, energy, efforts, money, and time while performing a task. In a more general sense, it is the ability to do things well, successfully, and without waste. ...
of the Whittle approximation always depends on the particular circumstances. Note that due to
linearity In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...
of the Fourier transform, Gaussianity in Fourier domain implies Gaussianity in time domain and vice versa. What makes the Whittle likelihood only approximately accurate is related to the
sampling theorem Sampling may refer to: *Sampling (signal processing), converting a continuous signal into a discrete signal *Sample (graphics), Sampling (graphics), converting continuous colors into discrete color components *Sampling (music), the reuse of a soun ...
—the effect of Fourier-transforming only a ''finite'' number of data points, which also manifests itself as
spectral leakage The Fourier transform of a function of time, s(t), is a complex-valued function of frequency, S(f), often referred to as a frequency spectrum. Any LTI system theory, linear time-invariant operation on s(t) produces a new spectrum of the form H(f)� ...
in related problems (and which may be ameliorated using the same methods, namely, windowing). In the present case, the implicit periodicity assumption implies correlation between the first and last samples (x_1 and x_N), which are effectively treated as "neighbouring" samples (like x_1 and x_2).


Applications


Parameter estimation

Whittle's likelihood is commonly used to estimate signal parameters for signals that are buried in non-white noise. The noise spectrum then may be assumed known, or it may be inferred along with the signal parameters.


Signal detection

Signal detection is commonly performed with the
matched filter In signal processing, the output of the matched filter is given by correlating a known delayed signal, or ''template'', with an unknown signal to detect the presence of the template in the unknown signal. This is equivalent to convolving the unkn ...
, which is based on the Whittle likelihood for the case of a ''known'' noise power spectral density. The matched filter effectively does a
maximum-likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stati ...
fit of the signal to the noisy data and uses the resulting likelihood ratio as the detection statistic. The matched filter may be generalized to an analogous procedure based on a
Student-t distribution In probability theory and statistics, Student's  distribution (or simply the  distribution) t_\nu is a continuous probability distribution that generalizes the Normal distribution#Standard normal distribution, standard normal distribu ...
by also considering uncertainty (e.g.
estimation Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is d ...
uncertainty) in the noise spectrum. On the technical side, the EM algorithm may be utilized here, effectively leading to repeated or iterative matched-filtering.


Spectrum estimation

The Whittle likelihood is also applicable for estimation of the noise spectrum, either alone or in conjunction with signal parameters.


See also

* * * * * * *


References

{{Statistics, analysis Time series Time series models Frequency-domain analysis Statistical inference Statistical models Statistical signal processing Signal estimation Normal distribution