
In
estimation theory
Estimation theory is a branch of statistics that deals with estimating the values of Statistical parameter, parameters based on measured empirical data that has a random component. The parameters describe an underlying physical setting in such ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the Cramér–Rao bound (CRB) relates to
estimation of a deterministic (fixed, though unknown) parameter. The result is named in honor of
Harald Cramér and
Calyampudi Radhakrishna Rao,
but has also been derived independently by
Maurice Fréchet,
Georges Darmois,
and by
Alexander Aitken and
Harold Silverstone. It is also known as Fréchet-Cramér–Rao or Fréchet-Darmois-Cramér-Rao lower bound. It states that the
precision of any
unbiased estimator
In statistics, the bias of an estimator (or bias function) is the difference between this estimator's expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called ''unbiased''. In stat ...
is at most the
Fisher information; or (equivalently) the reciprocal of the Fisher information is a lower bound on its
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
.
An unbiased estimator that achieves this bound is said to be (fully) ''
efficient''. Such a solution achieves the lowest possible
mean squared error
In statistics, the mean squared error (MSE) or mean squared deviation (MSD) of an estimator (of a procedure for estimating an unobserved quantity) measures the average of the squares of the errors—that is, the average squared difference betwee ...
among all unbiased methods, and is, therefore, the
minimum variance unbiased (MVU) estimator. However, in some cases, no unbiased technique exists which achieves the bound. This may occur either if for any unbiased estimator, there exists another with a strictly smaller variance, or if an MVU estimator exists, but its variance is strictly greater than the inverse of the Fisher information.
The Cramér–Rao bound can also be used to bound the variance of estimators of given bias. In some cases, a biased approach can result in both a variance and a
mean squared error
In statistics, the mean squared error (MSE) or mean squared deviation (MSD) of an estimator (of a procedure for estimating an unobserved quantity) measures the average of the squares of the errors—that is, the average squared difference betwee ...
that are the unbiased Cramér–Rao lower bound; see
estimator bias.
Significant progress over the Cramér–Rao lower bound was proposed by
Anil Kumar Bhattacharyya through a series of works, called
Bhattacharyya bound.
Statement
The Cramér–Rao bound is stated in this section for several increasingly general cases, beginning with the case in which the parameter is a
scalar and its estimator is
unbiased
Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
. All versions of the bound require certain regularity conditions, which hold for most well-behaved distributions. These conditions are listed
later in this section.
Scalar unbiased case
Suppose
is an unknown deterministic parameter that is to be estimated from
independent observations (measurements) of
, each from a distribution according to some
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
. The
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
of any ''unbiased'' estimator
of
is then bounded by the
reciprocal of the
Fisher information :
:
where the Fisher information
is defined by
:
and
is the
natural logarithm
The natural logarithm of a number is its logarithm to the base of a logarithm, base of the e (mathematical constant), mathematical constant , which is an Irrational number, irrational and Transcendental number, transcendental number approxima ...
of the
likelihood function
A likelihood function (often simply called the likelihood) measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the ...
for a single sample
and
denotes the
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
with respect to the density
of
. If not indicated, in what follows, the expectation is taken with respect to
.
If
is twice differentiable and certain regularity conditions hold, then the Fisher information can also be defined as follows:
:
The
efficiency
Efficiency is the often measurable ability to avoid making mistakes or wasting materials, energy, efforts, money, and time while performing a task. In a more general sense, it is the ability to do things well, successfully, and without waste.
...
of an unbiased estimator
measures how close this estimator's variance comes to this lower bound; estimator efficiency is defined as
:
or the minimum possible variance for an unbiased estimator divided by its actual variance.
The Cramér–Rao lower bound thus gives
:
.
General scalar case
A more general form of the bound can be obtained by considering a biased estimator
, whose expectation is not
but a function of this parameter, say,
. Hence
is not generally equal to 0. In this case, the bound is given by
:
where
is the derivative of
(by
), and
is the Fisher information defined above.
Bound on the variance of biased estimators
Apart from being a bound on estimators of functions of the parameter, this approach can be used to derive a bound on the variance of biased estimators with a given bias, as follows. Consider an estimator
with bias
, and let
. By the result above, any unbiased estimator whose expectation is
has variance greater than or equal to
. Thus, any estimator
whose bias is given by a function
satisfies
:
The unbiased version of the bound is a special case of this result, with
.
It's trivial to have a small variance − an "estimator" that is constant has a variance of zero. But from the above equation, we find that the
mean squared error
In statistics, the mean squared error (MSE) or mean squared deviation (MSD) of an estimator (of a procedure for estimating an unobserved quantity) measures the average of the squares of the errors—that is, the average squared difference betwee ...
of a biased estimator is bounded by
:
using the standard decomposition of the MSE. Note, however, that if
this bound might be less than the unbiased Cramér–Rao bound
. For instance, in the
example of estimating variance below,
.
Multivariate case
Extending the Cramér–Rao bound to multiple parameters, define a parameter column
vector
Vector most often refers to:
* Euclidean vector, a quantity with a magnitude and a direction
* Disease vector, an agent that carries and transmits an infectious pathogen into another living organism
Vector may also refer to:
Mathematics a ...
:
with probability density function
which satisfies the two
regularity conditions below.
The
Fisher information matrix is a
matrix with element
defined as
:
Let
be an estimator of any vector function of parameters,
, and denote its expectation vector