HOME

TheInfoList



OR:

In
Bayesian statistics Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability where probability expresses a ''degree of belief'' in an event. The degree of belief may be based on prior knowledge about the event, ...
, a credible interval is an interval within which an unobserved
parameter A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
value falls with a particular
probability Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and 1, where, roughly speakin ...
. It is an interval in the domain of a
posterior probability distribution The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posterior p ...
or a predictive distribution. The generalisation to multivariate problems is the credible region. Credible intervals are analogous to confidence intervals and confidence regions in
frequentist statistics Frequentist inference is a type of statistical inference based in frequentist probability, which treats “probability” in equivalent terms to “frequency” and draws conclusions from sample-data by means of emphasizing the frequency or pro ...
, although they differ on a philosophical basis: Bayesian intervals treat their bounds as fixed and the estimated parameter as a random variable, whereas frequentist confidence intervals treat their bounds as random variables and the parameter as a fixed value. Also, Bayesian credible intervals use (and indeed, require) knowledge of the situation-specific
prior distribution In Bayesian statistical inference, a prior probability distribution, often simply called the prior, of an uncertain quantity is the probability distribution that would express one's beliefs about this quantity before some evidence is taken in ...
, while the frequentist confidence intervals do not. For example, in an experiment that determines the distribution of possible values of the parameter \mu, if the
subjective probability Bayesian probability is an interpretation of the concept of probability, in which, instead of frequency or propensity of some phenomenon, probability is interpreted as reasonable expectation representing a state of knowledge or as quantification o ...
that \mu lies between 35 and 45 is 0.95, then 35 \le \mu \le 45 is a 95% credible interval.


Choosing a credible interval

Credible intervals are not unique on a posterior distribution. Methods for defining a suitable credible interval include: *Choosing the narrowest interval, which for a
unimodal distribution In mathematics, unimodality means possessing a unique mode. More generally, unimodality means there is only a single highest value, somehow defined, of some mathematical object. Unimodal probability distribution In statistics, a unimodal ...
will involve choosing those values of highest probability density including the mode (the ''
maximum a posteriori In Bayesian statistics, a maximum a posteriori probability (MAP) estimate is an estimate of an unknown quantity, that equals the mode of the posterior distribution. The MAP can be used to obtain a point estimate of an unobserved quantity on the ...
''). This is sometimes called the highest posterior density interval (HPDI). *Choosing the interval where the probability of being below the interval is as likely as being above it. This interval will include the
median In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fea ...
. This is sometimes called the equal-tailed interval. *Assuming that the mean exists, choosing the interval for which the
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithm ...
is the central point. It is possible to frame the choice of a credible interval within
decision theory Decision theory (or the theory of choice; not to be confused with choice theory) is a branch of applied probability theory concerned with the theory of making decisions based on assigning probabilities to various factors and assigning numerical ...
and, in that context, an optimal interval will always be a highest probability density set.O'Hagan, A. (1994) ''Kendall's Advanced Theory of Statistics, Vol 2B, Bayesian Inference'', Section 2.51. Arnold, Credible intervals can also be estimated through the use of simulation techniques such as
Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain a ...
.


Contrasts with confidence interval

A frequentist 95% confidence interval means that with a large number of repeated samples, 95% of such calculated confidence intervals would include the true value of the parameter. In frequentist terms, the parameter is ''fixed'' (cannot be considered to have a distribution of possible values) and the confidence interval is ''random'' (as it depends on the random sample). Bayesian credible intervals can be quite different from frequentist confidence intervals for two reasons: *credible intervals incorporate problem-specific contextual information from the
prior distribution In Bayesian statistical inference, a prior probability distribution, often simply called the prior, of an uncertain quantity is the probability distribution that would express one's beliefs about this quantity before some evidence is taken in ...
whereas confidence intervals are based only on the data; *credible intervals and confidence intervals treat
nuisance parameter Nuisance (from archaic ''nocence'', through Fr. ''noisance'', ''nuisance'', from Lat. ''nocere'', "to hurt") is a common law tort. It means that which causes offence, annoyance, trouble or injury. A nuisance can be either public (also "common" ...
s in radically different ways. For the case of a single parameter and data that can be summarised in a single
sufficient statistic In statistics, a statistic is ''sufficient'' with respect to a statistical model and its associated unknown parameter if "no other statistic that can be calculated from the same sample provides any additional information as to the value of the par ...
, it can be shown that the credible interval and the confidence interval ''will'' coincide if the unknown parameter is a
location parameter In geography, location or place are used to denote a region (point, line, or area) on Earth's surface or elsewhere. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ...
(i.e. the forward probability function has the form \mathrm(x, \mu) = f(x - \mu) ), with a prior that is a uniform flat distribution;Jaynes, E. T. (1976).
Confidence Intervals vs Bayesian Intervals
, in ''Foundations of Probability Theory, Statistical Inference, and Statistical Theories of Science'', (W. L. Harper and C. A. Hooker, eds.), Dordrecht: D. Reidel, pp. 175 ''et seq''
and also if the unknown parameter is a
scale parameter In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution. Definition If a family o ...
(i.e. the forward probability function has the form \mathrm(x, s) = f(x/s) ), with a
Jeffreys' prior In Bayesian probability, the Jeffreys prior, named after Sir Harold Jeffreys, is a non-informative (objective) prior distribution for a parameter space; its density function is proportional to the square root of the determinant of the Fisher inf ...
  \mathrm(s, I) \;\propto\; 1/s — the latter following because taking the logarithm of such a scale parameter turns it into a location parameter with a uniform distribution. But these are distinctly special (albeit important) cases; in general no such equivalence can be made.


References


Further reading

* {{Statistics Bayesian estimation Statistical intervals