Bayesian statistics Bayesian statistics ( or ) is a theory in the field of statistics based on the Bayesian interpretation of probability, where probability expresses a ''degree of belief'' in an event. The degree of belief may be based on prior knowledge about ...

, a credible interval is an interval used to characterize a

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

. It is defined such that an unobserved

parameter A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...

value has a particular

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

\gamma

to fall within it. For example, in an experiment that determines the distribution of possible values of the parameter

\mu

, if the probability that

\mu

lies between 35 and 45 is

\gamma=0.95

, then

35 \le \mu \le 45

is a 95% credible interval. Credible intervals are typically used to characterize

posterior probability The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posteri ...

distributions or predictive probability distributions. Their generalization to disconnected or multivariate sets is called credible set or credible region. Credible intervals are a Bayesian analog to confidence intervals in frequentist statistics. The two concepts arise from different philosophies: Bayesian intervals treat their bounds as fixed and the estimated parameter as a random variable, whereas frequentist confidence intervals treat their bounds as random variables and the parameter as a fixed value. Also, Bayesian credible intervals use (and indeed, require) knowledge of the situation-specific prior distribution, while the frequentist confidence intervals do not.

Definitions

Credible sets are not unique, as any given probability distribution has an infinite number of

\gamma

-credible sets, i.e. sets of probability

\gamma

. For example, in the univariate case, there are multiple definitions for a suitable interval or set: *The smallest credible interval (SCI), sometimes also called the highest density interval. This interval necessarily contains the

median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...

whenever

\gamma\geq 0.5

. When the distribution is unimodal, this interval also contains the mode. *The smallest credible set (SCS), sometimes also called the highest density region. For a multimodal distribution, this is not necessarily an interval as it can be disconnected. This set always contains the mode. *A quantile-based credible interval, which is computed by taking the inter-

quantile In statistics and probability, quantiles are cut points dividing the range of a probability distribution into continuous intervals with equal probabilities or dividing the observations in a sample in the same way. There is one fewer quantile t ...

interval

_\delta, q_/math> for some predefined \delta \in,1-\gamma /math>. For instance, the median credible interval (MCI) of probability \gamma is the interval where the probability of being below the interval is as likely as being above it, that is to say the interval_, q_/math>. It is sometimes also called the equal-tailed interval, and it always contains the

. Other quantile-based credible intervals can be defined, such as the lowest credible interval (LCI) which is

_0, q_/math>, or the highest credible interval (HCI) which is_, q_/math>. These intervals may be more suited for bounded variables.

One may also define an interval for which the

mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...

is the central point, assuming that the mean exists.

\gamma

-Smallest Credible Sets (

\gamma

-SCS) can easily be generalized to the multivariate case, and are bounded by probability density contour lines.O'Hagan, A. (1994) ''Kendall's Advanced Theory of Statistics, Vol 2B, Bayesian Inference'', Section 2.51. Arnold, They always contain the mode, but not necessarily the

, the coordinate-wise median, nor the geometric median. Credible intervals can also be estimated through the use of simulation techniques such as

Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) is a class of algorithms used to draw samples from a probability distribution. Given a probability distribution, one can construct a Markov chain whose elements' distribution approximates it – that ...

Contrasts with confidence interval

A frequentist 95% confidence interval means that with a large number of repeated samples, 95% of such calculated confidence intervals would include the true value of the parameter. In frequentist terms, the parameter is ''fixed'' (cannot be considered to have a distribution of possible values) and the confidence interval is ''random'' (as it depends on the random sample). Bayesian credible intervals differ from frequentist confidence intervals by two major aspects: * credible intervals are intervals whose values have a (posterior) probability density, representing the plausibility that the parameter has those values, whereas confidence intervals regard the population parameter as fixed and therefore not the object of probability. Within confidence intervals, confidence refers to the randomness of the very confidence interval under repeated trials, whereas credible intervals analyse the uncertainty of the target parameter given the data at hand. *credible intervals and confidence intervals treat nuisance parameters in radically different ways. For the case of a single parameter and data that can be summarised in a single sufficient statistic, it can be shown that the credible interval and the confidence interval coincide if the unknown parameter is a

location parameter In statistics, a location parameter of a probability distribution is a scalar- or vector-valued parameter x_0, which determines the "location" or shift of the distribution. In the literature of location parameter estimation, the probability distr ...

(i.e. the forward probability function has the form

\mathrm(x, \mu) = f(x - \mu)

), with a prior that is a uniform flat distribution;Jaynes, E. T. (1976).
Confidence Intervals vs Bayesian Intervals
, in ''Foundations of Probability Theory, Statistical Inference, and Statistical Theories of Science'', (W. L. Harper and C. A. Hooker, eds.), Dordrecht: D. Reidel, pp. 175 ''et seq'' and also if the unknown parameter is a scale parameter (i.e. the forward probability function has the form

\mathrm(x, s) = f(x/s)

), with a Jeffreys' prior

\mathrm(s, I) \;\propto\; 1/s

— the latter following because taking the logarithm of such a scale parameter turns it into a location parameter with a uniform distribution. But these are distinctly special (albeit important) cases; in general no such equivalence can be made.

Definitions

Contrasts with confidence interval

References

Further reading