
In
Bayesian statistics
Bayesian statistics ( or ) is a theory in the field of statistics based on the Bayesian interpretation of probability, where probability expresses a ''degree of belief'' in an event. The degree of belief may be based on prior knowledge about ...
, a credible interval is an
interval used to characterize a
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
. It is defined such that an unobserved
parameter
A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
value has a particular
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
to fall within it. For example, in an experiment that determines the distribution of possible values of the parameter
, if the probability that
lies between 35 and 45 is
, then
is a 95% credible interval.
Credible intervals are typically used to characterize
posterior probability
The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posteri ...
distributions or
predictive probability distributions. Their generalization to disconnected or multivariate sets is called credible set or credible region.
Credible intervals are a
Bayesian analog to
confidence intervals in
frequentist statistics. The two concepts arise from different philosophies:
Bayesian intervals treat their bounds as fixed and the estimated parameter as a random variable, whereas frequentist confidence intervals treat their bounds as random variables and the parameter as a fixed value. Also, Bayesian credible intervals use (and indeed, require) knowledge of the situation-specific
prior distribution, while the frequentist confidence intervals do not.
Definitions
Credible sets are not unique, as any given probability distribution has an infinite number of
-credible sets, i.e. sets of probability
. For example, in the univariate case, there are multiple definitions for a suitable interval or set:
*The smallest credible interval (SCI), sometimes also called the highest density interval. This interval necessarily contains the
median
The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
whenever
. When the distribution is
unimodal, this interval also contains the
mode.
*The smallest credible set (SCS), sometimes also called the highest density region. For a multimodal distribution, this is not necessarily an interval as it can be disconnected. This set always contains the
mode.
*A quantile-based credible interval, which is computed by taking the inter-
quantile
In statistics and probability, quantiles are cut points dividing the range of a probability distribution into continuous intervals with equal probabilities or dividing the observations in a sample in the same way. There is one fewer quantile t ...
interval