In the mathematical theory of probability, the Indian buffet process (IBP) is a
stochastic process defining a
probability distribution
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomeno ...
over
sparse
Sparse is a computer software tool designed to find possible coding faults in the Linux kernel. Unlike other such tools, this static analysis tool was initially designed to only flag constructs that were likely to be of interest to kernel de ...
binary matrices with a finite number of rows and an infinite number of columns. This distribution is suitable to use as a
prior for models with potentially infinite number of features. The form of the prior ensures that only a finite number of features will be present in any finite set of observations but more features may appear as more data points are observed.
Indian buffet process prior
Let
be an
binary matrix indicating the presence or absence of a latent feature. The IBP places the following prior on
:
:
where
is the number of non-zero columns in
,
is the number of ones in column
of
,
is the ''N''th
harmonic number, and
is the number of occurrences of the non-zero
binary vector among the columns in
. The parameter
controls the expected number of features present in each observation.
In the Indian buffet process, the rows of
correspond to customers and the columns correspond to dishes in an infinitely long buffet. The first customer takes the first
dishes. The
-th customer then takes dishes that have been previously sampled with probability
, where
is the number of people who have already sampled dish
. He also takes
new dishes. Therefore,
is one if customer
tried the
-th dish and zero otherwise.
This process is infinitely exchangeable for an
equivalence class
In mathematics, when the elements of some set S have a notion of equivalence (formalized as an equivalence relation), then one may naturally split the set S into equivalence classes. These equivalence classes are constructed so that elements ...
of binary matrices defined by a
left-ordered many-to-one function.
is obtained by ordering the columns of the binary matrix
from left to right by the magnitude of the binary number expressed by that column, taking the first row as the most significant bit.
See also
*
Chinese restaurant process
In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant.
Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. Cu ...
References
{{reflist
*T.L. Griffiths and Z. Ghahraman
The Indian Buffet Process: An Introduction and Review Journal of Machine Learning Research, pp. 1185–1224, 2011.
*
Bayesian
Bayesian statistics