HOME

TheInfoList



OR:

The VEGAS algorithm, due to G. Peter Lepage, is a method for reducing error in
Monte Carlo simulation Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be determ ...
s by using a known or approximate
probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomeno ...
function to concentrate the search in those areas of the
integrand In mathematics, an integral assigns numbers to functions in a way that describes displacement, area, volume, and other concepts that arise by combining infinitesimal data. The process of finding integrals is called integration. Along with ...
that make the greatest contribution to the final
integral In mathematics, an integral assigns numbers to functions in a way that describes displacement, area, volume, and other concepts that arise by combining infinitesimal data. The process of finding integrals is called integration. Along with ...
. The VEGAS algorithm is based on
importance sampling Importance sampling is a Monte Carlo method for evaluating properties of a particular distribution, while only having samples generated from a different distribution than the distribution of interest. Its introduction in statistics is generally att ...
. It samples points from the probability distribution described by the function , f, , so that the points are concentrated in the regions that make the largest contribution to the integral. The
GNU Scientific Library The GNU Scientific Library (or GSL) is a software library for numerical computations in applied mathematics and science. The GSL is written in C; wrappers are available for other programming languages. The GSL is part of the GNU Project and is d ...
(GSL) provides a VEGAS routine.


Sampling method

In general, if the Monte Carlo integral of f over a volume \Omega is sampled with points distributed according to a probability distribution described by the function g, we obtain an estimate \mathrm_g(f; N), :\mathrm_g(f; N) = \sum_i^N / g(x_i) . The
variance In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of number ...
of the new estimate is then :\mathrm_g(f; N) = \mathrm(f/g; N) where \mathrm(f;N) is the variance of the original estimate, \mathrm(f; N) = \mathrm(f^2; N) - (\mathrm(f; N))^2. If the probability distribution is chosen as g = , f, /\textstyle \int_\Omega , f(x), dx then it can be shown that the variance \mathrm_g(f; N) vanishes, and the error in the estimate will be zero. In practice it is not possible to sample from the exact distribution g for an arbitrary function, so importance sampling algorithms aim to produce efficient approximations to the desired distribution.


Approximation of probability distribution

The VEGAS algorithm approximates the exact distribution by making a number of passes over the integration region while
histogram A histogram is an approximate representation of the distribution of numerical data. The term was first introduced by Karl Pearson. To construct a histogram, the first step is to " bin" (or " bucket") the range of values—that is, divide the ent ...
ming the function f. Each histogram is used to define a sampling distribution for the next pass. Asymptotically this procedure converges to the desired distribution. In order to avoid the number of histogram bins growing like K^d with dimension ''d'' the probability distribution is approximated by a separable function: g(x_1, x_2, \ldots) = g_1(x_1) g_2(x_2) \cdots so that the number of bins required is only ''Kd''. This is equivalent to locating the peaks of the function from the projections of the integrand onto the coordinate axes. The efficiency of VEGAS depends on the validity of this assumption. It is most efficient when the peaks of the integrand are well-localized. If an integrand can be rewritten in a form which is approximately separable this will increase the efficiency of integration with VEGAS.


See also

*
Las Vegas algorithm In computing, a Las Vegas algorithm is a randomized algorithm that always gives correct results; that is, it always produces the correct result or it informs about the failure. However, the runtime of a Las Vegas algorithm differs depending on the ...
*
Monte Carlo integration In mathematics, Monte Carlo integration is a technique for numerical integration using random numbers. It is a particular Monte Carlo method that numerically computes a definite integral. While other algorithms usually evaluate the integrand at ...
*
Importance sampling Importance sampling is a Monte Carlo method for evaluating properties of a particular distribution, while only having samples generated from a different distribution than the distribution of interest. Its introduction in statistics is generally att ...


References


Monte Carlo methods Computational physics Statistical algorithms Variance reduction {{compu-physics-stub