VEGAS Algorithm
   HOME

TheInfoList



OR:

The VEGAS algorithm, due to
G. Peter Lepage G. Peter Lepage (born 13 April 1952) is a Canadian American theoretical physicist and an academic administrator. He was the Harold Tanner Dean of the College of Arts and Sciences at Cornell University from 2003 to 2013. Early life and educat ...
, is a method for reducing error in
Monte Carlo simulation Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be det ...
s by using a known or approximate
probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
function to concentrate the search in those areas of the
integrand In mathematics, an integral is the continuous analog of a sum, which is used to calculate areas, volumes, and their generalizations. Integration, the process of computing an integral, is one of the two fundamental operations of calculus,Inte ...
that make the greatest contribution to the final
integral In mathematics, an integral is the continuous analog of a Summation, sum, which is used to calculate area, areas, volume, volumes, and their generalizations. Integration, the process of computing an integral, is one of the two fundamental oper ...
. The VEGAS algorithm is based on
importance sampling Importance sampling is a Monte Carlo method for evaluating properties of a particular distribution, while only having samples generated from a different distribution than the distribution of interest. Its introduction in statistics is generally at ...
. It samples points from the probability distribution described by the function , f, , so that the points are concentrated in the regions that make the largest contribution to the integral. The
GNU Scientific Library The GNU Scientific Library (or GSL) is a software library for numerical computations in applied mathematics and science. The GSL is written in C (programming language), C; wrappers are available for other programming languages. The GSL is part of ...
(GSL) provides a VEGAS routine.


Sampling method

In general, if the Monte Carlo integral of f over a volume \Omega is sampled with points distributed according to a probability distribution described by the function g, we obtain an estimate \mathrm_g(f; N), :\mathrm_g(f; N) = \sum_i^N / g(x_i) . The
variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
of the new estimate is then :\mathrm_g(f; N) = \mathrm(f/g; N) where \mathrm(f;N) is the variance of the original estimate, \mathrm(f; N) = \mathrm(f^2; N) - (\mathrm(f; N))^2. If the probability distribution is chosen as g = , f, /\textstyle \int_\Omega , f(x), dx then it can be shown that the variance \mathrm_g(f; N) vanishes, and the error in the estimate will be zero. In practice it is not possible to sample from the exact distribution g for an arbitrary function, so importance sampling algorithms aim to produce efficient approximations to the desired distribution.


Approximation of probability distribution

The VEGAS algorithm approximates the exact distribution by making a number of passes over the integration region while
histogram A histogram is a visual representation of the frequency distribution, distribution of quantitative data. To construct a histogram, the first step is to Data binning, "bin" (or "bucket") the range of values— divide the entire range of values in ...
ming the function f. Each histogram is used to define a sampling distribution for the next pass. Asymptotically this procedure converges to the desired distribution. In order to avoid the number of histogram bins growing like K^d with dimension ''d'' the probability distribution is approximated by a separable function: g(x_1, x_2, \ldots) = g_1(x_1) g_2(x_2) \cdots so that the number of bins required is only ''Kd''. This is equivalent to locating the peaks of the function from the
projection Projection or projections may refer to: Physics * Projection (physics), the action/process of light, heat, or sound reflecting from a surface to another in a different direction * The display of images by a projector Optics, graphics, and carto ...
s of the integrand onto the coordinate axes. The efficiency of VEGAS depends on the validity of this assumption. It is most efficient when the peaks of the integrand are well-localized. If an integrand can be rewritten in a form which is approximately separable this will increase the efficiency of integration with VEGAS.


See also

*
Las Vegas algorithm In computing, a Las Vegas algorithm is a randomized algorithm that always gives Correctness (computer science), correct results; that is, it always produces the correct result or it informs about the failure. However, the runtime of a Las Vegas alg ...
*
Monte Carlo integration In mathematics, Monte Carlo integration is a technique for numerical integration using random numbers. It is a particular Monte Carlo method that numerically computes a definite integral. While other algorithms usually evaluate the integrand at ...
*
Importance sampling Importance sampling is a Monte Carlo method for evaluating properties of a particular distribution, while only having samples generated from a different distribution than the distribution of interest. Its introduction in statistics is generally at ...


References


Monte Carlo methods Computational physics Statistical algorithms Variance reduction {{compu-physics-stub