Variance-based sensitivity analysis (often referred to as the Sobol’ method or Sobol’ indices, after
Ilya M. Sobol’) is a form of global
sensitivity analysis
Sensitivity analysis is the study of how the uncertainty in the output of a mathematical model or system (numerical or otherwise) can be divided and allocated to different sources of uncertainty in its inputs. This involves estimating sensitivity ...
.
[Sobol, I.M. (2001), Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. ''MATH COMPUT SIMULAT'',55(1–3),271-280, ][Saltelli, A., Ratto, M., Andres, T., Campolongo, F., Cariboni, J., Gatelli, D. Saisana, M., and Tarantola, S., 2008, ''Global Sensitivity Analysis. The Primer'', John Wiley & Sons.] Working within a
probabilistic
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
framework, it decomposes the
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
of the output of the model or system into fractions which can be attributed to inputs or sets of inputs. For example, given a model with two inputs and one output, one might find that 70% of the output variance is caused by the variance in the first input, 20% by the variance in the second, and 10% due to
interactions between the two. These percentages are directly interpreted as measures of sensitivity. Variance-based measures of sensitivity are attractive because they measure sensitivity across the whole input space (i.e. it is a global method), they can deal with
nonlinear
In mathematics and science, a nonlinear system (or a non-linear system) is a system in which the change of the output is not proportional to the change of the input. Nonlinear problems are of interest to engineers, biologists, physicists, mathe ...
responses, and they can measure the effect of interactions in non-
additive
Additive may refer to:
Mathematics
* Additive function, a function in number theory
* Additive map, a function that preserves the addition operation
* Additive set-function see Sigma additivity
* Additive category, a preadditive category with fin ...
systems.
[Saltelli, A., Annoni, P., 2010, How to avoid a perfunctory sensitivity analysis, ''Environmental Modeling and Software'' 25, 1508–1517.]
Decomposition of variance
From a
black box
In science, computing, and engineering, a black box is a system which can be viewed in terms of its inputs and outputs (or transfer characteristics), without any knowledge of its internal workings. Its implementation is "opaque" (black). The te ...
perspective, any
model
A model is an informative representation of an object, person, or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin , .
Models can be divided in ...
may be viewed as a function ''Y''=''f''(X), where X is a vector of ''d'' uncertain model inputs , and ''Y'' is a chosen
univariate
In mathematics, a univariate object is an expression (mathematics), expression, equation, function (mathematics), function or polynomial involving only one Variable (mathematics), variable. Objects involving more than one variable are ''wikt:multi ...
model output (note that this approach examines scalar model outputs, but multiple outputs can be analysed by multiple independent sensitivity analyses). Furthermore, it will be assumed that the inputs are independently and uniformly distributed within the unit hypercube, i.e.
for
. This incurs no loss of generality because any input space can be transformed onto this unit hypercube. ''f''(X) may be decomposed in the following way,
[Sobol’, I. (1990). Sensitivity estimates for nonlinear mathematical models. ''Matematicheskoe Modelirovanie'' 2, 112–118. in Russian, translated in English in Sobol’ , I. (1993). Sensitivity analysis for non-linear mathematical models. ''Mathematical Modeling & Computational Experiment (Engl. Transl.)'', 1993, 1, 407–414.]
:
where ''f''
0 is a constant and ''f''
''i'' is a function of ''X''
''i'', ''f''
''ij'' a function of ''X''
''i'' and ''X''
''j'', etc. A condition of this decomposition is that,
:
i.e. all the terms in the
functional decomposition
In engineering, functional decomposition is the process of resolving a functional relationship into its constituent parts in such a way that the original function can be reconstructed (i.e., recomposed) from those parts.
This process of decompo ...
are
orthogonal
In mathematics, orthogonality (mathematics), orthogonality is the generalization of the geometric notion of ''perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendic ...
. This leads to definitions of the terms of the functional decomposition in terms of conditional expected values,
:
:
:
From which it can be seen that ''f''
''i'' is the effect of varying ''X''
''i'' alone (known as the
main effect In the design of experiments and analysis of variance, a main effect is the effect of an independent variable on a dependent variable averaged across the levels of any other independent variables. The term is frequently used in the context of fact ...
of ''X''
''i''), and ''f''
''ij'' is the effect of varying ''X''
''i'' and ''X''
''j'' simultaneously, ''additional to the effect of their individual variations''. This is known as a second-order
interaction. Higher-order terms have analogous definitions.
Now, further assuming that the ''f''(X) is
square-integrable
In mathematics, a square-integrable function, also called a quadratically integrable function or L^2 function or square-summable function, is a real- or complex-valued measurable function for which the integral of the square of the absolute value ...
, the functional decomposition may be squared and integrated to give,
:
Notice that the left hand side is equal to the variance of ''Y'', and the terms of the right hand side are variance terms, now decomposed with respect to sets of the ''X''
''i''. This finally leads to the decomposition of variance expression,
:
where
:
,
:
and so on. The ''X''
~''i'' notation indicates the set of all variables ''except'' ''X''
''i''. The above variance decomposition shows how the variance of the model output can be decomposed into terms attributable to each input, as well as the interaction effects between them. Together, all terms sum to the total variance of the model output.
First-order indices
A direct variance-based measure of sensitivity ''S''
''i'', called the "first-order
sensitivity index", or "main effect index" is stated as follows,
[
:
This is the contribution to the output variance of the main effect of ''X''''i'', therefore it measures the effect of varying ''X''''i'' ''alone'', but averaged over variations in other input parameters. It is standardised by the total variance to provide a fractional contribution. Higher-order interaction indices ''S''''ij'', ''S''''ijk'' and so on can be formed by dividing other terms in the variance decomposition by Var(''Y''). Note that this has the implication that,
:
]
Total-effect index
Using the ''S''''i'', ''S''''ij'' and higher-order indices given above, one can build a picture of the importance of each variable in determining the output variance. However, when the number of variables is large, this requires the evaluation of 2''d''-1 indices, which can be too computationally demanding. For this reason, a measure known as the "Total-effect index" or "Total-order index", ''S''''Ti'', is used. This measures the contribution to the output variance of ''X''''i'', ''including'' all variance caused by its interactions, of any order, with any other input variables. It is given as,
:
Note that unlike the ''S''''i'',
:
due to the fact that the interaction effect between e.g. ''X''''i'' and ''X''''j'' is counted in both ''S''''Ti'' ''and'' ''S''''Tj''. In fact, the sum of the ''S''''Ti'' will only be equal to 1 when the model is purely additive
Additive may refer to:
Mathematics
* Additive function, a function in number theory
* Additive map, a function that preserves the addition operation
* Additive set-function see Sigma additivity
* Additive category, a preadditive category with fin ...
.
Calculation of indices
For analytically tractable functions, the indices above may be calculated analytically by evaluating the integrals in the decomposition. However, in the vast majority of cases they are estimated – this is usually done by the Monte Carlo method
Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be ...
.
Sampling sequences
The Monte Carlo approach involves generating a sequence of randomly distributed points inside the unit hypercube (strictly speaking these will be pseudorandom
A pseudorandom sequence of numbers is one that appears to be statistically random, despite having been produced by a completely deterministic and repeatable process. Pseudorandom number generators are often used in computer programming, as tradi ...
). In practice, it is common to substitute random sequences with low-discrepancy sequence
In mathematics, a low-discrepancy sequence is a sequence with the property that for all values of N, its subsequence x_1, \ldots, x_N has a low discrepancy of a sequence, discrepancy.
Roughly speaking, the discrepancy of a sequence is low if the p ...
s to improve the efficiency of the estimators. This is then known as the quasi-Monte Carlo method
In numerical analysis, the quasi-Monte Carlo method is a method for numerical integration and solving some other problems using low-discrepancy sequences (also called quasi-random sequences or sub-random sequences) to achieve variance reduction. ...
. Some low-discrepancy sequences commonly used in sensitivity analysis include the Sobol’ sequence and the Latin hypercube design.
Procedure
To calculate the indices using the (quasi) Monte Carlo method, the following steps are used:[
# Generate an ''N''×2''d'' sample matrix, i.e. each row is a sample point in the hyperspace of 2''d'' dimensions. This should be done with respect to the probability distributions of the input variables.
# Use the first ''d'' columns of the matrix as matrix A, and the remaining ''d'' columns as matrix B. This effectively gives two independent samples of ''N'' points in the ''d''-dimensional unit hypercube.
# Build ''d'' further ''N''×''d'' matrices AB''i'', for ''i'' = 1,2,...,d, such that the ''i''th column of AB''i'' is equal to the ''i''th column of B, and the remaining columns are from A.
# The A, B, and the ''d'' AB''i'' matrices in total specify ''N''(''d''+2) points in the input space (one for each row). Run the model at each design point in the A, B, and AB''i'' matrices, giving a total of ''N''(''d''+2) model evaluations – the corresponding f(A), f(B) and f(AB''i'') values.
# Calculate the sensitivity indices using the estimators below.
The accuracy of the estimators is of course dependent on ''N''. The value of ''N'' can be chosen by sequentially adding points and calculating the indices until the estimated values reach some acceptable convergence. For this reason, when using low-discrepancy sequences, it can be advantageous to use those that allow sequential addition of points (such as the Sobol’ sequence), as compared to those that do not (such as Latin hypercube sequences).
]
Estimators
There are a number of possible Monte Carlo estimators available for both indices. Two that are currently in general use are,[Andrea Saltelli, Paola Annoni, Ivano Azzini, Francesca Campolongo, Marco Ratto, and Stefano Tarantola. Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index. ''Computer Physics Communications'', 181(2):259{270, 2010]
:
and
:
for the estimation of the ''S''''i'' and the ''S''''Ti'' respectively.
Computational expense
For the estimation of the ''S''''i'' and the ''S''''Ti'' for all input variables, ''N''(''d''+2) model runs are required. Since ''N'' is often of the order of hundreds or thousands of runs, computational expense can quickly become a problem when the model takes a significant amount of time for a single run. In such cases, there are a number of techniques available to reduce the computational cost of estimating sensitivity indices, such as emulators
In computing, an emulator is hardware or software that enables one computer system (called the ''host'') to behave like another computer system (called the ''guest''). An emulator typically enables the host system to run software or use perip ...
, HDMR and FAST
Fast or FAST may refer to:
Arts and entertainment
* "Fast" (Juice Wrld song), 2019
* "Fast" (Luke Bryan song), 2016
* "Fast" (Sueco song), 2019
* "Fast" (GloToven song), 2019
* ''Fast'', an album by Custom, 2002
* ''Fast'', a 2010 short fil ...
.
See also
* Sensitivity analysis
Sensitivity analysis is the study of how the uncertainty in the output of a mathematical model or system (numerical or otherwise) can be divided and allocated to different sources of uncertainty in its inputs. This involves estimating sensitivity ...
* Monte Carlo method
Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be ...
* Quasi-Monte Carlo method
In numerical analysis, the quasi-Monte Carlo method is a method for numerical integration and solving some other problems using low-discrepancy sequences (also called quasi-random sequences or sub-random sequences) to achieve variance reduction. ...
* Sobol’ sequence
References
{{Reflist
Mathematical modeling
fr:Indice de Sobol