Geostatistics is a branch of
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
focusing on spatial or
spatiotemporal datasets. Developed originally to predict
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
s of
ore grades for
mining
Mining is the Resource extraction, extraction of valuable geological materials and minerals from the surface of the Earth. Mining is required to obtain most materials that cannot be grown through agriculture, agricultural processes, or feasib ...
operations, it is currently applied in diverse disciplines including
petroleum geology
Petroleum geology is the study of the origins, occurrence, movement, accumulation, and exploration of hydrocarbon fuels. It refers to the specific set of geological disciplines that are applied to the search for hydrocarbons ( oil exploration).
...
,
hydrogeology,
hydrology
Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...
,
meteorology
Meteorology is the scientific study of the Earth's atmosphere and short-term atmospheric phenomena (i.e. weather), with a focus on weather forecasting. It has applications in the military, aviation, energy production, transport, agricultur ...
,
oceanography
Oceanography (), also known as oceanology, sea science, ocean science, and marine science, is the scientific study of the ocean, including its physics, chemistry, biology, and geology.
It is an Earth science, which covers a wide range of to ...
,
geochemistry
Geochemistry is the science that uses the tools and principles of chemistry to explain the mechanisms behind major geological systems such as the Earth's crust and its oceans. The realm of geochemistry extends beyond the Earth, encompassing the e ...
,
geometallurgy,
geography
Geography (from Ancient Greek ; combining 'Earth' and 'write', literally 'Earth writing') is the study of the lands, features, inhabitants, and phenomena of Earth. Geography is an all-encompassing discipline that seeks an understanding o ...
,
forestry
Forestry is the science and craft of creating, managing, planting, using, conserving and repairing forests and woodlands for associated resources for human and Natural environment, environmental benefits. Forestry is practiced in plantations and ...
,
environmental control,
landscape ecology,
soil science
Soil science is the study of soil as a natural resource on the surface of the Earth including soil formation, soil classification, classification and Soil survey, mapping; Soil physics, physical, Soil chemistry, chemical, Soil biology, biologica ...
, and
agriculture
Agriculture encompasses crop and livestock production, aquaculture, and forestry for food and non-food products. Agriculture was a key factor in the rise of sedentary human civilization, whereby farming of domesticated species created ...
(esp. in
precision farming). Geostatistics is applied in varied branches of
geography
Geography (from Ancient Greek ; combining 'Earth' and 'write', literally 'Earth writing') is the study of the lands, features, inhabitants, and phenomena of Earth. Geography is an all-encompassing discipline that seeks an understanding o ...
, particularly those involving the spread of diseases (
epidemiology
Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and Risk factor (epidemiology), determinants of health and disease conditions in a defined population, and application of this knowledge to prevent dise ...
), the practice of commerce and military planning (
logistics
Logistics is the part of supply chain management that deals with the efficient forward and reverse flow of goods, services, and related information from the point of origin to the Consumption (economics), point of consumption according to the ...
), and the development of efficient
spatial networks. Geostatistical algorithms are incorporated in many places, including
geographic information systems (GIS).
Background
Geostatistics is intimately related to interpolation methods but extends far beyond simple interpolation problems. Geostatistical techniques rely on statistical models based on random function (or
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
) theory to model the uncertainty associated with spatial estimation and simulation.
A number of simpler interpolation methods/algorithms, such as
inverse distance weighting,
bilinear interpolation and
nearest-neighbor interpolation, were already well known before geostatistics.
[Isaaks, E. H. and Srivastava, R. M. (1989), ''An Introduction to Applied Geostatistics,'' Oxford University Press, New York, USA.] Geostatistics goes beyond the interpolation problem by considering the studied phenomenon at unknown locations as a set of correlated random variables.
Let be the value of the variable of interest at a certain location . This value is unknown (e.g., temperature, rainfall,
piezometric level, geological facies, etc.). Although there exists a value at location that could be measured, geostatistics considers this value as random since it was not measured or has not been measured yet. However, the randomness of is not complete. Still, it is defined by a
cumulative distribution function (CDF) that depends on certain information that is known about the value :
:
Typically, if the value of is known at locations close to (or in the
neighborhood
A neighbourhood (Commonwealth English) or neighborhood (American English) is a geographically localized community within a larger town, city, suburb or rural area, sometimes consisting of a single street and the buildings lining it. Neigh ...
of ) one can constrain the CDF of by this neighborhood: if a high spatial continuity is assumed, can only have values similar to the ones found in the neighborhood. Conversely, in the absence of spatial continuity can take any value. The spatial continuity of the random variables is described by a model of spatial continuity that can be either a parametric function in the case of
variogram-based geostatistics, or have a non-parametric form when using other methods such as
multiple-point simulation or
pseudo-genetic techniques.
By applying a single spatial model on an entire domain, one makes the assumption that is a
stationary process. It means that the same statistical properties are applicable on the entire domain. Several geostatistical methods provide ways of relaxing this stationarity assumption.
In this framework, one can distinguish two modeling goals:
#
Estimating the value for , typically by the
expectation, the
median
The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
or the
mode of the CDF . This is usually denoted as an estimation problem.
#
Sampling from the entire probability density function by actually considering each possible outcome of it at each location. This is generally done by creating several alternative maps of , called realizations. Consider a domain discretized in grid nodes (or pixels). Each realization is a sample of the complete -dimensional joint distribution function
::
: In this approach, the presence of multiple solutions to the interpolation problem is acknowledged. Each realization is considered as a possible scenario of what the real variable could be. All associated workflows are then considering ensemble of realizations, and consequently ensemble of predictions that allow for probabilistic forecasting. Therefore, geostatistics is often used to generate or update spatial models when solving
inverse problem
An inverse problem in science is the process of calculating from a set of observations the causal factors that produced them: for example, calculating an image in X-ray computed tomography, sound source reconstruction, source reconstruction in ac ...
s.
A number of methods exist for both geostatistical estimation and multiple realizations approaches. Several reference books provide a comprehensive overview of the discipline.
Methods
Estimation
Kriging
Kriging is a group of geostatistical techniques to interpolate the value of a random field (e.g., the elevation, z, of the landscape as a function of the geographic location) at an unobserved location from observations of its value at nearby locations.
Bayesian estimation
Bayesian inference is a method of statistical inference in which
Bayes' theorem
Bayes' theorem (alternatively Bayes' law or Bayes' rule, after Thomas Bayes) gives a mathematical rule for inverting Conditional probability, conditional probabilities, allowing one to find the probability of a cause given its effect. For exampl ...
is used to update a probability model as more evidence or information becomes available. Bayesian inference is playing an increasingly important role in geostatistics. Bayesian estimation implements kriging through a spatial process, most commonly a
Gaussian process, and updates the process using
Bayes' Theorem
Bayes' theorem (alternatively Bayes' law or Bayes' rule, after Thomas Bayes) gives a mathematical rule for inverting Conditional probability, conditional probabilities, allowing one to find the probability of a cause given its effect. For exampl ...
to calculate its posterior. High-dimensional Bayesian geostatistics.
Finite difference method
Considering the principle of conservation of probability, recurrent difference equations (finite difference equations) were used in conjunction with lattices to compute probabilities quantifying uncertainty about the geological structures. This procedure is a numerical alternative method to Markov chains and Bayesian models.
Simulation
* Aggregation
* Dissagregation
*
Turning bands
*
Cholesky decomposition
* Truncated Gaussian
* Plurigaussian
* Annealing
* Spectral simulation
* Sequential Indicator
* Sequential Gaussian
* Dead Leave
*
Transition probabilities
*
Markov chain geostatistics
*
Support vector machine
*
Boolean simulation
* Genetic models
* Pseudo-genetic models
*
Cellular automata
* Multiple-Point Geostatistics
Definitions and tools
*
Regionalized variable theory
*
Covariance function
In probability theory and statistics, the covariance function describes how much two random variables change together (their ''covariance'') with varying spatial or temporal separation. For a random field or stochastic process ''Z''(''x'') on a dom ...
*
Semi-variance
*
Variogram
*
Kriging
*
Range (geostatistics)
*
Sill (geostatistics)
*
Nugget effect
*
Training image
*
Finite difference method
See also
*
Arbia's law of geography
*
Concepts and Techniques in Modern Geography
*
Multivariate interpolation
*
Spline interpolation
*
Geodemographic segmentation
*
Geodesy
Geodesy or geodetics is the science of measuring and representing the Figure of the Earth, geometry, Gravity of Earth, gravity, and Earth's rotation, spatial orientation of the Earth in Relative change, temporally varying Three-dimensional spac ...
*
Geographic Information Science
*
Geographic Information Systems
*
Geomatics
*
SaTScan
*
Remote sensing
Remote sensing is the acquisition of information about an physical object, object or phenomenon without making physical contact with the object, in contrast to in situ or on-site observation. The term is applied especially to acquiring inform ...
*
Pedometrics
*
Time geography
*
Tobler's first law of geography
*
Tobler's second law of geography
*
Uncertain geographic context problem
Notes
References
# Armstrong, M and Champigny, N, 1988, A Study on Kriging Small Blocks, CIM Bulletin, Vol 82, No 923
# Armstrong, M, 1992
Freedom of Speech?De Geeostatisticis, July, No 14
# Champigny, N, 1992
Geostatistics: A tool that works The Northern Miner, May 18
# Clark I, 1979
Practical Geostatistics Applied Science Publishers, London
# David, M, 1977, Geostatistical Ore Reserve Estimation, Elsevier Scientific Publishing Company, Amsterdam
# Hald, A, 1952, Statistical Theory with Engineering Applications, John Wiley & Sons, New York
# (best paper award IAMG 09)
# ISO/DIS 11648-1 Statistical aspects of sampling from bulk materials-Part1: General principles
# Lipschutz, S, 1968, Theory and Problems of Probability, McCraw-Hill Book Company, New York.
# Matheron, G. 1962. Traité de géostatistique appliquée. Tome 1, Editions Technip, Paris, 334 pp.
# Matheron, G. 1989. Estimating and choosing, Springer-Verlag, Berlin.
# McGrew, J. Chapman, & Monroe, Charles B., 2000. An introduction to statistical problem solving in geography, second edition, McGraw-Hill, New York.
# Merks, J W, 1992
Geostatistics or voodoo science The Northern Miner, May 18
#
Merks, J W
Abuse of statistics
CIM Bulletin, January 1993, Vol 86, No 966
# Myers, Donald E.
#
Philip, G M and Watson, D F, 1986, Matheronian Geostatistics; Quo Vadis?, Mathematical Geology, Vol 18, No 1
# Pyrcz, M.J. and Deutsch, C.V., 2014, Geostatistical Reservoir Modeling, 2nd Edition, Oxford University Press, New York, p. 448
# Sharov, A: Quantitative Population Ecology, 1996, https://web.archive.org/web/20020605050231/http://www.ento.vt.edu/~sharov/PopEcol/popecol.html
# Shine, J.A., Wakefield, G.I.: A comparison of supervised imagery classification using analyst-chosen and geostatistically-chosen training sets, 1999, https://web.archive.org/web/20020424165227/http://www.geovista.psu.edu/sites/geocomp99/Gc99/044/gc_044.htm
# Strahler, A. H., and Strahler A., 2006, Introducing Physical Geography, 4th Ed., Wiley.
# Tahmasebi, P., Hezarkhani, A., Sahimi, M., 2012
Multiple-point geostatistical modeling based on the cross-correlation functions Computational Geosciences, 16(3):779-79742.
# Volk, W, 1980, Applied Statistics for Engineers, Krieger Publishing Company, Huntington, New York.
External links
On-Line Library that chronicles Matheron's journey from classical statistics to the new science of geostatistics
{{Authority control