In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the Khmaladze transformation is a mathematical tool used in constructing convenient
goodness of fit
The goodness of fit of a statistical model describes how well it fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measur ...
tests for hypothetical
distribution functions. More precisely, suppose
are
i.i.d., possibly multi-dimensional, random observations generated from an unknown
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
. A classical problem in statistics is to decide how well a given hypothetical distribution function
, or a given hypothetical parametric family of distribution functions
, fits the set of observations. The Khmaladze transformation allows us to construct goodness of fit tests with desirable properties. It is named after
Estate V. Khmaladze.
Consider the sequence of
empirical distribution function
In statistics, an empirical distribution function ( an empirical cumulative distribution function, eCDF) is the Cumulative distribution function, distribution function associated with the empirical measure of a Sampling (statistics), sample. Th ...
s
based on a sequence of i.i.d random variables,
, as ''n'' increases. Suppose
is the hypothetical
distribution function of each
. To test whether the choice of
is correct or not, statisticians use the normalized difference,
:
This
, as a random process in
, is called the
empirical process
In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function from its expectation.
In mean field theory, limit theorems (as the number of objects becomes large) are con ...
. Various
functionals of
are used as test statistics. The change of the variable
,
transforms to the so-called uniform empirical process
. The latter is an empirical processes based on independent random variables
, which are
uniformly distributed on