Autocorrelation, sometimes known as serial correlation in the

discrete time In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled. Discrete time Discrete time views values of variables as occurring at distinct, separate "poi ...

case, measures the

correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...

of a

signal A signal is both the process and the result of transmission of data over some media accomplished by embedding some variation. Signals are important in multiple subject fields including signal processing, information theory and biology. In ...

with a delayed copy of itself. Essentially, it quantifies the similarity between observations of a

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...

at different points in time. The analysis of autocorrelation is a mathematical tool for identifying repeating patterns or hidden periodicities within a signal obscured by

noise Noise is sound, chiefly unwanted, unintentional, or harmful sound considered unpleasant, loud, or disruptive to mental or hearing faculties. From a physics standpoint, there is no distinction between noise and desired sound, as both are vibrat ...

. Autocorrelation is widely used in

signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...

time domain In mathematics and signal processing, the time domain is a representation of how a signal, function, or data set varies with time. It is used for the analysis of mathematical functions, physical signals or time series of economic or environmental ...

and

time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

to understand the behavior of data over time. Different fields of study define autocorrelation differently, and not all of these definitions are equivalent. In some fields, the term is used interchangeably with

autocovariance In probability theory and statistics, given a stochastic process, the autocovariance is a function that gives the covariance of the process with itself at pairs of time points. Autocovariance is closely related to the autocorrelation of the proces ...

. Various time series models incorporate autocorrelation, such as

unit root In probability theory and statistics, a unit root is a feature of some stochastic processes (such as random walks) that can cause problems in statistical inference involving time series models. A linear stochastic process has a unit root if ...

processes,

trend-stationary process In the statistical analysis of time series, a trend-stationary process is a stochastic process from which an underlying trend (function solely of time) can be removed, leaving a stationary process. The trend does not have to be linear. Converse ...

es,

autoregressive process In statistics, econometrics, and signal processing, an autoregressive (AR) model is a representation of a type of random process; as such, it can be used to describe certain time-varying processes in nature, economics, behavior, etc. The autoregre ...

es, and moving average processes.

Autocorrelation of stochastic processes

statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the autocorrelation of a real or complex

random process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Stoc ...

is the Pearson correlation between values of the process at different times, as a function of the two times or of the time lag. Let

\left\

be a random process, and

t

be any point in time (

t

may be an

integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...

for a

discrete-time In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled. Discrete time Discrete time views values of variables as occurring at distinct, separate "poi ...

process or a

real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...

for a

continuous-time In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled. Discrete time Discrete time views values of variables as occurring at distinct, separate "poi ...

process). Then

X_t

is the value (or realization) produced by a given run of the process at time

t

. Suppose that the process has

mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...

\mu_t

and

variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...

\sigma_t^2

at time

t

, for each

t

. Then the definition of the autocorrelation function between times

t_1

and

t_2

isKun Il Park, Fundamentals of Probability and Stochastic Processes with Applications to Communications, Springer, 2018, where

\operatorname

is the

expected value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...

operator and the bar represents

complex conjugation In mathematics, the complex conjugate of a complex number is the number with an equal real part and an imaginary part equal in magnitude but opposite in sign. That is, if a and b are real numbers, then the complex conjugate of a + bi is a - ...

. Note that the expectation may not be well defined. Subtracting the mean before multiplication yields the auto-covariance function between times

t_1

and

t_2

: Note that this expression is not well defined for all-time series or processes, because the mean may not exist, or the variance may be zero (for a constant process) or infinite (for processes with distribution lacking well-behaved moments, such as certain types of

power law In statistics, a power law is a Function (mathematics), functional relationship between two quantities, where a Relative change and difference, relative change in one quantity results in a relative change in the other quantity proportional to the ...

Definition for wide-sense stationary stochastic process

\left\

is a

wide-sense stationary process In mathematics and statistics, a stationary process (also called a strict/strictly stationary process or strong/strongly stationary process) is a stochastic process whose statistical properties, such as mean and variance, do not change over time. M ...

then the mean

\mu

and the variance

\sigma^2

are time-independent, and further the autocovariance function depends only on the lag between

t_1

and

t_2

: the autocovariance depends only on the time-distance between the pair of values but not on their position in time. This further implies that the autocovariance and autocorrelation can be expressed as a function of the time-lag, and that this would be an even function of the lag

\tau=t_2-t_1

. This gives the more familiar forms for the autocorrelation function and the auto-covariance function: In particular, note that

\operatorname_(0) = \sigma^2 .

Normalization

It is common practice in some disciplines (e.g. statistics and

) to normalize the autocovariance function to get a time-dependent

Pearson correlation coefficient In statistics, the Pearson correlation coefficient (PCC) is a correlation coefficient that measures linear correlation between two sets of data. It is the ratio between the covariance of two variables and the product of their standard deviatio ...

. However, in other disciplines (e.g. engineering) the normalization is usually dropped and the terms "autocorrelation" and "autocovariance" are used interchangeably. The definition of the autocorrelation coefficient of a stochastic process is

\rho_(t_1,t_2) = \frac = \frac .

If the function

\rho_

is well defined, its value must lie in the range

1,1 /math>, with 1 indicating perfect correlation and −1 indicating perfect anti-correlation .

For a wide-sense stationary (WSS) process, the definition is \rho_(\tau) = \frac = \frac .

The normalization is important both because the interpretation of the autocorrelation as a correlation provides a scale-free measure of the strength of statistical dependence, and because the normalization has an effect on the statistical properties of the estimated autocorrelations.

Properties

Symmetry property

The fact that the autocorrelation function

\operatorname_

is an even function can be stated as

\operatorname_(t_1,t_2) = \overline

respectively for a WSS process:

\operatorname_(\tau) = \overline .

Maximum at zero

For a WSS process:

\left, \operatorname_(\tau)\ \leq \operatorname_(0)

Notice that

\operatorname_(0)

is always real.

Cauchy–Schwarz inequality

The

Cauchy–Schwarz inequality The Cauchy–Schwarz inequality (also called Cauchy–Bunyakovsky–Schwarz inequality) is an upper bound on the absolute value of the inner product between two vectors in an inner product space in terms of the product of the vector norms. It is ...

, inequality for stochastic processes:

\left, \operatorname_(t_1,t_2)\^2 \leq \operatorname\left X_, ^2\right \operatorname\left white noise

In signal processing, white noise is a random signal having equal intensity at different frequencies, giving it a constant power spectral density. The term is used with this or similar meanings in many scientific and technical disciplines, i ...

Autocorrelation of stochastic processes

Definition for wide-sense stationary stochastic process

Normalization

Properties

Symmetry property

Maximum at zero

Cauchy–Schwarz inequality

Wiener–Khinchin theorem

Autocorrelation of random vectors

Properties of the autocorrelation matrix

Autocorrelation of deterministic signals

Autocorrelation of continuous-time signal

Autocorrelation of discrete-time signal

Definition for periodic signals

Properties

Multi-dimensional autocorrelation

Efficient computation

Estimation

Regression analysis

Applications

Serial dependence

See also

References

Further reading

Autocorrelation of white noise

Wiener–Khinchin theorem

Autocorrelation of random vectors

Properties of the autocorrelation matrix

Autocorrelation of deterministic signals

Autocorrelation of continuous-time signal

Autocorrelation of discrete-time signal

Definition for periodic signals

Properties

Multi-dimensional autocorrelation

Efficient computation

Estimation

Regression analysis

Applications

Serial dependence

See also

References

Further reading