The Granger causality test is a

statistical hypothesis test A statistical hypothesis test is a method of statistical inference used to decide whether the data provide sufficient evidence to reject a particular hypothesis. A statistical hypothesis test typically involves a calculation of a test statistic. T ...

for determining whether one

time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

is useful in

forecasting Forecasting is the process of making predictions based on past and present data. Later these can be compared with what actually happens. For example, a company might Estimation, estimate their revenue in the next year, then compare it against the ...

another, first proposed in 1969. Ordinarily, regressions reflect "mere"

correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...

s, but

Clive Granger Sir Clive William John Granger (; 4 September 1934 – 27 May 2009) was a British econometrician known for his contributions to nonlinear time series analysis. He taught in Britain, at the University of Nottingham and in the United States, at t ...

argued that causality in

economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...

could be tested for by measuring the ability to predict the future values of a time series using prior values of another time series. Since the question of "true causality" is deeply philosophical, and because of the

post hoc ergo propter hoc ''Post hoc ergo propter hoc'' (Latin: 'after this, therefore because of this') is an informal fallacy that states "Since event Y ''followed'' event X, event Y must have been ''caused'' by event X." It is a fallacy in which an event is presumed to ...

fallacy of assuming that one thing preceding another can be used as a proof of causation,

econometrician Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...

s assert that the Granger test finds only "predictive causality". Using the term "causality" alone is a misnomer, as Granger-causality is better described as "precedence", or, as Granger himself later claimed in 1977, "temporally related". Rather than testing whether ''X'' ''causes'' Y, the Granger causality tests whether X ''forecasts'' ''Y.'' A time series ''X'' is said to Granger-cause ''Y'' if it can be shown, usually through a series of

t-test Student's ''t''-test is a statistical test used to test whether the difference between the response of two groups is Statistical significance, statistically significant or not. It is any statistical hypothesis testing, statistical hypothesis test ...

s and

F-test An F-test is a statistical test that compares variances. It is used to determine if the variances of two samples, or if the ratios of variances among multiple samples, are significantly different. The test calculates a Test statistic, statistic, ...

s on lagged values of ''X'' (and with lagged values of ''Y'' also included), that those ''X'' values provide statistically significant information about future values of ''Y''. Granger also stressed that some studies using "Granger causality" testing in areas outside economics reached "ridiculous" conclusions. "Of course, many ridiculous papers appeared", he said in his Nobel lecture. However, it remains a popular method for causality analysis in time series due to its computational simplicity. The original definition of Granger causality does not account for latent confounding effects and does not capture instantaneous and non-linear causal relationships, though several extensions have been proposed to address these issues.

Intuition

We say that a variable ''X'' that evolves over time ''Granger-causes'' another evolving variable ''Y'' if predictions of the value of ''Y'' based on its own past values ''and'' on the past values of ''X'' are better than predictions of ''Y'' based only on ''Ys own past values.

Underlying principles

Granger defined the causality relationship based on two principles: # The cause happens prior to its effect. # The cause has ''unique'' information about the future values of its effect. Given these two assumptions about causality, Granger proposed to test the following hypothesis for identification of a causal effect of

X

Y

: :

\mathbb (t+1) \in A\mid \mathcal(t) \neq \mathbb (t+1) \in A\mid \mathcal_(t)

where

\mathbb

refers to probability,

A

is an arbitrary non-empty set, and

\mathcal(t)

and

\mathcal_(t)

respectively denote the information available as of time

t

in the entire universe, and that in the modified universe in which

X

is excluded. If the above hypothesis is accepted, we say that

X

Granger-causes

Y

Method

If a

is a

stationary process In mathematics and statistics, a stationary process (also called a strict/strictly stationary process or strong/strongly stationary process) is a stochastic process whose statistical properties, such as mean and variance, do not change over time. M ...

, the test is performed using the level values of two (or more) variables. If the variables are non-stationary, then the test is done using first (or higher) differences. The number of lags to be included is usually chosen using an information criterion, such as the Akaike information criterion or the Schwarz information criterion. Any particular lagged value of one of the variables is retained in the regression if (1) it is significant according to a t-test, and (2) it and the other lagged values of the variable jointly add

explanatory power Explanatory power is the ability of a hypothesis or theory to explain the subject matter effectively to which it pertains. Its opposite is ''explanatory impotence''. In the past, various criteria or measures for explanatory power have been prop ...

to the model according to an F-test. Then the

null hypothesis The null hypothesis (often denoted ''H''0) is the claim in scientific research that the effect being studied does not exist. The null hypothesis can also be described as the hypothesis in which no relationship exists between two sets of data o ...

of no Granger causality is not rejected if and only if no lagged values of an explanatory variable have been retained in the regression. In practice it may be found that neither variable Granger-causes the other, or that each of the two variables Granger-causes the other.

Mathematical statement

Let ''y'' and ''x'' be stationary time series. To test the null hypothesis that ''x'' does not Granger-cause ''y'', one first finds the proper lagged values of ''y'' to include in a univariate autoregression of ''y'': :

y_t = a_0 + a_1y_ + a_2y_ + \cdots + a_my_ + \text_t.

Next, the autoregression is augmented by including lagged values of ''x'': :

y_t = a_0 + a_1y_ + a_2y_ + \cdots + a_my_ + b_p x_ + \cdots + b_qx_ + \text_t.

One retains in this regression all lagged values of ''x'' that are individually significant according to their t-statistics, provided that collectively they add explanatory power to the regression according to an F-test (whose null hypothesis is no explanatory power jointly added by the ''xs). In the notation of the above augmented regression, ''p'' is the shortest, and ''q'' is the longest, lag length for which the lagged value of ''x'' is significant. The null hypothesis that ''x'' does not Granger-cause ''y'' is not rejected if and only if no lagged values of ''x'' are retained in the regression.

Multivariate analysis

Multivariate Granger causality analysis is usually performed by fitting a vector autoregressive model (VAR) to the time series. In particular, let

X(t) \in \mathbb^

for

t=1, \ldots, T

be a

d

-dimensional multivariate time series. Granger causality is performed by fitting a VAR model with

L

time lags as follows: :

X(t) = \sum_^L A_X(t-\tau) + \varepsilon(t),

where

\varepsilon(t)

is a white Gaussian random vector, and

A_\tau

is a matrix for every

\tau

. A time series

X_i

is called a Granger cause of another time series

X_j

, if at least one of the elements

A_(j, i)

for

\tau = 1, \ldots, L

is significantly larger than zero (in absolute value).

Non-parametric test

The above linear methods are appropriate for testing Granger causality in the mean. However they are not able to detect Granger causality in higher moments, e.g., in the variance. Non-parametric tests for Granger causality are designed to address this problem. The definition of Granger causality in these tests is general and does not involve any modelling assumptions, such as a linear autoregressive model. The non-parametric tests for Granger causality can be used as diagnostic tools to build better

parametric model In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters. Defi ...

s including higher order moments and/or non-linearity.

Limitations

As its name implies, Granger causality is not necessarily true causality. In fact, the Granger-causality tests fulfill only the

Humean definition of causality David Hume coined a sceptical, reductionist viewpoint on causality that inspired the logical-positivist definition of empirical law that "is a regularity or universal generalization of the form 'All Cs are Es' or, whenever C, then E". The Scotti ...

that identifies the cause-effect relations with constant conjunctions. If both ''X'' and ''Y'' are driven by a common third process with different lags, one might still fail to reject the

alternative hypothesis In statistical hypothesis testing, the alternative hypothesis is one of the proposed propositions in the hypothesis test. In general the goal of hypothesis test is to demonstrate that in the given condition, there is sufficient evidence supporting ...

of Granger causality. Yet, manipulation of one of the variables would not change the other. Indeed, the Granger-causality tests are designed to handle pairs of variables, and may produce misleading results when the true relationship involves three or more variables. Having said this, it has been argued that given a probabilistic view of causation, Granger causality can be considered true causality in that sense, especially when Reichenbach's "screening off" notion of probabilistic causation is taken into account. Other possible sources of misguiding test results are: (1) not frequent enough or too frequent sampling, (2) nonlinear causal relationship, (3) time series nonstationarity and nonlinearity and (4) existence of rational expectations. A similar test involving more variables can be applied with vector autoregression. The validity of the Granger causality test has been challenged in the academic literature, in a paper claiming that "not even the most fundamental requirement underlying any possible definition of causality is met by the Granger causality test... any definition of causality should refer to the prediction of the future from the past... we find that Granger also allows one to 'predict' the past from the future."

Extensions

A method for Granger causality has been developed that is not sensitive to deviations from the assumption that the error term is normally distributed. This method is especially useful in financial economics, since many financial variables are non-normally distributed. Recently, asymmetric causality testing has been suggested in the literature in order to separate the causal impact of positive changes from the negative ones. An extension of Granger (non-)causality testing to panel data is also available. A modified Granger causality test based on the GARCH (generalized auto-regressive conditional heteroscedasticity) type of integer-valued time series models is available in many areas.

Time-varying Granger causality

The extension of Granger causality to incorporate its dynamic, time-varying nature allows for a more nuanced understanding of how causal relationships in time-series data evolve over time. The methodology uses recursive techniques such as the Forward Expanding (FE), Rolling (RO), and Recursive Evolving (RE) windows to overcome the limitations of traditional Granger causality tests and understand changes in causal relationships across different periods. A central aspect of this methodology is the 'tvgc' command in Stata. Empirical applications, such as data involving transaction fees and economic sub-systems on Ethereum, highlight the dynamic nature of economic relationships over time.

In neuroscience

A long-held belief about neural function maintained that different areas of the brain were task specific; that the structural connectivity local to a certain area somehow dictated the function of that piece. Collecting work that has been performed over many years, there has been a move to a different, network-centric approach to describing information flow in the brain. Explanation of function is beginning to include the concept of networks existing at different levels and throughout different locations in the brain. The behavior of these networks can be described by non-deterministic processes that are evolving through time. That is to say that given the same input stimulus, you will not get the same output from the network. The dynamics of these networks are governed by probabilities so we treat them as stochastic (random) processes so that we can capture these kinds of dynamics between different areas of the brain. Different methods of obtaining some measure of information flow from the firing activities of a neuron and its surrounding ensemble have been explored in the past, but they are limited in the kinds of conclusions that can be drawn and provide little insight into the directional flow of information, its effect size, and how it can change with time. Recently Granger causality has been applied to address some of these issues. Put plainly, one examines how to best predict the future of a neuron: using either the entire ensemble or the entire ensemble except a certain target neuron. If the prediction is made worse by excluding the target neuron, then we say it has a "g-causal" relationship with the current neuron.

Extensions to point process models

Previous Granger-causality methods could only operate on continuous-valued data so the analysis of neural spike train recordings involved transformations that ultimately altered the stochastic properties of the data, indirectly altering the validity of the conclusions that could be drawn from it. In 2011, however, a new general-purpose Granger-causality framework was proposed that could directly operate on any modality, including neural-spike trains. Neural spike train data can be modeled as a point-process. A temporal point process is a stochastic time-series of binary events that occurs in continuous time. It can only take on two values at each point in time, indicating whether or not an event has actually occurred. This type of binary-valued representation of information suits the activity of neural populations because a single neuron's action potential has a typical waveform. In this way, what carries the actual information being output from a neuron is the occurrence of a "spike", as well as the time between successive spikes. Using this approach one could abstract the flow of information in a neural-network to be simply the spiking times for each neuron through an observation period. A point-process can be represented either by the timing of the spikes themselves, the waiting times between spikes, using a counting process, or, if time is discretized enough to ensure that in each window only one event has the possibility of occurring, that is to say one time bin can only contain one event, as a set of 1s and 0s, very similar to binary. One of the simplest types of neural-spiking models is the

Poisson process In probability theory, statistics and related fields, a Poisson point process (also known as: Poisson random measure, Poisson random point field and Poisson point field) is a type of mathematical object that consists of Point (geometry), points ...

. This however, is limited in that it is memory-less. It does not account for any spiking history when calculating the current probability of firing. Neurons, however, exhibit a fundamental (biophysical) history dependence by way of its relative and absolute refractory periods. To address this, a conditional intensity function is used to represent the

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

of a neuron spiking, conditioned on its own history. The conditional intensity function expresses the instantaneous firing probability and implicitly defines a complete probability model for the point process. It defines a probability per unit time. So if this unit time is taken small enough to ensure that only one spike could occur in that time window, then our conditional intensity function completely specifies the probability that a given neuron will fire in a certain time.

In computing

Software packages have been developed for measuring "Granger causality" in Python and R: * Python packag

* R packag