Hurst exponent
   HOME

TheInfoList



OR:

The Hurst exponent is used as a measure of
long-term memory Long-term memory (LTM) is the stage of the Atkinson–Shiffrin memory model in which informative knowledge is held indefinitely. It is defined in contrast to short-term and working memory, which persist for only about 18 to 30 seconds. Long-t ...
of
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Ex ...
. It relates to the
autocorrelation Autocorrelation, sometimes known as serial correlation in the discrete time case, is the correlation of a signal with a delayed copy of itself as a function of delay. Informally, it is the similarity between observations of a random variable ...
s of the time series, and the rate at which these decrease as the lag between pairs of values increases. Studies involving the Hurst exponent were originally developed in
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and environmental watershed sustainability. A practitioner of hydrology is call ...
for the practical matter of determining optimum dam sizing for the
Nile river The Nile, , Bohairic , lg, Kiira , Nobiin: Áman Dawū is a major north-flowing river in northeastern Africa. It flows into the Mediterranean Sea. The Nile is the longest river in Africa and has historically been considered the longest riv ...
's volatile rain and drought conditions that had been observed over a long period of time. The name "Hurst exponent", or "Hurst coefficient", derives from
Harold Edwin Hurst Harold Edwin Hurst (1 January 1880 – 7 December 1978) was a British hydrologist from Leicester. Hurst's (1951) study on measuring the long-term storage capacity of reservoirs documented the presence of long-range dependence in hydrology, especi ...
(1880–1978), who was the lead researcher in these studies; the use of the standard notation ''H'' for the coefficient also relates to his name. In
fractal geometry In mathematics, a fractal is a geometric shape containing detailed structure at arbitrarily small scales, usually having a fractal dimension strictly exceeding the topological dimension. Many fractals appear similar at various scales, as illu ...
, the generalized Hurst exponent has been denoted by ''H'' or ''Hq'' in honor of both Harold Edwin Hurst and Ludwig Otto Hölder (1859–1937) by
Benoît Mandelbrot Benoit B. Mandelbrot (20 November 1924 – 14 October 2010) was a Polish-born French-American mathematician and polymath with broad interests in the practical sciences, especially regarding what he labeled as "the art of roughness" of phy ...
(1924–2010). ''H'' is directly related to
fractal dimension In mathematics, more specifically in fractal geometry, a fractal dimension is a ratio providing a statistical index of complexity comparing how detail in a pattern (strictly speaking, a fractal pattern) changes with the scale at which it is me ...
, ''D'', and is a measure of a data series' "mild" or "wild" randomness. The Hurst exponent is referred to as the "index of dependence" or "index of long-range dependence". It quantifies the relative tendency of a time series either to regress strongly to the mean or to cluster in a direction. A value ''H'' in the range 0.5–1 indicates a time series with long-term positive autocorrelation, meaning both that a high value in the series will probably be followed by another high value and that the values a long time into the future will also tend to be high. A value in the range 0 – 0.5 indicates a time series with long-term switching between high and low values in adjacent pairs, meaning that a single high value will probably be followed by a low value and that the value after that will tend to be high, with this tendency to switch between high and low values lasting a long time into the future. A value of ''H''=0.5 can indicate a completely uncorrelated series, but in fact it is the value applicable to series for which the autocorrelations at small time lags can be positive or negative but where the absolute values of the autocorrelations decay exponentially quickly to zero. This in contrast to the typically
power law In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities: one q ...
decay for the 0.5 < ''H'' < 1 and 0 < ''H'' < 0.5 cases.


Definition

The Hurst exponent, ''H'', is defined in terms of the asymptotic behaviour of the rescaled range as a function of the time span of a time series as follows; :\mathbb \left \frac \right C n^H \text n \to \infty \, , where; * R(n) is the range of the first n cumulative deviations from the mean * S(n) is the series (sum) of the first n
standard deviations In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
* \mathbb \left \right \, is the
expected value In probability theory, the expected value (also called expectation, expectancy, mathematical expectation, mean, average, or first moment) is a generalization of the weighted average. Informally, the expected value is the arithmetic mean of a ...
* n is the time span of the observation (number of data points in a time series) * C is a constant.


Relation to Fractal Dimension

For self-similar time series, ''H'' is directly related to
fractal dimension In mathematics, more specifically in fractal geometry, a fractal dimension is a ratio providing a statistical index of complexity comparing how detail in a pattern (strictly speaking, a fractal pattern) changes with the scale at which it is me ...
, ''D'', where 1 < ''D'' < 2, such that ''D'' = 2 - ''H''. The values of the Hurst exponent vary between 0 and 1, with higher values indicating a smoother trend, less volatility, and less roughness. For more general time series or multi-dimensional process, the Hurst exponent and fractal dimension can be chosen independently, as the Hurst exponent represents structure over asymptotically longer periods, while fractal dimension represents structure over asymptotically shorter periods.


Estimating the exponent

A number of estimators of long-range dependence have been proposed in the literature. The oldest and best-known is the so-called rescaled range (R/S) analysis popularized by Mandelbrot and Wallis and based on previous hydrological findings of Hurst. Alternatives include DFA, Periodogram regression, aggregated variances, local Whittle's estimator, wavelet analysis, both in the
time domain Time domain refers to the analysis of mathematical functions, physical signals or time series of economic or environmental data, with respect to time. In the time domain, the signal or function's value is known for all real numbers, for the c ...
and
frequency domain In physics, electronics, control systems engineering, and statistics, the frequency domain refers to the analysis of mathematical functions or signals with respect to frequency, rather than time. Put simply, a time-domain graph shows how a s ...
.


Rescaled range (R/S) analysis

To estimate the Hurst exponent, one must first estimate the dependence of the rescaled range on the time span ''n'' of observation. A time series of full length ''N'' is divided into a number of shorter time series of length ''n'' = ''N'', ''N''/2, ''N''/4, ... The average rescaled range is then calculated for each value of ''n''. For a (partial) time series of length n, X=X_1,X_2,\dots, X_n \, , the rescaled range is calculated as follows: 1. Calculate the
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set. For a data set, the '' ar ...
; :m=\frac \sum_^ X_i \,. 2. Create a mean-adjusted series; :Y_t=X_-m \quad \text t=1,2, \dots ,n \,. 3. Calculate the cumulative deviate series Z; :Z_t= \sum_^ Y_ \quad \text t=1,2, \dots ,n \,. 4. Compute the range R; : R(n) =\operatorname\left (Z_1, Z_2, \dots, Z_n \right )- \operatorname\left (Z_1, Z_2, \dots, Z_n \right ). 5. Compute the
standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, whil ...
S; :S(n)= \sqrt. 6. Calculate the rescaled range R(n)/S(n) and average over all the partial time series of length n. The Hurst exponent is estimated by fitting the
power law In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities: one q ...
\mathbb R(n)/S(n)= C n^H to the data. This can be done by plotting \log (n)/S(n)/math> as a function of \log n, and fitting a straight line; the slope of the line gives H (a more principled approach fits the power law in a maximum-likelihood fashion). Such a graph is called a box plot. However, this approach is known to produce biased estimates of the power-law exponent. For small n there is a significant deviation from the 0.5 slope. Anis and Lloyd estimated the theoretical (i.e., for white noise) values of the R/S statistic to be: \mathbb R(n)/S(n) = \begin \frac \sum\limits_^ \sqrt, & \textn\le 340 \\ \frac \sum\limits_^ \sqrt, & \textn>340 \end where \Gamma is the
Euler gamma function In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers exce ...
. The Anis-Lloyd corrected R/S Hurst exponent is calculated as 0.5 plus the slope of R(n)/S(n) - \mathbb R(n)/S(n)/math>.


Confidence intervals

No asymptotic distribution theory has been derived for most of the Hurst exponent estimators so far. However, Weron used
bootstrapping In general, bootstrapping usually refers to a self-starting process that is supposed to continue or grow without external input. Etymology Tall boots may have a tab, loop or handle at the top known as a bootstrap, allowing one to use fingers ...
to obtain approximate functional forms for confidence intervals of the two most popular methods, i.e., for the Anis-Lloyd corrected R/S analysis: and for DFA: Here M=\log_2 N and N is the series length. In both cases only subseries of length n>50 were considered for estimating the Hurst exponent; subseries of smaller length lead to a high variance of the R/S estimates.


Generalized exponent

The basic Hurst exponent can be related to the expected size of changes, as a function of the lag between observations, as measured by E(, ''Xt+τ-Xt'', 2). For the generalized form of the coefficient, the exponent here is replaced by a more general term, denoted by ''q''. There are a variety of techniques that exist for estimating ''H'', however assessing the accuracy of the estimation can be a complicated issue. Mathematically, in one technique, the Hurst exponent can be estimated such that: :H_q = H(q), for a time series :g(t), t = 1, 2, ... may be defined by the scaling properties of its
structure A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such a ...
functions S_q (\tau): :S_q = \langle , g(t + \tau) - g(t), ^q \rangle_t \sim \tau^, \, where q > 0, \tau is the time lag and averaging is over the time window :t \gg \tau,\, usually the largest time scale of the system. Practically, in nature, there is no limit to time, and thus ''H'' is non-deterministic as it may only be estimated based on the observed data; e.g., the most dramatic daily move upwards ever seen in a stock market index can always be exceeded during some subsequent day. In the above mathematical estimation technique, the function ''H''(''q'') contains information about averaged generalized volatilities at scale \tau (only ''q'' = 1, 2 are used to define the volatility). In particular, the ''H''1 exponent indicates persistent (''H''1 > ½) or antipersistent (''H''1 < ½) behavior of the trend. For the BRW (
brown noise ] In science, Brownian noise, also known as Brown noise or red noise, is the type of signal noise produced by Brownian motion, hence its alternative name of random walk noise. The term "Brown noise" does not come from the color, but after R ...
, 1/f^2) one gets :H_q = \frac, and for
pink noise Pink noise or noise is a signal or process with a frequency spectrum such that the power spectral density (power per frequency interval) is inversely proportional to the frequency of the signal. In pink noise, each octave interval (halving ...
(1/f) :H_q = 0. The Hurst exponent for
white noise In signal processing, white noise is a random signal having equal intensity at different frequencies, giving it a constant power spectral density. The term is used, with this or similar meanings, in many scientific and technical disciplines ...
is dimension dependent, and for 1D and 2D it is :H^_q = \frac , \quad H^_q = -1. For the popular
Lévy stable process Levy, Lévy or Levies may refer to: People * Levy (surname), people with the surname Levy or Lévy * Levy Adcock (born 1988), American football player * Levy Barent Cohen (1747–1808), Dutch-born British financier and community worker * Levy F ...
es and
truncated Lévy process Truncation is the term used for limiting the number of digits right of the decimal point by discarding the least significant ones. Truncation may also refer to: Mathematics * Truncation (statistics) refers to measurements which have been cut ...
es with parameter α it has been found that :H_q = q/\alpha, for q < \alpha, and H_q = 1 for q \geq \alpha.
Multifractal detrended fluctuation analysis A multifractal system is a generalization of a fractal system in which a single exponent (the fractal dimension) is not enough to describe its dynamics; instead, a continuous spectrum of exponents (the so-called singularity spectrum) is needed ...
is one method to estimate H(q) from non-stationary time series. When H(q) is a non-linear function of q the time series is a multifractal system.


Note

In the above definition two separate requirements are mixed together as if they would be one. Here are the two independent requirements: (i) stationarity of the increments, x(t+T)-x(t)=x(T)-x(0) in distribution. This is the condition that yields longtime autocorrelations. (ii)
Self-similarity __NOTOC__ In mathematics, a self-similar object is exactly or approximately similar to a part of itself (i.e., the whole has the same shape as one or more of the parts). Many objects in the real world, such as coastlines, are statistically se ...
of the stochastic process then yields variance scaling, but is not needed for longtime memory. E.g., both
Markov process A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happen ...
es (i.e., memory-free processes) and
fractional Brownian motion In probability theory, fractional Brownian motion (fBm), also called a fractal Brownian motion, is a generalization of Brownian motion. Unlike classical Brownian motion, the increments of fBm need not be independent. fBm is a continuous-time Gauss ...
scale at the level of 1-point densities (simple averages), but neither scales at the level of pair correlations or, correspondingly, the 2-point probability density. An efficient market requires a martingale condition, and unless the variance is linear in the time this produces nonstationary increments, x(t+T)-x(t)≠x(T)-x(0). Martingales are Markovian at the level of pair correlations, meaning that pair correlations cannot be used to beat a martingale market. Stationary increments with nonlinear variance, on the other hand, induce the longtime pair memory of
fractional Brownian motion In probability theory, fractional Brownian motion (fBm), also called a fractal Brownian motion, is a generalization of Brownian motion. Unlike classical Brownian motion, the increments of fBm need not be independent. fBm is a continuous-time Gauss ...
that would make the market beatable at the level of pair correlations. Such a market would necessarily be far from "efficient". An analysis of economic time series by means of the Hurst exponent using rescaled range and Detrended fluctuation analysis is conducted by econophysicist A.F. Bariviera. This paper studies the time varying character of Long-range dependency and, thus of informational efficiency. Hurst exponent has also been applied to the investigation of long-range dependency in DNA, and photonic
band gap In solid-state physics, a band gap, also called an energy gap, is an energy range in a solid where no electronic states can exist. In graphs of the electronic band structure of solids, the band gap generally refers to the energy difference ( ...
materials.


See also

* Long-range dependency *
Anomalous diffusion Anomalous diffusion is a diffusion process with a non-linear relationship between the mean squared displacement (MSD), \langle r^(\tau )\rangle , and time. This behavior is in stark contrast to Brownian motion, the typical diffusion process descri ...
* Rescaled range * Detrended fluctuation analysis


Implementations

* Matlab code for computing R/S, DFA, periodogram regression and wavelet estimates of the Hurst exponent and their corresponding confidence intervals is available from RePEc: https://ideas.repec.org/s/wuu/hscode.html * Implementation of R/S in Python: https://github.com/Mottl/hurst and of DFA and MFDFA in Python: https://github.com/LRydin/MFDFA * Matlab code for computing real Hurst and complex Hurst: https://www.mathworks.com/matlabcentral/fileexchange/49803-calculate-complex-hurst * Excel sheet can also be used to do so: https://www.researchgate.net/publication/272792633_Excel_Hurst_Calculator


References

{{DEFAULTSORT:Hurst Exponent Autocorrelation Fractals