Mixed-data Sampling
   HOME

TheInfoList



OR:

Econometric models involving data sampled at different frequencies are of general interest. Mixed-data sampling (MIDAS) is an
econometric Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8ā ...
regression developed by
Eric Ghysels Eric Ghysels (born 1956 in Brussels) is a Belgian economist with interest in finance and time series econometrics, and in particular the fields of financial econometrics and financial technology. He is the Edward M. Bernstein Distinguished Profe ...
with several co-authors. There is now a substantial literature on MIDAS regressions and their applications, including Ghysels, Santa-Clara and Valkanov (2006), Ghysels, Sinko and Valkanov, Andreou, Ghysels and Kourtellos (2010) and Andreou, Ghysels and Kourtellos (2013).


MIDAS Regressions

A MIDAS regression is a direct forecasting tool which can relate future low-frequency data with current and lagged high-frequency indicators, and yield different forecasting models for each forecast horizon. It can flexibly deal with data sampled at different frequencies and provide a direct forecast of the low-frequency variable. It incorporates each individual high-frequency data in the regression, which solves the problems of losing potentially useful information and including mis-specification. A simple regression example has the
independent variable A variable is considered dependent if it depends on (or is hypothesized to depend on) an independent variable. Dependent variables are studied under the supposition or demand that they depend, by some law or rule (e.g., by a mathematical function ...
appearing at a higher frequency than the
dependent variable A variable is considered dependent if it depends on (or is hypothesized to depend on) an independent variable. Dependent variables are studied under the supposition or demand that they depend, by some law or rule (e.g., by a mathematical functio ...
: :y_t = \beta_0 + \beta_1 B(L^;\theta)x_t^ + \varepsilon_t^, where ''y'' is the dependent variable, ''x'' is the regressor, ''m'' denotes the frequency – for instance if ''y'' is yearly x_t^ is quarterly – \varepsilon is the disturbance and B(L^;\theta) is a lag distribution, for instance the
Beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^ ...
or the Almon Lag. For example B(L^;\theta) = \sum_^K B(k; \theta) L^. The regression models can be viewed in some cases as substitutes for the
Kalman filter In statistics and control theory, Kalman filtering (also known as linear quadratic estimation) is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, to produce estimates of unk ...
when applied in the context of mixed frequency data. Bai, Ghysels and Wright (2013) examine the relationship between MIDAS regressions and Kalman filter state space models applied to mixed frequency data. In general, the latter involves a system of equations, whereas, in contrast, MIDAS regressions involve a (reduced form) single equation. As a consequence, MIDAS regressions might be less efficient, but also less prone to specification errors. In cases where the MIDAS regression is only an approximation, the approximation errors tend to be small.


Machine Learning MIDAS Regressions

The MIDAS can also be used for
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
time series and panel data nowcasting. The machine learning MIDAS regressions involve
Legendre polynomials In mathematics, Legendre polynomials, named after Adrien-Marie Legendre (1782), are a system of complete and orthogonal polynomials with a wide number of mathematical properties and numerous applications. They can be defined in many ways, and t ...
. High-dimensional mixed frequency time series regressions involve certain data structures that once taken into account should improve the performance of unrestricted estimators in small samples. These structures are represented by groups covering lagged dependent variables and groups of lags for a single (high-frequency) covariate. To that end, the machine learning MIDAS approach exploits the sparse-group
LASSO A lasso or lazo ( or ), also called reata or la reata in Mexico, and in the United States riata or lariat (from Mexican Spanish lasso for roping cattle), is a loop of rope designed as a restraint to be thrown around a target and tightened when ...
(sg-LASSO) regularization that accommodates conveniently such structures. The attractive feature of the sg-LASSO estimator is that it allows us to combine effectively the approximately sparse and dense signals.


Software packages

Several software packages feature MIDAS regressions and related econometric methods. These include: * MIDAS Matlab Toolbox * midasr, R package * midasml, R package for High-Dimensional Mixed Frequency Time Series Data * EViews * Python * Julia * Stata,midasreg


Alternatives

In some situations it might be possible to alternatively use
temporal disaggregation Temporal may refer to: Entertainment * Temporal (band), an Australian metal band * ''Temporal'' (Radio Tarifa album), 1997 * ''Temporal'' (Love Spirals Downwards album), 2000 * ''Temporal'' (Isis album), 2012 * ''Temporal'' (video game), a 200 ...
methods (for
upsampling In digital signal processing, upsampling, expansion, and interpolation are terms associated with the process of sample rate conversion, resampling in a multi-rate digital signal processing system. ''Upsampling'' can be synonymous with ''expansion'' ...
time series data from e.g. monthly to daily).F. T. Denton. Adjustment of monthly or quarterly series to annual totals: An approach based on quadratic minimization. Journal of the American Statistical Association, Mar. 1971


References


See also

*
Distributed lag Distribution may refer to: Mathematics *Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations *Probability distribution, the probability of a particular value or value range of a varia ...
*
ARMAX In the statistical analysis of time series, autoregressive–moving-average (ARMA) models are a way to describe a (weakly) stationary stochastic process using autoregression (AR) and a moving average (MA), each with a polynomial. They are a too ...
Econometric modeling Time series models Statistical forecasting {{econometrics-stub