HOME

TheInfoList



OR:

In
econometrics Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
, a random effects model, also called a variance components model, is a
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
where the model parameters are
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s. It is a kind of hierarchical linear model, which assumes that the data being analysed are drawn from a hierarchy of different populations whose differences relate to that hierarchy. A random effects model is a special case of a
mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...
. Contrast this to the
biostatistics Biostatistics (also known as biometry) is a branch of statistics that applies statistical methods to a wide range of topics in biology. It encompasses the design of biological experiments, the collection and analysis of data from those experimen ...
definitions, as biostatisticians use "fixed" and "random" effects to respectively refer to the population-average and subject-specific effects (and where the latter are generally assumed to be unknown, latent variables).


Qualitative description

Random effect models assist in controlling for
unobserved heterogeneity In economic theory and econometrics, the term heterogeneity refers to differences across the units being studied. For example, a macroeconomic model in which consumers are assumed to differ from one another is said to have heterogeneous agents. U ...
when the heterogeneity is constant over time and not correlated with independent variables. This constant can be removed from longitudinal data through differencing, since taking a first difference will remove any time invariant components of the model. Two common assumptions can be made about the individual specific effect: the random effects assumption and the fixed effects assumption. The random effects assumption is that the individual unobserved heterogeneity is uncorrelated with the independent variables. The fixed effect assumption is that the individual specific effect is correlated with the independent variables. If the random effects assumption holds, the random effects estimator is more efficient than the fixed effects model.


Simple example

Suppose m large elementary schools are chosen randomly from among thousands in a large country. Suppose also that n pupils of the same age are chosen randomly at each selected school. Their scores on a standard aptitude test are ascertained. Let Y_ be the score of the j-th pupil at the i-th school. A simple way to model this variable is : Y_ = \mu + U_i + W_,\, where \mu is the average test score for the entire population. In this model U_i is the school-specific random effect: it measures the difference between the average score at school i and the average score in the entire country. The term W_ is the individual-specific random effect, i.e., it's the deviation of the j-th pupil's score from the average for the i-th school. The model can be augmented by including additional explanatory variables, which would capture differences in scores among different groups. For example: : Y_ = \mu + \beta_1 \mathrm_ + \beta_2 \mathrm_ + U_i + W_,\, where \mathrm_ is a binary dummy variable and \mathrm_records, say, the average education level of a child's parents. This is a
mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...
, not a purely random effects model, as it introduces fixed-effects terms for Sex and Parents' Education.


Variance components

The variance of Y_ is the sum of the variances \tau^2 and \sigma^2 of U_i and W_ respectively. Let : \overline_ = \frac\sum_^n Y_ be the average, not of all scores at the i-th school, but of those at the i-th school that are included in the
random sample In this statistics, quality assurance, and survey methodology, sampling is the selection of a subset or a statistical sample (termed sample for short) of individuals from within a statistical population to estimate characteristics of the whole ...
. Let :\overline_ = \frac\sum_^m\sum_^n Y_ be the grand average. Let :SSW = \sum_^m\sum_^n (Y_ - \overline_)^2 \, :SSB = n\sum_^m (\overline_ - \overline_)^2 \, be respectively the sum of squares due to differences ''within'' groups and the sum of squares due to difference ''between'' groups. Then it can be shown that : \fracE(SSW) = \sigma^2 and : \fracE(SSB) = \frac + \tau^2. These "
expected mean square In statistics, expected mean squares (EMS) are the expected values of certain statistics arising in partitions of sums of squares in the analysis of variance (ANOVA). They can be used for ascertaining which statistic should appear in the denominator ...
s" can be used as the basis for
estimation Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is d ...
of the "variance components" \sigma^2 and ''\tau^2. The \sigma^2 parameter is also called the
intraclass correlation coefficient In statistics, the intraclass correlation, or the intraclass correlation coefficient (ICC), is a descriptive statistic that can be used when quantitative measurements are made on units that are organized into groups. It describes how strongly uni ...
.


Marginal likelihood

For random effects models the
marginal likelihood A marginal likelihood is a likelihood function that has been integrated over the parameter space. In Bayesian statistics, it represents the probability of generating the observed sample for all possible values of the parameters; it can be under ...
s are important.Hedeker, D., Gibbons, R. D. (2006). Longitudinal Data Analysis. Deutschland: Wiley. Page 163 https://books.google.com/books?id=f9p9iIgzQSQC&pg=PA163


Applications

Random effects models used in practice include the
Bühlmann model In credibility theory, a branch of study in actuarial science, the Bühlmann model is a random effects model (or "variance components model" or hierarchical linear model) used to determine the appropriate premium for a group of insurance cont ...
of insurance contracts and the Fay-Herriot model used for
small area estimation Small area estimation is any of several statistical techniques involving the estimation of parameters for small sub-populations, generally used when the sub-population of interest is included in a larger survey. The term "small area" in this con ...
.


See also

*
Bühlmann model In credibility theory, a branch of study in actuarial science, the Bühlmann model is a random effects model (or "variance components model" or hierarchical linear model) used to determine the appropriate premium for a group of insurance cont ...
*
Hierarchical linear modeling Multilevel models are statistical models of parameters that vary at more than one level. An example could be a model of student performance that contains measures for individual students as well as measures for classrooms within which the studen ...
*
Fixed effects In statistics, a fixed effects model is a statistical model in which the model parameters are fixed or non-random quantities. This is in contrast to random effects models and mixed models in which all or some of the model parameters are random v ...
*
MINQUE In statistics, the theory of minimum norm quadratic unbiased estimation (MINQUE) was developed by C. R. Rao. MINQUE is a theory alongside other estimation methods in estimation theory, such as the method of moments or maximum likelihood estimatio ...
*
Covariance estimation In statistics, sometimes the covariance matrix of a multivariate random variable is not known but has to be estimated. Estimation of covariance matrices then deals with the question of how to approximate the actual covariance matrix on the basis ...
*
Conditional variance In probability theory and statistics, a conditional variance is the variance of a random variable given the value(s) of one or more other variables. Particularly in econometrics, the conditional variance is also known as the scedastic function or s ...
*
Panel analysis Panel (data) analysis is a statistical method, widely used in social science, epidemiology, and econometrics to analyze two-dimensional (typically cross sectional and longitudinal) panel data. The data are usually collected over time and over the s ...


Further reading

* * * *


References


External links


Fixed and random effects models
{{DEFAULTSORT:Random Effects Model Regression models Analysis of variance