HOME

TheInfoList



OR:

In statistics, a generalized linear mixed model (GLMM) is an extension to the
generalized linear model In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and by ...
(GLM) in which the linear predictor contains
random effects In statistics, a random effects model, also called a variance components model, is a statistical model where the model parameters are random variables. It is a kind of hierarchical linear model, which assumes that the data being analysed are dr ...
in addition to the usual fixed effects. They also inherit from GLMs the idea of extending linear mixed models to non- normal data. GLMMs provide a broad range of models for the analysis of grouped data, since the differences between groups can be modelled as a random effect. These models are useful in the analysis of many kinds of data, including longitudinal data.


Model

GLMMs are generally defined such that, conditioned on the random effects u, the dependent variable y is distributed according to the exponential family with its expectation related to the linear predictor X\beta+Zu via a link function g: :g(E \vert u=X\beta+Zu. Here X and \beta are the fixed effects design matrix, and fixed effects respectively; Z and u are the random effects design matrix and random effects respectively. To understand this very brief definition you will first need to understand the definition of a
generalized linear model In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and by ...
and of a
mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...
. Generalized linear mixed models are a special cases of
hierarchical generalized linear model In statistics, hierarchical generalized linear models extend generalized linear models by relaxing the assumption that error components are independent. This allows models to be built in situations where more than one error term is necessary and ...
s in which the random effects are normally distributed. The complete likelihood :\ln(y)=\ln\int p(y\vert u)p(u)du has no general closed form, and integrating over the random effects is usually extremely computationally intensive. In addition to numerically approximating this integral(e.g. via Gauss–Hermite quadrature), methods motivated by Laplace approximation have been proposed. For example, the penalized quasi-likelihood method, which essentially involves repeatedly fitting (i.e. doubly iterative) a weighted normal mixed model with a working variate, is implemented by various commercial and open source statistical programs.


Fitting a model

Fitting GLMMs via
maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed sta ...
(as via AIC) involves integrating over the random effects. In general, those integrals cannot be expressed in analytical form. Various approximate methods have been developed, but none has good properties for all possible models and data sets (e.g. ungrouped binary data are particularly problematic). For this reason, methods involving
numerical quadrature In analysis, numerical integration comprises a broad family of algorithms for calculating the numerical value of a definite integral, and by extension, the term is also sometimes used to describe the numerical solution of differential equations ...
or
Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain ...
have increased in use, as increasing computing power and advances in methods have made them more practical. The
Akaike information criterion The Akaike information criterion (AIC) is an estimator of prediction error and thereby relative quality of statistical models for a given set of data. Given a collection of models for the data, AIC estimates the quality of each model, relative to ...
(AIC) is a common criterion for model selection. Estimates of AIC for GLMMs based on certain exponential family distributions have recently been obtained.


Software

* Several contributed packages in R provide GLMM functionality, including lme4 and glmm. * GLMM can be fitted using
SAS SAS or Sas may refer to: Arts, entertainment, and media * ''SAS'' (novel series), a French book series by Gérard de Villiers * ''Shimmer and Shine'', an American animated children's television series * Southern All Stars, a Japanese rock ba ...
and SPSS *
MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementa ...
also provides a function called "fitglme" to fit GLMM models. * The Python package Statsmodels supports binomial and poisson implementation * The Julia package MixedModels.jl provides a function called GeneralizedLinearMixedModel that fits a GLMM to provided data. * DHARMa: residual diagnostics for hierarchical (multi-level/mixed) regression models (utk.edu)


See also

* Generalized estimating equation *
Hierarchical generalized linear model In statistics, hierarchical generalized linear models extend generalized linear models by relaxing the assumption that error components are independent. This allows models to be built in situations where more than one error term is necessary and ...


References

{{reflist Analysis of variance
Mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...