HOME

TheInfoList



OR:

The Gaussian network model (GNM) is a representation of a biological
macromolecule A macromolecule is a "molecule of high relative molecular mass, the structure of which essentially comprises the multiple repetition of units derived, actually or conceptually, from molecules of low relative molecular mass." Polymers are physi ...
as an elastic mass-and-
spring Spring(s) may refer to: Common uses * Spring (season), a season of the year * Spring (device), a mechanical device that stores energy * Spring (hydrology), a natural source of water * Spring (mathematics), a geometric surface in the shape of a he ...
network to study, understand, and characterize the mechanical aspects of its long-time large-scale dynamics. The model has a wide range of applications from small proteins such as enzymes composed of a single
domain A domain is a geographic area controlled by a single person or organization. Domain may also refer to: Law and human geography * Demesne, in English common law and other Medieval European contexts, lands directly managed by their holder rather ...
, to large
macromolecular assemblies In molecular biology, the term macromolecular assembly (MA) refers to massive chemical structures such as viruses and non-biologic nanoparticles, cellular organelles and cell membrane, membranes and ribosomes, etc. that are complex mixtures of p ...
such as a
ribosome Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
or a viral
capsid A capsid is the protein shell of a virus, enclosing its genetic material. It consists of several oligomeric (repeating) structural subunits made of protein called protomers. The observable 3-dimensional morphological subunits, which may or m ...
. Protein domain dynamics plays key roles in a multitude of molecular recognition and
cell signalling In biology, cell signaling (cell signalling in British English) is the process by which a cell interacts with itself, other cells, and the environment. Cell signaling is a fundamental property of all cellular life in both prokaryotes and eukary ...
processes. Protein domains, connected by intrinsically disordered
flexible linker In molecular biology, an intrinsically disordered protein (IDP) is a protein that lacks a fixed or ordered three-dimensional structure, typically in the absence of its macromolecular interaction partners, such as other proteins or RNA. IDPs rang ...
domains, induce long-range
allostery In the fields of biochemistry and pharmacology an allosteric regulator (or allosteric modulator) is a substance that binds to a site on an enzyme or Receptor (biochemistry), receptor distinct from the active site, resulting in a conformational ...
via protein domain dynamics. The resultant dynamic modes cannot be generally predicted from static structures of either the entire protein or individual domains. The Gaussian network model is a minimalist, coarse-grained approach to study biological molecules. In the model, proteins are represented by nodes corresponding to α-carbons of the amino acid residues. Similarly, DNA and RNA structures are represented with one to three nodes for each
nucleotide Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
. The model uses the harmonic approximation to model interactions. This coarse-grained representation makes the calculations computationally inexpensive. At the molecular level, many biological phenomena, such as catalytic activity of an
enzyme An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
, occur within the range of nano- to millisecond timescales. All atom simulation techniques, such as
molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the Motion (physics), physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamics ( ...
simulations, rarely reach microsecond trajectory length, depending on the size of the system and accessible computational resources. Normal mode analysis in the context of GNM, or elastic network (EN) models in general, provides insights on the longer-scale functional dynamic behaviors of macromolecules. Here, the model captures native state functional motions of a biomolecule at the cost of atomic detail. The inference obtained from this model is complementary to atomic detail simulation techniques. Another model for protein dynamics based on elastic mass-and-spring networks is the
Anisotropic Network Model The Anisotropic Network Model (ANM) is a simple yet powerful tool made for normal mode analysis of proteins, which has been successfully applied for exploring the relation between function and dynamics for many proteins. It is essentially an Elast ...
.


Gaussian network model theory

The Gaussian network model was proposed by Bahar, Atilgan, Haliloglu and Erman in 1997. The GNM is often analyzed using normal mode analysis, which offers an analytical formulation and unique solution for each structure. The GNM normal mode analysis differs from other normal mode analyses in that it is exclusively based on inter-residue contact topology, influenced by the theory of elasticity of Flory and the
Rouse model The Rouse model is one of the simplest coarse-grained descriptions of the dynamics of polymer chains. It treats a single polymer as an Ideal chain of ''N'' point-like beads connected by harmonic springs and neglects both excluded volume and ...
and does not take the three-dimensional directionality of motions into account.


Representation of structure as an elastic network

Figure 2 shows a schematic view of elastic network studied in GNM. Metal beads represent the nodes in this Gaussian network (residues of a protein) and springs represent the connections between the nodes (covalent and non-covalent interactions between residues). For nodes i and j, equilibrium position vectors, R0i and R0j, equilibrium distance vector, R0ij, instantaneous fluctuation vectors, ΔRi and ΔRj, and instantaneous distance vector, Rij, are shown in Figure 2. Instantaneous position vectors of these nodes are defined by Ri and Rj. The difference between equilibrium position vector and instantaneous position vector of residue i gives the instantaneous fluctuation vector, ΔRi = Ri - R0i. Hence, the instantaneous fluctuation vector between nodes i and j is expressed as ΔRij = ΔRj - ΔRi = Rij - R0ij.


Potential of the Gaussian network

The potential energy of the network in terms of ΔRi is :V_ = \frac\left \sum_^ (\Delta R_j-\Delta R_i)^2 \right \frac\left \sum_^ \Delta R_i \Gamma_ \Delta R_j\right/math> where γ is a force constant uniform for all springs and Γij is the ijth element of the Kirchhoff (or connectivity) matrix of inter-residue contacts, Γ, defined by :\Gamma_ = \left\{\begin{matrix} -1, & \mbox{if } i \ne j & \mbox{and }R_{ij} \le r_c \\ 0, & \mbox{if } i \ne j & \mbox{and }R_{ij} > r_c \\ -\sum_{j,j \ne i}^{N} \Gamma_{ij}, & \mbox{if } i = j \end{matrix}\right. ''r''c is a cutoff distance for spatial interactions and taken to be 7 Å for amino acid pairs (represented by their α-carbons). Expressing the X, Y and Z components of the fluctuation vectors ΔRi as ΔXT = �X1 ΔX2 ..... ΔXN ΔYT = �Y1 ΔY2 ..... ΔYN and ΔZT = �Z1 ΔZ2 ..... ΔZN above equation simplifies to :V_{GNM} = \frac{\gamma}{2} Delta X^T\Gamma \Delta X + \Delta Y^T\Gamma \Delta Y + \Delta Z^T\Gamma \Delta Z/math>


Statistical mechanics foundations

In the GNM, the probability distribution of all fluctuations, ''P''(ΔR) is ''isotropic'' :P(\Delta R)=P(\Delta X,\Delta Y,\Delta Z)=p(\Delta X)p(\Delta Y)p(\Delta Z) and ''Gaussian'' :p(\Delta X)\propto \exp\left\{ -\frac{\gamma}{2 k_B T} \Delta X^T\Gamma \Delta X \right\}=\exp\left\{ -\frac{1}{2} \left(\Delta X^T\left( \frac{k_B T}{\gamma} \Gamma^{-1} \right)^{-1} \Delta X \right) \right\} where ''k''''B'' is the Boltzmann constant and ''T'' is the absolute temperature. ''p''(ΔY) and ''p''(ΔZ) are expressed similarly. N-dimensional Gaussian probability density function with random variable vector x, mean vector μ and covariance matrix Σ is :W(x,\mu ,\Sigma ) = \frac{1}{\sqrt{(2\pi)^N , \Sigma} \exp\left\{ -\frac{1}{2} (x - \mu)^T \Sigma^{-1} (x - \mu) \right\} \sqrt{(2\pi)^N , \Sigma normalizes the distribution and , Σ, is the determinant of the covariance matrix. Similar to Gaussian distribution, normalized distribution for ΔXT = �X1 ΔX2 ..... ΔXNaround the equilibrium positions can be expressed as :p(\Delta X ) = \frac{1}{\sqrt{(2\pi)^N \frac{k_B T}{\gamma} , \Gamma^{-1 \exp\left\{ -\frac{1}{2} \left(\Delta X^T\left( \frac{k_B T}{\gamma} \Gamma^{-1} \right)^{-1} \Delta X \right) \right\} The normalization constant, also the partition function ''Z''X, is given by :Z_X = \int_0^\infty \exp\left\{ -\frac{1}{2} \left(\Delta X^T\left( \frac{k_B T}{\gamma} \Gamma^{-1} \right)^{-1} \Delta X \right) \right\}d\Delta X where \frac{k_B T}{\gamma} \Gamma^{-1} is the covariance matrix in this case. ''Z''Y and ''Z''Z are expressed similarly. This formulation requires inversion of the Kirchhoff matrix. In the GNM, the determinant of the Kirchhoff matrix is zero, hence calculation of its inverse requires
eigenvalue decomposition In linear algebra, eigendecomposition is the factorization of a matrix into a canonical form, whereby the matrix is represented in terms of its eigenvalues and eigenvectors. Only diagonalizable matrices can be factorized in this way. When the mat ...
. Γ−1 is constructed using the N-1 non-zero eigenvalues and associated eigenvectors. Expressions for ''p''(ΔY) and ''p''(ΔZ) are similar to that of ''p''(ΔX). The probability distribution of all fluctuations in GNM becomes :P(\Delta R) = p(\Delta X) p(\Delta Y) p(\Delta Z)=\frac{1} \exp\left\{ -\frac{3}{2} \left(\Delta X^T\left( \frac{k_B T}{\gamma} \Gamma^{-1} \right)^{-1} \Delta X \right) \right\} For this mass and spring system, the normalization constant in the preceding expression is the overall GNM partition function, ''Z''GNM, :Z_{GNM} = {Z_X}{Z_Y}{Z_Z} = {(2\pi)^{3N/2} \Biggl, {\frac{k_{B}T}{\gamma}{\Gamma^{-1}\Biggr^{3/2}


Expectation values of fluctuations and correlations

The expectation values of residue fluctuations, <ΔRi2> (also called mean-square fluctuations, MSFs), and their cross-correlations, <ΔRi · ΔRj> can be organized as the diagonal and off-diagonal terms, respectively, of a covariance matrix. Based on statistical mechanics, the covariance matrix for ΔX is given by :<\Delta X \cdot \Delta X^T > = \int \Delta X \cdot \Delta X^T p(\Delta X)d\Delta X=\frac{k_B T}{\gamma}\Gamma^{-1} The last equality is obtained by inserting the above p(ΔX) and taking the (generalized Gaussian) integral. Since, :<\Delta X \cdot \Delta X^T > = <\Delta Y \cdot \Delta Y^T > = <\Delta Z \cdot \Delta Z^T > =\frac{1}{3} <\Delta R \cdot \Delta R^T > <ΔRi2> and <ΔRi · ΔRj> follows :<\Delta R_i^2 > = \frac{3 k_B T}{\gamma}(\Gamma^{-1})_{ii} :<\Delta R_i \cdot \Delta R_j > = \frac{3 k_B T}{\gamma}(\Gamma^{-1})_{ij}


Mode decomposition

The GNM normal modes are found by diagonalization of the Kirchhoff matrix, Γ = UΛU''T''. Here, U is a unitary matrix, U''T'' = U−1, of the eigenvectors ui of Γ and Λ is the diagonal matrix of eigenvalues λi. The frequency and shape of a mode is represented by its eigenvalue and eigenvector, respectively. Since the Kirchhoff matrix is positive semi-definite, the first eigenvalue, λ1, is zero and the corresponding eigenvector have all its elements equal to 1/. This shows that the network model translationally invariant. Cross-correlations between residue fluctuations can be written as a sum over the N-1 nonzero modes as :<\Delta R_i \cdot \Delta R_j> = \frac{3 k_B T}{\gamma} \Lambda^{-1}U^T{ij}=\frac{3 k_B T}{\gamma}\sum_{k=1}^{N-1}\lambda_k^{-1}
_k u_k^T K, or k, is the eleventh letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is ''kay'' (pronounced ), plural ''kays''. The letter ...
{ij} It follows that, ''ΔRi · ΔRj the contribution of an individual mode is expressed as : Delta R_i \cdot \Delta R_jk = \frac{3 k_B T}{\gamma}\lambda_k^{-1} _ki _kj where ''uksub>i is the ith element of uk.


Influence of local packing density

By definition, a diagonal element of the Kirchhoff matrix, Γii, is equal to the degree of a node in GNM that represents the corresponding residue's coordination number. This number is a measure of the local packing density around a given residue. The influence of local packing density can be assessed by series expansion of Γ−1 matrix. Γ can be written as a sum of two matrices, Γ = D + O, containing diagonal elements and off-diagonal elements of Γ. :Γ−1 = (D + O)−1 = D (I + D−1O) sup>−1 = (I + D−1O)−1D−1 = (I - D−1O + ...)D−1 = D−1 - D−1O D−1 + ... This expression shows that local packing density makes a significant contribution to expected fluctuations of residues. The terms that follow inverse of the diagonal matrix, are contributions of positional correlations to expected fluctuations.


GNM applications


Equilibrium fluctuations

Equilibrium fluctuations of biological molecules can be experimentally measured. In
X-ray crystallography X-ray crystallography is the experimental science of determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to Diffraction, diffract in specific directions. By measuring th ...
the B-factor (also called Debye-Waller or temperature factor) of each atom is a measure of its mean-square fluctuation near its equilibrium position in the native structure. In NMR experiments, this measure can be obtained by calculating root-mean-square differences between different models. In many applications and publications, including the original articles, it has been shown that expected residue fluctuations obtained by the GNM are in good agreement with the experimentally measured native state fluctuations. The relation between B-factors, for example, and expected residue fluctuations obtained from GNM is as follows :B_i = \frac{8\pi^2}{3}< \Delta R_{i} \cdot \Delta R_{i} > = \frac{8\pi^2 k_B T}{\gamma}(\Gamma^{-1})_{ii} Figure 3 shows an example of GNM calculation for the catalytic domain of the protein Cdc25B, a
cell division cycle The cell cycle, or cell-division cycle, is the sequential series of events that take place in a cell that causes it to divide into two daughter cells. These events include the growth of the cell, duplication of its DNA (DNA replication) and ...
dual-specificity phosphatase.


Physical meanings of slow and fast modes

Diagonalization of the Kirchhoff matrix decomposes the conformational motions into a spectrum of collective modes. The expected values of fluctuations and cross-correlations are obtained from linear combinations of fluctuations along these normal modes. The contribution of each mode is scaled with the inverse of that modes frequency. Hence, slow (low frequency) modes contribute most to the expected fluctuations. Along the few slowest modes, motions are shown to be collective and global and potentially relevant to functionality of the biomolecules. Fast (high frequency) modes, on the other hand, describe uncorrelated motions not inducing notable changes in the structure. GNM-based methods do not provide real dynamics but only an approximation based on the combination and interpolation of normal modes. Their applicability strongly depends on how collective the motion is.


Other specific applications

There are several major areas in which the Gaussian network model and other elastic network models have proved to be useful. These include: * Spring bead based network model: In spring-bead based network model, the springs and beads are used as components in the crosslinked network. Springs are cross-linked to represent mechanical behavior of the material and bridge molecular dynamics (MD) model and finite element (FE) model (see Figure. 5). The beads represent material mass of cluster bonds. Each spring is used to represent a cluster of polymer chains, instead of part of a single polymer chain. This simplification allows to bridge different models at multiple length scales and improves the simulation efficiency significantly. At each iteration step in the simulation, forces in the springs are applied to the nodes at the center of the beads, and the equilibrated nodal displacements throughout the system are calculated. Different from the traditional FE method for obtaining stress and strain, the spring–bead model provides the displacements of the nodes and forces in the springs. The equivalent strain and strain energy of spring–bead based network model can be defined and calculated using the displacements of nodes and the spring characteristics. Furthermore, the results from the network model can be scaled up to obtain the structural response at the macroscale using FE analysis. * Decomposition of flexible/rigid regions and domains of proteins * Characterization of functional motions and functionally important sites/residues of proteins, enzymes and large macromolecular assemblies * Refinement and dynamics of low-resolution structural data, e.g.
Cryo-electron microscopy Cryogenic electron microscopy (cryo-EM) is a transmission electron microscopy technique applied to samples cooled to cryogenic temperatures. For biological specimens, the structure is preserved by embedding in an environment of vitreous ice. An ...
*
Molecular replacement Molecular replacement (MR) is a method of solving the phase problem in X-ray crystallography. MR relies upon the existence of a previously solved protein structure which is similar to our unknown structure from which the diffraction data is deriv ...
for solving
X-ray structure X-ray crystallography is the experimental science of determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract in specific directions. By measuring the angles and ...
s, when a
conformational change In biochemistry, a conformational change is a change in the shape of a macromolecule, often induced by environmental factors. A macromolecule is usually flexible and dynamic. Its shape can change in response to changes in its environment or othe ...
occurred, with respect to a known structure * Integration with atomistic models and simulations * Investigation of folding/unfolding pathways and kinetics. * Annotation of functional implication in molecular evolution


Web servers

In practice, two kinds of calculations can be performed. The first kind (the GNM per se) makes use of the
Kirchhoff matrix In the mathematical field of graph theory, the Laplacian matrix, also called the graph Laplacian, admittance matrix, Kirchhoff matrix, or discrete Laplacian, is a matrix representation of a graph. Named after Pierre-Simon Laplace, the graph Lap ...
. The second kind (more specifically called either the Elastic Network Model or the Anisotropic Network Model) makes use of the
Hessian matrix In mathematics, the Hessian matrix, Hessian or (less commonly) Hesse matrix is a square matrix of second-order partial derivatives of a scalar-valued Function (mathematics), function, or scalar field. It describes the local curvature of a functio ...
associated to the corresponding set of harmonic springs. Both kinds of models can be used online, using the following servers.


GNM servers

* iGNM: A database of protein functional motions based on GNM http://ignm.ccbb.pitt.edu * oGNM: Online calculation of structural dynamics using GNM https://web.archive.org/web/20070516042756/http://ignm.ccbb.pitt.edu/GNM_Online_Calculation.htm


ENM/ANM servers

*
Anisotropic Network Model The Anisotropic Network Model (ANM) is a simple yet powerful tool made for normal mode analysis of proteins, which has been successfully applied for exploring the relation between function and dynamics for many proteins. It is essentially an Elast ...
web server http://www.ccbb.pitt.edu/anm * elNemo: Web-interface to The Elastic Network Model http://www.sciences.univ-nantes.fr/elnemo/ * AD-ENM: Analysis of Dynamics of an Elastic Network Model http://enm.lobos.nih.gov/ * WEBnm@: Web-server for Normal Mode Analysis of proteins http://apps.cbu.uib.no/webnma/home


Other relevant servers

* ProDy: An Application Programming Interface (API) in Python, that integrates GNM and ANM analyses and several molecular structure and sequence analyses and visualization tools: http://prody.csb.pitt.edu * HingeProt: An algorithm for protein hinge prediction using elastic network models http://www.prc.boun.edu.tr/appserv/prc/hingeprot/, or http://bioinfo3d.cs.tau.ac.il/HingeProt/hingeprot.html * DNABindProt: A Server for Determination of Potential DNA Binding Sites of Proteins http://www.prc.boun.edu.tr/appserv/prc/dnabindprot/ * MolMovDB: A database of macromolecular motions: http://www.molmovdb.org/


See also

*
Gaussian distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real number, real-valued random variable. The general form of its probability density function is f(x ...
*
Harmonic oscillator In classical mechanics, a harmonic oscillator is a system that, when displaced from its equilibrium position, experiences a restoring force ''F'' proportional to the displacement ''x'': \vec F = -k \vec x, where ''k'' is a positive const ...
*
Hooke's law In physics, Hooke's law is an empirical law which states that the force () needed to extend or compress a spring by some distance () scales linearly with respect to that distance—that is, where is a constant factor characteristic of ...
*
Molecular dynamics Molecular dynamics (MD) is a computer simulation method for analyzing the Motion (physics), physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamics ( ...
*
Normal mode A normal mode of a dynamical system is a pattern of motion in which all parts of the system move sinusoidally with the same frequency and with a fixed phase relation. The free motion described by the normal modes takes place at fixed frequencies ...
*
Principal component analysis Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing. The data is linearly transformed onto a new coordinate system such that th ...
*
Protein dynamics In molecular biology, proteins are generally thought to adopt unique structures determined by their amino acid sequences. However, proteins are not strictly static objects, but rather populate ensembles of (sometimes similar) conformations. Tran ...
*
Rubber elasticity Rubber elasticity is the ability of solid rubber to be stretched up to a factor of 10 from its original length, and return to close to its original length upon release. This process can be repeated many times with no apparent Material failure th ...
*
Statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. Sometimes called statistical physics or statistical thermodynamics, its applicati ...


References


Primary sources

* * * Cui Q, Bahar I, (2006). Normal Mode Analysis: Theory and applications to biological and chemical systems, Chapman & Hall/CRC, London, UK


Specific citations

{{DEFAULTSORT:Gaussian Network Model Molecular modelling