Lattice proteins are highly simplified models of protein-like heteropolymer chains on lattice conformational space which are used to investigate
protein folding
Protein folding is the physical process by which a protein chain is translated to its native three-dimensional structure, typically a "folded" conformation by which the protein becomes biologically functional. Via an expeditious and reprodu ...
.
Simplification in lattice proteins is twofold: each whole residue (
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
) is modeled as a single "bead" or "point" of a finite set of types (usually only two), and each residue is restricted to be placed on vertices of a (usually cubic)
lattice
Lattice may refer to:
Arts and design
* Latticework, an ornamental criss-crossed framework, an arrangement of crossing laths or other thin strips of material
* Lattice (music), an organized grid model of pitch ratios
* Lattice (pastry), an ornam ...
.
To guarantee the connectivity of the protein chain, adjacent residues on the backbone must be placed on adjacent vertices of the lattice.
Steric constraints are expressed by imposing that no more than one residue can be placed on the same lattice vertex.
Because proteins are such
large molecules, there are severe computational limits on the simulated timescales of their behaviour when modeled in all-atom detail. The
millisecond
A millisecond (from '' milli-'' and second; symbol: ms) is a unit of time in the International System of Units (SI) equal to one thousandth (0.001 or 10−3 or 1/1000) of a second and to 1000 microseconds.
A unit of 10 milliseconds may be ca ...
regime for all-atom simulations was not reached until 2010, and it is still not possible to fold all real proteins on a computer. Simplification significantly reduces the computational effort in handling the model, although even in this simplified scenario the protein folding problem is
NP-complete
In computational complexity theory, a problem is NP-complete when:
# it is a problem for which the correctness of each solution can be verified quickly (namely, in polynomial time) and a brute-force search algorithm can find a solution by tryin ...
.
Overview
Different versions of lattice proteins may adopt different types of lattice (typically square and triangular ones), in two or three dimensions, but it has been shown that generic lattices can be used and handled via a uniform approach.
Lattice proteins are made to resemble real proteins by introducing an ''
energy function
Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criterion, from some set of available alternatives. It is generally divided into two subfi ...
'', a set of conditions which specify the
interaction energy
In physics, interaction energy is the contribution to the total energy that is caused by an interaction between the objects being considered.
The interaction energy usually depends on the relative position of the objects. For example, Q_1 Q_2 / ( ...
between beads occupying adjacent lattice sites.
The energy function mimics the interactions between amino acids in real proteins, which include
steric
Steric effects arise from the spatial arrangement of atoms. When atoms come close together there is a rise in the energy of the molecule. Steric effects are nonbonding interactions that influence the shape ( conformation) and reactivity of ions ...
,
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, ...
and
hydrogen bonding
In chemistry, a hydrogen bond (or H-bond) is a primarily electrostatic force of attraction between a hydrogen (H) atom which is covalently bound to a more electronegative "donor" atom or group (Dn), and another electronegative atom bearing ...
effects.
The beads are divided into types, and the energy function specifies the interactions depending on the bead type, just as different types of amino acids interact differently.
One of the most popular lattice models, the hydrophobic-polar model (
HP model), features just two bead types—
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, ...
(H) and
polar
Polar may refer to:
Geography
Polar may refer to:
* Geographical pole, either of two fixed points on the surface of a rotating body or planet, at 90 degrees from the equator, based on the axis around which a body rotates
*Polar climate, the cli ...
(P)—and mimics the
hydrophobic effect
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and exclude water molecules. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpola ...
by specifying a favorable interaction between H beads.
For any sequence in any particular structure, an energy can be rapidly calculated from the energy function. For the simple HP model, this is an enumeration of all the contacts between H residues that are adjacent in the structure but not in the chain. Most researchers consider a lattice protein sequence ''protein-like'' only if it possesses a single structure with an energetic state lower than in any other structure, although there are exceptions that consider ensembles of possible folded states. This is the energetic ground state, or
native state
In biochemistry, the native state of a protein or nucleic acid is its properly folded and/or assembled form, which is operative and functional. The native state of a biomolecule may possess all four levels of biomolecular structure, with the s ...
. The relative positions of the beads in the native state constitute the lattice protein's
tertiary structure
Protein tertiary structure is the three dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains may int ...
. Lattice proteins do not have genuine
secondary structure
Protein secondary structure is the three dimensional form of ''local segments'' of proteins. The two most common secondary structural elements are alpha helices and beta sheets, though beta turns and omega loops occur as well. Secondary struct ...
; however, some researchers have claimed that they can be extrapolated onto real protein structures which do include secondary structure, by appealing to the same law by which the
phase diagrams of different substances can be scaled onto one another (the
theorem of corresponding states
According to van der Waals, the theorem of corresponding states (or principle/law of corresponding states) indicates that all fluids, when compared at the same reduced temperature and reduced pressure, have approximately the same compressibility fa ...
).
By varying the energy function and the bead sequence of the chain (the
primary structure
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthes ...
), effects on the native state structure and the
kinetics
Kinetics ( grc, κίνησις, , kinesis, ''movement'' or ''to move'') may refer to:
Science and medicine
* Kinetics (physics), the study of motion and its causes
** Rigid body kinetics, the study of the motion of rigid bodies
* Chemical k ...
of folding can be explored, and this may provide insights into the folding of real proteins. Some of the examples include study of folding processes in lattice proteins that have been discussed to resemble the two-phase folding kinetics in proteins. Lattice protein was shown to have quickly collapsed into compact state and followed by slow subsequent structure rearrangement into native state. Attempts to resolve
Levinthal paradox
Levinthal's paradox is a thought experiment, also constituting a self-reference in the theory of protein folding. In 1969, Cyrus Levinthal noted that, because of the very large number of degrees of freedom in an unfolded polypeptide chain, the mol ...
in protein folding are another efforts made in the field. As an example, study conducted by Fiebig and Dill examined searching method involving constraints in forming residue contacts in lattice protein to provide insights to the question of how a protein finds its native structure without global exhaustive searching. Lattice protein models have also been used to investigate the
energy landscape
An energy landscape is a mapping of possible states of a system. The concept is frequently used in physics, chemistry, and biochemistry, e.g. to describe all possible conformations of a molecular entity, or the spatial positions of interacting mo ...
s of proteins, i.e. the variation of their internal
free energy as a function of conformation.
Lattices
A
lattice
Lattice may refer to:
Arts and design
* Latticework, an ornamental criss-crossed framework, an arrangement of crossing laths or other thin strips of material
* Lattice (music), an organized grid model of pitch ratios
* Lattice (pastry), an ornam ...
is a set of orderly points that are connected by "edges".
These points are called vertices and are connected to a certain number other vertices in the lattice by edges. The number of vertices each individual vertex is connected to is called the
coordination number
In chemistry, crystallography, and materials science, the coordination number, also called ligancy, of a central atom in a molecule or crystal is the number of atoms, molecules or ions bonded to it. The ion/molecule/atom surrounding the central i ...
of the lattice, and it can be scaled up or down by changing the shape or
dimension
In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coor ...
(2-dimensional to 3-dimensional, for example) of the lattice.
This number is important in shaping the characteristics of the lattice protein because it controls the number of other
residues
Residue may refer to:
Chemistry and biology
* An amino acid, within a peptide chain
* Crop residue, materials left after agricultural processes
* Pesticide residue, refers to the pesticides that may remain on or in food after they are appli ...
allowed to be adjacent to a given residue.
It has been shown that for most proteins the coordination number of the lattice used should fall between 3 and 20, although most commonly used lattices have coordination numbers at the lower end of this range.
Lattice shape is an important factor in the accuracy of lattice protein models. Changing lattice shape can dramatically alter the shape of the energetically favorable conformations.
It can also add unrealistic constraints to the protein structure such as in the case of the
parity
Parity may refer to:
* Parity (computing)
** Parity bit in computing, sets the parity of data for the purpose of error detection
** Parity flag in computing, indicates if the number of set bits is odd or even in the binary representation of the r ...
problem where in square and cubic lattices residues of the same parity (odd or even numbered) cannot make hydrophobic contact.
It has also been reported that triangular lattices yield more accurate structures than other lattice shapes when compared to
crystallographic
Crystallography is the experimental science of determining the arrangement of atoms in crystalline solids. Crystallography is a fundamental subject in the fields of materials science and solid-state physics (condensed matter physics). The wo ...
data.
To combat the parity problem, several researchers have suggested using triangular lattices when possible, as well as a square matrix with diagonals for theoretical applications where the square matrix may be more appropriate.
Hexagonal lattices were introduced to alleviate sharp turns of adjacent residues in triangular lattices. Hexagonal lattices with diagonals have also been suggested as a way to combat the parity problem.
Hydrophobic-polar model

]
]
The
Hydrophobic-polar protein folding model, hydrophobic-polar protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
model is the original lattice protein model. It was first proposed by Dill et al. in 1985 as a way to overcome the significant cost and difficulty of predicting protein structure, using only the
hydrophobicity
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, t ...
of the
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
s in the protein to predict the protein structure.
It is considered to be the paradigmatic lattice protein model.
The method was able to quickly give an estimate of protein structure by representing proteins as "short chains on a 2D square lattice" and has since become known as the hydrophobic-polar model. It breaks the protein folding problem into three separate problems: modeling the protein conformation, defining the energetic properties of the amino acids as they interact with one another to find said conformation, and developing an efficient algorithm for the prediction of these conformations. It is done by classifying amino acids in the protein as either hydrophobic or polar and assuming that the protein is being
folded in an
aqueous
An aqueous solution is a solution in which the solvent is water. It is mostly shown in chemical equations by appending (aq) to the relevant chemical formula. For example, a solution of table salt, or sodium chloride (NaCl), in water would ...
environment. The lattice statistical model seeks to recreate protein folding by minimizing the
free energy of the contacts between hydrophobic amino acids. Hydrophobic amino acid residues are predicted to group around each other, while hydrophilic residues interact with the surrounding water.
Different lattice types and algorithms were used to study protein folding with HP model. Efforts were made to obtain higher approximation ratios using
approximation algorithm
In computer science and operations research, approximation algorithms are efficient algorithms that find approximate solutions to optimization problems (in particular NP-hard problems) with provable guarantees on the distance of the returned solu ...
s in 2 dimensional and 3 dimensional, square and triangular lattices. Alternative to approximation algorithms, some
genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gen ...
s were also exploited with square, triangular, and face-centered-cubic lattices.
Problems and alternative models
The simplicity of the hydrophobic-polar model has caused it to have several problems that people have attempted to correct with alternative lattice protein models.
Chief among these problems is the issue of
degeneracy
Degeneracy, degenerate, or degeneration may refer to:
Arts and entertainment
* ''Degenerate'' (album), a 2010 album by the British band Trigger the Bloodshed
* Degenerate art, a term adopted in the 1920s by the Nazi Party in Germany to descri ...
, which is when there is more than one minimum energy
conformation for the modeled protein, leading to uncertainty about which conformation is the native one. Attempts to address this include the HPNX model which classifies amino acids as hydrophobic (H),
positive
Positive is a property of positivity and may refer to:
Mathematics and science
* Positive formula, a logical formula not containing negation
* Positive number, a number that is greater than 0
* Plus sign, the sign "+" used to indicate a posi ...
(P), negative (N), or neutral (X) according to the
charge
Charge or charged may refer to:
Arts, entertainment, and media Films
* ''Charge, Zero Emissions/Maximum Speed'', a 2011 documentary
Music
* ''Charge'' (David Ford album)
* ''Charge'' (Machel Montano album)
* ''Charge!!'', an album by The Aqua ...
of the amino acid, adding additional parameters to reduce the number of
low energy
Lethargy is a state of tiredness, sleepiness, weariness, fatigue, sluggishness or lack of energy. It can be accompanied by depression, decreased motivation, or apathy. Lethargy can be a normal response to inadequate sleep, overexertion, overwo ...
conformations and allowing for more realistic protein simulations.
Another model is the Crippen model which uses protein characteristics taken from
crystal structures
A crystal or crystalline solid is a solid material whose constituents (such as atoms, molecules, or ions) are arranged in a highly ordered microscopic structure, forming a crystal lattice that extends in all directions. In addition, macros ...
to inform the choice of native conformation.
Another issue with lattice models is that they generally don't take into account the space taken up by amino acid
side chain
In organic chemistry and biochemistry, a side chain is a chemical group that is attached to a core part of the molecule called the "main chain" or backbone. The side chain is a hydrocarbon branching element of a molecule that is attached to a ...
s, instead considering only the
α-carbon
In the nomenclature of organic chemistry, a locant is a term to indicate the position of a functional group or substituent within a molecule.
Numeric locants
The International Union of Pure and Applied Chemistry (IUPAC) recommends the use of n ...
.
The side chain model addresses this by adding a side chain to the vertex adjacent to the α-carbon.
References
{{reflist
Protein structure