A neutral network is a set of
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s all related by
point mutation
A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
s that have equivalent function or
fitness.
Each node represents a gene sequence and each line represents the mutation connecting two sequences. Neutral networks can be thought of as high, flat plateaus in a
fitness landscape
Fitness may refer to:
* Physical fitness, a state of health and well-being of the body
* Fitness (biology), an individual's ability to propagate its genes
* Fitness (cereal), a brand of breakfast cereals and granola bars
* ''Fitness'' (magazine), ...
. During
neutral evolution
The neutral theory of molecular evolution holds that most evolutionary changes occur at the molecular level, and most of the variation within and between species are due to random genetic drift of mutant alleles that are selectively neutral. The ...
, genes can randomly move through neutral networks and traverse regions of
sequence space
In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural ...
which may have consequences for
robustness
Robustness is the property of being strong and healthy in constitution. When it is transposed into a system, it refers to the ability of tolerating perturbations that might affect the system’s functional body. In the same line ''robustness'' ca ...
and
evolvability
Evolvability is defined as the capacity of a system for adaptive evolution. Evolvability is the ability of a population of organisms to not merely generate genetic diversity, but to generate '' adaptive'' genetic diversity, and thereby evolve thr ...
.
Genetic and molecular causes
Neutral networks exist in
fitness landscape
Fitness may refer to:
* Physical fitness, a state of health and well-being of the body
* Fitness (biology), an individual's ability to propagate its genes
* Fitness (cereal), a brand of breakfast cereals and granola bars
* ''Fitness'' (magazine), ...
s since proteins are
robust
Robustness is the property of being strong and healthy in constitution. When it is transposed into a system, it refers to the ability of tolerating perturbations that might affect the system’s functional body. In the same line ''robustness'' ca ...
to mutations. This leads to extended networks of genes of equivalent function, linked by
neutral mutation Neutral mutations are changes in DNA sequence that are neither beneficial nor detrimental to the ability of an organism to survive and reproduce. In population genetics, mutations in which natural selection does not affect the spread of the mutatio ...
s. Proteins are resistant to mutations because many sequences can fold into highly similar
structural folds. A protein adopts a limited ensemble of native conformations because those conformers have lower energy than unfolded and mis-folded states (ΔΔG of folding). This is achieved by a distributed, internal network of cooperative interactions (
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, ...
,
polar
Polar may refer to:
Geography
Polar may refer to:
* Geographical pole, either of two fixed points on the surface of a rotating body or planet, at 90 degrees from the equator, based on the axis around which a body rotates
*Polar climate, the cli ...
and
covalent
A covalent bond is a chemical bond that involves the sharing of electrons to form electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs. The stable balance of attractive and repulsive forces between atom ...
). Protein structural robustness results from few single mutations being sufficiently disruptive to compromise function. Proteins have also evolved to avoid
aggregation
Aggregation may refer to:
Business and economics
* Aggregation problem (economics)
* Purchasing aggregation, the joining of multiple purchasers in a group purchasing organization to increase their buying power
* Community Choice Aggregation, the ...
as partially folded proteins can combine to form large, repeating, insoluble
protein fibrils and masses. There is evidence that proteins show negative design features to reduce the exposure of aggregation-prone
beta-sheet
The beta sheet, (β-sheet) (also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a gen ...
motifs in their structures.
Additionally, there is some evidence that the
genetic code
The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
itself may be optimised such that most point mutations lead to similar amino acids (
conservative
Conservatism is a cultural, social, and political philosophy that seeks to promote and to preserve traditional institutions, practices, and values. The central tenets of conservatism may vary in relation to the culture and civilization in ...
). Together these factors create a
distribution of fitness effects
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosi ...
of mutations that contains a high proportion of neutral and nearly-neutral mutations.
Evolution
Neutral networks are a subset of the sequences in
sequence space
In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural ...
that have equivalent function, and so form a wide, flat
plateau
In geology and physical geography, a plateau (; ; ), also called a high plain or a tableland, is an area of a highland consisting of flat terrain that is raised sharply above the surrounding area on at least one side. Often one or more sides ...
in a
fitness landscape
Fitness may refer to:
* Physical fitness, a state of health and well-being of the body
* Fitness (biology), an individual's ability to propagate its genes
* Fitness (cereal), a brand of breakfast cereals and granola bars
* ''Fitness'' (magazine), ...
.
Neutral evolution
The neutral theory of molecular evolution holds that most evolutionary changes occur at the molecular level, and most of the variation within and between species are due to random genetic drift of mutant alleles that are selectively neutral. The ...
can therefore be visualised as a population diffusing from one set of sequence nodes, through the neutral network, to another cluster of sequence nodes. Since the majority of evolution is thought to be neutral,
[Kimura, Motoo. (1983). ''The neutral theory of molecular evolution.'' Cambridge] a large proportion of gene change is the movement though expansive neutral networks.
Robustness

The more neutral neighbours a sequence has, the more
robust to mutations it is since mutations are more likely to simply neutrally convert it into an equally functional sequence.
Indeed, if there are large differences between the number of neutral neighbours of different sequences within a neutral network, the population is predicted to evolve towards these robust sequences. This is sometimes called circum-neutrality and represents the movement of populations away from cliffs in the
fitness landscape
Fitness may refer to:
* Physical fitness, a state of health and well-being of the body
* Fitness (biology), an individual's ability to propagate its genes
* Fitness (cereal), a brand of breakfast cereals and granola bars
* ''Fitness'' (magazine), ...
.
In addition to in silico models,
these processes are beginning to be confirmed by
experimental evolution
Experimental evolution is the use of laboratory experiments or controlled field manipulations to explore evolutionary dynamics. Evolution may be observed in the laboratory as individuals/populations adapt to new environmental conditions by natura ...
of
cytochrome P450
Cytochromes P450 (CYPs) are a superfamily of enzymes containing heme as a cofactor that functions as monooxygenases. In mammals, these proteins oxidize steroids, fatty acids, and xenobiotics, and are important for the clearance of various compo ...
s
and
B-lactamase.
Evolvability
Interest in the interplay between
genetic drift
Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance.
Genetic drift may cause gene variants to disappear completely and there ...
and selection has been around since the 1930s when the shifting-balance theory proposed that in some situations, genetic drift could facilitate later adaptive evolution. Although the specifics of the theory were largely discredited, it drew attention to the possibility that drift could generate cryptic variation that, though neutral to current function, may affect selection for new functions (
evolvability
Evolvability is defined as the capacity of a system for adaptive evolution. Evolvability is the ability of a population of organisms to not merely generate genetic diversity, but to generate '' adaptive'' genetic diversity, and thereby evolve thr ...
).
By definition, all genes in a neutral network have equivalent function, however some may exhibit
promiscuous activities
Enzyme promiscuity is the ability of an enzyme to catalyse a fortuitous side reaction in addition to its main reaction. Although enzymes are remarkably specific catalysts, they can often perform side reactions in addition to their main, native cata ...
which could serve as starting points for adaptive evolution towards new functions. In terms of
sequence space
In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural ...
, current theories predict that if the neutral networks for two different activities overlap, a neutrally evolving population may diffuse to regions of the neutral network of the first activity that allow it to access the second. This would only be the case when the distance between activities is smaller than the distance that a neutrally evolving population can cover. The degree of interpenetration of the two networks will determine how common cryptic variation for the promiscuous activity is in sequence space.
Mathematical Framework
The fact that neutral mutations were probably widespread was proposed by Freese and Yoshida in 1965.
Motoo Kimura
(November 13, 1924 – November 13, 1994) was a Japanese biologist best known for introducing the neutral theory of molecular evolution in 1968. He became one of the most influential theoretical population geneticists. He is remembered in gen ...
later crystallized a theory of neutral evolution in 1968 with King and Jukes independently proposing a similar theory (1969). Kimura computed the rate of nucleotide substitutions in a population (i.e. the average time for one base pair replacement to occur within a genome) and found it to be ~1.8 years. Such a high rate would not be tolerated by any mammalian population according to
Haldane
Haldane is a surname and a given name which may refer to:
People
* Clan Haldane, a Lowland Scottish clan
Surname
* A. R. B. Haldane (1900–1982), Scottish social historian and author
* Andrew "Ack-Ack" Haldane (1917–1944), U.S. Marine, Worl ...
's formula. He thus concluded that, in mammals, neutral (or nearly neutral) nucleotide substitution mutations of
DNA must dominate. He computed that such mutations were occurring at the rate of roughly 0-5 per year per gamete.

In later years, a new paradigm emerged, that placed
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
as a precursor molecule to
DNA. A primordial molecule principle was put forth as early as 1968 by
Crick, and lead to what is now known as
The RNA World Hypothesis.
DNA is found, predominantly, as fully
base paired double helices, while biological
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
is single stranded and often exhibits complex base-pairing interactions. These are due to its increased ability to form
hydrogen bond
In chemistry, a hydrogen bond (or H-bond) is a primarily electrostatic force of attraction between a hydrogen (H) atom which is covalently bound to a more electronegative "donor" atom or group (Dn), and another electronegative atom bearing ...
s, a fact which stems from the existence of the extra
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydrox ...
group in the
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compou ...
sugar.
In the 1970s, Stein and
M. Waterman laid the groundwork for the combinatorics of
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structures.
Waterman
Waterman may refer to:
* Waterman (occupation), a river worker who transferred passengers across and along the city centre rivers in Britain
* The Chesapeake Bay term for commercial fishermen, particularly those who seek oysters and the blue crab
...
gave the first graph theoretic description of
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structures and their associated properties, and used them to produce an efficient
minimum free energy (MFE) folding algorithm. An
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structure can be viewed as a diagram over N labeled vertices with its Watson-Crick
base pairs represented as non-crossing arcs in the upper half plane. Therefore, a
secondary structure
Protein secondary structure is the three dimensional form of ''local segments'' of proteins. The two most common secondary structural elements are alpha helices and beta sheets, though beta turns and omega loops occur as well. Secondary struct ...
is a scaffold having many sequences compatible with its implied base pairing constraints. Later,
Smith and Waterman developed an algorithm that performed local sequence alignment.
Another prediction algorithm for
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structure was given by
Nussinov Nussinov's algorithm described the folding problem over a two letter alphabet as a planar graph optimization problem, where the quantity to be maximized is the number of matchings in the sequence string.
Come the year 1980, Howell et al. computed a generating function of all foldings of a sequence while
D. Sankoff (1985) described algorithms for alignment of finite sequences, the prediction of RNA secondary structures (folding), and the reconstruction of proto-sequences on a phylo-genetic tree. Later,
Waterman
Waterman may refer to:
* Waterman (occupation), a river worker who transferred passengers across and along the city centre rivers in Britain
* The Chesapeake Bay term for commercial fishermen, particularly those who seek oysters and the blue crab
...
and
Temple
A temple (from the Latin ) is a building reserved for spiritual rituals and activities such as prayer and sacrifice. Religions which erect temples include Christianity (whose temples are typically called church (building), churches), Hindui ...
(1986) produced a
polynomial time
In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by ...
dynamic programming
Dynamic programming is both a mathematical optimization method and a computer programming method. The method was developed by Richard Bellman in the 1950s and has found applications in numerous fields, from aerospace engineering to economics.
I ...
(DP) algorithm for predicting general
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structure. while in the year 1990, John McCaskill presented a polynomial time DP algorithm for computing the full equilibrium partition function of an RNA secondary structure.
M. Zuker, implemented algorithms for computation of MFE
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structures based on the work of
Nussinov et al.,
Smith and Waterman and Studnicka, et al. Later L. Hofacker (et al., 1994), presented The
Vienna RNA package, a software package that integrated MFE folding and the computation of the partition function as well as base pairing probabilities.
Peter Schuster
Peter K. Schuster (born 7 March 1941) is a theoretical chemist known for his work with the German Nobel Laureate Manfred Eigen in developing the quasispecies model. His work has made great strides in the understanding of viruses and their replic ...
and W. Fontana (1994) shifted the focus towards sequence to structure maps (
genotype–phenotype) . They used an inverse folding algorithm, to produce computational evidence that
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
sequences sharing the same structure are distributed randomly in
sequence space
In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural ...
. They observed that common structures can be reached from a random sequence by just a few mutations. These two facts lead them to conclude that the sequence space seemed to be
percolated by neutral networks of nearest neighbor mutants that fold to the same structure.
In 1997, C. Reidys Stadler and
Schuster Schuster ("shoemaker", "cobbler") is a common family name in German. It is also common among Ashkenazi-Jews, sometimes spelled as " Shuster".
People
* Blessed Alfredo Ildefonso Schuster (1880–1954), Italian cardinal and Archbishop of Milan
* ...
laid the mathematical foundations for the study and modelling of neutral networks of
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
secondary structures. Using a
random graph model they proved the existence of a threshold value for connectivity of random sub-graphs in a configuration space, parametrized by λ, the fraction of neutral neighbors. They showed that the networks are connected and
percolate
Percolation (from Latin ''percolare'', "to filter" or "trickle through"), in physics, chemistry and materials science, refers to the movement and filtering of fluids through porous materials.
It is described by Darcy's law.
Broader applicatio ...
sequence space if the fraction of neutral nearest neighbors exceeds λ*, a threshold value. Below this threshold the networks are partitioned into a largest
giant component
In network theory, a giant component is a connected component of a given random graph that contains a finite fraction of the entire graph's vertices.
Giant component in Erdős–Rényi model
Giant components are a prominent feature of the Erdő ...
and several smaller ones. Key results of this analysis where concerned with threshold functions for density and connectivity for neutral networks as well as
Schuster Schuster ("shoemaker", "cobbler") is a common family name in German. It is also common among Ashkenazi-Jews, sometimes spelled as " Shuster".
People
* Blessed Alfredo Ildefonso Schuster (1880–1954), Italian cardinal and Archbishop of Milan
* ...
's shape space conjecture.
See also
*
Neutral theory of molecular evolution
The neutral theory of molecular evolution holds that most evolutionary changes occur at the molecular level, and most of the variation within and between species are due to random genetic drift of mutant alleles that are selectively neutral. Th ...
*
RNA world
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existenc ...
*
Nucleic acid secondary structure
Nucleic acid secondary structure is the basepairing interactions within a single nucleic acid polymer or between two polymers. It can be represented as a list of bases which are paired in a nucleic acid molecule.
The secondary structures of bio ...
References
{{genarch
Evolutionary biology
Genetics