HOME

TheInfoList



OR:

The NK model is a
mathematical model A mathematical model is an abstract and concrete, abstract description of a concrete system using mathematics, mathematical concepts and language of mathematics, language. The process of developing a mathematical model is termed ''mathematical m ...
described by its primary inventor
Stuart Kauffman Stuart Alan Kauffman (born September 28, 1939) is an American medical doctor, theoretical biology, theoretical biologist, and complex systems researcher who studies the origin of life on Earth. He was a professor at the University of Chicago, Un ...
as a "tunably rugged"
fitness landscape Fitness may refer to: * Physical fitness, a state of health and well-being of the body * Fitness culture, a sociocultural phenomenon surrounding exercise and physical fitness * Fitness (biology), an individual's ability to propagate its genes * ...
. "Tunable ruggedness" captures the intuition that both the overall size of the landscape and the number of its local "hills and valleys" can be adjusted via changes to its two parameters, N and K, with N being the length of a string of evolution and K determining the level of landscape ruggedness. The NK model has found application in a wide variety of fields, including the theoretical study of
evolutionary biology Evolutionary biology is the subfield of biology that studies the evolutionary processes such as natural selection, common descent, and speciation that produced the diversity of life on Earth. In the 1930s, the discipline of evolutionary biolo ...
,
immunology Immunology is a branch of biology and medicine that covers the study of Immune system, immune systems in all Organism, organisms. Immunology charts, measures, and contextualizes the Physiology, physiological functioning of the immune system in ...
, optimisation, technological evolution, team science, and
complex systems A complex system is a system composed of many components that may interact with one another. Examples of complex systems are Earth's global climate, organisms, the human brain, infrastructure such as power grid, transportation or communication s ...
. The model was also adopted in
organizational theory Organizational theory refers to a series of interrelated concepts that involve the sociological study of the structures and operations of formal social organizations. Organizational theory also seeks to explain how interrelated units of organiza ...
, where it is used to describe the way an
agent Agent may refer to: Espionage, investigation, and law *, spies or intelligence officers * Law of agency, laws involving a person authorized to act on behalf of another ** Agent of record, a person with a contractual agreement with an insuran ...
may search a landscape by manipulating various characteristics of itself. For example, an agent can be an
organization An organization or organisation (English in the Commonwealth of Nations, Commonwealth English; American and British English spelling differences#-ise, -ize (-isation, -ization), see spelling differences) is an legal entity, entity—such as ...
, the hills and valleys represent
profit Profit may refer to: Business and law * Profit (accounting), the difference between the purchase price and the costs of bringing to market * Profit (economics), normal profit and economic profit * Profit (real property), a nonpossessory inter ...
(or changes thereof), and movement on the landscape necessitates organizational decisions (such as adding product lines or altering the organizational structure), which tend to interact with each other and affect profit in a complex fashion. An early version of the model, which considered only the smoothest (K=0) and most rugged (K=N-1) landscapes, was presented in Kauffman and Levin (1987). The model as it is currently known first appeared in Kauffman and Weinberger (1989). One of the reasons why the model has attracted wide attention in optimisation is that it is a particularly simple instance of a so-called NP-complete problemWeinberger, E. (1996), "NP-completeness of Kauffman's N-k model, a Tuneably Rugged Fitness Landscape", Santa Fe Institute Working Paper, 96-02-003. which means it is difficult to find global optima. Recently, it was shown that the NK model for K > 1 is also PLS-complete which means than, in general, it is difficult to find even local fitness optima. This has consequences for the study of open-ended evolution.


Prototypical example: plasmid fitness

A
plasmid A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria and ...
is a small circle of DNA inside certain cells that can replicate independently of their host cells. Suppose we wish to study the fitness of plasmids. For simplicity, we model a plasmid as a ring of ''N'' possible genes, always in the same order, and each can have two possible states (active or inactive, type X or type Y, etc...). Then the plasmid is modelled by a
binary Binary may refer to: Science and technology Mathematics * Binary number, a representation of numbers using only two values (0 and 1) for each digit * Binary function, a function that takes two arguments * Binary operation, a mathematical op ...
string with length ''N'', and so the fitness function is F: \^N\to \R. The simplest model would have the genes not interacting with each other, and so we obtainF(S_1S_2\cdots S_N) = f_1(S_1) + f_2(S_2) + \cdots + f_N(S_N)where each f_i(S_i) denotes the contribution to fitness of gene S_i at location i. To model
epistasis Epistasis is a phenomenon in genetics in which the effect of a gene mutation is dependent on the presence or absence of mutations in one or more other genes, respectively termed modifier genes. In other words, the effect of the mutation is depe ...
, we introduce another factor ''K'', the number of other genes that a gene interacts with. It is reasonable to assume that on a plasmid, two genes interact if they are adjacent, thus givingF(S_1S_2\cdots S_N) = f_1(S_1, S_2) + f_2(S_2, S_3) + \cdots + f_(S_, S_N) + f_N(S_N, S_1)For example, when ''K = 1'', and ''N = 5'', : F(00101) = f_1(0,0) + f_2(0,1) + f_3(1,0) + f_4(0, 1) + f_5(1, 0) The NK model generalizes this by allowing arbitrary finite K, N, as well as allowing arbitrary definition of adjacency of genes (the genes do not necessarily lie on a circle or a line segment).


Mathematical definition

The NK model defines a
combinatorial Combinatorics is an area of mathematics primarily concerned with counting, both as a means and as an end to obtaining results, and certain properties of finite structures. It is closely related to many other areas of mathematics and has many ...
phase space The phase space of a physical system is the set of all possible physical states of the system when described by a given parameterization. Each possible state corresponds uniquely to a point in the phase space. For mechanical systems, the p ...
, consisting of every string (chosen from a given alphabet) of length N. For each string in this search space, a scalar value (called the '' fitness'') is defined. If a distance
metric Metric or metrical may refer to: Measuring * Metric system, an internationally adopted decimal system of measurement * An adjective indicating relation to measurement in general, or a noun describing a specific type of measurement Mathematics ...
is defined between strings, the resulting structure is a ''landscape''. Fitness values are defined according to the specific incarnation of the model, but the key feature of the NK model is that the fitness of a given string S is the sum of contributions from each locus f_i(S) in the string: :F(S) = \sum_i \tilde_i(S), and the contribution from each locus in general depends on its state and the state of K other loci,: :\tilde_i(S) = f_i(S_i, S_, \dots, S_), where k_ is the index of the jth neighbor of locus i. Hence, the fitness function f_i is a mapping between strings of length ''K'' + 1 and scalars, which Weinberger's later work calls "fitness contributions". Such fitness contributions are often chosen randomly from some specified probability distribution.


Example: the spin glass models

The 1D
Ising model The Ising model (or Lenz–Ising model), named after the physicists Ernst Ising and Wilhelm Lenz, is a mathematical models in physics, mathematical model of ferromagnetism in statistical mechanics. The model consists of discrete variables that r ...
of
spin glass In condensed matter physics, a spin glass is a magnetic state characterized by randomness, besides cooperative behavior in freezing of spins at a temperature called the "freezing temperature," ''T''f. In ferromagnetic solids, component atoms' ...
is usually written asH = -\sum_^N J_ S_i S_ - \mu\sum_^N h_i S_iwhere H is the Hamiltonian, which can be thought as energy. We can reformulate it as a special case of the NK model with ''K=1'':H = F(S) = \sum_ f_(S_i, S_j)by definingf_(S_i, S_) = -J_ S_i S_ - \mu h_i S_iIn general, the m-dimensional Ising model on a square grid \^m is an NK model with N = n^m, K = m. Since K roughly measures "ruggedness" of the fitness landscape (see below), we see that as the dimension of Ising model increases, its ruggedness also increases. When \mu= 0, this is the Edwards–Anderson model, which is exactly solvable. The Sherrington–Kirkpatrick model generalizes the Ising model by allowing all possible pairs of spins to interact (instead of a
grid graph In graph theory, a lattice graph, mesh graph, or grid graph is a Graph (discrete mathematics), graph whose graph drawing, drawing, Embedding, embedded in some Euclidean space , forms a regular tiling. This implies that the group (mathematics), g ...
, use the
complete graph In the mathematical field of graph theory, a complete graph is a simple undirected graph in which every pair of distinct vertices is connected by a unique edge. A complete digraph is a directed graph in which every pair of distinct vertices i ...
), thus it is also an NK model with K = N-1. Allowing all possible subsequences of spins to interact, instead of merely pairs, we obtain the infinite-range model, which is also an NK model with K = N-1.


Tunable topology

The value of ''K'' controls the degree of
epistasis Epistasis is a phenomenon in genetics in which the effect of a gene mutation is dependent on the presence or absence of mutations in one or more other genes, respectively termed modifier genes. In other words, the effect of the mutation is depe ...
in the NK model, or how much other loci affect the fitness contribution of a given locus. With ''K'' = 0, the fitness of a given string is a simple sum of individual contributions of loci: for nontrivial fitness functions, a
global optimum In mathematical analysis, the maximum and minimum of a function are, respectively, the greatest and least value taken by the function. Known generically as extremum, they may be defined either within a given range (the ''local'' or ''relative' ...
is present and easy to locate (the genome of all 0s if ''f''(0) > ''f''(1), or all 1s if ''f''(1) > ''f''(0)). For nonzero ''K'', the fitness of a string is a sum of fitnesses of substrings, which may interact to frustrate the system (consider how to achieve optimal fitness in the example above). Increasing ''K'' thus increases the ruggedness of the fitness landscape.


Variations with neutral spaces

The bare NK model does not support the phenomenon of ''neutral space'' -- that is, sets of genomes connected by single mutations that have the same fitness value. Two adaptations have been proposed to include this biologically important structure. The ''NKP model'' introduces a parameter P: a proportion P of the 2^K fitness contributions is set to zero, so that the contributions of several genetic motifs are degenerate . The ''NKQ model'' introduces a parameter Q and enforces a discretisation on the possible fitness contribution values so that each contribution takes one of Q possible values, again introducing degeneracy in the contributions from some genetic motifs . The bare NK model corresponds to the P = 0 and Q = \infty cases under these parameterisations.


Known results

In 1991, Weinberger published a detailed analysis of the case in which 1 << k \le N and the fitness contributions are chosen randomly. His analytical estimate of the number of local optima was later shown to be flawed . However, numerical experiments included in Weinberger's analysis support his analytical result that the expected fitness of a string is normally distributed with a mean of approximately \mu + \sigma \sqrt and a variance of approximately .


Applications

The NK model has found use in many fields, including in the study of spin glasses, collective problem solving,
epistasis Epistasis is a phenomenon in genetics in which the effect of a gene mutation is dependent on the presence or absence of mutations in one or more other genes, respectively termed modifier genes. In other words, the effect of the mutation is depe ...
and
pleiotropy Pleiotropy () is a condition in which a single gene or genetic variant influences multiple phenotypic traits. A gene that has such multiple effects is referred to as a ''pleiotropic gene''. Mutations in pleiotropic genes can impact several trait ...
in
evolutionary biology Evolutionary biology is the subfield of biology that studies the evolutionary processes such as natural selection, common descent, and speciation that produced the diversity of life on Earth. In the 1930s, the discipline of evolutionary biolo ...
, and combinatorial optimisation.


References

{{DEFAULTSORT:Nk Model Applied mathematics Mathematical and theoretical biology