The NK model is a mathematical model described by its primary inventor Stuart Kauffman as a "tunably rugged" fitness landscape. "Tunable ruggedness" captures the intuition that both the overall size of the landscape and the number of its local "hills and valleys" can be adjusted via changes to its two parameters,
N
K
N
K
The NK model has found application in a wide variety of fields, including the theoretical study of evolutionary biology, immunology, optimisation, technological evolution, team science,[1] and complex systems. The model was also adopted in organizational theory, where it is used to describe the way an agent may search a landscape by manipulating various characteristics of itself. For example, an agent can be an organization, the hills and valleys represent profit (or changes thereof), and movement on the landscape necessitates organizational decisions (such as adding product lines or altering the organizational structure), which tend to interact with each other and affect profit in a complex fashion.[2]
An early version of the model, which considered only the smoothest (
K=0
K=N-1
One of the reasons why the model has attracted wide attention in optimisation is that it is a particularly simple instance of a so-called NP-complete problem[5] which means it is difficult to find global optima. Recently, it was shown that the NK model for K > 1 is also PLS-complete[6] which means than, in general, it is difficult to find even local fitness optima. This has consequences for the study of open-ended evolution.
A plasmid is a small circle of DNA inside certain cells that can replicate independently of their host cells. Suppose we wish to study the fitness of plasmids.
For simplicity, we model a plasmid as a ring of N possible genes, always in the same order, and each can have two possible states (active or inactive, type X or type Y, etc...). Then the plasmid is modelled by a binary string with length N, and so the fitness function is
F:\{0,1\}N\to\R
The simplest model would have the genes not interacting with each other, and so we obtainwhere each
fi(Si)
Si
i
To model epistasis, we introduce another factor K, the number of other genes that a gene interacts with. It is reasonable to assume that on a plasmid, two genes interact if they are adjacent, thus givingFor example, when K = 1, and N = 5,
F(00101)=f1(0,0)+f2(0,1)+f3(1,0)+f4(0,1)+f5(1,0)
The NK model defines a combinatorial phase space, consisting of every string (chosen from a given alphabet) of length
N
Fitness values are defined according to the specific incarnation of the model, but the key feature of the NK model is that the fitness of a given string
S
fi(S)
F(S)=\sumi\tilde{f}i(S),
and the contribution from each locus in general depends on its state and the state of
K
\tilde{f}i(S)=fi(Si,
S | |
ki1 |
,...,
S | |
kiK |
),
where
kij
j
i
Hence, the fitness function
fi
The 1D Ising model of spin glass is usually written aswhere
H
We can reformulate it as a special case of the NK model with K=1:by definingIn general, the m-dimensional Ising model on a square grid
\{1,2,...,n\}m
N=nm,K=m
Since K roughly measures "ruggedness" of the fitness landscape (see below), we see that as the dimension of Ising model increases, its ruggedness also increases.
When
\mu=0
The Sherrington–Kirkpatrick model generalizes the Ising model by allowing all possible pairs of spins to interact (instead of a grid graph, use the complete graph), thus it is also an NK model with
K=N-1
Allowing all possible subsequences of spins to interact, instead of merely pairs, we obtain the infinite-range model, which is also an NK model with
K=N-1
The value of K controls the degree of epistasis in the NK model, or how much other loci affect the fitness contribution of a given locus. With K = 0, the fitness of a given string is a simple sum of individual contributions of loci: for nontrivial fitness functions, a global optimum is present and easy to locate (the genome of all 0s if f(0) > f(1), or all 1s if f(1) > f(0)). For nonzero K, the fitness of a string is a sum of fitnesses of substrings, which may interact to frustrate the system (consider how to achieve optimal fitness in the example above). Increasing K thus increases the ruggedness of the fitness landscape.
The bare NK model does not support the phenomenon of neutral space -- that is, sets of genomes connected by single mutations that have the same fitness value. Two adaptations have been proposed to include this biologically important structure. The NKP model introduces a parameter
P
P
2K
Q
Q
P=0
Q=infty
In 1991, Weinberger published a detailed analysis of the case in which
1<<k\leN
\mu+\sigma\sqrt{{2ln(k+1)}\over{k+1}}
and a variance of approximately
{{(k+1)\sigma2}\over{N[k+1+2(k+2)ln(k+1)]}}
The NK model has found use in many fields, including in the study of spin glasses, collective problem solving,[7] epistasis and pleiotropy in evolutionary biology, and combinatorial optimisation.