HOME

TheInfoList



OR:

Mutation is a
genetic operator A genetic operator is an operator used in genetic algorithms to guide the algorithm towards a solution to a given problem. There are three main types of operators (mutation, crossover and selection), which must work in conjunction with one another i ...
used to maintain
genetic diversity Genetic diversity is the total number of Genetics, genetic characteristics in the genetic makeup of a species, it ranges widely from the number of species to differences within species and can be attributed to the span of survival for a species. ...
of the
chromosomes A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins are ...
of a population of a genetic or, more generally, an
evolutionary algorithm In computational intelligence (CI), an evolutionary algorithm (EA) is a subset of evolutionary computation, a generic population-based metaheuristic optimization algorithm. An EA uses mechanisms inspired by biological evolution, such as rep ...
(EA). It is analogous to biological
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, m ...
. The classic example of a mutation operator of a binary coded genetic algorithm (GA) involves a probability that an arbitrary
bit The bit is the most basic unit of information in computing and digital communications. The name is a portmanteau of binary digit. The bit represents a logical state with one of two possible values. These values are most commonly represented a ...
in a
genetic sequence A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. By convention, sequences are usua ...
will be flipped from its original state. A common method of implementing the mutation operator involves generating a
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the p ...
for each bit in a sequence. This random variable tells whether or not a particular bit will be flipped. This mutation procedure, based on the biological
point mutation A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
, is called single point mutation. Other types of mutation operators are commonly used for representations other than binary, such as floating-point encodings or representations for combinatorial problems. The purpose of mutation in EAs is to introduce diversity into the sampled
population Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using ...
. Mutation operators are used in an attempt to avoid
local minima In mathematical analysis, the maxima and minima (the respective plurals of maximum and minimum) of a function, known collectively as extrema (the plural of extremum), are the largest and smallest value of the function, either within a given ...
by preventing the population of chromosomes from becoming too similar to each other, thus slowing or even stopping convergence to the global optimum. This reasoning also leads most EAs to avoid only taking the fittest of the population in generating the next generation, but rather selecting a random (or semi-random) set with a weighting toward those that are fitter. The following requirements apply to all mutation operators used in an EA: # every point in the search space must be reachable by one or more mutations. # there must be no preference for parts or directions in the search space (no drift). # small mutations should be more probable than large ones. For different genome types, different mutation types are suitable. Some mutations are Gaussian, Uniform, Zigzag, Scramble, Insertion, Inversion, Swap, and so on. An overview and more operators than those presented below can be found in the introductory book by Eiben and Smith or in.


Bit string mutation

The mutation of bit strings ensue through bit flips at random positions. Example: The probability of a mutation of a bit is \frac, where l is the length of the binary vector. Thus, a mutation rate of 1 per mutation and individual selected for mutation is reached.


Mutation of real numbers

Many EAs, such as the
evolution strategy In computer science, an evolution strategy (ES) is an optimization technique based on ideas of evolution. It belongs to the general class of evolutionary computation or artificial evolution methodologies. History The 'evolution strategy' optimiza ...
or the real-coded
genetic algorithms In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gene ...
, work with real numbers instead of bit strings. This is due to the good experiences that have been made with this type of coding. The value of a real-valued gene can either be changed or redetermined. A mutation that implements the latter should only ever be used in conjunction with the value-changing mutations and then only with comparatively low probability, as it can lead to large changes. In practical applications, the decision variables to be changed of the optimisation problem to be solved are usually limited. Accordingly, the values of the associated genes are each restricted to an interval
_, x_ The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
/math>. Mutations may or may not take these restrictions into account. In the latter case, suitable post-treatment is then required as described below.


Mutation without consideration of restrictions

A real number x can be mutated using
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu i ...
\mathcal(0,\sigma) by adding the generated random value to the old value of the gene, resulting in the mutated value x':
x' = x + \mathcal(0, \sigma)
In the case of genes with a restricted range of values, it is a good idea to choose the step size of the mutation \sigma so that it reasonably fits the range
_, x_ The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
/math> of the gene to be changed, e.g.:
\sigma = \frac
The step size can also be adjusted to the smaller permissible change range depending on the current value. In any case, however, it is likely that the new value x' of the gene will be outside the permissible range of values. Such a case must be considered a lethal mutation, since the obvious repair by using the respective violated limit as the new value of the gene would lead to a drift. This is because the limit value would then be selected with the entire probability of the values beyond the limit of the value range. The evolution strategy works with real numbers and mutation based on normal distribution. The step sizes are part of the
chromosome A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins ar ...
and are subject to evolution together with the actual decision variables.


Mutation with consideration of restrictions

One possible form of changing the value of a gene while taking its value range
_, x_ The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
/math> into account is the mutation ''relative parameter change'' of the evolutionary algorithm GLEAM (General Learning Evolutionary Algorithm and Method), in which, as with the mutation presented earlier, small changes are more likely than large ones. First, an equally distributed decision is made as to whether the current value x should be increased or decreased and then the corresponding total change interval is determined.
Without loss of generality ''Without loss of generality'' (often abbreviated to WOLOG, WLOG or w.l.o.g.; less commonly stated as ''without any loss of generality'' or ''with no loss of generality'') is a frequently used expression in mathematics. The term is used to indicat ...
, an increase is assumed for the explanation and the total change interval is then , x_\max/math>. It is divided into k sub-areas of equal size with the width \delta, from which k sub-change intervals of different size are formed: :i-th sub-change interval: , x + \delta \cdot i/math> with :\delta = \frac and i = 1, \dots, k Subsequently, one of the k sub-change intervals is selected in equal distribution and a random number, also equally distributed, is drawn from it as the new value x' of the gene. The resulting summed probabilities of the sub-change intervals result in the probability distribution of the k sub-areas for the exemplary case of k=10 shown in the adjacent figure. This is not a normal distribution as before, but this distribution also clearly favours small changes over larger ones. This mutation for larger values of k, such as 10, is less well suited for tasks where the optimum lies on one of the value range boundaries. This can be remedied by significantly reducing k when a gene value approaches its limits very closely.


Mutation of permutations

Mutations of permutations are specially designed for genomes that are themselves
permutations In mathematics, a permutation of a set is, loosely speaking, an arrangement of its members into a sequence or linear order, or if the set is already ordered, a rearrangement of its elements. The word "permutation" also refers to the act or pr ...
of a set. These are often used to solve combinatorial tasks. In the two mutations presented, parts of the genome are rotated or inverted.


Rotation to the right

The presentation of the procedure is illustrated by an example on the right:


Inversion

The presentation of the procedure is illustrated by an example on the right:


Variants with preference for smaller changes

The requirement raised at the beginning for mutations, according to which small changes should be more probable than large ones, is only inadequately fulfilled by the two permutation mutations presented, since the lengths of the partial lists and the number of shift positions are determined in an equally distributed manner. However, the longer the partial list and the shift, the greater the change in gene order. This can be remedied by the following modifications. The end index j of the partial lists is determined as the distance d to the start index i: :j = (i+ d) \bmod \left, P_0\ where d is determined randomly according to one of the two procedures for the mutation of real numbers from the interval \left P_0\ - 1\right/math> and rounded. For the ''rotation'', k is determined similarly to the distance d, but the value 0 is forbidden. For the ''inversion'', note that i \ne j must hold, so for d the value 0 must be excluded.


See also

*
Evolutionary algorithm In computational intelligence (CI), an evolutionary algorithm (EA) is a subset of evolutionary computation, a generic population-based metaheuristic optimization algorithm. An EA uses mechanisms inspired by biological evolution, such as rep ...
s *
Genetic algorithm In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gen ...
s


References


Bibliography

* John Holland (1975). ''Adaptation in Natural and Artificial Systems'', PhD thesis,
University of Michigan Press The University of Michigan Press is part of Michigan Publishing at the University of Michigan Library. It publishes 170 new titles each year in the humanities and social sciences. Titles from the press have earned numerous awards, including ...
, Ann Arbor, Michigan. . * * * * * * {{Evolutionary computation Genetic algorithms Evolutionary algorithms