Fitness proportionate selection, also known as roulette wheel selection, is a
genetic operator A genetic operator is an operator used in genetic algorithms to guide the algorithm towards a solution to a given problem. There are three main types of operators (mutation, crossover and selection), which must work in conjunction with one another i ...
used in
genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gen ...
s for selecting potentially useful solutions for recombination.
In fitness proportionate selection, as in all selection methods, the
fitness function {{no footnotes, date=May 2015
A fitness function is a particular type of objective function that is used to summarise, as a single figure of merit, how close a given design solution is to achieving the set aims. Fitness functions are used in genet ...
assigns a fitness to possible solutions or
chromosome
A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins ar ...
s. This fitness level is used to associate a
probability
Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
of selection with each individual chromosome. If
is the fitness of individual
in the population, its probability of being selected is
:
where
is the number of individuals in the population.
This could be imagined similar to a Roulette wheel in a casino. Usually a proportion of the wheel is assigned to each of the possible selections based on their fitness value. This could be achieved by dividing the fitness of a selection by the total fitness of all the selections, thereby normalizing them to 1. Then a random selection is made similar to how the roulette wheel is rotated.
While candidate solutions with a higher fitness will be less likely to be eliminated, there is still a chance that they may be eliminated because their probability of selection is less than 1 (or 100%). Contrast this with a less sophisticated selection algorithm, such as
truncation selection, which will eliminate a fixed percentage of the weakest candidates. With fitness proportionate selection there is a chance some weaker solutions may survive the selection process. This is because even though the probability that the weaker solutions will survive is low, it is not zero which means it is still possible they will survive; this is an advantage, because there is a chance that even weak solutions may have some features or characteristics which could prove useful following the recombination process.
The analogy to a roulette wheel can be envisaged by imagining a roulette wheel in which each candidate solution represents a pocket on the wheel; the size of the pockets are proportionate to the probability of selection of the solution. Selecting N chromosomes from the population is equivalent to playing N games on the roulette wheel, as each candidate is drawn independently.
Other selection techniques, such as
stochastic universal sampling
Stochastic universal sampling (SUS) is a technique used in genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger clas ...
or
tournament selection Tournament selection is a method of selecting an individual from a population of individuals in a genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural sele ...
, are often used in practice. This is because they have less stochastic noise, or are fast, easy to implement and have a constant selection pressure.
The naive implementation is carried out by first generating the
cumulative probability distribution (CDF) over the list of individuals using a probability proportional to the fitness of the individual. A
uniform random number from the range
binary search
In computer science, binary search, also known as half-interval search, logarithmic search, or binary chop, is a search algorithm that finds the position of a target value within a sorted array. Binary search compares the target value to the m ...
over the elements of the CDF. It takes in the
O(log n) time to choose an individual. A faster alternative that generates individuals in O(1) time will be to use the alias method">Big O notation">O(log n) time to choose an individual. A faster alternative that generates individuals in O(1) time will be to use the alias method.
Recently, a very simple algorithm was introduced that is based on "stochastic acceptance". The algorithm randomly selects an individual (say
) and accepts the selection with probability
, where
is the maximum fitness in the population. Certain analysis indicates that the stochastic acceptance version has a considerably better performance than versions based on linear or binary search, especially in applications where fitness values might change during the run.
Fast Proportional Selection
/ref> While the behavior of this algorithm is typically fast, some fitness distributions (such as exponential distributions) may require iterations in the worst case. This algorithm also requires more random numbers than binary search.
Pseudocode
For example, if you have a population with fitnesses [1, 2, 3, 4], then the sum is (1 + 2 + 3 + 4 = 10). Therefore, you would want the probabilities or chances to be [1/10, 2/10, 3/10, 4/10] or [0.1, 0.2, 0.3, 0.4]. If you were to visually normalize this between 0.0 and 1.0, it would be grouped like below with ed = 1/10, green = 2/10, blue = 3/10, black = 4/10
0.1 ]
0.2 \
0.3 /
0.4 \
0.5 ,
0.6 /
0.7 \
0.8 ,
0.9 ,
1.0 /
Using the above example numbers, this is how to determine the probabilities:
sum_of_fitness = 10
previous_probability = 0.0
= previous_probability + (fitness / sum_of_fitness) = 0.0 + (1 / 10) = 0.1
previous_probability = 0.1
= previous_probability + (fitness / sum_of_fitness) = 0.1 + (2 / 10) = 0.3
previous_probability = 0.3
= previous_probability + (fitness / sum_of_fitness) = 0.3 + (3 / 10) = 0.6
previous_probability = 0.6
= previous_probability + (fitness / sum_of_fitness) = 0.6 + (4 / 10) = 1.0
The last index should always be 1.0 or close to it. Then this is how to randomly select an individual:
random_number # Between 0.0 and 1.0
if random_number < 0.1
select 1
else if random_number < 0.3 # 0.3 − 0.1 = 0.2 probability
select 2
else if random_number < 0.6 # 0.6 − 0.3 = 0.3 probability
select 3
else if random_number < 1.0 # 1.0 − 0.6 = 0.4 probability
select 4
end if
See also
* Reward-based selection
*Stochastic universal sampling
Stochastic universal sampling (SUS) is a technique used in genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger clas ...
*Tournament selection Tournament selection is a method of selecting an individual from a population of individuals in a genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural sele ...
References
External links
C implementation
(.tar.gz; see selector.cxx) WBL
Example on Roulette wheel selection
{{DEFAULTSORT:Fitness Proportionate Selection
Genetic algorithms