Reward-based Selection
   HOME

TheInfoList



OR:

Reward-based selection is a technique used in
evolutionary algorithm Evolutionary algorithms (EA) reproduce essential elements of the biological evolution in a computer algorithm in order to solve "difficult" problems, at least Approximation, approximately, for which no exact or satisfactory solution methods are k ...
s for selecting potentially useful solutions for recombination. The probability of being selected for an individual is proportional to the cumulative reward obtained by the individual. The cumulative reward can be computed as a sum of the individual reward and the reward inherited from parents.


Description

Reward-based selection can be used within Multi-armed bandit framework for
Multi-objective optimization Multi-objective optimization or Pareto optimization (also known as multi-objective programming, vector optimization, multicriteria optimization, or multiattribute optimization) is an area of MCDM, multiple-criteria decision making that is concerned ...
to obtain a better approximation of the
Pareto front In multi-objective optimization, the Pareto front (also called Pareto frontier or Pareto curve) is the set of all Pareto efficient solutions. The concept is widely used in engineering. It allows the designer to restrict attention to the set of effi ...
. The newborn a'^ and its parents receive a reward r^, if a'^ was selected for new population Q^, otherwise the reward is zero. Several reward definitions are possible: *1. r^=1, if the newborn individual a'^ was selected for new population Q^. *2. r^ = 1 - \frac \mbox a'^ \in Q^ , where rank(a'^) is the rank of newly inserted individual in the population of \mu individuals. Rank can be computed using a well-known non-dominated sorting procedure. *3. r^ = \sum_ \Delta(a,Q^) - \sum_ \Delta(a,Q^), where \Delta(a,Q^) is the hypervolume indicator contribution of the individual a to the population Q^. The reward r^>0 if the newly inserted individual improves the quality of the population, which is measured as its hypervolume contribution in the objective space. *4. A relaxation of the above reward, involving a rank-based penalization for points for k-th dominated Pareto front: r^ = \frac \left( \sum_ \Delta(a,ndom_k(Q^)) - \sum_ \Delta(a,ndom_k(Q^)) \right) Reward-based selection can quickly identify the most fruitful directions of search by maximizing the cumulative reward of individuals.


See also

*
Fitness proportionate selection Fitness proportionate selection, also known as roulette wheel selection or spinning wheel selection, is a selection technique used in evolutionary algorithms for selecting potentially useful solutions for recombination. Method In fitness prop ...
*
Selection (evolutionary algorithm) Selection is a genetic operator in an evolutionary algorithm (EA). An EA is a metaheuristic inspired by biological evolution and aims to solve challenging problems at least approximately. Selection has a dual purpose: on the one hand, it can cho ...
*
Stochastic universal sampling Stochastic universal sampling (SUS) is a selection technique used in evolutionary algorithm Evolutionary algorithms (EA) reproduce essential elements of the biological evolution in a computer algorithm in order to solve "difficult" problems, at ...
*
Tournament selection Tournament selection is a method of selecting an individual from a population of individuals in a evolutionary algorithm. Tournament selection involves running several "tournaments" among a few individuals (or "chromosomes") chosen at random from t ...


References

{{DEFAULTSORT:Rewardbased Selection Selection (evolutionary algorithm)