Schema (genetic algorithms)
   HOME

TheInfoList



OR:

A schema () is a template in
computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
used in the field of genetic algorithms that identifies a subset of strings with similarities at certain string positions. Schemata are a special case of cylinder sets, forming a
basis Basis may refer to: Finance and accounting * Adjusted basis, the net cost of an asset after adjusting for various tax-related items *Basis point, 0.01%, often used in the context of interest rates * Basis trading, a trading strategy consisting ...
for a
product topology In topology and related areas of mathematics, a product space is the Cartesian product of a family of topological spaces equipped with a natural topology called the product topology. This topology differs from another, perhaps more natural-s ...
on strings. In other words, schemata can be used to generate a
topology In mathematics, topology (from the Greek words , and ) is concerned with the properties of a geometric object that are preserved under continuous deformations, such as stretching, twisting, crumpling, and bending; that is, without closing ...
on a space of strings.


Description

For example, consider binary strings of length 6. The schema 1**0*1 describes the set of all words of length 6 with 1's at the first and sixth positions and a 0 at the fourth position. The * is a
wildcard Wild card most commonly refers to: * Wild card (cards), a playing card that substitutes for any other card in card games * Wild card (sports), a tournament or playoff place awarded to an individual or team that has not qualified through normal pla ...
symbol, which means that positions 2, 3 and 5 can have a value of either 1 or 0. The ''order of a schema'' is defined as the number of fixed positions in the template, while the ''
defining length In genetic algorithms and genetic programming defining length L(H) is the maximum distance between two defining symbols (that is symbols that have a fixed value as opposed to symbols that can take any value, commonly denoted as # or *) in schema H. ...
'' \delta(H) is the distance between the first and last specific positions. The order of 1**0*1 is 3 and its defining length is 5. The ''fitness of a schema'' is the average fitness of all strings matching the schema. The fitness of a string is a measure of the value of the encoded problem solution, as computed by a problem-specific evaluation function.


Length

The length of a schema H, called N(H), is defined as the total number of nodes in the schema. N(H) is also equal to the number of nodes in the programs matching H.


Disruption

If the child of an individual that matches schema H does not ''itself'' match H, the schema is said to have been ''disrupted''.


Propagation of schema

In
evolutionary computing In computer science, evolutionary computation is a family of algorithms for global optimization inspired by biological evolution, and the subfield of artificial intelligence and soft computing studying these algorithms. In technical terms, the ...
such as
genetic algorithms In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gene ...
and
genetic programming In artificial intelligence, genetic programming (GP) is a technique of evolving programs, starting from a population of unfit (usually random) programs, fit for a particular task by applying operations analogous to natural genetic processes to t ...
, propagation refers to the inheritance of characteristics of one generation by the next. For example, a schema is propagated if individuals in the current generation match it and so do those in the next generation. Those in the next generation may be (but don't have to be) children of parents who matched it.


The Expansion and Compression Operators

Recently schema have been studied using
order theory Order theory is a branch of mathematics that investigates the intuitive notion of order using binary relations. It provides a formal framework for describing statements such as "this is less than that" or "this precedes that". This article int ...
. Two basic operators are defined for schema: expansion and compression. The expansion maps a schema onto a set of words which it represents, while the compression maps a set of words on to a schema. In the following definitions \Sigma denotes an alphabet, \Sigma^l denotes all words of length l over the alphabet \Sigma , \Sigma_* denotes the alphabet \Sigma with the extra symbol *. \Sigma_*^l denotes all schema of length l over the alphabet \Sigma_* as well as the empty schema \epsilon_* . For any schema s \in \Sigma^l_* the following operator s, called the expansion of s, which maps s to a subset of words in \Sigma^l : s := \ Where subscript i denotes the character at position i in a word or schema. When s= \epsilon_* then s = \emptyset. More simply put, s is the set of all words in \Sigma^l that can be made by exchanging the * symbols in s with symbols from \Sigma. For example, if \Sigma=\, l=3 and s=10* then s=\ . Conversely, for any A \subseteq \Sigma^l we define , called the compression of A, which maps A on to a schema s\in \Sigma_*^l: A:= s where s is a schema of length l such that the symbol at position i in s is determined in the following way: if x_i = y_i for all x,y \in A then s_i = x_i otherwise s_i = *. If A = \emptyset then A = \epsilon_*. One can think of this operator as stacking up all the items in A and if all elements in a column are equivalent, the symbol at that position in s takes this value, otherwise there is a wild card symbol. For example, let A = \ then A = **0. Schemata can be
partially ordered In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a set. A poset consists of a set together with a binary r ...
. For any a,b \in \Sigma^l_* we say a \leq b if and only if a \subseteq b. It follows that \leq is a
partial ordering In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a set. A poset consists of a set together with a binary r ...
on a set of schemata from the reflexivity,
antisymmetry In linguistics, antisymmetry is a syntactic theory presented in Richard S. Kayne's 1994 monograph ''The Antisymmetry of Syntax''. It asserts that grammatical hierarchies in natural language follow a universal order, namely specifier-head-comple ...
and transitivity of the subset relation. For example, \epsilon_* \leq 11 \leq 1* \leq **. This is because \epsilon_* \subseteq 11 \subseteq 1* \subseteq ** = \emptyset \subseteq \ \subseteq \ \subseteq \. The compression and expansion operators form a Galois connection, where \downarrow is the lower adjoint and \uparrow the upper adjoint.


The Schematic Completion and The Schematic Lattice

For a set A \subseteq \Sigma^l, we call the process of calculating the compression on each subset of A, that is \, the schematic completion of A, denoted \mathcal(A). For example, let A = \. The schematic completion of A, results in the following set: \mathcal(A) = \ The
poset In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a set. A poset consists of a set together with a binary r ...
(\mathcal(A),\leq) always forms a complete lattice called the schematic lattice. The schematic lattice is similar to the concept lattice found in Formal concept analysis.


See also

*
Holland's schema theorem Holland's schema theorem, also called the fundamental theorem of genetic algorithms, is an inequality that results from coarse-graining an equation for evolutionary dynamics. The Schema Theorem says that short, low-order schemata with above-average ...
* Formal concept analysis


References

{{Reflist Genetic algorithms Genetic programming