A schema () is a template in
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
used in the field of
genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gen ...
s that identifies a
subset
In mathematics, set ''A'' is a subset of a set ''B'' if all elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are unequal, then ''A'' is a proper subset o ...
of strings with similarities at certain string positions. Schemata are a special case of
cylinder set In mathematics, the cylinder sets form a basis of the product topology on a product of sets; they are also a generating family of the cylinder σ-algebra.
General definition
Given a collection S of sets, consider the Cartesian product X = \prod_ ...
s, forming a
basis
Basis may refer to:
Finance and accounting
*Adjusted basis, the net cost of an asset after adjusting for various tax-related items
*Basis point, 0.01%, often used in the context of interest rates
* Basis trading, a trading strategy consisting o ...
for a
product topology
In topology and related areas of mathematics, a product space is the Cartesian product of a family of topological spaces equipped with a natural topology called the product topology. This topology differs from another, perhaps more natural-seem ...
on strings.
In other words, schemata can be used to generate a
topology
In mathematics, topology (from the Greek words , and ) is concerned with the properties of a geometric object that are preserved under continuous deformations, such as stretching, twisting, crumpling, and bending; that is, without closing ho ...
on a space of strings.
Description
For example, consider binary strings of length 6. The schema 1**0*1 describes the set of all words of length 6 with 1's at the first and sixth positions and a 0 at the fourth position. The * is a
wildcard
Wild card most commonly refers to:
* Wild card (cards), a playing card that substitutes for any other card in card games
* Wild card (sports), a tournament or playoff place awarded to an individual or team that has not qualified through normal pla ...
symbol, which means that positions 2, 3 and 5 can have a value of either 1 or 0. The ''order of a schema'' is defined as the number of fixed positions in the template, while the ''
defining length''
is the distance between the first and last specific positions. The order of 1**0*1 is 3 and its defining length is 5. The ''fitness of a schema'' is the average fitness of all strings matching the schema. The fitness of a string is a measure of the value of the encoded problem solution, as computed by a problem-specific evaluation function.
Length
The length of a schema
, called
, is defined as the total number of nodes in the schema.
is also equal to the number of nodes in the programs matching
.
Disruption
If the child of an individual that matches schema H does not ''itself'' match H, the schema is said to have been ''disrupted''.
Propagation of schema
In
evolutionary computing such as
genetic algorithms
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to gene ...
and
genetic programming
In artificial intelligence, genetic programming (GP) is a technique of evolving programs, starting from a population of unfit (usually random) programs, fit for a particular task by applying operations analogous to natural genetic processes to t ...
, propagation refers to the inheritance of characteristics of one generation by the next. For example, a schema is propagated if individuals in the current generation match it and so do those in the next generation. Those in the next generation may be (but don't have to be) children of parents who matched it.
The Expansion and Compression Operators
Recently schema have been studied using
order theory
Order theory is a branch of mathematics that investigates the intuitive notion of order using binary relations. It provides a formal framework for describing statements such as "this is less than that" or "this precedes that". This article intr ...
.
[
]
Two basic operators are defined for schema: expansion and compression. The expansion maps a schema onto a set of words which it represents, while the compression maps a set of words on to a schema.
In the following definitions
denotes an alphabet,
denotes all words of length
over the alphabet
,
denotes the alphabet
with the extra symbol
.
denotes all schema of length
over the alphabet
as well as the empty schema
.
For any schema
the following operator
, called the
of
, which maps
to a subset of words in
:
Where subscript
denotes the character at position
in a word or schema. When
then
. More simply put,
is the set of all words in
that can be made by exchanging the
symbols in
with symbols from
. For example, if
,
and
then
.
Conversely, for any
we define
, called the
of
, which maps
on to a schema
:
where
is a schema of length
such that the symbol at position
in
is determined in the following way: if
for all
then
otherwise
. If
then
. One can think of this operator as stacking up all the items in
and if all elements in a column are equivalent, the symbol at that position in
takes this value, otherwise there is a wild card symbol. For example, let
then
.
Schemata can be
partially ordered. For any
we say
if and only if
. It follows that
is a
partial ordering
In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a set. A poset consists of a set together with a binary ...
on a set of schemata from the
reflexivity,
antisymmetry
In linguistics, antisymmetry is a syntactic theory presented in Richard S. Kayne's 1994 monograph ''The Antisymmetry of Syntax''. It asserts that grammatical hierarchies in natural language follow a universal order, namely specifier-head-complem ...
and
transitivity of the
subset
In mathematics, set ''A'' is a subset of a set ''B'' if all elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are unequal, then ''A'' is a proper subset o ...
relation. For example,
.
This is because
.
The compression and expansion operators form a
Galois connection In mathematics, especially in order theory, a Galois connection is a particular correspondence (typically) between two partially ordered sets (posets). Galois connections find applications in various mathematical theories. They generalize the fu ...
, where
is the lower adjoint and
the upper adjoint.
The Schematic Completion and The Schematic Lattice
For a set
, we call the process of calculating the compression on each subset of A, that is
, the schematic completion of
, denoted
.
For example, let
. The schematic completion of
, results in the following set:
The
poset
In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a set. A poset consists of a set together with a binary r ...
always forms a
complete lattice
In mathematics, a complete lattice is a partially ordered set in which ''all'' subsets have both a supremum (join) and an infimum (meet). A lattice which satisfies at least one of these properties is known as a ''conditionally complete lattice.'' S ...
called the schematic lattice.

The schematic lattice is similar to the concept lattice found in
Formal concept analysis.
See also
*
Holland's schema theorem
*
Formal concept analysis
References
{{Reflist
Genetic algorithms
Genetic programming