Automata theory is the study of

abstract machine An abstract machine is a computer science theoretical model that allows for a detailed and precise analysis of how a computer system functions. It is analogous to a mathematical function in that it receives inputs and produces outputs based on pr ...

s and

automata An automaton (; plural: automata or automatons) is a relatively self-operating machine, or control mechanism designed to automatically follow a sequence of operations, or respond to predetermined instructions.Automaton – Definition and More ...

, as well as the

computational problem In theoretical computer science, a computational problem is a problem that may be solved by an algorithm. For example, the problem of factoring :"Given a positive integer ''n'', find a nontrivial prime factor of ''n''." is a computational probl ...

s that can be solved using them. It is a theory in

theoretical computer science computer science (TCS) is a subset of general computer science and mathematics that focuses on mathematical aspects of computer science such as the theory of computation, lambda calculus, and type theory. It is difficult to circumscribe the ...

. The word ''automata'' comes from the Greek word αὐτόματος, which means "self-acting, self-willed, self-moving". An automaton (automata in plural) is an abstract self-propelled computing device which follows a predetermined sequence of operations automatically. An automaton with a finite number of states is called a Finite Automaton (FA) or Finite-State Machine (FSM). The figure on the right illustrates a

finite-state machine A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...

, which is a well-known type of automaton. This automaton consists of states (represented in the figure by circles) and transitions (represented by arrows). As the automaton sees a symbol of input, it makes a transition (or jump) to another state, according to its transition function, which takes the previous state and current input symbol as its arguments. Automata theory is closely related to

formal language In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules. The alphabet of a formal language consists of sym ...

theory. In this context, automata are used as finite representations of formal languages that may be infinite. Automata are often classified by the class of formal languages they can recognize, as in the Chomsky hierarchy, which describes a nesting relationship between major classes of automata. Automata play a major role in the theory of computation, compiler construction,

artificial intelligence Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech ...

parsing Parsing, syntax analysis, or syntactic analysis is the process of analyzing a string of symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal grammar. The term ''parsing'' comes from ...

and

formal verification In the context of hardware and software systems, formal verification is the act of proving or disproving the correctness of intended algorithms underlying a system with respect to a certain formal specification or property, using formal met ...

History

The theory of abstract automata was developed in the mid-20th century in connection with

finite automata A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...

. Automata theory was initially considered a branch of mathematical

systems theory Systems theory is the interdisciplinary study of systems, i.e. cohesive groups of interrelated, interdependent components that can be natural or human-made. Every system has causal boundaries, is influenced by its context, defined by its structu ...

, studying the behavior of discrete-parameter systems. Early work in automata theory differed from previous work on systems by using

abstract algebra In mathematics, more specifically algebra, abstract algebra or modern algebra is the study of algebraic structures. Algebraic structures include groups, rings, fields, modules, vector spaces, lattices, and algebras over a field. The ter ...

to describe

information system An information system (IS) is a formal, sociotechnical, organizational system designed to collect, process, store, and distribute information. From a sociotechnical perspective, information systems are composed by four components: task, people ...

s rather than

differential calculus In mathematics, differential calculus is a subfield of calculus that studies the rates at which quantities change. It is one of the two traditional divisions of calculus, the other being integral calculus—the study of the area beneath a curve ...

to describe material systems. The theory of the finite-state transducer was developed under different names by different research communities. The earlier concept of

Turing machine A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer alg ...

was also included in the discipline along with new forms of infinite-state automata, such as

pushdown automata In the theory of computation, a branch of theoretical computer science, a pushdown automaton (PDA) is a type of automaton that employs a stack. Pushdown automata are used in theories about what can be computed by machines. They are more capab ...

. 1956 saw the publication of ''Automata Studies'', which collected work by scientists including

Claude Shannon Claude Elwood Shannon (April 30, 1916 – February 24, 2001) was an American mathematician, electrical engineer, and cryptographer known as a "father of information theory". As a 21-year-old master's degree student at the Massachusetts I ...

, W. Ross Ashby,

John von Neumann John von Neumann (; hu, Neumann János Lajos, ; December 28, 1903 – February 8, 1957) was a Hungarian-American mathematician, physicist, computer scientist, engineer and polymath. He was regarded as having perhaps the widest c ...

Marvin Minsky Marvin Lee Minsky (August 9, 1927 – January 24, 2016) was an American cognitive and computer scientist concerned largely with research of artificial intelligence (AI), co-founder of the Massachusetts Institute of Technology's AI laboratory ...

Edward F. Moore Edward Forrest Moore (November 23, 1925 in Baltimore, Maryland – June 14, 2003 in Madison, Wisconsin) was an American professor of mathematics and computer science, the inventor of the Moore finite state machine, and an early pioneer of artifi ...

, and Stephen Cole Kleene. With the publication of this volume, "automata theory emerged as a relatively autonomous discipline". The book included Kleene's description of the set of regular events, or regular languages, and a relatively stable measure of complexity in Turing machine programs by Shannon. In the same year,

Noam Chomsky Avram Noam Chomsky (born December 7, 1928) is an American public intellectual: a linguist, philosopher, cognitive scientist, historian, social critic, and political activist. Sometimes called "the father of modern linguistics", Chomsky i ...

described the Chomsky hierarchy, a correspondence between automata and formal grammars, and Ross Ashby published '' An Introduction to Cybernetics'', an accessible textbook explaining automata and information using basic

set theory Set theory is the branch of mathematical logic that studies sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, is mostly concern ...

. The study of

linear bounded automata In computer science, a linear bounded automaton (plural linear bounded automata, abbreviated LBA) is a restricted form of Turing machine. Operation A linear bounded automaton is a nondeterministic Turing machine that satisfies the following th ...

led to the

Myhill–Nerode theorem In the theory of formal languages, the Myhill–Nerode theorem provides a necessary and sufficient condition for a language to be regular. The theorem is named for John Myhill and Anil Nerode, who proved it at the University of Chicago in 1958 ...

, which gives a necessary and sufficient condition for a formal language to be regular, and an exact count of the number of states in a minimal machine for the language. The pumping lemma for regular languages, also useful in regularity proofs, was proven in this period by Michael O. Rabin and

Dana Scott Dana Stewart Scott (born October 11, 1932) is an American logician who is the emeritus Hillman University Professor of Computer Science, Philosophy, and Mathematical Logic at Carnegie Mellon University; he is now retired and lives in Berkeley, Ca ...

, along with the computational equivalence of deterministic and nondeterministic finite automata. In the 1960s, a body of algebraic results known as "structure theory" or "algebraic decomposition theory" emerged, which dealt with the realization of sequential machines from smaller machines by interconnection. While any finite automaton can be simulated using a universal gate set, this requires that the simulating circuit contain loops of arbitrary complexity. Structure theory deals with the "loop-free" realizability of machines. The theory of

computational complexity In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations) ...

also took shape in the 1960s. By the end of the decade, automata theory came to be seen as "the pure mathematics of computer science".

Automata

What follows is a general definition of an automaton, which restricts a broader definition of a

system A system is a group of interacting or interrelated elements that act according to a set of rules to form a unified whole. A system, surrounded and influenced by its environment, is described by its boundaries, structure and purpose and express ...

to one viewed as acting in discrete time-steps, with its state behavior and outputs defined at each step by unchanging functions of only its state and input.

Informal description

An automaton ''runs'' when it is given some sequence of ''inputs'' in discrete (individual) ''time steps'' (or just ''steps''). An automaton processes one input picked from a set of ''

symbols A symbol is a mark, sign, or word that indicates, signifies, or is understood as representing an idea, object, or relationship. Symbols allow people to go beyond what is known or seen by creating linkages between otherwise very different co ...

'' or ''letters'', which is called an ''input

alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...

''. The symbols received by the automaton as input at any step are a sequence of symbols called ''words''. An automaton has a set of ''states''. At each moment during a run of the automaton, the automaton is ''in'' one of its states. When the automaton receives new input it moves to another state (or ''transitions'') based on a ''transition function'' that takes the previous state and current input symbol as parameters. At the same time, another function called the ''output function'' produces symbols from the ''output alphabet'', also according to the previous state and current input symbol. The automaton reads the symbols of the input word and transitions between states until the word is read completely, if it is finite in length, at which point the automaton ''halts''. A state at which the automaton halts is called the ''final state''. To investigate the possible state/input/output sequences in an automaton using

theory, a machine can be assigned a ''starting state'' and a set of ''accepting states''. Then, depending on whether a run starting from the starting state ends in an accepting state, the automaton can be said to ''accept'' or ''reject'' an input sequence. The set of all the words accepted by an automaton is called the ''language recognized by the automaton''. A familiar example of a machine recognizing a language is an electronic lock, which accepts or rejects attempts to enter the correct code.

Formal definition

;Automaton :An automaton can be represented formally by a 5-tuple

M = \langle \Sigma, \Gamma, Q, \delta, \lambda \rangle

, where: :*

\Sigma

is a finite set of ''symbols'', called the ''input alphabet'' of the automaton, :*

\Gamma

is another finite set of symbols, called the ''output alphabet'' of the automaton, :*

Q

is a set of ''states'', :*

\delta

is the ''next-state function'' or ''transition function''

\delta : Q \times \Sigma \to Q

mapping state-input pairs to successor states, :*

\lambda

is the ''next-output function''

\lambda : Q \times \Sigma \to \Gamma

mapping state-input pairs to outputs. :If

Q

is finite, then

M

is a

finite automaton A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...

. ;Input word :An automaton reads a finite string of symbols

a_1a_2...a_n

, where

a_i \in \Sigma

, which is called an ''input word''. The set of all words is denoted by

\Sigma^*

. ;Run :A sequence of states

q_0,q_1,...,q_n

, where

q_i \in Q

such that

q_i = \delta(q_, a_i)

for

0 < i \le n

, is a ''run'' of the automaton on an input

a_1a_2...a_n \in \Sigma^*

starting from state

q_0

. In other words, at first the automaton is at the start state

q_0

, and receives input

a_1

. For

a_1

and every following

a_i

in the input string, the automaton picks the next state

q_i

according to the transition function

\delta(q_,a_i)

, until the last symbol

a_n

has been read, leaving the machine in the ''final state'' of the run,

q_n

. Similarly, at each step, the automaton emits an output symbol according to the output function

\lambda(q_,a_i)

. :The transition function

\delta

is extended inductively into

\overline\delta: Q \times \Sigma^* \to Q

to describe the machine's behavior when fed whole input words. For the empty string

\varepsilon

\overline\delta(q, \varepsilon) = q

for all states

q

, and for strings

wa

where

a

is the last symbol and

w

is the (possibly empty) rest of the string,

\overline\delta(q, wa) = \delta(\overline\delta(q,w),a)

. The output function

\lambda

may be extended similarly into

\overline\lambda(q,w)

, which gives the complete output of the machine when run on word

w

from state

q

. ;Acceptor :In order to study an automaton with the theory of

formal languages In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules. The alphabet of a formal language consists of sy ...

, an automaton may be considered as an ''acceptor'', replacing the output alphabet and function

\Gamma

and

\lambda

with :*

q_0 \in Q

, a designated ''start state'', and :*

F

, a set of states of

Q

(i.e.

F \subseteq Q

) called ''accept states''. :This allows the following to be defined: ;Accepting word :A word

w = a_1a_2...a_n \in \Sigma^*

is an ''accepting word'' for the automaton if

\overline\delta(q_0,w) \in F

, that is, if after consuming the whole string

w

the machine is in an accept state. ;Recognized language :The language

L \subseteq \Sigma^*

''recognized'' by an automaton is the set of all the words that are accepted by the automaton,

L = \

. ;Recognizable languages :The recognizable languages are the set of languages that are recognized by some automaton. For ''finite automata'' the recognizable languages are regular languages. For different types of automata, the recognizable languages are different.

Variant definitions of automata

Automata are defined to study useful machines under mathematical formalism. So the definition of an automaton is open to variations according to the "real world machine" that we want to model using the automaton. People have studied many variations of automata. The following are some popular variations in the definition of different components of automata. ;Input * ''Finite input'': An automaton that accepts only finite sequences of symbols. The above introductory definition only encompasses finite words. * ''Infinite input'': An automaton that accepts infinite words ( ω-words). Such automata are called '' ω-automata''. * ''Tree input'': The input may be a '' tree of symbols'' instead of sequence of symbols. In this case after reading each symbol, the automaton ''reads'' all the successor symbols in the input tree. It is said that the automaton ''makes one copy'' of itself for each successor and each such copy starts running on one of the successor symbols from the state according to the transition relation of the automaton. Such an automaton is called a

tree automaton A tree automaton is a type of state machine. Tree automata deal with tree structures, rather than the strings of more conventional state machines. The following article deals with branching tree automata, which correspond to regular languages ...

. * ''Infinite tree input'' : The two extensions above can be combined, so the automaton reads a tree structure with (in)finite branches. Such an automaton is called an infinite tree automaton. ;States * ''Single state'': An automaton with one state, also called a ''combinational circuit'', performs a transformation which may implement combinational logic. * ''Finite states'': An automaton that contains only a finite number of states. * ''Infinite states'': An automaton that may not have a finite number of states, or even a

countable In mathematics, a set is countable if either it is finite or it can be made in one to one correspondence with the set of natural numbers. Equivalently, a set is ''countable'' if there exists an injective function from it into the natural numbers ...

number of states. Different kinds of abstract memory may be used to give such machines finite descriptions. * ''Stack memory'': An automaton may also contain some extra memory in the form of a

stack Stack may refer to: Places * Stack Island, an island game reserve in Bass Strait, south-eastern Australia, in Tasmania’s Hunter Island Group * Blue Stack Mountains, in Co. Donegal, Ireland People * Stack (surname) (including a list of people ...

in which symbols can be pushed and popped. This kind of automaton is called a '' pushdown automaton''. * ''Queue memory'': An automaton may have memory in the form of a queue. Such a machine is called '' queue machine'' and is Turing-complete. * ''Tape memory'': The inputs and outputs of automata are often described as input and output ''tapes''. Some machines have additional ''working tapes'', including the

, linear bounded automaton, and

log-space transducer In computational complexity theory, a log space transducer (LST) is a type of Turing machine used for log-space reductions. A log space transducer, M, has three tapes: * A read-only ''input'' tape. * A read/write ''work'' tape (bounded to at most ...

. ;Transition function * ''Deterministic'': For a given current state and an input symbol, if an automaton can only jump to one and only one state then it is a '' deterministic automaton''. * ''Nondeterministic'': An automaton that, after reading an input symbol, may jump into any of a number of states, as licensed by its transition relation. Notice that the term transition function is replaced by transition relation: The automaton ''non-deterministically'' decides to jump into one of the allowed choices. Such automata are called ''nondeterministic automata''. * ''Alternation'': This idea is quite similar to tree automata but orthogonal. The automaton may run its ''multiple copies'' on the ''same'' next read symbol. Such automata are called '' alternating automata''. The acceptance condition must be satisfied on all runs of such ''copies'' to accept the input. ;Acceptance condition * ''Acceptance of finite words'': Same as described in the informal definition above. * ''Acceptance of infinite words'': an ''ω-automaton'' cannot have final states, as infinite words never terminate. Rather, acceptance of the word is decided by looking at the infinite sequence of visited states during the run. * ''Probabilistic acceptance'': An automaton need not strictly accept or reject an input. It may accept the input with some

probability Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and 1, where, roughly speaking, ...

between zero and one. For example, quantum finite automata, geometric automata and metric automata have probabilistic acceptance. Different combinations of the above variations produce many classes of automata. Automata theory is a subject matter that studies properties of various types of automata. For example, the following questions are studied about a given type of automata. * Which class of formal languages is recognizable by some type of automata? (Recognizable languages) * Are certain automata ''closed'' under union, intersection, or complementation of formal languages? (Closure properties) * How expressive is a type of automata in terms of recognizing a class of formal languages? And, their relative expressive power? (Language hierarchy) Automata theory also studies the existence or nonexistence of any effective algorithms to solve problems similar to the following list: * Does an automaton accept at least one input word? (Emptiness checking) * Is it possible to transform a given non-deterministic automaton into a deterministic automaton without changing the language recognized? (Determinization) * For a given formal language, what is the smallest automaton that recognizes it? ( Minimization)

Types of automata

The following is an incomplete list of types of automata.

Discrete, continuous, and hybrid automata

Normally automata theory describes the states of abstract machines but there are discrete automata, analog automata or continuous automata, or hybrid discrete-continuous automata, which use digital data, analog data or continuous time, or digital ''and'' analog data, respectively.

Hierarchy in terms of powers

The following is an incomplete hierarchy in terms of powers of different types of virtual machines. The hierarchy reflects the nested categories of languages the machines are able to accept.

Applications

Each model in automata theory plays important roles in several applied areas.

Finite automata A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...

are used in text processing, compilers, and hardware design.

Context-free grammar In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules are of the form :A\ \to\ \alpha with A a ''single'' nonterminal symbol, and \alpha a string of terminals and/or nonterminals (\alpha can be em ...

(CFGs) are used in

programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...

s and artificial intelligence. Originally, CFGs were used in the study of

human language Language is a structured system of communication. The structure of a language is its grammar and the free components are its vocabulary. Languages are the primary means by which humans communicate, and may be conveyed through a variety of met ...

s. Cellular automata are used in the field of artificial life, the most famous example being John Conway's Game of Life. Some other examples which could be explained using automata theory in biology include mollusk and pine cone growth and pigmentation patterns. Going further, a theory suggesting that the whole universe is computed by some sort of a discrete automaton, is advocated by some scientists. The idea originated in the work of

Konrad Zuse Konrad Ernst Otto Zuse (; 22 June 1910 – 18 December 1995) was a German civil engineer, pioneering computer scientist, inventor and businessman. His greatest achievement was the world's first programmable computer; the functional program- ...

, and was popularized in America by Edward Fredkin. Automata also appear in the theory of

finite field In mathematics, a finite field or Galois field (so-named in honor of Évariste Galois) is a field that contains a finite number of elements. As with any field, a finite field is a set on which the operations of multiplication, addition, subtr ...

s: the set of

irreducible polynomial In mathematics, an irreducible polynomial is, roughly speaking, a polynomial that cannot be factored into the product of two non-constant polynomials. The property of irreducibility depends on the nature of the coefficients that are accepted f ...

s that can be written as composition of degree two polynomials is in fact a regular language. Another problem for which automata can be used is the

induction of regular languages In computational learning theory, induction of regular languages refers to the task of learning a formal description (e.g. grammar) of a regular language from a given set of example strings. Although E. Mark Gold has shown that not every regular la ...

Automata simulators

Automata simulators are pedagogical tools used to teach, learn and research automata theory. An automata simulator takes as input the description of an automaton and then simulates its working for an arbitrary input string. The description of the automaton can be entered in several ways. An automaton can be defined in a symbolic language or its specification may be entered in a predesigned form or its transition diagram may be drawn by clicking and dragging the mouse. Well known automata simulators include Turing's World, JFLAP, VAS, TAGS and SimStudio.

Connection to category theory

One can define several distinct categories of automata following the automata classification into different types described in the previous section. The mathematical category of deterministic automata, sequential machines or ''sequential automata'', and Turing machines with ''automata homomorphisms'' defining the arrows between automata is a Cartesian closed category, it has both categorical

limit Limit or Limits may refer to: Arts and media * ''Limit'' (manga), a manga by Keiko Suenobu * ''Limit'' (film), a South Korean film * Limit (music), a way to characterize harmony * "Limit" (song), a 2016 single by Luna Sea * "Limits", a 2019 ...

s and colimits. An automata homomorphism maps a quintuple of an automaton ''A''_''i'' onto the quintuple of another automaton '' A''_''j''. Automata homomorphisms can also be considered as ''automata transformations'' or as

semigroup In mathematics, a semigroup is an algebraic structure consisting of a set together with an associative internal binary operation on it. The binary operation of a semigroup is most often denoted multiplicatively: ''x''·''y'', or simply ''xy'', ...

homomorphisms, when the state space, ''S'', of the automaton is defined as a semigroup S_g.

Monoid In abstract algebra, a branch of mathematics, a monoid is a set equipped with an associative binary operation and an identity element. For example, the nonnegative integers with addition form a monoid, the identity element being 0. Monoid ...

s are also considered as a suitable setting for automata in monoidal categories.Meseguer, J., Montanari, U.: 1990 Petri nets are monoids. ''Information and Computation'' 88:105–155 ;Categories of variable automata One could also define a ''variable automaton'', in the sense of

Norbert Wiener Norbert Wiener (November 26, 1894 – March 18, 1964) was an American mathematician and philosopher. He was a professor of mathematics at the Massachusetts Institute of Technology (MIT). A child prodigy, Wiener later became an early researcher ...

in his book on '' The Human Use of Human Beings'' ''via'' the endomorphisms

A_\to A_

. Then one can show that such variable automata homomorphisms form a mathematical group. In the case of non-deterministic, or other complex kinds of automata, the latter set of endomorphisms may become, however, a ''variable automaton

groupoid In mathematics, especially in category theory and homotopy theory, a groupoid (less often Brandt groupoid or virtual group) generalises the notion of group in several equivalent ways. A groupoid can be seen as a: *'' Group'' with a partial func ...

''. Therefore, in the most general case, categories of variable automata of any kind are categories of groupoids or groupoid categories. Moreover, the category of reversible automata is then a 2-category, and also a subcategory of the 2-category of groupoids, or the groupoid category.

References

External links

dk.brics.automaton

{{Authority control