Generalized context-free grammar (GCFG) is a grammar formalism that expands on
context-free grammars by adding potentially non-context-free composition functions to rewrite rules.
Head grammar
Head grammar (HG) is a grammar formalism introduced in Carl Pollard (1984) Pollard, C. 1984. ''Generalized Phrase Structure Grammars, Head Grammars, and Natural Language''. Ph.D. thesis, Stanford University, CA. as an extension of the context-f ...
(and its weak equivalents) is an instance of such a GCFG which is known to be especially adept at handling a wide variety of non-CF properties of natural language.
Description
A GCFG consists of two components: a set of composition functions that combine string tuples, and a set of rewrite rules. The composition functions all have the form
, where
is either a single string tuple, or some use of a (potentially different) composition function which reduces to a string tuple. Rewrite rules look like
, where
,
, ... are string tuples or non-terminal symbols.
The rewrite semantics of GCFGs is fairly straightforward. An occurrence of a non-terminal symbol is rewritten using rewrite rules as in a context-free grammar, eventually yielding just compositions (composition functions applied to string tuples or other compositions). The composition functions are then applied, successively reducing the tuples to a single tuple.
Example
A simple translation of a context-free grammar into a GCFG can be performed in the following fashion. Given the grammar in (), which generates the palindrome language
, where
is the string reverse of
, we can define the composition function ''conc'' as in () and the rewrite rules as in ().
The CF production of ' is
:
:
:
:
:
and the corresponding GCFG production is
:
:
:
:
:
:
:
:
Linear Context-free Rewriting Systems (LCFRSs)
Weir (1988)
describes two properties of composition functions, linearity and regularity. A function defined as
is linear if and only if each variable appears at most once on either side of the ''='', making
linear but not
. A function defined as
is regular if the left hand side and right hand side have exactly the same variables, making
regular but not
or
.
A grammar in which all composition functions are both linear and regular is called a Linear Context-free Rewriting System (LCFRS). LCFRS is a
proper subclass of the GCFGs, i.e. it has strictly less computational power than the GCFGs as a whole.
On the other hand, LCFRSs are strictly more expressive than
linear-indexed grammars and their
weakly equivalent variant
tree adjoining grammar
Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi. Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free gr ...
s (TAGs).
Head grammar
Head grammar (HG) is a grammar formalism introduced in Carl Pollard (1984) Pollard, C. 1984. ''Generalized Phrase Structure Grammars, Head Grammars, and Natural Language''. Ph.D. thesis, Stanford University, CA. as an extension of the context-f ...
is another example of an LCFRS that is strictly less powerful than the class of LCFRSs as a whole.
LCFRS are weakly equivalent to (set-local) ''multicomponent'' TAGs (
MCTAGs) and also with
multiple context-free grammar (MCFG
.
and
minimalist grammar
Minimalist grammars are a class of formal grammars that aim to provide a more rigorous, usually proof-theoretic, formalization of Chomskyan Minimalist program than is normally provided in the mainstream Minimalist literature. A variety of particula ...
s (MGs). The languages generated by LCFRS (and their weakly equivalents) can be parsed in
polynomial time
In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by ...
.
See also
*
Range concatenation grammar
References
{{DEFAULTSORT:Generalized Context-Free Grammar
Formal languages
Grammar frameworks