Pregroup grammar (PG) is a
grammar formalism
A formal grammar is a set of Terminal and nonterminal symbols, symbols and the Production (computer science), production rules for rewriting some of them into every possible string of a formal language over an Alphabet (formal languages), alphabe ...
intimately related to
categorial grammars. Much like categorial grammar (CG), PG is a kind of
type logical grammar. Unlike CG, however, PG does not have a distinguished function type. Rather, PG uses inverse types combined with its monoidal operation.
Definition of a pregroup
A pregroup is a
partially ordered algebra
Algebra is a branch of mathematics that deals with abstract systems, known as algebraic structures, and the manipulation of expressions within those systems. It is a generalization of arithmetic that introduces variables and algebraic ope ...
such that
is a
monoid
In abstract algebra, a monoid is a set equipped with an associative binary operation and an identity element. For example, the nonnegative integers with addition form a monoid, the identity element being .
Monoids are semigroups with identity ...
, satisfying the following relations:
*
(contraction)
*
(expansion)
The contraction and expansion relations are sometimes called
Ajdukiewicz laws.
From this, it can be proven that the following equations hold:
*
*
*
and
are called the
left
Left may refer to:
Music
* ''Left'' (Hope of the States album), 2006
* ''Left'' (Monkey House album), 2016
* ''Left'' (Helmet album), 2023
* "Left", a song by Nickelback from the album ''Curb'', 1996
Direction
* Left (direction), the relativ ...
and
right adjoint
In mathematics, specifically category theory, adjunction is a relationship that two functors may exhibit, intuitively corresponding to a weak form of equivalence between two related categories. Two functors that stand in this relationship are k ...
s of ''x'', respectively.
The symbols
and
are also written
and
respectively. In
category theory
Category theory is a general theory of mathematical structures and their relations. It was introduced by Samuel Eilenberg and Saunders Mac Lane in the middle of the 20th century in their foundational work on algebraic topology. Category theory ...
, pregroups are also known as
autonomous categories or (non-symmetric)
compact closed categories. More typically,
will just be represented by adjacency, i.e. as
.
Definition of a pregroup grammar
A pregroup grammar consists of a
lexicon
A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word () ...
of words (and possibly
morphemes
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
) ''L'', a set of atomic types ''T'' which
freely generates a pregroup, and a relation
that relates words to types. In simple pregroup grammars, typing is a function that maps words to only one type each.
Examples
Some simple, intuitive examples using English as the language to model demonstrate the core principles behind pregroups and their use in linguistic domains.
Let ''L'' = , let ''T'' = , and let the following typing relation hold:
:
:
A sentence ''S'' that has type ''T'' is said to be grammatical if
. We can prove this by use of a chain of
. For example, we can prove that
is grammatical by proving that
:
:
by first using contraction on
and then again on
. A more convenient notation exists, however, that indicates contractions by connecting them with a drawn link between the contracting types (provided that the links are nested, i.e. don't cross). Words are also typically placed above their types to make the proof more intuitive. The same proof in this notation is simply

A more complex example proves that ''the dog barked at the cat'' is grammatical:
Historical notes
Pregroup grammars were introduced by
Joachim Lambek
Joachim "Jim" Lambek (5 December 1922 – 23 June 2014) was a Canadian mathematician. He was Peter Redpath Emeritus Professor of Pure Mathematics at McGill University, where he earned his PhD degree in 1950 with Hans Zassenhaus as advisor.
B ...
in 1993 as a development of his
syntactic calculus, replacing the quotients by adjoints. Such adjoints had already been used earlier by
Harris
Harris may refer to:
Places Canada
* Harris, Ontario
* Northland Pyrite Mine (also known as Harris Mine)
* Harris, Saskatchewan
* Rural Municipality of Harris No. 316, Saskatchewan
Scotland
* Harris, Outer Hebrides (sometimes called the Isle ...
but without iterated adjoints and expansion rules.
Adding such adjoints was interesting to handle more complex linguistic cases, where the fact that
is needed. It was also motivated by a more algebraic viewpoint: the definition of a pregroup is a weakening of that of a
group
A group is a number of persons or things that are located, gathered, or classed together.
Groups of people
* Cultural group, a group whose members share the same cultural identity
* Ethnic group, a group whose members share the same ethnic iden ...
, introducing a distinction between the left and right inverses and replacing the equality by an order. This weakening was needed because using types from a
free group
In mathematics, the free group ''F'S'' over a given set ''S'' consists of all words that can be built from members of ''S'', considering two words to be different unless their equality follows from the group axioms (e.g. ''st'' = ''suu''− ...
would not work: an adjective would get the type
, hence it could be inserted at any position in the sentence.
Pregroup grammars have then been defined and studied for various languages (or fragments of them) including
English,
Italian
Italian(s) may refer to:
* Anything of, from, or related to the people of Italy over the centuries
** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom
** Italian language, a Romance languag ...
,
French,
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
and
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
.
Languages with a relatively free word order such as Sanskrit required to introduce commutation relations to the pregroup, using precyclicity.
Semantics of pregroup grammars
Because of the lack of function types in PG, the usual method of giving a semantics via the
λ-calculus
In mathematical logic, the lambda calculus (also written as ''λ''-calculus) is a formal system for expressing computation based on function abstraction and application using variable binding and substitution. Untyped lambda calculus, the topic ...
or via function denotations is not available in any obvious way. Instead, two different methods exist, one purely formal method that corresponds to the λ-calculus, and one denotational method analogous to (a fragment of) the tensor mathematics of
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
.
Purely formal semantics
The purely formal semantics for PG consists of a logical language defined according to the following rules:
* Given a set of atomic terms ''T'' = and atomic function symbols ''F'' = (where subscripts are meta-notational indicating arity), and variables ''x'', ''y'', ..., all constants, variables, and well-formed function applications are basic terms (a function application is well-formed when the function symbol is applied to the appropriate number of arguments, which can be drawn from the atomic terms, variables, or can be other basic terms)
* Any basic term is a term
* Given any variable ''x'',
'x''is a term
* Given any terms ''m'' and ''n'',
is a term
Some examples of terms are ''f''(''x''), ''g''(''a'',''h''(''x'',''y'')),