Pregroup grammar (PG) is a

grammar formalism A formal grammar is a set of Terminal and nonterminal symbols, symbols and the Production (computer science), production rules for rewriting some of them into every possible string of a formal language over an Alphabet (formal languages), alphabe ...

intimately related to categorial grammars. Much like categorial grammar (CG), PG is a kind of type logical grammar. Unlike CG, however, PG does not have a distinguished function type. Rather, PG uses inverse types combined with its monoidal operation.

Definition of a pregroup

A pregroup is a partially ordered

algebra Algebra is a branch of mathematics that deals with abstract systems, known as algebraic structures, and the manipulation of expressions within those systems. It is a generalization of arithmetic that introduces variables and algebraic ope ...

(A, 1, \cdot, -^l, -^r, \leq)

such that

(A, 1, \cdot)

is a

monoid In abstract algebra, a monoid is a set equipped with an associative binary operation and an identity element. For example, the nonnegative integers with addition form a monoid, the identity element being . Monoids are semigroups with identity ...

, satisfying the following relations: *

x^l \cdot x \leq 1 \qquad x \cdot x^r \leq 1

(contraction) *

1 \leq x \cdot x^l \qquad 1 \leq x^r \cdot x

(expansion) The contraction and expansion relations are sometimes called Ajdukiewicz laws. From this, it can be proven that the following equations hold: *

1^l = 1 = 1^r

x^ = x = x^

(x\cdot y)^l = y^l \cdot x^l \qquad (x\cdot y)^r = y^r \cdot x^r

x^l

and

x^r

are called the

left Left may refer to: Music * ''Left'' (Hope of the States album), 2006 * ''Left'' (Monkey House album), 2016 * ''Left'' (Helmet album), 2023 * "Left", a song by Nickelback from the album ''Curb'', 1996 Direction * Left (direction), the relativ ...

and

right adjoint In mathematics, specifically category theory, adjunction is a relationship that two functors may exhibit, intuitively corresponding to a weak form of equivalence between two related categories. Two functors that stand in this relationship are k ...

s of ''x'', respectively. The symbols

\cdot

and

\leq

are also written

\otimes

and

\to

respectively. In

category theory Category theory is a general theory of mathematical structures and their relations. It was introduced by Samuel Eilenberg and Saunders Mac Lane in the middle of the 20th century in their foundational work on algebraic topology. Category theory ...

, pregroups are also known as autonomous categories or (non-symmetric) compact closed categories. More typically,

x \cdot y

will just be represented by adjacency, i.e. as

xy

Definition of a pregroup grammar

A pregroup grammar consists of a

lexicon A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word () ...

of words (and possibly

morphemes A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...

) ''L'', a set of atomic types ''T'' which freely generates a pregroup, and a relation

:

that relates words to types. In simple pregroup grammars, typing is a function that maps words to only one type each.

Examples

Some simple, intuitive examples using English as the language to model demonstrate the core principles behind pregroups and their use in linguistic domains. Let ''L'' = , let ''T'' = , and let the following typing relation hold: :

\textit : N \qquad \textit : N \qquad \textit : N \cdot N_0^l \qquad \textit : N_0 \qquad \textit : N_0

\textit : N^r \cdot S \cdot N^l \qquad \textit : N^r \cdot S \qquad \textit : S^r \cdot N^ \cdot N^r \cdot S \cdot N^l

A sentence ''S'' that has type ''T'' is said to be grammatical if

T \leq S

. We can prove this by use of a chain of

\leq

. For example, we can prove that

\textit\ \textit\ \textit : N \cdot N^r \cdot S \cdot N^l \cdot N

is grammatical by proving that

N \cdot N^r \cdot S \cdot N^l \cdot N \leq S

: :

N \cdot N^r \cdot S \cdot N^l \cdot N ~\leq~ S \cdot N^l \cdot N ~\leq~ S

by first using contraction on

N \cdot N^r

and then again on

N^l \cdot N

. A more convenient notation exists, however, that indicates contractions by connecting them with a drawn link between the contracting types (provided that the links are nested, i.e. don't cross). Words are also typically placed above their types to make the proof more intuitive. The same proof in this notation is simply PregroupGrammar-Example-JohnMetMary

A more complex example proves that ''the dog barked at the cat'' is grammatical: PregroupGrammar-Example-TheDogBarkedAtTheCat

Historical notes

Pregroup grammars were introduced by

Joachim Lambek Joachim "Jim" Lambek (5 December 1922 – 23 June 2014) was a Canadian mathematician. He was Peter Redpath Emeritus Professor of Pure Mathematics at McGill University, where he earned his PhD degree in 1950 with Hans Zassenhaus as advisor. B ...

in 1993 as a development of his syntactic calculus, replacing the quotients by adjoints. Such adjoints had already been used earlier by

Harris Harris may refer to: Places Canada * Harris, Ontario * Northland Pyrite Mine (also known as Harris Mine) * Harris, Saskatchewan * Rural Municipality of Harris No. 316, Saskatchewan Scotland * Harris, Outer Hebrides (sometimes called the Isle ...

but without iterated adjoints and expansion rules. Adding such adjoints was interesting to handle more complex linguistic cases, where the fact that

a^ \neq a

is needed. It was also motivated by a more algebraic viewpoint: the definition of a pregroup is a weakening of that of a

group A group is a number of persons or things that are located, gathered, or classed together. Groups of people * Cultural group, a group whose members share the same cultural identity * Ethnic group, a group whose members share the same ethnic iden ...

, introducing a distinction between the left and right inverses and replacing the equality by an order. This weakening was needed because using types from a

free group In mathematics, the free group ''F'S'' over a given set ''S'' consists of all words that can be built from members of ''S'', considering two words to be different unless their equality follows from the group axioms (e.g. ''st'' = ''suu''− ...

would not work: an adjective would get the type

N \cdot N^ = 1

, hence it could be inserted at any position in the sentence. Pregroup grammars have then been defined and studied for various languages (or fragments of them) including English,

Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance languag ...

, French,

Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...

and

Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...

. Languages with a relatively free word order such as Sanskrit required to introduce commutation relations to the pregroup, using precyclicity.

Semantics of pregroup grammars

Because of the lack of function types in PG, the usual method of giving a semantics via the

λ-calculus In mathematical logic, the lambda calculus (also written as ''λ''-calculus) is a formal system for expressing computation based on function abstraction and application using variable binding and substitution. Untyped lambda calculus, the topic ...

or via function denotations is not available in any obvious way. Instead, two different methods exist, one purely formal method that corresponds to the λ-calculus, and one denotational method analogous to (a fragment of) the tensor mathematics of

quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...

Purely formal semantics

The purely formal semantics for PG consists of a logical language defined according to the following rules: * Given a set of atomic terms ''T'' = and atomic function symbols ''F'' = (where subscripts are meta-notational indicating arity), and variables ''x'', ''y'', ..., all constants, variables, and well-formed function applications are basic terms (a function application is well-formed when the function symbol is applied to the appropriate number of arguments, which can be drawn from the atomic terms, variables, or can be other basic terms) * Any basic term is a term * Given any variable ''x'', 'x''is a term * Given any terms ''m'' and ''n'',