An attribute grammar is a formal way to supplement a
formal grammar
In formal language theory, a grammar (when the context is not given, often called a formal grammar for clarity) describes how to form strings from a language's alphabet that are valid according to the language's syntax. A grammar does not describe ...
with semantic information processing. Semantic information is stored in
attributes associated with
terminal and nonterminal symbols of the grammar. The values of attributes are result of attribute evaluation rules associated with productions of the grammar. Attributes allow to transfer information from anywhere in the
abstract syntax tree
In computer science, an abstract syntax tree (AST), or just syntax tree, is a tree representation of the abstract syntactic structure of text (often source code) written in a formal language. Each node of the tree denotes a construct occurring ...
to anywhere else, in a controlled and formal way.
Each semantic function deals with attributes of symbols occurring only in one production rule: both semantic function parameters and its result are attributes of symbols from one particular rule. When a semantic function defines the value of an attribute of the symbol on the left hand side of the rule, the attribute is called ''synthesized''; otherwise it is called ''inherited''. Thus, synthesized attributes serve to pass semantic information up the parse tree, while inherited attributes allow values to be passed from the parent nodes down and across the syntax tree.
In simple applications, such as evaluation of arithmetic expressions, attribute grammar may be used to describe the entire task to be performed besides parsing in straightforward way; in complicated systems, for instance, when constructing a language translation tool, such as a compiler, it may be used to validate semantic checks associated with a grammar, representing the rules of a language not explicitly imparted by the syntax definition. It may be also used by
parser
Parsing, syntax analysis, or syntactic analysis is the process of analyzing a string of symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal grammar. The term ''parsing'' comes from Lati ...
s or
compiler
In computing, a compiler is a computer program that translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primarily used for programs that ...
s to translate the syntax tree directly into code for some specific machine, or into some
intermediate language.
History
Attribute grammars were invented by
Donald Knuth
Donald Ervin Knuth ( ; born January 10, 1938) is an American computer scientist, mathematician, and professor emeritus at Stanford University. He is the 1974 recipient of the ACM Turing Award, informally considered the Nobel Prize of computer ...
and
Peter Wegner.
[D. E. Knuth]
The genesis of attribute grammars
''Proceedings of the international conference on Attribute grammars and their applications'' (1990), LNCS
vol. 461
1–12. While Donald Knuth is credited for the overall concept, Peter Wegner invented inherited attributes during a conversation with Knuth. Some embryonic ideas trace back
to the work of Edgar T. "Ned" Irons,
the author of
IMP.
Example
The following is a simple
context-free grammar
In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules are of the form
:A\ \to\ \alpha
with A a ''single'' nonterminal symbol, and \alpha a string of terminals and/or nonterminals (\alpha can be ...
which can describe a language made up of multiplication and addition of integers.
Expr → Expr + Term
Expr → Term
Term → Term * Factor
Term → Factor
Factor → "(" Expr ")"
Factor → ''integer''
The following attribute grammar can be used to calculate the result of an expression written in the grammar. Note that this grammar only uses synthesized values, and is therefore an
S-attributed grammar
S-attributed grammars are a class of attribute grammars characterized by having no inherited attributes, but only synthesized attributes. Inherited attributes, which must be passed down from parent nodes to children nodes of the abstract synta ...
.
Expr
1 → Expr
2 + Term
1.value = Expr2.value + Term.value ">Expr1.value = Expr2.value + Term.value Expr → Term
Expr.value = Term.value Term
1 → Term
2 * Factor
1.value = Term2.value * Factor.value ">Term1.value = Term2.value * Factor.value Term → Factor
Term.value = Factor.value Factor → "(" Expr ")"
Factor.value = Expr.value Factor → ''integer''
Factor.value = strToInt(''integer''.str)
Synthesized attributes
A synthesized attribute is computed from the values of attributes of the children. Since the values of the children must be computed first, this is an example of bottom-up propagation. To formally define a synthesized attribute, let
be a formal grammar, where
*
is the set of non terminal symbols
*
is the set of terminal symbols
*
is the set of
productions
*
is the distinguished, or start, symbol
Then, given a string of nonterminal symbols
and an attribute name
,
is a synthesized attribute if all three of these conditions are met:
*
(i.e.
is one of the rules in the grammar)
*
(i.e. every symbol in the body of the rule is either nonterminal or terminal)
*
, where
(i.e. the value of the attribute is a function
applied to some values from the symbols in the body of the rule)
Inherited attributes
An ''inherited attribute'' at a node in parse tree is defined using the attribute values at the parent or siblings. Inherited attributes are convenient for expressing the dependence of a programming language construct on the context in which it appears. For example, we can use an inherited attribute to keep track of whether an identifier appears on the left or the right side of an assignment in order to decide whether the address or the value of the identifier is needed. In contrast to synthesized attributes, inherited attributes can take values from parent and/or siblings. As in the following production,
: S → ABC
where A can get values from S, B, and C. B can take values from S, A, and C. Likewise, C can take values from S, A, and B.
Special types of attribute grammars
*
L-attributed grammar
L-attributed grammars are a special type of attribute grammars. They allow the attributes to be evaluated in one depth-first left-to-right traversal of the abstract syntax tree. As a result, attribute evaluation in L-attributed grammars can be inco ...
: ''inherited attributes'' can be evaluated in one left-to-right traversal of the abstract syntax tree
*
LR-attributed grammar: an L-attributed grammar whose ''inherited attributes'' can also be evaluated in
bottom-up parsing.
*
ECLR-attributed grammar: a subset of LR-attributed grammars where equivalence classes can be used to optimize the evaluation of inherited attributes.
*
S-attributed grammar
S-attributed grammars are a class of attribute grammars characterized by having no inherited attributes, but only synthesized attributes. Inherited attributes, which must be passed down from parent nodes to children nodes of the abstract synta ...
: a simple type of attribute grammar, using only ''synthesized attributes'', but no ''inherited attributes''
See also
*
Affix grammar
*
Van Wijngaarden grammar
*
Syntax-directed translation
References
* Original paper introducing attributed grammars: {{Cite journal
, url = https://www.csee.umbc.edu/courses/331/fall16/01/resources/papers/Knuth67AG.pdf
, title = Semantics of context-free languages
, first = Donald E.
, surname = Knuth
, year = 1968
, authorlink = Donald Knuth
, volume = 2
, number = 2
, journal = Mathematical Systems Theory
, pages = 127–145
, doi = 10.1007/BF01692511
, s2cid = 5182310
backup
External links
Why Attribute Grammars Matter The Monad Reader, Issue 4, July 5, 2005. (This article narrates on how the formalism of attribute grammars brings
aspect-oriented programming
In computing, aspect-oriented programming (AOP) is a programming paradigm that aims to increase modularity by allowing the separation of cross-cutting concerns. It does so by adding behavior to existing code (an advice) ''without'' modifying ...
to
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that ...
by helping writing
catamorphisms
compositionally. It refers to th
Utrecht University Attribute Grammarsystem (see als
as the implementation used in the examples.)
Attribute grammarin relation to
Haskell and
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by applying and composing functions. It is a declarative programming paradigm in which function definitions are trees of expressions that ...
.
* Jukka Paakki
Attribute grammar paradigms—a high-level methodology in language implementation ''ACM Computing Surveys'' 27:2 (June 1995), 196–255.
Silveris an extensible attribute grammar specification language and system from University of Minnesota. (See also th
GitHub repository)
Formal languages
Compiler construction
Parsing