
Standard Average European (SAE) is a concept originally introduced in 1939 by American linguist
Benjamin Whorf to group the modern
Indo-European
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
languages of Europe
There are over 250 languages indigenous to Europe, and most belong to the Indo-European language family. Out of a demographics of Europe, total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European lang ...
with shared common features. Whorf argued that the SAE
language
Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed language, signed forms, and may also be conveyed through writing syste ...
s were characterized by a number of similarities, including
syntax
In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
and
grammar
In linguistics, grammar is the set of rules for how a natural language is structured, as demonstrated by its speakers or writers. Grammar rules may concern the use of clauses, phrases, and words. The term may also refer to the study of such rul ...
,
vocabulary
A vocabulary (also known as a lexicon) is a set of words, typically the set in a language or the set known to an individual. The word ''vocabulary'' originated from the Latin , meaning "a word, name". It forms an essential component of languag ...
and its use, as well as the relationship between contrasting words and their origins, idioms, and word order, which all made them stand out from many other language groups around the world which do not share these similarities, in essence creating a continental
sprachbund
A sprachbund (, from , 'language federation'), also known as a linguistic area, area of linguistic convergence, or diffusion area, is a group of languages that share areal features resulting from geographical proximity and language contact. Th ...
. His intention was to argue that the disproportionate amount of SAE-specific knowledge in
linguistics
Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
created a substantial SAE-centric bias, leading to
generalization errors, such as mistaking linguistic features idiosyncratic to the SAE
language group for universal tendencies.
Whorf contrasted what he called the ''SAE tense system'' (which contrasts past, present and future tenses) with that of the
Hopi language
Hopi (Hopi: ) is a Uto-Aztecan language spoken by the Hopi people (a Puebloan group) of northeastern Arizona, United States.
The use of Hopi has gradually declined over the course of the 20th century. In 1990, it was estimated that more than ...
of North America, which Whorf analyzed as being based on a distinction not of
tense, but on things that have ''in fact occurred'' (a
realis mood
A realis mood (abbreviated ) is a grammatical mood which is used principally to indicate that something is a statement of fact; in other words, to express what the speaker considers to be a known state of affairs, as in declarative sentences. Most ...
encompassing SAE past and present) compared to things that have ''as'' ''yet not occurred'', but which may or may not occur in the future (
irrealis mood
In linguistics, irrealis moods (abbreviated ) are the main set of grammatical moods that indicate that a certain situation or action is not known to have happened at the moment the speaker is talking. This contrasts with the realis moods. They a ...
). The accuracy of Whorf's analysis of Hopi tense later became
a point of controversy in linguistics.
Overview
Whorf likely considered
Romance and
West Germanic
The West Germanic languages constitute the largest of the three branches of the Germanic languages, Germanic family of languages (the others being the North Germanic languages, North Germanic and the extinct East Germanic languages, East Germ ...
to form the core of the SAE, i.e. the
literary language
Literary language is the Register (sociolinguistics), register of a language used when writing in a formal, academic writing, academic, or particularly polite tone; when speaking or writing in such a tone, it can also be known as formal language. ...
s of
Europe
Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
which have seen substantial cultural influence from
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
during the
medieval period
In the history of Europe, the Middle Ages or medieval period lasted approximately from the 5th to the late 15th centuries, similarly to the post-classical period of World history (field), global history. It began with the fall of the West ...
. The
North Germanic and
Balto-Slavic languages
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
tend to be more peripheral members.
Alexander Gode, who was instrumental in the development of
Interlingua
Interlingua (, ) is an international auxiliary language (IAL) developed between 1937 and 1951 by the American International Auxiliary Language Association (IALA). It is a constructed language of the "naturalistic" variety, whose vocabulary, ...
, characterized it as "Standard Average European".
The Romance,
Germanic, and
Slavic control languages of Interlingua are reflective of the language groups most often included in the SAE ''Sprachbund''.
As a ''Sprachbund''
According to , the SAE languages form a ''
Sprachbund
A sprachbund (, from , 'language federation'), also known as a linguistic area, area of linguistic convergence, or diffusion area, is a group of languages that share areal features resulting from geographical proximity and language contact. Th ...
'' characterized by the following features, sometimes called "euroversals" by analogy with
linguistic universals:
* definite and indefinite
articles (e.g. English ''the'' vs. ''a''/''an'')
* postnominal
relative clause
A relative clause is a clause that modifies a noun or noun phrase and uses some grammatical device to indicate that one of the arguments in the relative clause refers to the noun or noun phrase. For example, in the sentence ''I met a man who wasn ...
s with inflected
relative pronoun
A relative pronoun is a pronoun that marks a relative clause. An example is the word ''which'' in the sentence "This is the house which Jack built." Here the relative pronoun ''which'' introduces the relative clause. The relative clause modifies th ...
s that signal the role of the head in the clause (e.g. English ''who'' vs. ''whose'')
* a
periphrastic
In linguistics and literature, periphrasis () is the use of a larger number of words, with an implicit comparison to the possibility of using fewer. The comparison may be within a language or between languages. For example, "more happy" is periph ...
perfect formed with 'have' plus a passive participle (e.g. English ''I have said'');
* a preponderance of generalizing
predicates to encode
experiencers, i.e. experiencers appear as surface subjects in
nominative case
In grammar, the nominative case ( abbreviated ), subjective case, straight case, or upright case is one of the grammatical cases of a noun or other part of speech, which generally marks the subject of a verb, or (in Latin and formal variants ...
(e.g. English ''I like music'' instead of ''Music pleases me,'' though compare Italian ''Mi piace la musica'' and German ''Musik gefällt mir,'' which are of the form "Music pleases me")
* a passive construction formed with a passive participle plus an intransitive
copula-like verb (e.g. English ''I am known'');
* a prominence of
anticausative verbs in
inchoative-
causative
In linguistics, a causative (abbreviated ) is a valency-increasing operationPayne, Thomas E. (1997). Describing morphosyntax: A guide for field linguists'' Cambridge: Cambridge University Press. p. 173–186. that indicates that a subject either ...
pairs (''e.g.'', expansion of Latin reflexive ''se'' in its Spanish
reflex
In biology, a reflex, or reflex action, is an involuntary, unplanned sequence or action and nearly instantaneous response to a stimulus.
Reflexes are found with varying levels of complexity in organisms with a nervous system. A reflex occurs ...
to cover anticausative constructions such as ''ubicarse'', "to
hysicallybe
n some specific location" derived from ''ubicar'', "to locate"; ''cf.'' Russian inchoative anticausative ''izmenit’-sja'', "to change (
intransitive
In grammar, an intransitive verb is a verb, aside from an auxiliary verb, whose context does not entail a transitive object. That lack of an object distinguishes intransitive verbs from transitive verbs, which entail one or more objects. Additi ...
)," derived from causative ''izmenit’', "to change
omething make
omethingchange")
*
dative
In grammar, the dative case (abbreviated , or sometimes when it is a core argument) is a grammatical case used in some languages to indicate the recipient or beneficiary of an action, as in "", Latin for "Maria gave Jacob a drink". In this exampl ...
external possessors (e.g. German ''Die Mutter wusch dem Kind die Haare'' "The mother washed the child's hair" (lit. "The mother washed the hair to the child"), Portuguese ''Ela lavou-lhe o cabelo'' "She washed his hair" (lit. "She washed him the hair")
* negative indefinite pronouns without verbal negation (e.g. German ''Niemand kommt'' "nobody comes" vs. Modern Greek "nobody (lit. not) comes")
* particle comparatives in comparisons of inequality (e.g. English ''bigger than an elephant'')
* equative constructions (i.e. constructions for comparison of equality) based on adverbial relative-clause structures, e.g. Occitan ''tan grand coma un elefant'', Russian ''tak že X kak Y'', where ''coma/kak'' (historically coming from the adverbial interrogative pronoun "how") are "adverbial relative pronouns" according to Haspelmath
* subject person affixes as strict
agreement markers, i.e. the verb is inflected for person and number of the subject, but subject pronouns may not be
dropped even when this would be unambiguous (only in some languages, such as German and French)
* differentiation between intensifiers and
reflexive pronoun
A reflexive pronoun is a pronoun that refers to another noun or pronoun (its antecedent) within the same sentence.
In the English language specifically, a reflexive pronoun will end in ''-self'' or ''-selves'', and refer to a previously n ...
s (e.g. German intensifier ''selbst'' vs. reflexive ''sich'')
Besides these features, which are uncommon outside Europe and thus useful for defining the SAE area, Haspelmath (2001) lists further features characteristic of European languages (but also found elsewhere):
* verb-initial order in yes/no questions;
* comparative inflection of adjectives (e.g. English ''bigger'');
* For
conjunctions of
noun phrases
A noun phrase – or NP or nominal (phrase) – is a phrase that usually has a noun or pronoun as its head, and has the same grammatical functions as a noun. Noun phrases are very common cross-linguistically, and they may be the most frequently ...
, SAE languages prefer "A and-B" instead of "A-and B", "A-and B-and", "A B-and", or the
comitative
In grammar, the comitative case (abbreviated ) is a grammatical case that denotes accompaniment. In English, the preposition "with", in the sense of "in company with" or "together with", plays a substantially similar role. Other uses of "with", l ...
"with";
* syncretism of
instrumental
An instrumental or instrumental song is music without any vocals, although it might include some inarticulate vocals, such as shouted backup vocals in a big band setting. Through Semantic change, semantic widening, a broader sense of the word s ...
and
comitative
In grammar, the comitative case (abbreviated ) is a grammatical case that denotes accompaniment. In English, the preposition "with", in the sense of "in company with" or "together with", plays a substantially similar role. Other uses of "with", l ...
cases (e.g. English ''I cut my food with a knife when eating with my friends'');
*
suppletivism in ''second'' vs. ''two'';
* lack of distinction between alienable (e.g. legal property) and
inalienable (e.g. body part) possession;
* lack of distinction between
inclusive and exclusive first-person plural pronouns ("we and you" vs. "we and not you");
* lack of productive usage of
reduplication
In linguistics, reduplication is a Morphology (linguistics), morphological process in which the Root (linguistics), root or Stem (linguistics), stem of a word, part of that, or the whole word is repeated exactly or with a slight change.
The cla ...
;
*
topic and
focus
Focus (: foci or focuses) may refer to:
Arts
* Focus or Focus Festival, former name of the Adelaide Fringe arts festival in East Australia Film
*Focus (2001 film), ''Focus'' (2001 film), a 2001 film based on the Arthur Miller novel
*Focus (2015 ...
expressed by intonation and word order;
* word order
subject–verb–object;
* only one
converb
In theoretical linguistics, a converb ( abbreviated ) is a nonfinite verb form that serves to express adverbial subordination: notions like 'when', 'because', 'after' and 'while'. Other terms that have been used to refer to converbs include ''adv ...
(e.g. English ''-ing'' form, Romance gerunds), preference for finite rather than non-finite subordinate clauses;
* specific construction for negative coordination (e.g. English ''neither...nor...'');
* phasal adverbs (e.g. English ''already'', ''still'', ''not yet'');
* tendency towards replacement of
past tense
The past tense is a grammatical tense whose function is to place an action or situation in the past. Examples of verbs in the past tense include the English verbs ''sang'', ''went'' and ''washed''. Most languages have a past tense, with some hav ...
by the
perfect.
The ''Sprachbund'' defined this way consists of the following languages:
*
Germanic languages
The Germanic languages are a branch of the Indo-European languages, Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoke ...
;
*
Romance languages
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
;
*
Baltic languages
The Baltic languages are a branch of the Indo-European languages, Indo-European language family spoken natively or as a second language by a population of about 6.5–7.0 million people ;
*
Slavic languages
The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavs, Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic language, Proto- ...
;
*
Albanian;
*
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
;
* the "westernmost"
Finno-Ugric languages
Finno-Ugric () is a traditional linguistic grouping of all languages in the Uralic language family except for the Samoyedic languages. Its once commonly accepted status as a subfamily of Uralic is based on criteria formulated in the 19th centur ...
.
The
Balkan sprachbund is thus included as a subset of the larger SAE, while Baltic Eastern Europe is a coordinate member.
Not all the languages listed above show all the listed features, so membership in SAE can be described as gradient. Based on nine of the above-mentioned common features, Haspelmath regards French and German as forming the ''nucleus'' of the ''Sprachbund'', surrounded by a ''core'' formed by English, the other Romance languages, the Nordic languages, and the Western and Southern Slavic languages. Hungarian, the Baltic languages, the Eastern Slavic languages, and the
Finnic languages
The Finnic or Baltic Finnic languages constitute a branch of the Uralic language family spoken around the Baltic Sea by the Baltic Finnic peoples. There are around 7 million speakers, who live mainly in Finland and Estonia.
Traditionally, ...
form more peripheral groups.
[Haspelmath, Martin, 1998. How young is Standard Average European? ''Language Sciences''.] All languages identified by Haspelmath as core SAE are
Indo-European languages
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
, except Hungarian and the Finnic languages. However, not all Indo-European languages are SAE languages: the
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
,
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
, and
Indo-Iranian languages
The Indo-Iranian languages (also known as Indo-Iranic languages or collectively the Aryan languages) constitute the largest branch of the Indo-European language family. They include over 300 languages, spoken by around 1.7 billion speakers ...
remain outside the SAE ''Sprachbund''.
The Standard Average European ''Sprachbund'' is most likely the result of ongoing
language contact
Language contact occurs when speakers of two or more languages or varieties interact with and influence each other. The study of language contact is called contact linguistics. Language contact can occur at language borders, between adstratum ...
beginning in the time of the
Migration Period
The Migration Period ( 300 to 600 AD), also known as the Barbarian Invasions, was a period in European history marked by large-scale migrations that saw the fall of the Western Roman Empire and subsequent settlement of its former territories ...
.
Inheritance of the SAE features from
Proto-Indo-European
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
can be ruled out because Proto-Indo-European, as currently reconstructed, lacked most of the SAE features.
Furthermore, in some cases younger forms of a language do have an SAE feature which attested older forms lack; for example,
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
does not have a periphrastic perfect, but modern
Romance languages
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
such as Spanish and French do. Much of the area of SAE was at various times part of the
Roman Empire
The Roman Empire ruled the Mediterranean and much of Europe, Western Asia and North Africa. The Roman people, Romans conquered most of this during the Roman Republic, Republic, and it was ruled by emperors following Octavian's assumption of ...
or the vague concept of a political entity called
Christendom
The terms Christendom or Christian world commonly refer to the global Christian community, Christian states, Christian-majority countries or countries in which Christianity is dominant or prevails.SeMerriam-Webster.com : dictionary, "Christen ...
and thus affected by the religious, political and ideological discourse of these entities and their respective
sphere of influence
In the field of international relations, a sphere of influence (SOI) is a spatial region or concept division over which a state or organization has a level of cultural, economic, military, or political exclusivity.
While there may be a formal a ...
. This discourse and long distance communication among elites generally took place in one of the
linguas francas of the era –
Koine Greek
Koine Greek (, ), also variously known as Hellenistic Greek, common Attic, the Alexandrian dialect, Biblical Greek, Septuagint Greek or New Testament Greek, was the koiné language, common supra-regional form of Greek language, Greek spoken and ...
and
Classical Latin
Classical Latin is the form of Literary Latin recognized as a Literary language, literary standard language, standard by writers of the late Roman Republic and early Roman Empire. It formed parallel to Vulgar Latin around 75 BC out of Old Latin ...
in
Late Antiquity
Late antiquity marks the period that comes after the end of classical antiquity and stretches into the onset of the Early Middle Ages. Late antiquity as a period was popularized by Peter Brown (historian), Peter Brown in 1971, and this periodiza ...
,
Medieval Latin
Medieval Latin was the form of Literary Latin used in Roman Catholic Church, Roman Catholic Western Europe during the Middle Ages. It was also the administrative language in the former Western Roman Empire, Roman Provinces of Mauretania, Numidi ...
in the Middle Ages and finally in the modern era
Modern Latin gradually being replaced by vernaculars such as modern French, German and – in the 20th and 21st century – increasingly English. These languages have left
learned borrowings (also known as
inkhorn term
An inkhorn term is a loanword, or a word coined from existing roots, which is deemed to be unnecessary or over-pretentious.
Etymology
An inkhorn is an inkwell made of horn. It was an important item for many scholars, which soon became symb ...
s) in the
prestige
Prestige may refer to:
Arts, entertainment and media Films
*Prestige (film), ''Prestige'' (film), a 1932 American film directed by Tay Garnett: woman travels to French Indochina to meet up with husband
*The Prestige (film), ''The Prestige'' (fi ...
variants of almost all European languages and continue to provide
loanword
A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing. Borrowing is a metaphorical term t ...
s,
calque
In linguistics, a calque () or loan translation is a word or phrase borrowed from another language by literal word-for-word or root-for-root translation. When used as a verb, "to calque" means to borrow a word or phrase from another language ...
s and
idiom
An idiom is a phrase or expression that largely or exclusively carries a Literal and figurative language, figurative or non-literal meaning (linguistic), meaning, rather than making any literal sense. Categorized as formulaic speech, formulaic ...
s.
See also
*
Eurolinguistics
*
Languages of Europe
There are over 250 languages indigenous to Europe, and most belong to the Indo-European language family. Out of a demographics of Europe, total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European lang ...
References
Bibliography
*
Grzega, Joachim (ed.). 2025. ''The Routledge Handbook of Eurolinguistics''. Routledge.
* {{Cite work
, chapter=The European linguistic area: Standard Average European
, last1=Haspelmath
, first1=Martin
, date=2001
, author-link=Martin Haspelmath
, chapter-url=https://www.anglistik.uni-freiburg.de/seminar/abteilungen/sprachwissenschaft/ls_kortmann/Courses/Kortmann/Variation/index_html/2008-05-27.8724094854
, archive-url=https://web.archive.org/web/20190304215519/https://www.anglistik.uni-freiburg.de/seminar/abteilungen/sprachwissenschaft/ls_kortmann/Courses/Kortmann/Variation/index_html/2008-05-27.8724094854
, archive-date=4 March 2019
, title=Language Typology and Language Universals
, location=Berlin
, publisher=De Gruyter
, pages=1492–1510
, isbn=9783110171549
, doi=10.1515/9783110194265-044
, volume=2
, url=https://zenodo.org/record/1236769
, editor-last1=Haspelmath
, editor-first1=Martin
, editor-link1=Martin Haspelmath
, editor-last2=König
, editor-first2=Ekkehard
, editor-link2=Ekkehard König
, editor-last3=Oesterreicher
, editor-first3=Wulf
, editor-last4=Raible
, editor-first4=Wolfgang
* Heine, Bernd and Kuteva, Tania. 2006. ''The Changing Languages of Europe''. Oxford University Press.
* Van der Auwera, Johan. 2011. Standard Average European. In: Kortmann, B. & van der Auwera, J. (eds.) The Languages and Linguistics of Europe: A Comprehensive Guide. (pp. 291–306) Berlin: de Gruyter Mouto
Sprachbund
Languages of Europe
Linguistic typology
1939 introductions