HOME

TheInfoList



OR:

The Indo-European languages are a
language family A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
native to the northern
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, most of
Europe Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
, and the Iranian plateau with additional native branches found in regions such as
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
, the Maldives, parts of Central Asia (e.g.,
Tajikistan Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
and
Afghanistan Afghanistan, officially the Islamic Emirate of Afghanistan, is a landlocked country located at the crossroads of Central Asia and South Asia. It is bordered by Pakistan to the Durand Line, east and south, Iran to the Afghanistan–Iran borde ...
),
Armenia Armenia, officially the Republic of Armenia, is a landlocked country in the Armenian Highlands of West Asia. It is a part of the Caucasus region and is bordered by Turkey to the west, Georgia (country), Georgia to the north and Azerbaijan to ...
, and areas of southern India. Historically, Indo-European languages were also spoken in
Anatolia Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
. Some European languages of this family— English, French, Portuguese, Russian, Spanish, and Dutch—have expanded through
colonialism Colonialism is the control of another territory, natural resources and people by a foreign group. Colonizers control the political and tribal power of the colonised territory. While frequently an Imperialism, imperialist project, colonialism c ...
in the modern period and are now spoken across several continents. The Indo-European family is divided into several branches or sub-families, including Albanian, Armenian, Balto-Slavic, Celtic, Germanic, Hellenic, Indo-Iranian, and Italic, all of which contain present-day living languages, as well as many more
extinct Extinction is the termination of an organism by the death of its Endling, last member. A taxon may become Functional extinction, functionally extinct before the death of its last member if it loses the capacity to Reproduction, reproduce and ...
branches. Today, the individual Indo-European languages with the most native speakers are English, Spanish, Portuguese, Russian, Hindustani, Bengali, French, and German; many others spoken by smaller groups are in danger of extinction. Over 3.4 billion people (42% of the global population) speak an Indo-European language as a
first language A first language (L1), native language, native tongue, or mother tongue is the first language a person has been exposed to from birth or within the critical period hypothesis, critical period. In some countries, the term ''native language'' ...
—by far the most of any language family. There are about 446 living Indo-European languages, according to an estimate by '' Ethnologue'', of which 313 belong to the Indo-Iranian branch. All Indo-European languages are descended from a single prehistoric language, linguistically reconstructed as
Proto-Indo-European Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
, spoken sometime during the
Neolithic The Neolithic or New Stone Age (from Ancient Greek, Greek 'new' and 'stone') is an archaeological period, the final division of the Stone Age in Mesopotamia, Asia, Europe and Africa (c. 10,000 BCE to c. 2,000 BCE). It saw the Neolithic Revo ...
or early
Bronze Age The Bronze Age () was a historical period characterised principally by the use of bronze tools and the development of complex urban societies, as well as the adoption of writing in some areas. The Bronze Age is the middle principal period of ...
(). The geographical location where it was spoken, the Proto-Indo-European homeland, has been the object of many competing hypotheses; the academic consensus supports the Kurgan hypothesis, which posits the homeland to be the Pontic–Caspian steppe in what is now
Ukraine Ukraine is a country in Eastern Europe. It is the List of European countries by area, second-largest country in Europe after Russia, which Russia–Ukraine border, borders it to the east and northeast. Ukraine also borders Belarus to the nor ...
and Southern Russia, associated with the Yamnaya culture and other related archaeological cultures during the 4th and early 3rd millennia BC. By the time the first written records appeared, Indo-European had already evolved into numerous languages spoken across much of Europe,
South Asia South Asia is the southern Subregion#Asia, subregion of Asia that is defined in both geographical and Ethnicity, ethnic-Culture, cultural terms. South Asia, with a population of 2.04 billion, contains a quarter (25%) of the world's populatio ...
, and part of Western Asia. Written evidence of Indo-European appeared during the Bronze Age in the form of
Mycenaean Greek Mycenaean Greek is the earliest attested form of the Greek language. It was spoken on the Greek mainland and Crete in Mycenaean Greece (16th to 12th centuries BC). The language is preserved in inscriptions in Linear B, a script first atteste ...
and the Anatolian languages of Hittite and Luwian. The oldest records are isolated Hittite words and names—interspersed in texts that are otherwise in the unrelated Akkadian language, a Semitic language—found in texts of the Assyrian colony of Kültepe in eastern
Anatolia Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
dating to the 20th century BC. Although no older written records of the original Proto-Indo-European population remain, some aspects of their culture and their religion can be reconstructed from later evidence in the daughter cultures. The Indo-European family is significant to the field of
historical linguistics Historical linguistics, also known as diachronic linguistics, is the scientific study of how languages change over time. It seeks to understand the nature and causes of linguistic change and to trace the evolution of languages. Historical li ...
as it possesses the second-longest
recorded history Recorded history or written history describes the historical events that have been recorded in a written form or other documented communication which are subsequently evaluated by historians using the historical method. For broader world h ...
of any known family after Egyptian and the
Semitic languages The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic, Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
, which belong to the Afroasiatic language family. The analysis of the family relationships between the Indo-European languages, and the reconstruction of their common source, was central to the development of the methodology of historical linguistics as an academic discipline in the 19th century. The Indo-European language family is not considered by the current academic consensus in the field of linguistics to have any genetic relationships with other language families, although several disputed hypotheses propose such relations.


History of Indo-European linguistics

During the 16th century, European visitors to the
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
began to notice similarities among Indo-Aryan, Iranian, and European languages. In 1583, English
Jesuit The Society of Jesus (; abbreviation: S.J. or SJ), also known as the Jesuit Order or the Jesuits ( ; ), is a religious order (Catholic), religious order of clerics regular of pontifical right for men in the Catholic Church headquartered in Rom ...
missionary and
Konkani __NOTOC__ Konkani may refer to: Language * Konkani language is an Indo-Aryan language spoken in the Konkan region of India. * Konkani alphabets, different scripts used to write the language **Konkani in the Roman script, one of the scripts used to ...
scholar Thomas Stephens wrote a letter from Goa to his brother (not published until the 20th century) in which he noted similarities between Indian languages and Greek and
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
. Another account was made by Filippo Sassetti, a merchant born in
Florence Florence ( ; ) is the capital city of the Italy, Italian region of Tuscany. It is also the most populated city in Tuscany, with 362,353 inhabitants, and 989,460 in Metropolitan City of Florence, its metropolitan province as of 2025. Florence ...
in 1540, who travelled to the Indian subcontinent. Writing in 1585, he noted some word similarities between
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
and Italian (these included ''devaḥ''/''dio'' 'God', ''sarpaḥ''/''serpe'' 'serpent', ''sapta''/''sette'' 'seven', ''aṣṭa''/''otto'' 'eight', and ''nava''/''nove'' 'nine'). However, neither Stephens' nor Sassetti's observations led to further scholarly inquiry. In 1647, Dutch linguist and scholar Marcus Zuerius van Boxhorn noted the similarity among certain Asian and European languages and theorized that they were derived from a primitive common language that he called Scythian. He included in his hypothesis Dutch, Albanian, Greek,
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, Persian, and German, later adding Slavic, Celtic, and Baltic languages. However, Van Boxhorn's suggestions did not become widely known and did not stimulate further research. Ottoman Turkish traveller Evliya Çelebi visited Vienna in 1665–1666 as part of a diplomatic mission and noted a few similarities between words in German and in Persian. Gaston Coeurdoux and others made observations of the same type. Coeurdoux made a thorough comparison of Sanskrit, Latin, and Greek conjugations in the late 1760s to suggest a relationship among them. Meanwhile, Mikhail Lomonosov compared different language groups, including Slavic, Baltic (" Kurlandic"), Iranian (" Medic"), Finnish, Chinese, "Hottentot" ( Khoekhoe), and others, noting that related languages (including Latin, Greek, German, and Russian) must have separated in antiquity from common ancestors.M. V. Lomonosov (drafts for ''Russian Grammar'', published 1755). In: Complete Edition, Moscow, 1952, vol. 7, pp. 652–659
: Представимъ долготу времени, которою сіи языки раздѣлились. ... Польской и россійской языкъ коль давно раздѣлились! Подумай же, когда курляндской! Подумай же, когда латинской, греч., нѣм., росс. О глубокая древность! magine the depth of time when these languages separated! ... Polish and Russian separated so long ago! Now think how long ago [this happened toKurlandic! Think when [this happened to] Latin, Greek, German, and Russian! Oh, great antiquity!]
The hypothesis reappeared in 1786 when Sir William Jones first lectured on the striking similarities among three of the oldest languages known in his time:
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, Greek, and
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
, to which he tentatively added Gothic, Celtic, and Persian, though his classification contained some inaccuracies and omissions. In one of the most famous quotations in linguistics, Jones made the following prescient statement in a lecture to the Asiatic Society of Bengal in 1786, conjecturing the existence of an earlier ancestor language, which he called "a common source" but did not name: Thomas Young first used the term ''Indo-European'' in 1813, deriving it from the geographical extremes of the language family: from
Western Europe Western Europe is the western region of Europe. The region's extent varies depending on context. The concept of "the West" appeared in Europe in juxtaposition to "the East" and originally applied to the Western half of the ancient Mediterranean ...
to North India. A synonym is Indo-Germanic (''Idg.'' or ''IdG.''), specifying the family's southeasternmost and northwesternmost branches. This first appeared in French (''indo-germanique'') in 1810 in the work of Conrad Malte-Brun; in most languages this term is now dated or less common than ''Indo-European'', although in German ''indogermanisch'' remains the standard scientific term. A number of other synonymous terms have also been used. Franz Bopp wrote in 1816 ''On the conjugational system of the Sanskrit language compared with that of Greek, Latin, Persian and Germanic'' and between 1833 and 1852 he wrote ''Comparative Grammar''. This marks the beginning of Indo-European studies as an academic discipline. The classical phase of Indo-European comparative linguistics leads from this work to August Schleicher's 1861 ''Compendium'' and up to Karl Brugmann's '' Grundriss'', published in the 1880s. Brugmann's neogrammarian reevaluation of the field and Ferdinand de Saussure's development of the laryngeal theory may be considered the beginning of "modern" Indo-European studies. The generation of Indo-Europeanists active in the last third of the 20th century (such as Calvert Watkins, Jochem Schindler, and Helmut Rix) developed a better understanding of morphology and of ablaut in the wake of Kuryłowicz's 1956 ''Apophony in Indo-European'', who in 1927 pointed out the existence of the Hittite consonant ḫ. Kuryłowicz's discovery supported Ferdinand de Saussure's 1879 proposal of the existence of , elements de Saussure reconstructed to account for vowel length alternations in Indo-European languages. This led to the so-called laryngeal theory, a major step forward in Indo-European linguistics and a confirmation of de Saussure's theory.


Classification

The various subgroups of the Indo-European language family include ten major branches, listed below in alphabetical order: * Albanian, attested from the 13th century; Proto-Albanian evolved from an ancient Paleo-Balkan language, traditionally thought to be Illyrian, or otherwise a totally unattested Balkan Indo-European language that was closely related to Illyrian and Messapic. * Anatolian, extinct by
Late Antiquity Late antiquity marks the period that comes after the end of classical antiquity and stretches into the onset of the Early Middle Ages. Late antiquity as a period was popularized by Peter Brown (historian), Peter Brown in 1971, and this periodiza ...
, spoken in
Anatolia Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
, attested in isolated terms in Luwian/ Hittite mentioned in Semitic Old Assyrian texts from the 20th and 19th centuries BC, Hittite texts from about 1650 BC. * Armenian, attested from the early 5th century AD. It evolved from the Proto-Armenian language which, according to the Armenian hypothesis, developed ''in situ'' from the
Proto-Indo-European language Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Eu ...
of the 3rd millennium BC. * Balto-Slavic, believed by most Indo-Europeanists to form a phylogenetic unit, while a minority ascribes similarities to prolonged language-contact. ** Slavic (from Proto-Slavic), attested from the 9th century AD ( possibly earlier), earliest texts in
Old Church Slavonic Old Church Slavonic or Old Slavonic ( ) is the first Slavic languages, Slavic literary language and the oldest extant written Slavonic language attested in literary sources. It belongs to the South Slavic languages, South Slavic subgroup of the ...
. Slavic languages include Bulgarian, Russian, Polish, Czech, Slovak, Silesian, Kashubian, Macedonian,
Serbo-Croatian Serbo-Croatian ( / ), also known as Bosnian-Croatian-Montenegrin-Serbian (BCMS), is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually i ...
( Bosnian, Croatian, Montenegrin, Serbian), Sorbian, Slovenian, Ukrainian, Belarusian, and Rusyn. ** Baltic, attested from the 14th century; although attested relatively recently, they retain many archaic features attributed to
Proto-Indo-European Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
(PIE). Living examples are Lithuanian and Latvian. * Celtic (from
Proto-Celtic Proto-Celtic, or Common Celtic, is the hypothetical ancestral proto-language of all known Celtic languages, and a descendant of Proto-Indo-European. It is not attested in writing but has been partly Linguistic reconstruction, reconstructed throu ...
), attested since the 6th century BC; Lepontic inscriptions date as early as the 6th century BC; Celtiberian from the 2nd century BC; Primitive Irish Ogham inscriptions from the 4th or 5th century AD, earliest inscriptions in
Old Welsh Old Welsh () is the stage of the Welsh language from about 800 AD until the early 12th century when it developed into Middle Welsh.Koch, p. 1757. The preceding period, from the time Welsh became distinct from Common Brittonic around 550, ha ...
from the 7th century AD. Modern Celtic languages include Welsh, Cornish, Breton,
Scottish Gaelic Scottish Gaelic (, ; Endonym and exonym, endonym: ), also known as Scots Gaelic or simply Gaelic, is a Celtic language native to the Gaels of Scotland. As a member of the Goidelic language, Goidelic branch of Celtic, Scottish Gaelic, alongs ...
, Irish and Manx. * Germanic (from
Proto-Germanic Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages. Proto-Germanic eventually developed from ...
), earliest attestations in runic inscriptions from around the 2nd century AD, earliest coherent texts in Gothic, 4th century AD.
Old English Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
manuscript tradition from about the 8th century AD. Includes English, Frisian, German, Dutch, Scots, Danish, Swedish, Norwegian,
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
,
Yiddish Yiddish, historically Judeo-German, is a West Germanic language historically spoken by Ashkenazi Jews. It originated in 9th-century Central Europe, and provided the nascent Ashkenazi community with a vernacular based on High German fused with ...
,
Low German Low German is a West Germanic languages, West Germanic language variety, language spoken mainly in Northern Germany and the northeastern Netherlands. The dialect of Plautdietsch is also spoken in the Russian Mennonite diaspora worldwide. "Low" ...
, Icelandic, Elfdalian, and Faroese. * Hellenic (from Proto-Greek, see also History of Greek); fragmentary records in Mycenaean Greek from between 1450 and 1350 BC have been found.
Homer Homer (; , ; possibly born ) was an Ancient Greece, Ancient Greek poet who is credited as the author of the ''Iliad'' and the ''Odyssey'', two epic poems that are foundational works of ancient Greek literature. Despite doubts about his autho ...
ic texts date to the 8th century BC. * Indo-Iranian, attested , descended from Proto-Indo-Iranian (dated to the late 3rd millennium BC). ** Indo-Aryan, attested from around 1400 BC in Hittite texts from
Anatolia Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
, showing traces of Indo-Aryan words. Epigraphically from the 3rd century BC in the form of Prakrit ( Edicts of Ashoka). The
Rigveda The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
is assumed to preserve intact records via oral tradition dating from c. the mid-2nd millennium BC in the form of
Vedic Sanskrit Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
. Includes a wide range of modern languages from North India, Eastern Pakistan and Bangladesh, including Hindustani (
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
,
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
), Bengali, Odia, Assamese, Punjabi, Kashmiri, Gujarati, Marathi, Sindhi and Nepali, as well as Sinhala of
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
and Dhivehi of the Maldives and Minicoy. ** Iranian or Iranic, attested from roughly 1000 BC in the form of Avestan. Epigraphically from 520 BC in the form of Old Persian ( Behistun inscription). Includes Persian, Pashto, Kurdish, Balochi, Luri, Tajik, and Ossetian. ** Nuristani, attested since the 20th century, are among the newest Indo-European languages to be studied. Includes Katë, Prasun, Ashkun, Nuristani Kalasha, Tregami, and Zemiaki. * Italic (from Proto-Italic), attested from the 7th century BC. Includes the ancient Osco-Umbrian languages, Faliscan, as well as
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
and its descendants, the
Romance languages The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
, such as Italian and French. * Tocharian, with proposed links to the Afanasevo culture of Southern Siberia. Extant in two dialects (Turfanian and Kuchean, or Tocharian A and B), attested during roughly the 6th–9th centuries AD. Marginalized by the Old Turkic Uyghur Khaganate and probably extinct by the 10th century. In addition to the classical ten branches listed above, several extinct and little-known languages and language-groups have existed or are proposed to have existed: * Ancient Belgian: hypothetical language associated with the proposed Nordwestblock cultural area. Speculated to be connected to Italic or Venetic, and to have certain phonological features in common with Lusitanian. * Cimmerian: possibly Iranic, Thracian, or Celtic * Dacian: possibly very close to Thracian * Elymian: Poorly-attested language spoken by the Elymians, one of the three indigenous (i.e. pre-Greek and pre-Punic) tribes of Sicily. Indo-European affiliation widely accepted, possibly related to Italic or Anatolian. * Illyrian: possibly related to Albanian, Messapian, or both * Liburnian: evidence too scant and uncertain to determine anything with certainty * Ligurian: possibly close to or part of Celtic. * Lusitanian: possibly related to (or part of) Celtic, Ligurian, or Italic * Ancient Macedonian: proposed relationship to Greek. * Messapic: not conclusively deciphered, often considered to be related to Albanian as the available fragmentary linguistic evidence shows common characteristic innovations and a number of significant lexical correspondences between the two languages * Paionian: extinct language once spoken north of Macedon * Phrygian: language of the ancient Phrygians. Very likely, but not certainly, a sister group to Hellenic. * Sicel: an ancient language spoken by the Sicels (Greek Sikeloi, Latin Siculi), one of the three indigenous (i.e. pre-Greek and pre-Punic) tribes of Sicily. Proposed relationship to Latin or Proto-Illyrian (Pre-Indo-European) at an earlier stage. * Sorothaptic: proposed, pre-Celtic, Iberian language * Thracian: possibly including Dacian * Venetic: shares several similarities with Latin and the Italic languages, but also has some affinities with other IE languages, especially Germanic and Celtic. Membership of languages in the Indo-European language family is determined by genealogical relationships, meaning that all members are presumed descendants of a common ancestor,
Proto-Indo-European Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
. Membership in the various branches, groups, and subgroups of Indo-European is also genealogical, but here the defining factors are ''shared innovations'' among various languages, suggesting a common ancestor that split off from other Indo-European groups. For example, what makes the Germanic languages a branch of Indo-European is that much of their structure and phonology can be stated in rules that apply to all of them. Many of their common features are presumed innovations that took place in
Proto-Germanic Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages. Proto-Germanic eventually developed from ...
, the source of all the Germanic languages. In the 21st century, several attempts have been made to model the phylogeny of Indo-European languages using Bayesian methodologies similar to those applied to problems in biological phylogeny. Although there are differences in absolute timing between the various analyses, there is much commonality between them, including the result that the first known language groups to diverge were the Anatolian and Tocharian language families, in that order.


Tree versus wave model

The " tree model" is considered an appropriate representation of the genealogical history of a language family if communities do not remain in contact after their languages have started to diverge. In this case, subgroups defined by shared innovations form a nested pattern. The tree model is not appropriate in cases where languages remain in contact as they diversify; in such cases subgroups may overlap, and the " wave model" is a more accurate representation. Most approaches to Indo-European subgrouping to date have assumed that the tree model is by-and-large valid for Indo-European; however, there is also a long tradition of wave-model approaches. In addition to genealogical changes, many of the early changes in Indo-European languages can be attributed to language contact. It has been asserted, for example, that many of the more striking features shared by Italic languages (Latin, Oscan, Umbrian, etc.) might well be areal features. More certainly, very similar-looking alterations in the systems of long vowels in the West Germanic languages greatly postdate any possible notion of a
proto-language In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unatte ...
innovation (and cannot readily be regarded as "areal", either, because English and continental West Germanic were not a linguistic area). In a similar vein, there are many similar innovations in Germanic and Balto-Slavic that are far more likely areal features than traceable to a common proto-language, such as the uniform development of a high vowel (*''u'' in the case of Germanic, *''i/u'' in the case of Baltic and Slavic) before the PIE syllabic resonants *''ṛ, *ḷ, *ṃ, *ṇ'', unique to these two groups among IE languages, which is in agreement with the wave model. The Balkan sprachbund even features areal convergence among members of very different branches. An extension to the '' Ringe- Warnow model of language evolution'' suggests that early IE had featured limited contact between distinct lineages, with only the Germanic subfamily exhibiting a less treelike behaviour as it acquired some characteristics from neighbours early in its evolution. The internal diversification of especially West Germanic is cited to have been radically non-treelike.


Proposed subgroupings

Specialists have postulated the existence of higher-order subgroups such as Italo-Celtic, Graeco-Armenian, Graeco-Aryan or Graeco-Armeno-Aryan, and Balto-Slavo-Germanic. However, unlike the ten traditional branches, these are all controversial to a greater or lesser degree. The Italo-Celtic subgroup was at one point uncontroversial, considered by Antoine Meillet to be even better established than Balto-Slavic. The main lines of evidence included the genitive suffix ''-ī''; the superlative suffix ''-m̥mo''; the change of /p/ to /kʷ/ before another /kʷ/ in the same word (as in ''penkʷe'' > ''*kʷenkʷe'' > Latin , Old Irish ); and the subjunctive morpheme ''-ā-''. This evidence was prominently challenged by Calvert Watkins, while Michael Weiss has argued for the subgroup. Evidence for a relationship between Greek and Armenian includes the regular change of the second laryngeal to ''a'' at the beginnings of words, as well as terms for "woman" and "sheep". Greek and Indo-Iranian share innovations mainly in verbal morphology and patterns of nominal derivation. Relations have also been proposed between Phrygian and Greek, and between Thracian and Armenian. Some fundamental shared features, like the aorist (a verb form denoting action without reference to duration or completion) having the perfect active particle -s fixed to the stem, link this group closer to Anatolian languages and Tocharian. Shared features with Balto-Slavic languages, on the other hand (especially present and preterit formations), might be due to later contacts. The Indo-Hittite hypothesis proposes that the Indo-European language family consists of two main branches: one represented by the Anatolian languages and another branch encompassing all other Indo-European languages. Features that separate Anatolian from all other branches of Indo-European (such as the gender or the verb system) have been interpreted alternately as archaic debris or as innovations due to prolonged isolation. Points proffered in favour of the Indo-Hittite hypothesis are the (non-universal) Indo-European agricultural terminology in Anatolia and the preservation of laryngeals. However, in general this hypothesis is considered to attribute too much weight to the Anatolian evidence. According to another view, the Anatolian subgroup left the Indo-European parent language comparatively late, approximately at the same time as Indo-Iranian and later than the Greek or Armenian divisions. A third view, especially prevalent in the so-called French school of Indo-European studies, holds that extant similarities in non- satem languages in general—including Anatolian—might be due to their peripheral location in the Indo-European language-area and to early separation, rather than indicating a special ancestral relationship. Hans J. Holm, based on lexical calculations, arrives at a picture roughly replicating the general scholarly opinion and refuting the Indo-Hittite hypothesis.


Satem and centum languages

The division of the Indo-European languages into satem and centum groups was put forward by Peter von Bradke in 1890, although Karl Brugmann did propose a similar type of division in 1886. In the satem languages, which include the Balto-Slavic and Indo-Iranian branches, as well as (in most respects) Albanian and Armenian, the reconstructed Proto-Indo-European palatovelars remained distinct and were fricativized, while the labiovelars merged with the 'plain velars'. In the centum languages, the palatovelars merged with the plain velars, while the labiovelars remained distinct. The results of these alternative developments are exemplified by the words for "hundred" in Avestan () and Latin ()—the initial palatovelar developed into a fricative in the former, but became an ordinary velar in the latter. Rather than being a genealogical separation, the centum–satem division is commonly seen as resulting from innovative changes that spread across PIE dialect-branches over a particular geographical area; the centum–satem isogloss intersects a number of other isoglosses that mark distinctions between features in the early IE branches. It may be that the centum branches in fact reflect the original state of affairs in PIE, and only the satem branches shared a set of innovations, which affected all but the peripheral areas of the PIE dialect continuum. Kortlandt proposes that the ancestors of Balts and Slavs took part in satemization before being drawn later into the western Indo-European sphere.


Proposed external relations

From the very beginning of Indo-European studies, there have been attempts to link the Indo-European languages genealogically to other languages and language families. However, these theories remain highly controversial, and most specialists in Indo-European linguistics are sceptical or agnostic about such proposals. Proposals linking the Indo-European languages with a single language family include: * Indo-Uralic, joining Indo-European with Uralic * Pontic, postulated by John Colarusso, which joins Indo-European with Northwest Caucasian Other proposed families include: * Nostratic, comprising all or some of the Eurasiatic languages and the Kartvelian, Dravidian (or wider, Elamo-Dravidian) and Afroasiatic language families * Eurasiatic, a theory championed by Joseph Greenberg, comprising the Uralic, Altaic and various ' Paleosiberian' families ( Ainu, Yukaghir, Nivkh, Chukotko-Kamchatkan, Eskimo–Aleut) and possibly others Nostratic and Eurasiatic, in turn, have been included in even wider groupings, such as Borean, a language family separately proposed by Harold C. Fleming and Sergei Starostin that encompasses almost all of the world's natural languages with the exception of those native to
sub-Saharan Africa Sub-Saharan Africa is the area and regions of the continent of Africa that lie south of the Sahara. These include Central Africa, East Africa, Southern Africa, and West Africa. Geopolitically, in addition to the list of sovereign states and ...
,
New Guinea New Guinea (; Hiri Motu: ''Niu Gini''; , fossilized , also known as Papua or historically ) is the List of islands by area, world's second-largest island, with an area of . Located in Melanesia in the southwestern Pacific Ocean, the island is ...
,
Australia Australia, officially the Commonwealth of Australia, is a country comprising mainland Australia, the mainland of the Australia (continent), Australian continent, the island of Tasmania and list of islands of Australia, numerous smaller isl ...
, and the Andaman Islands.


Evolution


Proto-Indo-European

The proposed Proto-Indo-European language (PIE) is the reconstructed common ancestor of the Indo-European languages, spoken by the Proto-Indo-Europeans. From the 1960s, knowledge of Anatolian became certain enough to establish its relationship to PIE. Using the method of internal reconstruction, an earlier stage, called Pre-Proto-Indo-European, has been proposed. PIE is an inflected language, in which the grammatical relationships between words were signalled through inflectional morphemes (usually endings). The roots of PIE are basic
morpheme A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
s carrying a lexical meaning. By addition of suffixes, they form stems, and by addition of endings, these form grammatically inflected words ( nouns or verbs). The reconstructed Indo-European verb system is complex and, like the noun, exhibits a system of ablaut.


Diversification

The diversification of the parent language into the attested branches of daughter languages is historically unattested. The timeline of the evolution of the various daughter languages, on the other hand, is mostly undisputed, quite regardless of the question of Indo-European origins. Using a mathematical analysis borrowed from evolutionary biology, Donald Ringe and Tandy Warnow propose the following evolutionary tree of Indo-European branches: * Pre- Anatolian (before 3500 BC) * Pre- Tocharian * Pre-Italic and Pre-Celtic (before 2500 BC) * Pre-Armenian and Pre-Greek (after 2500 BC) * Proto- Indo-Iranian (2000 BC) * Pre-Germanic and Pre-Balto-Slavic; Proto-Germanic David Anthony proposes the following sequence: * Pre- Anatolian (4200 BC) * Pre- Tocharian (3700 BC) * Pre-Germanic (3300 BC) * Pre-Italic and Pre-Celtic (3000 BC) * Pre-Armenian (2800 BC) * Pre-Balto-Slavic (2800 BC) * Pre-Greek (2500 BC) * Proto- Indo-Iranian (2200 BC); split into Iranian and Old Indic 1800 BC From 1500 BC the following sequence may be given: * 1500–1000 BC: The Nordic Bronze Age of
Scandinavia Scandinavia is a subregion#Europe, subregion of northern Europe, with strong historical, cultural, and linguistic ties between its constituent peoples. ''Scandinavia'' most commonly refers to Denmark, Norway, and Sweden. It can sometimes also ...
develops pre-Proto-Germanic, and the (pre-) Proto-Celtic Urnfield and Hallstatt cultures emerge in Central Europe, introducing the
Iron Age The Iron Age () is the final epoch of the three historical Metal Ages, after the Chalcolithic and Bronze Age. It has also been considered as the final age of the three-age division starting with prehistory (before recorded history) and progre ...
. Migration of the Proto- Italic speakers into the Italian peninsula ( Bagnolo stele). Migration of Aryans to India followed by the redaction of the
Rigveda The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
; rise of the Vedic civilization and beginning of Iron Age in the Punjab. The Mycenaean civilization gives way to the Greek Dark Ages. Hittite goes extinct. Iranian speakers start migrating southwards to Greater Iran. Balto-Slavic splits into ancestors of modern Baltic and Slavic. * 1000–500 BC: The
Celtic languages The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
spread over Central and Western Europe, including Britain. Baltic languages are spoken in a huge area from present-day Poland to
Moscow Moscow is the Capital city, capital and List of cities and towns in Russia by population, largest city of Russia, standing on the Moskva (river), Moskva River in Central Russia. It has a population estimated at over 13 million residents with ...
. Pre-Proto-Germanic gives rise to
Proto-Germanic Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages. Proto-Germanic eventually developed from ...
in southern Scandinavia.
Homer Homer (; , ; possibly born ) was an Ancient Greece, Ancient Greek poet who is credited as the author of the ''Iliad'' and the ''Odyssey'', two epic poems that are foundational works of ancient Greek literature. Despite doubts about his autho ...
and the beginning of
Classical Antiquity Classical antiquity, also known as the classical era, classical period, classical age, or simply antiquity, is the period of cultural History of Europe, European history between the 8th century BC and the 5th century AD comprising the inter ...
. The Vedic civilization gives way to the Mahajanapadas as the Indo-Aryan tongue reaches eastwards, giving rise to the Greater Magadha cultural sphere, where Mahavira preaches
Jainism Jainism ( ), also known as Jain Dharma, is an Indian religions, Indian religion whose three main pillars are nonviolence (), asceticism (), and a rejection of all simplistic and one-sided views of truth and reality (). Jainism traces its s ...
and Siddhartha Gautama preaches
Buddhism Buddhism, also known as Buddhadharma and Dharmavinaya, is an Indian religion and List of philosophies, philosophical tradition based on Pre-sectarian Buddhism, teachings attributed to the Buddha, a wandering teacher who lived in the 6th or ...
. Zoroaster composes the Gathas, rise of the Achaemenid Empire, replacing the Elamites and
Babylonia Babylonia (; , ) was an Ancient history, ancient Akkadian language, Akkadian-speaking state and cultural area based in the city of Babylon in central-southern Mesopotamia (present-day Iraq and parts of Kuwait, Syria and Iran). It emerged as a ...
. Separation of Proto-Italic into Osco-Umbrian, Latin-Faliscan, and possibly Venetic and Siculian. A variety of Paleo-Balkan languages besides Greek are spoken in Southern Europe, including Thracian, Dacian and Illyrian, and in
Anatolia Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
( Phrygian). Development of Prakrits across the northern Indian subcontinent, as well as migration of Indo-Aryan speakers to
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
and the Maldives. * 500–1 BC:
Classical Antiquity Classical antiquity, also known as the classical era, classical period, classical age, or simply antiquity, is the period of cultural History of Europe, European history between the 8th century BC and the 5th century AD comprising the inter ...
: spread of Greek and
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
throughout the Mediterranean and, during the
Hellenistic period In classical antiquity, the Hellenistic period covers the time in Greek history after Classical Greece, between the death of Alexander the Great in 323 BC and the death of Cleopatra VII in 30 BC, which was followed by the ascendancy of the R ...
( Indo-Greeks), to Central Asia and the Hindukush. The Magadhan power and influence rises in ancient India, especially with the conquests of the Nandan and Mauryan empires. Germanic speakers start migrating southwards to occupy formerly Celtic territories. Scythian cultures extend from Eastern Europe ( Pontic Scythians) to Northwest China ( Ordos culture). * 1 BC – AD 500:
Late Antiquity Late antiquity marks the period that comes after the end of classical antiquity and stretches into the onset of the Early Middle Ages. Late antiquity as a period was popularized by Peter Brown (historian), Peter Brown in 1971, and this periodiza ...
, Gupta period; attestation of Armenian. Proto-Slavic. The
Roman Empire The Roman Empire ruled the Mediterranean and much of Europe, Western Asia and North Africa. The Roman people, Romans conquered most of this during the Roman Republic, Republic, and it was ruled by emperors following Octavian's assumption of ...
and then the Germanic migrations marginalize the Celtic languages to the British Isles. Sogdian, an eastern Iranian language, becomes the ''
lingua franca A lingua franca (; ; for plurals see ), also known as a bridge language, common language, trade language, auxiliary language, link language or language of wider communication (LWC), is a Natural language, language systematically used to make co ...
'' of the Silk Road in Central Asia leading to China, due to the proliferation of Sogdian merchants there. Greek settlements and
Byzantine The Byzantine Empire, also known as the Eastern Roman Empire, was the continuation of the Roman Empire centred on Constantinople during late antiquity and the Middle Ages. Having survived the events that caused the fall of the Western Roman E ...
rule make the last Anatolian languages
extinct Extinction is the termination of an organism by the death of its Endling, last member. A taxon may become Functional extinction, functionally extinct before the death of its last member if it loses the capacity to Reproduction, reproduce and ...
. Turkic languages start replacing Scythian languages. * 500–1000: Early Middle Ages. The
Viking Age The Viking Age (about ) was the period during the Middle Ages when Norsemen known as Vikings undertook large-scale raiding, colonising, conquest, and trading throughout Europe and reached North America. The Viking Age applies not only to their ...
forms an Old Norse koine spanning Scandinavia, the British Isles and Iceland. Phrygian becomes extinct. The Islamic conquests and the Turkic expansion result in the Arabization and Turkification of significant areas where Indo-European languages were spoken, but Persian still develops under Islamic rule and extends into
Afghanistan Afghanistan, officially the Islamic Emirate of Afghanistan, is a landlocked country located at the crossroads of Central Asia and South Asia. It is bordered by Pakistan to the Durand Line, east and south, Iran to the Afghanistan–Iran borde ...
and
Tajikistan Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
. Due to further Turkic migrations, Tocharian becomes fully extinct while Scythian languages are overwhelmingly replaced. Slavic languages spread over wide areas in central, eastern and southeastern Europe, largely replacing Romance in the Balkans (with the exception of Romanian) and whatever was left of the Paleo-Balkan languages with the exception of Albanian. Pannonian Basin is taken by the
Magyars Hungarians, also known as Magyars, are an ethnic group native to Hungary (), who share a common culture, language and history. They also have a notable presence in former parts of the Kingdom of Hungary. The Hungarian language belongs to the ...
from the western
Slavs The Slavs or Slavic people are groups of people who speak Slavic languages. Slavs are geographically distributed throughout the northern parts of Eurasia; they predominantly inhabit Central Europe, Eastern Europe, Southeastern Europe, and ...
. * 1000–1500:
Late Middle Ages The late Middle Ages or late medieval period was the Periodization, period of History of Europe, European history lasting from 1300 to 1500 AD. The late Middle Ages followed the High Middle Ages and preceded the onset of the early modern period ( ...
: Attestation of Albanian and Baltic. Modern dialects of Indo-European languages start emerging. * 1500–2000:
early modern period The early modern period is a Periodization, historical period that is defined either as part of or as immediately preceding the modern period, with divisions based primarily on the history of Europe and the broader concept of modernity. There i ...
to present: Colonialism results in the spread of Indo-European languages to every habitable continent, most notably Romance (North, Central and South America, North and Sub-Saharan Africa, West Asia), West Germanic ( English in North America, Sub-Saharan Africa, East Asia and Australia; to a lesser extent Dutch and German), and Russian to Central Asia and North Asia.


Key languages for reconstruction

In reconstructing the history of the Indo-European languages and the form of the
Proto-Indo-European language Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Eu ...
, some languages have been of particular importance. These generally include the ancient Indo-European languages that are both well-attested and documented at an early date, although some languages from later periods are important if they are particularly linguistically conservative (most notably, Lithuanian). Early poetry is of special significance because of the rigid poetic meter normally employed, which makes it possible to reconstruct a number of features (e.g.
vowel length In linguistics, vowel length is the perceived or actual length (phonetics), duration of a vowel sound when pronounced. Vowels perceived as shorter are often called short vowels and those perceived as longer called long vowels. On one hand, many ...
) that were either unwritten or corrupted in the process of transmission down to the earliest extant written
manuscript A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has ...
s. Most noticeably: *
Vedic Sanskrit Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
(). This language is unique in that its source documents were all composed orally, and were passed down through
oral tradition Oral tradition, or oral lore, is a form of human communication in which knowledge, art, ideas and culture are received, preserved, and transmitted orally from one generation to another.Jan Vansina, Vansina, Jan: ''Oral Tradition as History'' (19 ...
( shakha schools) for c. 2,000 years before ever being written down. The oldest documents are all in poetic form; oldest and most important of all is the
Rigveda The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
()). *
Ancient Greek Ancient Greek (, ; ) includes the forms of the Greek language used in ancient Greece and the classical antiquity, ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Greek ...
().
Mycenaean Greek Mycenaean Greek is the earliest attested form of the Greek language. It was spoken on the Greek mainland and Crete in Mycenaean Greece (16th to 12th centuries BC). The language is preserved in inscriptions in Linear B, a script first atteste ...
() is the oldest recorded form, but its value is lessened by the limited material, restricted subject matter, and highly ambiguous writing system. More important is Ancient Greek, documented extensively beginning with the two Homeric poems (the ''
Iliad The ''Iliad'' (; , ; ) is one of two major Ancient Greek epic poems attributed to Homer. It is one of the oldest extant works of literature still widely read by modern audiences. As with the ''Odyssey'', the poem is divided into 24 books and ...
'' and the '' Odyssey'', ). * Hittite (). This is the earliest-recorded of all Indo-European languages, and highly divergent from the others due to the early separation of the Anatolian languages from the remainder. It possesses some highly archaic features found only fragmentarily, if at all, in other languages. At the same time, however, it appears to have undergone many early phonological and grammatical changes which, combined with the ambiguities of its writing system, hinder its usefulness somewhat. Other primary sources: *
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, attested in a huge amount of poetic and prose material in the Classical period () and limited Old Latin material from as early as . * Gothic (the most archaic well-documented
Germanic language The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoken Germanic language, ...
, ), along with the combined witness of the other old Germanic languages: most importantly,
Old English Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
(),
Old High German Old High German (OHG; ) is the earliest stage of the German language, conventionally identified as the period from around 500/750 to 1050. Rather than representing a single supra-regional form of German, Old High German encompasses the numerous ...
() and
Old Norse Old Norse, also referred to as Old Nordic or Old Scandinavian, was a stage of development of North Germanic languages, North Germanic dialects before their final divergence into separate Nordic languages. Old Norse was spoken by inhabitants ...
(, with limited earlier sources dating to ). * Old Avestan () and Younger Avestan ()). Documentation is sparse, but nonetheless quite important due to its highly archaic nature. * Modern Lithuanian, with limited records in Old Lithuanian (). *
Old Church Slavonic Old Church Slavonic or Old Slavonic ( ) is the first Slavic languages, Slavic literary language and the oldest extant written Slavonic language attested in literary sources. It belongs to the South Slavic languages, South Slavic subgroup of the ...
(). Other secondary sources, due to poor attestation: * Luwian, Lycian, Lydian and other Anatolian languages (). * Oscan, Umbrian and other Old Italic languages ()). * Old Persian (). * Old Prussian (); even more archaic than Lithuanian. Other secondary sources, due to extensive phonological changes and relatively limited attestation: * Old Irish (). * Tocharian (), underwent large phonetic shifts and mergers in the proto-language, and has an almost entirely reworked declension system. *
Classical Armenian Classical Armenian (, , ; meaning "literary anguage; also Old Armenian or Liturgical Armenian) is the oldest attested form of the Armenian language. It was first written down at the beginning of the 5th century, and most Armenian literature fro ...
(). * Albanian (present).


Sound changes

As speakers of Proto-Indo-European (PIE) dispersed, the language's sound system diverged as well, changing according to various sound laws evidenced in the daughter languages. PIE is normally reconstructed with a complex system of 15 stop consonants, including an unusual three-way
phonation The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defi ...
( voicing) distinction between voiceless, voiced and " voiced aspirated" (i.e. breathy voiced) stops, and a three-way distinction among velar consonants (''k''-type sounds) between "palatal" ''ḱ ǵ ǵh'', "plain velar" ''k g gh'' and labiovelar ''kʷ gʷ gʷh''. (The correctness of the terms ''palatal'' and ''plain velar'' is disputed; see Proto-Indo-European phonology.) All daughter languages have reduced the number of distinctions among these sounds, often in divergent ways. As an example, in English, one of the
Germanic language The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoken Germanic language, ...
s, the following are some of the major changes that happened: None of the daughter-language families (except possibly Anatolian, particularly Luvian) reflect the plain velar stops differently from the other two series, and there is even a certain amount of dispute whether this series existed at all in PIE. The major distinction between ''centum'' and ''satem'' languages corresponds to the outcome of the PIE plain velars: * The "central" ''satem'' languages ( Indo-Iranian, Balto-Slavic, Albanian, and Armenian) reflect both "plain velar" and labiovelar stops as plain velars, often with secondary palatalization before a front vowel (''e i ē ī''). The "palatal" stops are palatalized and often appear as sibilants (usually but not always distinct from the secondarily palatalized stops). * The "peripheral" ''centum'' languages ( Germanic, Italic, Celtic, Greek, Anatolian and Tocharian) reflect both "palatal" and "plain velar" stops as plain velars, while the labiovelars continue unchanged, often with later reduction into plain labial or velar consonants. The three-way PIE distinction between voiceless, voiced and voiced aspirated stops is considered extremely unusual from the perspective of linguistic typology—particularly in the existence of voiced aspirated stops without a corresponding series of voiceless aspirated stops. None of the various daughter-language families continue it unchanged, with numerous "solutions" to the apparently unstable PIE situation: * The Indo-Aryan languages preserve the three series unchanged but have evolved a fourth series of voiceless aspirated consonants. * The Iranian languages probably passed through the same stage, subsequently changing the aspirated stops into fricatives. * Greek converted the voiced aspirates into voiceless aspirates. * Italic probably passed through the same stage, but reflects the voiced aspirates as voiceless fricatives, especially ''f'' (or sometimes plain voiced stops in
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
). * Celtic, Balto-Slavic, Anatolian, and Albanian merge the voiced aspirated into plain voiced stops. * Germanic and Armenian change all three series in a chain shift (e.g. with ''bh b p'' becoming ''b p f'' (known as ''
Grimm's law Grimm's law, also known as the First Germanic Consonant Shift or First Germanic Sound Shift, is a set of sound laws describing the Proto-Indo-European (PIE) stop consonants as they developed in Proto-Germanic in the first millennium BC, first d ...
'' in Germanic)). Among the other notable changes affecting consonants are: * The Ruki sound law (''s'' becomes before ''r, u, k, i'') in the '' satem'' languages. * Loss of prevocalic ''p'' in
Proto-Celtic Proto-Celtic, or Common Celtic, is the hypothetical ancestral proto-language of all known Celtic languages, and a descendant of Proto-Indo-European. It is not attested in writing but has been partly Linguistic reconstruction, reconstructed throu ...
. * Development of prevocalic ''s'' to ''h'' in Proto-Greek, with later loss of ''h'' between vowels. * Verner's law in
Proto-Germanic Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages. Proto-Germanic eventually developed from ...
. * Grassmann's law (dissimilation of aspirates) independently in Proto-Greek and Proto-Indo-Iranian. The following table shows the basic outcomes of PIE consonants in some of the most important daughter languages for the purposes of reconstruction. For a fuller table, see Indo-European sound laws. :Notes: * C- At the beginning of a word. * -C- Between vowels. * -C At the end of a word. * `-C- Following an unstressed vowel ( Verner's law). * -C-(rl) Between vowels, or between a vowel and (on either side). * CT Before a (PIE) stop (). * CT− After a (PIE) obstruent (, etc.; ). * C(T) Before or after an obstruent (, etc.; ). * CH Before an original laryngeal. * CE Before a (PIE) front vowel (). * CE' Before secondary (post-PIE) front-vowels. * Ce Before . * C(u) Before or after a (PIE) ( boukólos rule). * C(O) Before or after a (PIE) ( boukólos rule). * Cn− After . * CR Before a sonorant (). * C(R) Before or after a sonorant (). * C(r),l,u− Before or after . * Cruki− After ( Ruki sound law). * C..Ch Before an aspirated consonant in the next syllable ( Grassmann's law, also known as dissimilation of aspirates). * CE..Ch Before a (PIE) front vowel () as well as before an aspirated consonant in the next syllable ( Grassmann's law, also known as dissimilation of aspirates). * C(u)..Ch Before or after a (PIE) as well as before an aspirated consonant in the next syllable ( Grassmann's law, also known as dissimilation of aspirates).


Comparison of conjugations

The following table presents a comparison of conjugations of the thematic present indicative of the verbal root * of the English verb '' to bear'' and its reflexes in various early attested IE languages and their modern descendants or relatives, showing that all languages had in the early stage an inflectional verb system. While similarities are still visible between the modern descendants and relatives of these ancient languages, the differences have increased over time. Some IE languages have moved from synthetic verb systems to largely periphrastic systems. In addition, the pronouns of periphrastic forms are in parentheses when they appear. Some of these verbs have undergone a change in meaning as well. * In Modern Irish ''beir'' usually only carries the meaning ''to bear'' in the sense of bearing a child; its common meanings are ''to catch, grab''. Apart from the first person, the forms given in the table above are dialectical or obsolete. The second and third person forms are typically instead conjugated periphrastically by adding a pronoun after the verb: ''beireann tú, beireann sé/sí, beireann sibh, beireann siad''. * The Hindustani (
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
and
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
) verb ''bʰarnā'', the continuation of the Sanskrit verb, can have a variety of meanings, but the most common is "to fill". The forms given in the table, although etymologically derived from the present indicative, now have the meaning of future subjunctive. The loss of the present indicative in Hindustani is roughly compensated by the periphrastic habitual indicative construction, using the habitual participle (etymologically from the Sanskrit present participle ''bʰarant-'') and an auxiliary: ''ma͠i bʰartā hū̃, tū bʰartā hai, vah bʰartā hai, ham bʰarte ha͠i, tum bʰarte ho, ve bʰarte ha͠i'' (masculine forms). * German is not directly descended from Gothic, but the Gothic forms are a close approximation of what the early West Germanic forms of would have looked like. The descendant of Proto-Germanic ''*beraną'' (English ''bear'') survives in German only in the compound ''gebären'', meaning "bear (a child)". * The Latin verb ''ferre'' is irregular, and not a good representative of a normal thematic verb. In most Romance languages such as Portuguese, other verbs now mean "to carry" (e.g. Pt. ''portar'' < Lat. ''portare'') and ''ferre'' was borrowed and nativized only in compounds such as "to suffer" (from Latin ''sub-'' and ''ferre'') and "to confer" (from Latin "con-" and "ferre"). * In Modern Greek, ''phero'' φέρω (modern transliteration ''fero'') "to bear" is still used but only in specific contexts and is most common in such compounds as αναφέρω, διαφέρω, εισφέρω, εκφέρω, καταφέρω, προφέρω, προαναφέρω, προσφέρω etc. The form that is (very) common today is ''pherno'' φέρνω (modern transliteration ''ferno'') meaning "to bring". Additionally, the perfective form of ''pherno'' (used for the subjunctive voice and also for the future tense) is also ''phero''. * The dual forms are archaic in standard Lithuanian, and are only presently used in some dialects (e.g. Samogitian). * Among modern Slavic languages, only Slovene continues to have a dual number in the standard variety.


Comparison of cognates


Present distribution

Today, Indo-European languages are spoken by billions of native speakers across all inhabited continents, the largest number by far for any recognized language family. Of the 20 languages with the largest numbers of speakers according to ''Ethnologue'', 10 are Indo-European: English, Hindustani, Spanish, Bengali, French, Russian, Portuguese, German, Persian and Punjabi, each with 100 million speakers or more. Additionally, hundreds of millions of persons worldwide study Indo-European languages as secondary or tertiary languages, including in cultures which have completely different language families and historical backgrounds—there are around 600 million learners of English alone. The success of the language family, including the large number of speakers and the vast portions of the Earth that they inhabit, is due to several factors. The ancient Indo-European migrations and widespread dissemination of Indo-European culture throughout
Eurasia Eurasia ( , ) is a continental area on Earth, comprising all of Europe and Asia. According to some geographers, Physical geography, physiographically, Eurasia is a single supercontinent. The concept of Europe and Asia as distinct continents d ...
, including that of the Proto-Indo-Europeans themselves, and that of their daughter cultures including the Indo-Aryans,
Iranian peoples Iranian peoples, or Iranic peoples, are the collective ethnolinguistic groups who are identified chiefly by their native usage of any of the Iranian languages, which are a branch of the Indo-Iranian languages within the Indo-European langu ...
,
Celts The Celts ( , see Names of the Celts#Pronunciation, pronunciation for different usages) or Celtic peoples ( ) were a collection of Indo-European languages, Indo-European peoples. "The Celts, an ancient Indo-European people, reached the apoge ...
,
Greeks Greeks or Hellenes (; , ) are an ethnic group and nation native to Greece, Greek Cypriots, Cyprus, Greeks in Albania, southern Albania, Greeks in Turkey#History, Anatolia, parts of Greeks in Italy, Italy and Egyptian Greeks, Egypt, and to a l ...
, Romans,
Germanic peoples The Germanic peoples were tribal groups who lived in Northern Europe in Classical antiquity and the Early Middle Ages. In modern scholarship, they typically include not only the Roman-era ''Germani'' who lived in both ''Germania'' and parts of ...
, and
Slavs The Slavs or Slavic people are groups of people who speak Slavic languages. Slavs are geographically distributed throughout the northern parts of Eurasia; they predominantly inhabit Central Europe, Eastern Europe, Southeastern Europe, and ...
, led to these peoples' branches of the language family already taking a dominant foothold in virtually all of
Eurasia Eurasia ( , ) is a continental area on Earth, comprising all of Europe and Asia. According to some geographers, Physical geography, physiographically, Eurasia is a single supercontinent. The concept of Europe and Asia as distinct continents d ...
except for swathes of the
Near East The Near East () is a transcontinental region around the Eastern Mediterranean encompassing the historical Fertile Crescent, the Levant, Anatolia, Egypt, Mesopotamia, and coastal areas of the Arabian Peninsula. The term was invented in the 20th ...
,
North North is one of the four compass points or cardinal directions. It is the opposite of south and is perpendicular to east and west. ''North'' is a noun, adjective, or adverb indicating Direction (geometry), direction or geography. Etymology T ...
and
East Asia East Asia is a geocultural region of Asia. It includes China, Japan, Mongolia, North Korea, South Korea, and Taiwan, plus two special administrative regions of China, Hong Kong and Macau. The economies of Economy of China, China, Economy of Ja ...
, replacing many (but not all) of the previously-spoken pre-Indo-European languages of this extensive area. However
Semitic languages The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic, Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
remain dominant in much of the
Middle East The Middle East (term originally coined in English language) is a geopolitical region encompassing the Arabian Peninsula, the Levant, Turkey, Egypt, Iran, and Iraq. The term came into widespread usage by the United Kingdom and western Eur ...
and
North Africa North Africa (sometimes Northern Africa) is a region encompassing the northern portion of the African continent. There is no singularly accepted scope for the region. However, it is sometimes defined as stretching from the Atlantic shores of t ...
, and Caucasian languages in much of the Caucasus region. Similarly in
Europe Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
and the Urals the Uralic languages (such as Hungarian, Finnish, Estonian etc.) remain, as does
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
, a pre-Indo-European isolate. Despite being unaware of their common linguistic origin, diverse groups of Indo-European speakers continued to culturally dominate and often replace the indigenous languages of the western two-thirds of Eurasia. By the beginning of the
Common Era Common Era (CE) and Before the Common Era (BCE) are year notations for the Gregorian calendar (and its predecessor, the Julian calendar), the world's most widely used calendar era. Common Era and Before the Common Era are alternatives to the ...
, Indo-European peoples controlled almost the entirety of this area: the Celts western and central Europe, the Romans southern Europe, the Germanic peoples northern Europe, the Slavs eastern Europe, the Iranian peoples most of western and central Asia and parts of eastern Europe, and the Indo-Aryan peoples in the
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, with the Tocharians inhabiting the Indo-European frontier in western China. By the medieval period, only the Semitic, Dravidian, Caucasian, and Uralic languages, and the language isolate
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
remained of the (relatively) indigenous languages of Europe and the western half of Asia. Despite medieval invasions by
Eurasian nomads Eurasian nomads form groups of nomad, nomadic peoples who have lived in various areas of the Eurasian Steppe. History largely knows them via frontier historical sources from Europe and Asia. The steppe nomads had no permanent abode, but travelle ...
, a group to which the Proto-Indo-Europeans had once belonged, Indo-European expansion reached another peak in the
early modern period The early modern period is a Periodization, historical period that is defined either as part of or as immediately preceding the modern period, with divisions based primarily on the history of Europe and the broader concept of modernity. There i ...
with the dramatic increase in the population of the
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
and European expansionism throughout the globe during the Age of Discovery, as well as the continued replacement and assimilation of surrounding non-Indo-European languages and peoples due to increased state centralization and nationalism. These trends compounded throughout the modern period due to the general global
population growth Population growth is the increase in the number of people in a population or dispersed group. The World population, global population has grown from 1 billion in 1800 to 8.2 billion in 2025. Actual global human population growth amounts to aroun ...
and the results of European colonization of the Western Hemisphere and Oceania, leading to an explosion in the number of Indo-European speakers as well as the territories inhabited by them. Due to colonization and the modern dominance of Indo-European languages in the fields of politics, global science, technology, education, finance, and sports, even many modern countries whose populations largely speak non-Indo-European languages have Indo-European languages as official languages, and the majority of the global population speaks at least one Indo-European language. The overwhelming majority of languages used on the Internet are Indo-European, with English continuing to lead the group; English in general has in many respects become the ''lingua franca'' of global communication.


See also


Indo-European Swadesh lists
* Grammatical conjugation * '' The Horse, the Wheel, and Language'' (book) * Indo-European copula * Indo-European sound laws * Indo-European studies * Indo-Semitic languages * Indo-Uralic languages * Eurasiatic languages *
Language family A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
*
Languages of Asia Asia is home to hundreds of languages comprising several families and some unrelated isolates. The most spoken language families on the continent include Austroasiatic languages, Austroasiatic, Austronesian languages, Austronesian, Japonic langua ...
*
Languages of Europe There are over 250 languages indigenous to Europe, and most belong to the Indo-European language family. Out of a demographics of Europe, total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European lang ...
*
Languages of India Languages of India belong to several list of language families, language families, the major ones being the Indo-Aryan languages spoken by 78.05% of Indian people, Indians and the Dravidian languages spoken by 19.64% of Indians; both fami ...
*
Linguistics Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
* List of Indo-European languages * Proto-Indo-European root * Proto-Indo-European religion


Notes


References


Citations


Sources

* * * * * Paperback: . * * * * * * * * * * * * *
Part II via Internet Archive
* ** Reprinted in * *


Further reading

* * * * * * * * * * * * * * Asadpour, Hiwa, and Thomas Jügel, eds. Word Order Variation: Semitic, Turkic and Indo-European Languages in Contact. Vol. 31. Walter de Gruyter GmbH & Co KG, 2022.


External links


Encyclopedia of Indo-European Culture (1997)


Databases

* * * * . * *

an online collection of introductory videos to Ancient Indo-European languages produced by the University of Göttingen


Lexica

* * * * {{Authority control Language families