Indo Aryan Languages
   HOME

TheInfoList



OR:

The Indo-Aryan languages, or sometimes Indic languages, are a branch of the
Indo-Iranian languages The Indo-Iranian languages (also known as Indo-Iranic languages or collectively the Aryan languages) constitute the largest branch of the Indo-European language family. They include over 300 languages, spoken by around 1.7 billion speakers ...
in the
Indo-European language family The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
. As of 2024, there are more than 1.5 billion speakers, primarily concentrated east of the
Indus river The Indus ( ) is a transboundary river of Asia and a trans-Himalayas, Himalayan river of South Asia, South and Central Asia. The river rises in mountain springs northeast of Mount Kailash in the Western Tibet region of China, flows northw ...
in
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
, Northern
India India, officially the Republic of India, is a country in South Asia. It is the List of countries and dependencies by area, seventh-largest country by area; the List of countries by population (United Nations), most populous country since ...
, Eastern
Pakistan Pakistan, officially the Islamic Republic of Pakistan, is a country in South Asia. It is the List of countries and dependencies by population, fifth-most populous country, with a population of over 241.5 million, having the Islam by country# ...
,
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
,
Maldives The Maldives, officially the Republic of Maldives, and historically known as the Maldive Islands, is an Archipelagic state, archipelagic country in South Asia located in the Indian Ocean. The Maldives is southwest of Sri Lanka and India, abou ...
and
Nepal Nepal, officially the Federal Democratic Republic of Nepal, is a landlocked country in South Asia. It is mainly situated in the Himalayas, but also includes parts of the Indo-Gangetic Plain. It borders the Tibet Autonomous Region of China Ch ...
. Moreover, apart from the
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, large immigrant and expatriate Indo-Aryan–speaking communities live in
Northwestern Europe Northwestern Europe, or Northwest Europe, is a loosely defined subregion of Europe, overlapping Northern and Western Europe. The term is used in geographic, history, and military contexts. Geographic definitions Geographically, Northwestern ...
,
Western Asia West Asia (also called Western Asia or Southwest Asia) is the westernmost region of Asia. As defined by most academics, UN bodies and other institutions, the subregion consists of Anatolia, the Arabian Peninsula, Iran, Mesopotamia, the Armenian ...
,
North America North America is a continent in the Northern Hemisphere, Northern and Western Hemisphere, Western hemispheres. North America is bordered to the north by the Arctic Ocean, to the east by the Atlantic Ocean, to the southeast by South Ameri ...
, the
Caribbean The Caribbean ( , ; ; ; ) is a region in the middle of the Americas centered around the Caribbean Sea in the Atlantic Ocean, North Atlantic Ocean, mostly overlapping with the West Indies. Bordered by North America to the north, Central America ...
,
Southeast Africa Southeast Africa, or Southeastern Africa, is an African region that is intermediate between East Africa and Southern Africa. It comprises the countries Botswana, Eswatini, Kenya, Lesotho, Malawi, Mozambique, Namibia, Rwanda, South Africa, Tanza ...
,
Polynesia Polynesia ( , ) is a subregion of Oceania, made up of more than 1,000 islands scattered over the central and southern Pacific Ocean. The indigenous people who inhabit the islands of Polynesia are called Polynesians. They have many things in ...
and
Australia Australia, officially the Commonwealth of Australia, is a country comprising mainland Australia, the mainland of the Australia (continent), Australian continent, the island of Tasmania and list of islands of Australia, numerous smaller isl ...
, along with several million speakers of
Romani language Romani ( ; also Romanes , Romany, Roma; ) is an Indo-Aryan languages, Indo-Aryan macrolanguage of the Romani people. The largest of these are Vlax Romani language, Vlax Romani (about 500,000 speakers), Balkan Romani (600,000), and Sinte Roma ...
s primarily concentrated in
Southeastern Europe Southeast Europe or Southeastern Europe is a geographical sub-region of Europe, consisting primarily of the region of the Balkans, as well as adjacent regions and Archipelago, archipelagos. There are overlapping and conflicting definitions of t ...
. There are over 200 known Indo-Aryan languages. Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early
Vedic Sanskrit Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
, through
Middle Indo-Aryan languages The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OI ...
(or
Prakrit Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
s). The largest such languages in terms of first-speakers are
Hindi–Urdu Hindustani is an Indo-Aryan language spoken in North India and Pakistan as the lingua franca of the region. It is also spoken by the Deccani people, Deccani-speaking community in the Deccan plateau. Hindustani is a pluricentric language w ...
(),Standard Hindi first language: 260.3 million (2001), as second language: 120 million (1999). Urdu L1: 68.9 million (2001–2014), L2: 94 million (1999): ''Ethnologue'' 19.
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
(242 million), Punjabi (about 150 million),
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
(112 million), and Gujarati (60 million). A 2005 estimate placed the total number of native speakers of the Indo-Aryan languages at nearly 900 million people. Other estimates are higher, suggesting a figure of 1.5 billion speakers of Indo-Aryan languages.


Classification


Theories

The Indo-Aryan family as a whole is thought to represent a
dialect continuum A dialect continuum or dialect chain is a series of Variety (linguistics), language varieties spoken across some geographical area such that neighboring varieties are Mutual intelligibility, mutually intelligible, but the differences accumulat ...
, where languages are often transitional towards neighbouring varieties. Because of this, the division into languages vs. dialects is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a
tree model In historical linguistics, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a family tree, particularly a phylogenetic tree in the biological evolution of species. ...
is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the
wave model In historical linguistics, the wave model or wave theory () is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, being adopted by a gradually expa ...
.


Subgroups

The following table of proposals is expanded from (from Hoernlé to Turner), and also includes subsequent classification proposals. The table lists only some modern Indo-Aryan languages. Anton I. Kogan, in 2016, conducted a
lexicostatistical Lexicostatistics is a method of comparative linguistics that involves comparing the percentage of lexical cognates between languages to determine their relationship. Lexicostatistics is related to the comparative method but does not reconstruct a ...
study of the New Indo-Aryan languages based on a 100-word
Swadesh list A Swadesh list () is a compilation of cultural universal, tentatively universal concepts for the purposes of lexicostatistics. That is, a Swadesh list is a list of forms and concepts which all languages, without exception, have terms for, such as ...
, using techniques developed by the glottochronologist and comparative linguist
Sergei Starostin Sergei Anatolyevich Starostin (; March 24, 1953 – September 30, 2005) was a Russian historical linguistics, historical linguist and philology, philologist, perhaps best known for his reconstructions of hypothetical proto-languages, including hi ...
. That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan (43.5%) and negligible difference with similarity to Iranian (39.3%). He also calculated Sinhala–Dhivehi to be the most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features.


Inner–Outer hypothesis

The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan (generally including Eastern and Southern Indo-Aryan, and sometimes Northwestern Indo-Aryan,
Dardic The Dardic languages (also Dardu or Pisaca), or Hindu-Kush Indo-Aryan languages, are a group of several Indo-Aryan languages spoken in northern Pakistan, northwestern India and parts of northeastern Afghanistan. This region has sometimes been re ...
and Pahari) representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with the newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by
Rudolf Hoernlé Augustus Frederic Rudolf Hoernlé CIE (1841 – 1918), also referred to as Rudolf Hoernle or A. F. Rudolf Hoernle, was a German Indologist and philologist. He is famous for his studies on the Bower Manuscript (1891), Weber Manuscript (1893) an ...
in 1880 and refinement by George Grierson it has undergone numerous revisions and a great deal of debate, with the most recent iteration by Franklin Southworth and Claus Peter Zoller based on robust linguistic evidence (particularly an Outer past tense in ''-l-''). Some of the theory's sceptics include
Suniti Kumar Chatterji Suniti Kumar Chatterji (26 November 1890 – 29 May 1977) was an Indian linguist, educationist and litterateur. He was a recipient of the second highest Indian civilian honour of Padma Vibhushan. Life Childhood Chatterji was born on 26 Novem ...
and Colin P. Masica.


Groups

The below classification follows , and .


Dardic

The Dardic languages (also Dardu or Pisaca) are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by
George Abraham Grierson Sir George Abraham Grierson (7 January 1851 – 9 March 1941) was an Irish administrator and linguist in British India. He worked in the Indian Civil Service but an interest in philology and linguistics led him to pursue studies in the languag ...
in his
Linguistic Survey of India The Linguistic Survey of India (LSI) is a comprehensive survey of the languages of British India, describing 364 languages and dialects. The Survey was first proposed by George Abraham Grierson, a member of the Indian Civil Service and a lingu ...
but he did not consider it to be a subfamily of Indo-Aryan. The Dardic group as a genetic grouping (rather than areal) has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among ardic languagesmay result from subsequent convergence". The Dardic languages are thought to be transitional with Punjabi and Pahari (e.g. Zoller describes Kashmiri as "an interlink between Dardic and West Pahāṛī"), as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in the context of
Proto-Indo-Aryan Proto-Indo-Aryan (sometimes Proto-Indic) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Indo-Aryans, who had migrated into the Indian subcontinent. Being descended from Proto-I ...
. * Kashmiri: Kashmiri,
Kishtwari Kishtwari or Kashtwari is an Indo-Aryan language closely related to the Kashmiri language, with strong influences from neighbouring Western Pahari varieties. It is spoken by Hindus in Kishtwar district of Jammu division in Jammu and Kashmir, In ...
,
Poguli Pogali, or Poguli is a language spoken in Jammu and Kashmir, primarily in the Ramban district and Chenani. It is a part of the Western Pahari group, which falls under the larger umbrella of Indo-Aryan languages. The language has a rich histo ...
; * Shina: Brokskad, Kundal Shahi, Shina, Ushojo, Kalkoti, Palula,
Savi Savi is a town and arrondissement in the Atlantique Department of southern Benin. It is an administrative division under the jurisdiction of the commune of Ouidah. According to the population census conducted by the Institut National de la Sta ...
; * Chitrali:
Kalasha A kalasha, also called Pūrṇa-Kalaśa, Pūrṇa-Kumbha, Pūrṇa-Ghaṭa, also called ghat or ghot or kumbh ( , Telugu: కలశము Kannada: ಕಳಶ literally "pitcher, pot"), is a metal (brass, copper, silver or gold) pot with a large ...
,
Khowar Khowar (, ''Khōwār'', ), also known by its common exonym Chitrali, is an Indo-Aryan language of the Dardic group, primarily spoken by the Kho (Chitrali) people, native to the Chitral region and surrounding areas of Pakistan. Khowar is th ...
; * Kohistani: Bateri, Chilisso, Gowro,
Indus Kohistani Indus Kohistani or simply Kohistani (, Kōstaiñ) is an Indo-Aryan language spoken in the former Kohistan District of Pakistan. The language was referred to as Maiyã (Mayon) or Shuthun by early researchers, but subsequent observations have n ...
, Kalami,
Tirahi Tirahi () are Indo-Aryan people who are native and original inhabitants of Tirah valley. They are closely related to their Dardic neighbours and speak Tirahi language, a nearly extinct if not already extinct Indo-Aryan language which may still ...
, Torwali, Wotapuri-Katarqalai; * Pashayi * Kunar: Dameli, Gawar-Bati, Nangalami, Shumashti.


Northern Zone

The Northern Indo-Aryan languages, also known as the Pahari ('hill') languages, are spoken throughout the Himalayan regions of the subcontinent. * Eastern Pahari: Nepali, Jumli,
Doteli Doteli, or Dotyali ( Doteli-Devanagari: ) is an Indo-Aryan language spoken by about 495,000 people, most of whom live in Nepal. It is a dialect of Khas, which is an ancient form of the modern Nepali language Nepali (; , ), or ''Gorkhali'' ...
; * Central Pahari: Garhwali, Kumaoni; *
Western Pahari The Western Pahari languages are a range of languages and dialects of Northern Indo-Aryan languages spoken in the western parts of the Himalayan range, primarily in the Indian state of Himachal Pradesh. They are also spoken in Jammu and Jaunsar ...
: Dogri, Kangri, Bhadarwahi, Churahi, Bhateali, Bilaspuri, Chambeali, Gaddi, Pangwali,
Mandeali Mandeali (Takri script, Takri: ) is a Western Pahari language, spoken in northern India, predominantly in the Mandi district of Himachal Pradesh by the people of the Mandi Valley and particularly in the major city of Mandi, Himachal Pradesh, Man ...
,
Mahasu Pahari Mahasu Pahari (Takri: ) is a Western Pahari language spoken in Himachal Pradesh. It is also known as Mahasui or Mahasuvi. The speaking population is about 1,000,000 (2001). It is more commonly spoken in the Himachal Pradesh, Shimla (Simla) and Sol ...
, Jaunsari, Kullui, Pahari Kinnauri, Hinduri, Sarazi, Sirmauri.


Northwestern Zone

Northwestern Indo-Aryan languages are spoken in the northwestern region of India and eastern region of Pakistan. Punjabi is spoken predominantly in the
Punjab region Punjab (; ; also romanised as Panjāb or Panj-Āb) is a geopolitical, cultural, and historical region in South Asia. It is located in the northwestern part of the Indian subcontinent, comprising areas of modern-day eastern Pakistan and no ...
and is the official language of the northern Indian state of Punjab, in addition to being the most widely-spoken language in Pakistan. Sindhi and its variants are spoken natively in the Pakistani province of
Sindh Sindh ( ; ; , ; abbr. SD, historically romanized as Sind (caliphal province), Sind or Scinde) is a Administrative units of Pakistan, province of Pakistan. Located in the Geography of Pakistan, southeastern region of the country, Sindh is t ...
and neighbouring regions. Northwestern languages are ultimately thought to be descended from
Shauraseni Prakrit Shauraseni Prakrit () was a Middle Indo-Aryan language and a Dramatic Prakrit. Shauraseni was the chief language used in drama in medieval northern India. Most of the material in this language originates from the 3rd to 10th centuries, and re ...
, with influence from
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
and
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
. * Punjabi ** '' Central'': Majhi ** '' Central-
Western Western may refer to: Places *Western, Nebraska, a village in the US *Western, New York, a town in the US *Western Creek, Tasmania, a locality in Australia *Western Junction, Tasmania, a locality in Australia *Western world, countries that id ...
'': Dhanni, Jhangvi,
Pahari-Pothwari Pahari Pothwari is an Indo-Aryan language variety of the Lahnda group, spoken in the northern half of Pothohar Plateau, in Punjab, Pakistan, as well as in the most of Pakistan-administered Azad Kashmir and in the western areas of Indian-admini ...
, Shahpuri,
Thali Thali (meaning "plate" or "tray") or Bhojanam (meaning "full meal") is a round Platter (dishware), platter used to serve food in South Asia, Southeast Asia and the Caribbean. Thali is also used to refer to an Indian-style meal made up of a sel ...
** '' Eastern'': Doabi, Malwai, Puadhi ** ''
Standard Standard may refer to: Symbols * Colours, standards and guidons, kinds of military signs * Standard (emblem), a type of a large symbol or emblem used for identification Norms, conventions or requirements * Standard (metrology), an object ...
'' ** ''
Western Western may refer to: Places *Western, Nebraska, a village in the US *Western, New York, a town in the US *Western Creek, Tasmania, a locality in Australia *Western Junction, Tasmania, a locality in Australia *Western world, countries that id ...
'':
Hindko Hindko (, , ) is an Indo-Aryan languages, Indo-Aryan language spoken by several million people of various ethnic backgrounds in northwestern Pakistan, primarily in the provinces of Khyber Pakhtunkhwa Police, Khyber Pakhtunkhwa and northern Pun ...
( Kohati, Ghebi, Awankari, Chachhi, Peshawari, Hazarvi), Inku, Saraiki ( Multani, Riasti, Derawali) * Sindhi: Sindhi, Jadgali, Kutchi, Luwati, Memoni, Khetrani,
Kholosi Kholosi is an Indo-Aryan language spoken in two villages in southern Iran that was first described in 2008. At its current status, the language is considered endangered. In 2008, it was only spoken in the neighboring villages of Kholus and Got ...
.


Western Zone

Western Indo-Aryan languages are spoken in central and western India, in states such as
Madhya Pradesh Madhya Pradesh (; ; ) is a state in central India. Its capital is Bhopal and the largest city is Indore, Indore. Other major cities includes Gwalior, Jabalpur, and Sagar, Madhya Pradesh, Sagar. Madhya Pradesh is the List of states and union te ...
and
Rajasthan Rajasthan (; Literal translation, lit. 'Land of Kings') is a States and union territories of India, state in northwestern India. It covers or 10.4 per cent of India's total geographical area. It is the List of states and union territories of ...
, in addition to contiguous regions in Pakistan. Gujarati is the official language of
Gujarat Gujarat () is a States of India, state along the Western India, western coast of India. Its coastline of about is the longest in the country, most of which lies on the Kathiawar peninsula. Gujarat is the List of states and union territories ...
, and is spoken by over 50 million people. In Europe, various Romani languages are spoken by the
Romani people {{Infobox ethnic group , group = Romani people , image = , image_caption = , flag = Roma flag.svg , flag_caption = Romani flag created in 1933 and accepted at the 1971 World Romani Congress , po ...
, an itinerant community who historically migrated from India. The Western Indo-Aryan languages are thought to have diverged from their northwestern counterparts, although they have a common antecedent in
Shauraseni Prakrit Shauraseni Prakrit () was a Middle Indo-Aryan language and a Dramatic Prakrit. Shauraseni was the chief language used in drama in medieval northern India. Most of the material in this language originates from the 3rd to 10th centuries, and re ...
. *
Rajasthani Rajasthani may refer to: * something of, from, or related to Rajasthan, a state of India * Rajasthani languages, a group of Indic languages spoken there * Rajasthani people, the native inhabitants of the state * Rajasthani architecture, Indian ar ...
: Bagri,
Marwari Marwari may refer to: * anything of, from, or related to the Marwar region of Rajasthan, India, largely in the Thar Desert ** Marwari people, an Indian ethnic group originating in the Marwar region ** Marwari language, the language of the Marwari ...
, Mewati, Dhundari, Harauti, Mewari,
Shekhawati Shekhawati is a region in the northern part of the States and union territories of India, Indian state of Rajasthan, comprising the districts of Sikar, Jhunjhunu, and Churu district, Churu. The language of this region is also called Shekhawat ...
,
Dhatki Dhatki (धाटकी; ڍاٽڪي), also known as Dhatti (धाटी; ڍاٽي), Thari (थारी; ٿَري), is a Indo-Aryan Language of the Indo-European language family. It is mainly spoken in Tharparkar and Umerkot districts of Sindh ...
, Malvi, Nimadi,
Gujari Gujari (also spelt Gojri, Gujri, or Gojari; ) is a Rajasthani Indo-Aryan language spoken by most of the Gujjar people in the northern parts of India and Pakistan, as well as in Afghanistan. In India, the language is spoken by 16.3 million pe ...
, Goaria, Loarki,
Bhoyari Bhoyari or Pawari is an Indo-Aryan dialect of central India, exclusively spoken by the people of the Kshatriya Pawar (Pawar / Bhoyar Pawar) caste. It is a dialect of the Rajasthani Malvi language, primarily spoken in the regions of Betul, Chhi ...
/ Pawari, Kanjari, Od,
Lambadi Lambadi, Lambani, Lamani or Banjari is a Western Indo-Aryan language spoken by the Banjara people across India. The language does not have a native script. Regional dialects are divided between the Banjara of Maharashtra (written in Devanaga ...
; * Gujarati: Gujarati, Jandavra,
Saurashtra Saurashtra, Sourashtra, or variants may refer to: ** Kathiawar, also called Saurashtra Peninsula, a peninsula in western India ** Saurashtra (state), alias United State of Kathiawar, a former Indian state, merged into Bombay State and since its d ...
, Aer, Vaghri, Parkari Koli,
Kachi Koli Kachi Koli is an Indo-Aryan languages, Indo-Aryan language spoken in India. There is a small population of Koli who live across the border in eastern Sindh province in neighbouring Pakistan. Part of the Gujarati languages, Gujarati subfamily, K ...
,
Wadiyara Koli Wadiyara Koli is an Indo-Aryan language of the Gujarati group. It is spoken by the Wadiyara people, who originate from Wadiyar in Gujarat; many of whom are thought to have migrated to Sindh in the early twentieth century, following the onset ...
; *
Bhil Bhil or Bheel refer to the various Indigenous peoples, indigenous groups inhabiting western India, including parts of Rajasthan and Madhya Pradesh and are also found in distant places such as Bengal and Tripura. Though they now speak the Bhili ...
: Kalto, Vasavi, Wagdi, Gamit, Vaagri Booli; ** Northern Bhil:
Bauria ''Bauria'' is an extinct genus of the suborder Therocephalia that existed during the Early Triassic, Early and Middle Triassic period, around 246-251 million years ago. It belonged to the family Bauriidae. ''Bauria'' was probably a herbivore o ...
, Bhilori, Magari; ** Central Bhil: Bhili proper, Bhilali, Chodri,
Dhodia Dhodia are an Adivasi people who have been placed in the Indian communities recognition, under Schedule Tribes. The majority of the Dhodia tribes are located in the southern part of Gujarat (Navsari, Surat and Valsad districts), Dadra and Nagar H ...
, Dhanki, Dubli; ** Bareli: Palya Bareli, Pauri Bareli, Rathwi Bareli,
Pardhi Pardhi is a Hindu tribe in India. The tribe is found mostly in Maharashtra and parts of Madhya Pradesh however small numbers can be found in Gujarat and Andhra Pradesh. The word Pardhi is derived from the Marathi language, Marathi word ‘''Para ...
; * Khandeshi * Domaaki * Domari *
Romani Romani may refer to: Ethnic groups * Romani people, or Roma, an ethnic group of Indo-Aryan origin ** Romani language, an Indo-Aryan macrolanguage of the Romani communities ** Romanichal, Romani subgroup in the United Kingdom * Romanians (Romanian ...
: Carpathian Romani,
Balkan Romani Balkan Roma, Balkaniko Romanes, or Balkan Gypsy is a specific non- Vlax dialect of the Romani language, spoken by groups within the Balkans, which include countries such as Albania, Bosnia-Herzegovina, Bulgaria, Greece, Kosovo, North Macedonia, ...
,
Vlax Romani Vlax Romani varieties are spoken mainly in Southeastern Europe by the Romani people.Norbert Boretzky and Birgit Igla. Kommentierter Dialektatlas des Romani. Wiesbaden: Harrassowitz Verlag 2004. Teil 1: Vergleich der Dialekte. Vlax Romani can also ...
, Baltic Romani; ** Northern Romani *** British Romani:
Angloromani Angloromani or Anglo-Romani (literally "English Romani"; also known as Angloromany, Rummaness, or ) is a Para-Romani dialect spoken by the Romanichal, a subgroup of the Romani people in the United Kingdom and other parts of the English-speaking w ...
, Welsh Romani *** Northwestern Romani:
Sinte Romani Sinte Romani (also known as Sintitikes, Manuš) is the variety of Romani spoken by the Sinti people in Germany, France, Austria, Belgium, the Netherlands, some parts of Northern Italy and other adjacent regions. Sinte Romani is characterized by ...
, Finnish Kalo


Zone

Within India,
Central Indo-Aryan languages The Central Indo-Aryan languages or Hindi languages are a group of Indo-Aryan languages spoken across Northern and Central India. They historically form a dialect continuum that descends from the Middle Prakrits. Located in the Hindi Belt, ...
are spoken primarily in the western
Gangetic plains The Indo-Gangetic Plain, also known as the Northern Plain or North Indian River Plain, is a fertile plain spanning across the northern and north-eastern part of the Indian subcontinent. It encompasses northern and eastern India, eastern Pakist ...
, including
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, but spread chiefly to the west, or beyond its Bank (geography ...
and parts of the Central Highlands, where they are often transitional with neighbouring lects. Many of these languages, including
Braj Braj, also known as Vraj, Vraja, Brij or Brijbhumi, is a region in India on both sides of the Yamuna river with its centre at Mathura-Vrindavan in Uttar Pradesh state encompassing the area which also includes Palwal, Ballabhgarh and Nuh in ...
and
Awadhi Awadhi may refer to: * Something of, from, or related to the Awadh or Oudh region in Uttar Pradesh, northern India ** Awadhi people, ethnic group of India *** Awadhi language Awadhi, also known as Audhi, is an Indo-Aryan language belonging ...
, have rich literary and poetic traditions.
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
, a Persianised derivative of Dehlavi descended from
Shauraseni Prakrit Shauraseni Prakrit () was a Middle Indo-Aryan language and a Dramatic Prakrit. Shauraseni was the chief language used in drama in medieval northern India. Most of the material in this language originates from the 3rd to 10th centuries, and re ...
, is the official language of
Pakistan Pakistan, officially the Islamic Republic of Pakistan, is a country in South Asia. It is the List of countries and dependencies by population, fifth-most populous country, with a population of over 241.5 million, having the Islam by country# ...
and also has strong
historical History is the systematic study of the past, focusing primarily on the human past. As an academic discipline, it analyses and interprets evidence to construct narratives about what happened and explain why it happened. Some theorists categ ...
connections to
India India, officially the Republic of India, is a country in South Asia. It is the List of countries and dependencies by area, seventh-largest country by area; the List of countries by population (United Nations), most populous country since ...
, where it also has been designated with official status.
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
, a standardised and Sanskritised register of Dehlavi, is the official language of the
Government of India The Government of India (ISO 15919, ISO: Bhārata Sarakāra, legally the Union Government or Union of India or the Central Government) is the national authority of the Republic of India, located in South Asia, consisting of States and union t ...
(along with English). Together with Urdu, it is the third most-spoken language in the world. *
Western Hindi The Western Hindi languages, are a branch of the Indo-Aryan language family spoken chiefly in Haryana, western Uttar Pradesh and Bundelkhand region of Madhya Pradesh, in Northwest and Central India. The Western Hindi languages evolved from Saur ...
: Hindustani (including
Standard Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the standardised variety of the Hindustani language written in the Devanagari script. It is an official language of the Government of India, alongside English, and is the ''lin ...
and
Standard Urdu Urdu (; , , ) is an Indo-Aryan language spoken chiefly in South Asia. It is the national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule language, the status and cultural heritage of which are recognised by ...
), Khariboli,
Braj Braj, also known as Vraj, Vraja, Brij or Brijbhumi, is a region in India on both sides of the Yamuna river with its centre at Mathura-Vrindavan in Uttar Pradesh state encompassing the area which also includes Palwal, Ballabhgarh and Nuh in ...
,
Haryanvi Haryanvi (हरियाणवी or हरयाणवी) is an Indo-Aryan language spoken primarily in the Indian state of Haryana and the territory of Delhi. Haryanvi is considered to be part of the dialect group of Western Hindi, which a ...
, Bundeli, Kannauji, Parya, Sansi. *
Eastern Hindi The Eastern Hindi languages, are a branch of the Indo-Aryan language family spoken chiefly in Awadh region of Uttar Pradesh, Baghelkhand region of Madhya Pradesh, and Chhattisgarh, in Northern and Central India. Eastern Hindi languages evolved ...
:
Bagheli Bagheli (Devanagari: बघेली) or Baghelkhandi is a Central Indo-Aryan language spoken in the Baghelkhand region of central India. Classification A language belonging to the Eastern Hindi subgroup, Bagheli is one of the languages d ...
,
Chhattisgarhi Chhattisgarhi () is an Indo-Aryan language spoken by approximately 16.25 million people from Chhattisgarh, Odisha, Madhya Pradesh and Maharashtra in India. It is the official language of Chhattisgarh. It is grouped within the Eastern Hindi lan ...
, Surgujia,
Awadhi Awadhi may refer to: * Something of, from, or related to the Awadh or Oudh region in Uttar Pradesh, northern India ** Awadhi people, ethnic group of India *** Awadhi language Awadhi, also known as Audhi, is an Indo-Aryan language belonging ...
(
Fiji Hindi Fiji Hindi (Devanagari: ; Kaithi: ; Perso-Arabic: ) is an Indo-Aryan language spoken by Indo-Fijians. It is considered to be a ''koiné'' language based on Awadhi that has also been subject to considerable influence by other Eastern Hindi ...
,
Caribbean Hindustani Caribbean Hindustani () is an Indo-Aryan language spoken by Indo-Caribbean people and the Indo-Caribbean diaspora. It is a koiné language mainly based on the Bhojpuri and Awadhi dialects. These Hindustani dialects were the most-spoken dialec ...
).


Eastern Zone

The Eastern Indo-Aryan languages, also known as Magadhan languages, are spoken throughout the eastern subcontinent, alongside other regions surrounding the northwestern Himalayan corridor.
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
is the seventh most-spoken language in the world, and has a strong literary tradition; the
national anthem A national anthem is a patriotic musical composition symbolizing and evoking eulogies of the history and traditions of a country or nation. The majority of national anthems are marches or hymns in style. American, Central Asian, and European ...
s of
India India, officially the Republic of India, is a country in South Asia. It is the List of countries and dependencies by area, seventh-largest country by area; the List of countries by population (United Nations), most populous country since ...
and
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
are written in Bengali. Assamese and Odia are the official languages of
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
and
Odisha Odisha (), formerly Orissa (List of renamed places in India, the official name until 2011), is a States and union territories of India, state located in East India, Eastern India. It is the List of states and union territories of India by ar ...
, respectively. The Eastern Indo-Aryan languages descend from Magadhan
Apabhraṃśa Apabhraṃśa (, , Prakrit: ) is a term used by '' vaiyākaraṇāḥ'' (native grammarians) since Patañjali to refer to languages spoken in North India before the rise of the modern languages. In Indology, it is used as an umbrella term for ...
and ultimately from Magadhi Prakrit. Eastern Indo-Aryan languages display many morphosyntactic features similar to those of Munda languages, which are largely absent in western Indo-Aryan languages. It is suggested that "proto-Munda" languages may have once dominated the eastern Indo-Gangetic Plain, and were then absorbed by Indo-Aryan languages at an early date as Indo-Aryan spread east. * Bihari languages, Bihari: ** Bhojpuri language, Bhojpuri,
Caribbean Hindustani Caribbean Hindustani () is an Indo-Aryan language spoken by Indo-Caribbean people and the Indo-Caribbean diaspora. It is a koiné language mainly based on the Bhojpuri and Awadhi dialects. These Hindustani dialects were the most-spoken dialec ...
,
Fiji Hindi Fiji Hindi (Devanagari: ; Kaithi: ; Perso-Arabic: ) is an Indo-Aryan language spoken by Indo-Fijians. It is considered to be a ''koiné'' language based on Awadhi that has also been subject to considerable influence by other Eastern Hindi ...
; ** Magahi language, Magahi, Khortha language, Khortha; ** Maithili language, Maithili, Angika, Bajjika, Thēthi, Thethi, Dehati; ** Sadanic languages, Sadanic: Nagpuri language, Nagpuri, Kurmali language, Kurmali (Panchpargania); ** Tharu languages, Tharu: Kochila Tharu, Rana Tharu language, Rana Tharu, Kathariya Tharu, Sonha language, Sonha Tharu, Dangaura Tharu, Chitwania Buksa language, Buksa, Majhi language, Majhi, Musasa language, Musasa; ** Kumhali language, Kumhali, Kuswaric: Danwar language, Danwar, Bote-Darai language, Bote-Darai; * Halbic languages, Halbic: Halbi language, Halbi, Kamar language, Kamar, Bhunjia language (Halbic), Bhunjia, Nahari language, Nahari; * Odia: Baleswari Odia, Baleswari, Kataki, Ganjami Odia, Ganjami, Sundargadi Odia, Sundargadi, Sambalpuri language, Sambalpuri, Desia language, Desia; ** Bodo Parja language, Bodo Parja, Bhatri language, Bhatri, Reli language, Reli, Kupia language, Kupia; * Bengali–Assamese languages, Bengali–Assamese ** Bengali group of languages, Gaudi:
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
(Bangali dialect, Bangali, Rarhi dialect, Rarhi, Varendri dialect, Varendri, Manbhumi dialect, Manbhumi, Dhakaiya Kutti, Mymensinghi language, Mymensinghi, Dobhashi), Chittagonian language, Chittagonian, Sylheti language, Sylheti, Bishnupriya Manipuri language, Bishnupriya Manipuri, Hajong language, Hajong, Noakhali language, Noakhailla, Chakma language, Chakma, Tanchangya language, Tanchangya, Rohingya language, Rohingya; ** Gaudi-Kamarupi Apabhraṃśa, Gaudi-Kamarupic: Goalpariya dialects, Goalpariya, Rangpuri language, Rangpuri, Surjapuri language, Surjapuri, Rajbanshi language (Nepal), Rajbanshi; ** Kamarupi Prakrit, Kamarupic: Assamese, Nagamese creole, Nagamese;


Southern Zone

Marathi-Konkani languages are ultimately descended from Maharashtri Prakrit, whereas Insular Indo-Aryan languages are descended from Elu, Elu Prakrit and possess several characteristics that markedly distinguish them from most of their mainland Indo-Aryan counterparts. Insular Indo-Aryan languages (of
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
and
Maldives The Maldives, officially the Republic of Maldives, and historically known as the Maldive Islands, is an Archipelagic state, archipelagic country in South Asia located in the Indian Ocean. The Maldives is southwest of Sri Lanka and India, abou ...
) started developing independently and diverging from the continental Indo-Aryan languages from around 5th century BCE. * Marathi-Konkani languages, Marathi-Konkani ** Marathic:
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
, Varhadi dialect, Varhadi, Andh language, Andh, Agri dialect, Agri, Zadi Boli, Thanjavur Marathi (language), Thanjavur, Berar-Deccan Marathi, Phudagi language, Phudagi, Judeo-Marathi, Judeo, Katkari language, Katkari, Varli language, Varli, Kadodi language, Kadodi; ** Konkanic: Konkani language, Konkani, Canarese Konkani, Maharashtrian Konkani. * Insular Indo-Aryan ** Sinhala language, Sinhala: Vedda language, Vedda ** Maldivian language, Maldivian: Dhivehi (Addu), Dhivehi


Unclassified

The following languages are otherwise unclassified within Indo-Aryan: * Chinali-Lahuli languages, Chinali–Lahul Lohar: Chinali language, Chinali, Lahul Lohar language, Lahul Lohar. * Bazigar language, Bazigar * Sheikhgal


History


Indian subcontinent

Dates indicate only a rough time frame. *
Proto-Indo-Aryan Proto-Indo-Aryan (sometimes Proto-Indic) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Indo-Aryans, who had migrated into the Indian subcontinent. Being descended from Proto-I ...
(before 1500 BCE, reconstructed) * Old Indo-Aryan (–300 BCE) ** early Old Indo-Aryan: includes
Vedic Sanskrit Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
( to 500 BCE) ** late Old Indo-Aryan: Epic Sanskrit, Classical Sanskrit ( to 1300 CE) ** Indo-Aryan superstrate in Mitanni, Mitanni Indo-Aryan () * Middle Indo-Aryan languages, Middle Indo-Aryan or
Prakrit Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
s ( to 1500 CE) ** early Jain and Buddhist texts ( or 5th century BCE) ** early Middle Indo-Aryan: e.g. Ashokan Prakrits, Pali, Gandhari language, Gandhari, ( to 200 BCE) ** middle Middle Indo-Aryan: e.g. Dramatic Prakrits, Elu ( to 700 CE) ** late Middle Indo-Aryan: e.g. Abahattha ( to 1500 CE) * Early Modern Indo-Aryan (Late Medieval India): e.g. early Dakhini and emergence of the Dehlavi dialect


Proto-Indo-Aryan

Proto-Indo-Aryan (or sometimes Proto-Indic) is the Linguistic reconstruction, reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Indo-Aryan peoples#History, pre-Vedic Indo-Aryans. Proto-Indo-Aryan is meant to be the predecessor of #Old Indo-Aryan, Old Indo-Aryan (1500–300 BCE), which is directly attested as Vedic Sanskrit, Vedic and Indo-Aryan superstrate in Mitanni, Mitanni-Aryan. Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of Proto-Indo-Aryan language#Differences from Vedic, conservative features lost in Vedic.


Mitanni-Aryan hypothesis

Some theonyms, proper names, and other terminology of the Late Bronze Age Mitanni civilisation of Upper Mesopotamia exhibit an Indo-Aryan superstrate. While what few written records left by the Mittani are either in Hurrian language, Hurrian (which appears to have been the predominant language of their kingdom) or Akkadian language, Akkadian (the main diplomatic language of the Late Bronze Age Near East), these apparently Indo-Aryan names suggest that an Indo-Aryan elite imposed itself over the Hurrians in the course of the Indo-Aryan migration, Indo-Aryan expansion. If these traces are Indo-Aryan, they would be the earliest known direct evidence of Indo-Aryan, and would increase the precision in dating the split between the Indo-Aryan and Iranian languages (as the texts in which the apparent Indicisms occur can be dated with some accuracy). In a treaty between the Hittites and the Mitanni, the deities Mitra, Varuna, Indra, and the Ashvins (Nasatya) are invoked. Kikkuli's horse training text includes technical terms such as ''aika'' (cf. Sanskrit ''eka'', "one"), ''tera'' (''tri'', "three"), ''panza'' (''panca'', "five"), ''satta'' (''sapta'', seven), ''na'' (''nava'', "nine"), ''vartana'' (''vartana'', "turn", round in the horse race). The numeral ''aika'' "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian (which has ''aiva''). Another text has ''babru'' (''babhru'', "brown"), ''parita'' (''palita'', "grey"), and (''pingala'', "red"). Their chief festival was the celebration of the solstice (''vishuva'') which was common in most cultures in the ancient world. The Mitanni warriors were called ''marya'', the term for "warrior" in Sanskrit as well; note ''mišta-nnu'' (= ''miẓḍha'', ≈ Sanskrit ''mīḍha'') "payment (for catching a fugitive)" (M. Mayrhofer, ''Etymologisches Wörterbuch des Altindoarischen'', Heidelberg, 1986–2000; Vol. II:358). Sanskritic interpretations of Mitanni royal names render Artashumara (''artaššumara'') as ''Ṛtasmara'' "who thinks of Ṛta" (Mayrhofer II 780), Biridashva (''biridašṷa, biriiašṷ''a) as ''Prītāśva'' "whose horse is dear" (Mayrhofer II 182), Priyamazda (''priiamazda'') as ''Priyamedha'' "whose wisdom is dear" (Mayrhofer II 189, II378), Citrarata as ''Citraratha'' "whose chariot is shining" (Mayrhofer I 553), Indaruda/Endaruta as ''Indrota'' "helped by Indra" (Mayrhofer I 134), Shativaza (''šattiṷaza'') as ''Sātivāja'' "winning the race price" (Mayrhofer II 540, 696), Šubandhu as ''Subandhu'' "having good relatives" (a name in Palestine (region), Palestine, Mayrhofer II 209, 735), Tushratta (''tṷišeratta, tušratta'', etc.) as *tṷaiašaratha, Vedic Tvastar "whose chariot is vehement" (Mayrhofer, Etym. Wb., I 686, I 736).


Old Indo-Aryan

The earliest evidence of the group is from
Vedic Sanskrit Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
, that is used in the ancient preserved texts of the
Indian subcontinent The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, the foundational canon of the Hindu synthesis known as the Vedas. The Indo-Aryan superstrate in Mitanni is of similar age to the language of the Rigveda, but the only evidence of it is a few proper names and specialised loanwords. While Old Indo-Aryan is the earliest stage of the Indo-Aryan branch, from which all known languages of the later stages Middle and New Indo-Aryan are derived, some documented Middle Indo-Aryan variants cannot fully be derived from the documented form of Old Indo-Aryan (on which Vedic and Classical Sanskrit are based), but betray features that must go back to other undocumented dialects of Old Indo-Aryan. From Vedic Sanskrit, "Sanskrit" (literally 'put together, perfected, elaborated') developed as the prestige language of culture, science and religion, as well as the court, theatre, etc. Sanskrit of the later Vedic texts is comparable to Classical Sanskrit, but is largely mutually unintelligible with Vedic Sanskrit.


Middle Indo-Aryan (Prakrits)

Outside the learned sphere of Sanskrit, vernacular dialects (
Prakrit Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
s) continued to evolve. The oldest attested Prakrits are the Buddhism, Buddhist and Jainism, Jain canonical languages Pali and Ardhamagadhi Prakrit, respectively. Inscriptions in Ashokan Prakrit were also part of this early Middle Indo-Aryan stage. By medieval times, the Prakrits had diversified into various
Middle Indo-Aryan languages The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OI ...
. ''
Apabhraṃśa Apabhraṃśa (, , Prakrit: ) is a term used by '' vaiyākaraṇāḥ'' (native grammarians) since Patañjali to refer to languages spoken in North India before the rise of the modern languages. In Indology, it is used as an umbrella term for ...
'' is the conventional cover term for transitional dialects connecting late Middle Indo-Aryan with early Modern Indo-Aryan, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the ''Śravakacāra'' of Devasena (dated to the 930s) is now considered to be the first Hindi book. The next major milestone occurred with the Muslim conquests in the Indian subcontinent in the 13th–16th centuries. Under the flourishing Turco-Mongol tradition, Turco-Mongol Mughal Empire, Persian language in the Indian subcontinent, Persian became very influential as the language of prestige of the Islamic courts due to adoption of the foreign language by the Mughal emperors. The largest languages that formed from Apabhraṃśa were
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
, Bhojpuri language, Bhojpuri, Hindustani, Assamese, Sindhi, Gujarati, Odia,
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
, and Punjabi.


New Indo-Aryan


= Medieval Hindustani

= In the Central Zone (Hindi), Central Zone Hindi-speaking areas, for a long time the prestige dialect was Braj Bhasha, but this was replaced in the 13th century by Dehlavi dialect, Dehlavi-based Hindustani. Hindustani was strongly influenced by
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
, with these and later Sanskrit influence leading to the emergence of Modern Standard Hindi and Modern Standard
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
as register (sociolinguistics), registers of the Hindustani language. This state of affairs continued until the division of the British Indian Empire in 1947, when Hindi became the official language in India and
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
became official in Pakistan. Despite the different script the fundamental grammar remains identical, the difference is more sociolinguistics, sociolinguistic than purely linguistic. Today it is widely understood/spoken as a second or third language throughout South Asia and one of the most widely known languages in the world in terms of number of speakers.


Outside the Indian subcontinent


Domari

Domari is an Indo-Aryan language spoken by older Dom people scattered across the Middle East. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan.*Matras, Y. (2012). ''A grammar of Domari''. Berlin: De Gruyter Mouton (Mouton Grammar Library). Based on the systematicity of sound changes, linguists have concluded that the ethnonyms ''Domari'' and ''Romani people, Romani'' derive from the Indo-Aryan word ''ḍom''.


Lomavren

Lomavren is a nearly extinct mixed language, spoken by the Lom people, that arose from language contact between a language related to
Romani Romani may refer to: Ethnic groups * Romani people, or Roma, an ethnic group of Indo-Aryan origin ** Romani language, an Indo-Aryan macrolanguage of the Romani communities ** Romanichal, Romani subgroup in the United Kingdom * Romanians (Romanian ...
and Domari and the Armenian language.


Parya

Parya is spoken in Tajikistan and Uzbekistan by the descendants of migrants from the Indian subcontinent. The language retains many features similar to Punjabi and the Western Hindi dialects, while also bearing some influence from Tajik Persian.


Romani

The Romani language is usually included in the Western Indo-Aryan languages. Romani varieties, which are mainly spoken throughout Europe, are noted for their relatively conservative nature; maintaining the Middle Indo-Aryan present-tense person concord markers, alongside consonantal endings for nominal case. Indeed, these features are no longer evident in most other modern Central Indo-Aryan languages. Moreover, Romani shares an innovative pattern of past-tense person, which corresponds to Dardic languages, such as Kashmiri and Shina. This is believed to be further indication that proto-Romani speakers were originally situated in central regions of the subcontinent, before migrating to northwestern regions. However, there are no known historical sources regarding the development of the Romani language specifically within India. Research conducted by nineteenth-century scholars Pott (1845) and Miklosich (1882–1888) demonstrated that the Romani language is most aptly designated as a New Indo-Aryan language (NIA), as opposed to Middle Indo-Aryan (MIA); establishing that proto-Romani speakers could not have left India significantly earlier than AD 1000. The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, coupled with its reduction to a two-way nominative-oblique case system. A secondary argument concerns the system of gender differentiation, due to the fact that Romani has only two genders (masculine and feminine). Middle Indo-Aryan languages (named MIA) generally employed three genders (masculine, feminine and neuter), and some modern Indo-Aryan languages retain this aspect today. It is suggested that loss of the neuter gender did not occur until the transition to NIA. During this process, most of the neuter nouns became masculine, while several became feminine. For example, the neuter ''aggi'' "fire" in Prakrit morphed into the feminine ''āg'' in Hindi, and ''jag'' in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have additionally been cited as indications that the forerunner of Romani remained on the Indian subcontinent until a later period, possibly as late as the tenth century.


Sindhic migrations

Kholosi, Jadgali, and Luwati represent offshoots of the Sindhic subfamily of Indo-Aryan that have established themselves in the Persian Gulf region, perhaps through sea-based migrations. These are of a later origin than the Rom and Dom migrations which represent a different part of Indo-Aryan as well.


Indentured labourer migrations

The use by the British East India Company of indentured labourers led to the transplanting of Indo-Aryan languages around the world, leading to locally influenced lects that diverged from the source language, such as
Fiji Hindi Fiji Hindi (Devanagari: ; Kaithi: ; Perso-Arabic: ) is an Indo-Aryan language spoken by Indo-Fijians. It is considered to be a ''koiné'' language based on Awadhi that has also been subject to considerable influence by other Eastern Hindi ...
and
Caribbean Hindustani Caribbean Hindustani () is an Indo-Aryan language spoken by Indo-Caribbean people and the Indo-Caribbean diaspora. It is a koiné language mainly based on the Bhojpuri and Awadhi dialects. These Hindustani dialects were the most-spoken dialec ...
.


Phonology


Consonants


Stop positions

The normative system of New Indo-Aryan stops consists of five places of articulation: Labial consonant, labial, Dental consonant, dental, "Retroflex consonant, retroflex", palatal consonant, palatal, and velar consonant, velar, which is the same as that of Sanskrit. The "retroflex" position may involve retroflexion, or curling the tongue to make the contact with the underside of the tip, or merely retraction. The point of contact may be alveolar consonant, alveolar or postalveolar, and the distinctive quality may arise more from the shaping than from the position of the tongue. Palatal stops have affricate consonant, affricated release and are traditionally included as involving a distinctive tongue position (blade in contact with hard palate). Widely transcribed as , claims to be a more accurate rendering. Moving away from the normative system, some languages and dialects have alveolar affricates instead of palatal, though some among them retain in certain positions: before front vowels (esp. ), before , or when geminated. Alveolar as an ''additional'' point of articulation occurs in
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
and Konkani people, Konkani where dialect mixture and others factors upset the aforementioned complementation to produce minimal environments, in some West Pahari dialects through internal developments (, > ), and in Kashmiri. The addition of a Voiceless retroflex affricate, retroflex affricate to this in some Dardic languages maxes out the number of stop positions at seven (barring borrowed ), while a reduction to the inventory involves *ts > , which has happened in Assamese, Chittagonian language, Chittagonian, Sinhala language, Sinhala (though there have been other sources of a secondary ), and Southern Mewari. Further reductions in the number of stop articulations are in Assamese and
Romani Romani may refer to: Ethnic groups * Romani people, or Roma, an ethnic group of Indo-Aryan origin ** Romani language, an Indo-Aryan macrolanguage of the Romani communities ** Romanichal, Romani subgroup in the United Kingdom * Romanians (Romanian ...
, which have lost the characteristic dental/retroflex contrast, and in Chittagonian, which may lose its labial and velar articulations through spirantisation in many positions (> ). /q x ɣ f/ are restricted to Perso-Arabic loanwords in most IA languages but they occur natively in Khowar. According to Masica (1991) some dialects of Pashayi have a /θ/ which is unusual for IA languages. Domari which is spoken in the Middle East and had high contact with Middle Eastern languages has /q ħ ʕ ʔ/ and emphatic consonants from loanwords.


Nasals

Sanskrit was noted as having five nasal stop, nasal-stop articulations corresponding to its oral stops, and among modern languages and dialects Dogri, Kacchi, Kalasha, Rudhari, Shina, Saurashtri, and Sindhi have been analysed as having this full complement of phonemic nasals , with the last two generally as the result of the loss of the stop from a homorganic nasal + stop cluster ( > and > ), though there are other sources as well. In languages that lack phonemic nasals at some places of articulation, they can still occur allophonically from place assimilation in a nasal + stop culture, e.g. Hindi > .


Aspiration and breathy-voice

Most Indo-Aryan languages have contrastive Aspirated consonant, aspiration (), and some retain historical breathy voice on voiced consonants (). Sometimes both phenomena are analysed as a single aspiration contrast. The places and manners of articulation which allow contrastive aspiration vary by language; e.g. Sindhi permits phonemic , but the phonemic status of this sound in Hindi is uncertain, and many "Dardic" languages lack aspirated retroflex sibilants despite having unaspirated equivalents. In languages that have lost breathy-voice, the contrast has often been replaced with tone.


Regional developments

Some of these are mentioned in . * Implosive consonant, Implosives: Languages in the Sindhi languages, Sindhic subfamily, as well as Saraiki, western
Marwari Marwari may refer to: * anything of, from, or related to the Marwar region of Rajasthan, India, largely in the Thar Desert ** Marwari people, an Indian ethnic group originating in the Marwar region ** Marwari language, the language of the Marwari ...
dialects, and some dialects of Gujarati have developed implosive consonants from historical intervocalic geminates and word-initial stops. Sindhi has a full implosive series except for the dental implosive: . It has been claimed that Wadiyari Koli has the dental implosive too. Other languages have less complete implosive series, e.g. Kacchi has just . * Prenasalized stops: Sinhala and Maldivian (Dhivehi) have a series of prenasalised stops covering all places except for palatal: . * Palatalization (phonetics), Palatalization: Kashmiri (natively) and some Romani dialects (from contact with Slavic languages) have contrastive palatalisation. * ɬ, Voiceless lateral In Gawarbati, some Pashai dialects, partly Bashkarik and some Shina dialects have /ɬ/ from clusters of tr kr or sometimes pr; dr gr and br merged with /l/ in these languages. * Lateral affricates: Bhadarwahi has an unusual series of lateral retroflex affricates ( derived from historical clusters.


Vowels

Vowel typologies are varied across Indo-Aryan due to diachronic mergers and (in some cases) splits, as well as different accounts by linguists for even the widely-spoken languages. Vowel systems per are listed below. Many languages also have phonemic nasal vowels. Sylheti language is one of the few Tone (linguistics), tonal Indo-Aryan languages, others being Punjabi and a few Dardic languages. The vowels of Sylheti language listed below.


Charts

The following are consonant systems of major and representative New Indo-Aryan languages, mostly following , though here they are in International Phonetic Alphabet, IPA. Parentheses indicate those consonants found only in loanwords: square brackets indicate those with "very low functional load". The arrangement is roughly geographical.


Sociolinguistics


Register

In many Indo-Aryan languages, the literary register is often more archaic and utilises a different lexicon (Sanskrit or Perso-Arabic) than spoken vernacular. One example is Bengali's high literary form, Sadhu bhasha, Sādhū bhāṣā, as opposed to the more modern Calita bhasa, Calita bhāṣā (Cholito-bhasha). This distinction approaches diglossia.


Language and dialect

In the context of South Asia, the choice between the appellations Language or dialect, "language" and "dialect" is a difficult one, and any distinction made using these terms is obscured by their ambiguity. In one general colloquial sense, a language is a "developed" dialect: one that is standardised, has a written tradition and enjoys Prestige (sociolinguistics), social prestige. As there are degrees of development, the boundary between a language and a dialect thus defined is not clear-cut, and there is a large middle ground where assignment is contestable. There is a second meaning of these terms, in which the distinction is drawn on the basis of linguistic similarity. Though seemingly a "proper" linguistics sense of the terms, it is still problematic: methods that have been proposed for quantifying difference (for example, based on mutual intelligibility) have not been seriously applied in practice; and any relationship established in this framework is relative.


See also

* Indo-Aryans * Iranic languages * Proto-Vedic Continuity * Brahmic scripts, The family of Brahmic scripts * Indo-Aryan loanwords in Tamil * Dravidian languages * Languages of Bangladesh * Languages of India * Maldives#Languages, Languages of Maldives * Languages of Nepal * Languages of Pakistan * Languages of Sri Lanka * Languages of South Asia


Notes


References


Further reading

*Morgenstierne, Georg. "Early Iranic Influence upon Indo-Aryan." Acta Iranica, I. série, Commemoration Cyrus. Vol. I. Hommage universel (1974): 271–279. * John Beames, ''A comparative grammar of the modern Aryan languages of India: to wit, Hindi, Panjabi, Sindhi, Gujarati, Marathi, Oriya, and Bangali''. Londinii: Trübner, 1872–1879. 3 vols. * Madhav Deshpande (1979). ''Sociolinguistic attitudes in India: An historical reconstruction''. Ann Arbor: Karoma Publishers. , (pbk). * Byomkes Chakrabarti, Chakrabarti, Byomkes (1994). ''A comparative study of Santali and Bengali''. Calcutta: K.P. Bagchi & Co. * Erdosy, George. (1995). ''The Indo-Aryans of ancient South Asia: Language, material culture and ethnicity''. Berlin: Walter de Gruyter. . * * Kobayashi, Masato.; & George Cardona (2004). ''Historical phonology of old Indo-Aryan consonants''. Tokyo: Research Institute for Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies. . * . * Misra, Satya Swarup. (1980). ''Fresh light on Indo-European classification and chronology''. Varanasi: Ashutosh Prakashan Sansthan. * Misra, Satya Swarup. (1991–1993). ''The Old-Indo-Aryan, a historical & comparative grammar'' (Vols. 1–2). Varanasi: Ashutosh Prakashan Sansthan. * Sen, Sukumar. (1995). ''Syntactic studies of Indo-Aryan languages''. Tokyo: Institute for the Study of Languages and Foreign Cultures of Asia and Africa, Tokyo University of Foreign Studies. * Vacek, Jaroslav. (1976). ''The sibilants in Old Indo-Aryan: A contribution to the history of a linguistic area''. Prague: Charles University.


External links


The Indo-Aryan languages
25 October 2009
The Indo-Aryan languages
Colin P.Masica
Survey of the syntax of the modern Indo-Aryan languages
(Rajesh Bhatt), 7 February 2003. {{DEFAULTSORT:Indo-Aryan Languages Indo-European languages Indo-Aryan languages,