HOME

TheInfoList



OR:

Bengali ( ), generally known by its endonym Bangla (, ), is an
Indo-Aryan language The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in India, Pa ...
native to the Bengal region of
South Asia South Asia is the southern subregion of Asia, which is defined in both geographical Geography (from Greek: , ''geographia''. Combination of Greek words ‘Geo’ (The Earth) and ‘Graphien’ (to describe), literally "earth descr ...
. It is the official, national, and most widely spoken language of Bangladesh and the second most widely spoken of the 22
scheduled languages of India There is no national language in India. However, article 343(1) of the Indian constitution specifically mentions that, "The official language of the Union shall be Hindi in Devanagari script. The form of numerals to be used for the official pu ...
. With approximately 300 million
native speakers A first language, native tongue, native language, mother tongue or L1 is the first language or dialect that a person has been exposed to from birth or within the critical period. In some countries, the term ''native language'' or ''mother tongu ...
and another 37 million as second language speakers, Bengali is the fifth most-spoken native language and the seventh most spoken language by total number of speakers in the world. Bengali is the fifth most spoken Indo-European language. Bengali is the official and national language of Bangladesh, with 98% of Bangladeshis using Bengali as their first language. Within India, Bengali is the official language of the states of West Bengal,
Tripura Tripura (, Bengali: ) is a state in Northeast India. The third-smallest state in the country, it covers ; and the seventh-least populous state with a population of 36.71 lakh ( 3.67 million). It is bordered by Assam and Mizoram to the ea ...
and the
Barak Valley The Barak Valley is located in the southern region of the Indian state of Assam. The region is named after the Barak river. The Barak valley consists of three administrative districts of Assam - namely Cachar, Karimganj, and Hailakandi. The ...
region of the state of Assam. It is also a second official language of the Indian state of Jharkhand since September 2011. It is the most widely spoken language in the Andaman and Nicobar Islands in the Bay of Bengal, and is spoken by significant populations in other states including Bihar,
Arunachal Pradesh Arunachal Pradesh (, ) is a state in Northeastern India. It was formed from the erstwhile North-East Frontier Agency (NEFA) region, and became a state on 20 February 1987. It borders the states of Assam and Nagaland to the south. It shares ...
,
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, primarily its western or right bank, Delhi shares borders ...
, Chhattisgarh, Meghalaya, Mizoram, Nagaland,
Odisha Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of ...
and Uttarakhand. Bengali is also spoken by the Bengali diasporas (
Bangladeshi diaspora The Bangladeshi diaspora ( bn, প্রবাসী বাংলাদেশী) are people of Bangladeshi birth or descent who live outside of Bangladesh. First-generation migrants may have moved abroad from Bangladesh for various reasons in ...
and Indian Bengalis) in Europe, the United States, the Middle East and other countries. Bengali has developed over the course of more than 1,300 years. Bengali literature, with its millennium-old literary history, extensively developed during the Bengali Renaissance and is one of the most prolific and diverse literary traditions in Asia. The Bengali language movement from 1948 to 1956 demanding Bengali to be an official language of Pakistan fostered
Bengali nationalism Bengalism or Bengali nationalism () was a form of nationalism that focused on Bengalis as a singular nation. The people of Bengali ethnicity speak Bengali language. Bengalis mostly live across Bangladesh and the Indian states of Tripura an ...
in East Bengal leading to the emergence of Bangladesh in 1971. In 1999,
UNESCO The United Nations Educational, Scientific and Cultural Organization is a specialized agency of the United Nations (UN) aimed at promoting world peace and security through international cooperation in education, arts, sciences and culture. It ...
recognised 21 February as
International Mother Language Day International Mother Language Day is a worldwide annual observance held on 21 February to promote awareness of linguistic and cultural diversity and to promote multilingualism. First announced by UNESCO on 17 November 1999, it was formal ...
in recognition of the language movement.


Etymology

The first native name for Bengali was ''Gauda-bhasa'' in 16th century. In the 19th century, it was known as ''Vanga-bhasa'' or ''Bangala-bhasa''. In the 21st century, it is known as ''Bangla-bhasa''.


History


Ancient

Although
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
was practised by Hindu
Brahmins Brahmin (; sa, ब्राह्मण, brāhmaṇa) is a varna as well as a caste within Hindu society. The Brahmins are designated as the priestly class as they serve as priests ( purohit, pandit, or pujari) and religious teachers (guru ...
in Bengal since the
first millennium BC The 1st millennium BC, also known as the last millennium BC, was the period of time lasting from the years 1000 BC to 1 BC ( 10th to 1st centuries BC; in astronomy: JD – ). It encompasses the Iron Age in the Old World and sees the transiti ...
, the local Buddhist population were speaking in some varieties of the Prakrit. These varieties generally referred to as "eastern Magadhi Prakrit", as coined by linguist Suniti Kumar Chatterji, as the Middle Indo-Aryan dialects were influential in the
first millennium File:1st millennium montage.png, From top left, clockwise: Depiction of Jesus, the central figure in Christianity; The Colosseum, a landmark of the once-mighty Roman Empire; Kaaba, the Great Mosque of Mecca, the holiest site of Islam; Chess, ...
when Bengal was a part of the Magadhan realm. The local varieties had no official status during the Gupta Empire, and with Bengal increasingly becoming a hub of Sanskrit literature for Hindu priests, the vernacular of Bengal gained a lot of influence from Sanskrit. Magadhi Prakrit was also spoken in modern-day Bihar and Assam, and this vernacular eventually evolved into Ardha Magadhi. Ardha Magadhi began to give way to what is known as Apabhraṃśa, by the end of the first millennium. The Bengali language evolved as a distinct language over the course of time.


Early

Though some archeologists claim that some 10th-century texts were in Bengali, it is not certain whether they represent a differentiated language or whether they represent a stage when
Eastern Indo-Aryan languages The Eastern Indo-Aryan languages, also known as Māgadhan languages, are spoken throughout the eastern Indian subcontinent (East India and Assam, Bangladesh), including Bihar, Uttar Pradesh, Jharkhand, Bengal, Tripura, Assam, and Odisha; alongs ...
were differentiating. The local Apabhraṃśa of the eastern subcontinent, Purbi Apabhraṃśa or
Abahatta Abahaṭ‌ṭha, Abahatta or Avahaṭṭha ( Prakrit: ''abasaṭ‌ṭa'', ultimately from Sanskrit ''apaśabda'' 'meaningless sound') is a stage in the evolution of the Eastern group of the Indo-Aryan languages. The eastern group consists of ...
(lit. 'meaningless sounds'), eventually evolved into regional dialects, which in turn formed three groups, the
Bengali–Assamese languages The Bengali–Assamese languages (also Gauda–Kamarupa languages) is a grouping of several languages. This group belongs to the Eastern zone of Indo-Aryan languages. The languages in this group as per Glottolog includes Assamese, Bengali, Bi ...
, the Bihari languages, and the Odia language. The language was not static: different varieties coexisted and authors often wrote in multiple dialects in this period. For example, Ardhamagadhi is believed to have evolved into Abahatta around the 6th century, which competed with the ancestor of Bengali for some time. The ancestor of Bengali was the language of the
Pala Empire The Pāla Empire (r. 750-1161 CE) was an imperial power during the post-classical period in the Indian subcontinent, which originated in the region of Bengal. It is named after its ruling dynasty, whose rulers bore names ending with the suffi ...
and the
Sena dynasty The Sena dynasty was a Hindu dynasty during the early medieval period on the Indian subcontinent, that ruled from Bengal through the 11th and 12th centuries. The empire at its peak covered much of the north-eastern region of the Indian subcont ...
.


Medieval

During the medieval period, Middle Bengali was characterised by the elision of word-final '' ô'', the spread of compound verbs, which is originate from
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
Schwa. Slowly word-final ''ô'' was disappeared from many words as influenced from the Arabic,
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
and Turkic languages. The arrival of merchants and traders from the Middle East and Turkestan into the Buddhist-ruling
Pala Empire The Pāla Empire (r. 750-1161 CE) was an imperial power during the post-classical period in the Indian subcontinent, which originated in the region of Bengal. It is named after its ruling dynasty, whose rulers bore names ending with the suffi ...
, from as early as the 7th century, gave birth to Islamic influence in the region. In the 13th century, the subsequent Muslim expeditions to Bengal greatly encouraged the migratory movements of Arab Muslims and
Turco-Persian The composite Turko-Persian, Turco-Persian
''Turko-Persia in historical perspective'', Cambridge University Press, ...
s, who heavily influenced the local vernacular by settling among the native population. Bengali acquired prominence, over Persian, in the court of the
Sultans of Bengal The Sultanate of Bengal ( Middle Bengali: শাহী বাঙ্গালা ''Shahī Baṅgala'', Classical Persian: ''Saltanat-e-Bangālah'') was an empire based in Bengal for much of the 14th, 15th and 16th centuries. It was the dominan ...
with the ascent of Jalaluddin Muhammad Shah. Subsequent Muslim rulers actively promoted the literary development of Bengali, allowing it to become the most spoken
vernacular A vernacular or vernacular language is in contrast with a "standard language". It refers to the language or dialect that is spoken by people that are inhabiting a particular country or region. The vernacular is typically the native language, n ...
language in the Sultanate. Bengali adopted many words from Arabic and
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
, which was a manifestation of Islamic culture on the language. Major texts of Middle Bengali (1400–1800) include Yusuf-Zulekha by Shah Muhammad Sagir and Srikrishna Kirtana by the
Chandidas Chandidas (born 1408) was a medieval poet of Bengal, or possibly more than one. Over 1250 poems related to the love of Radha and Krishna in Bengali with the ''bhanita'' of Chandidas are found with three different sobriquets along with his name, ...
poets. Court support for Bengali culture and language waned when the Mughal Empire conquered Bengal in the late 16th and early 17th century.


Modern

The modern literary form of Bengali was developed during the 19th and early 20th centuries based on the dialect spoken in the Nadia region, a west-central Bengali dialect. Bengali presents a strong case of diglossia, with the literary and standard form differing greatly from the colloquial speech of the regions that identify with the language. The modern Bengali vocabulary contains the vocabulary base from Magadhi Prakrit and Pali, also tatsamas and reborrowings from Sanskrit and other major borrowings from
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
, Arabic,
Austroasiatic languages The Austroasiatic languages , , are a large language family in Mainland Southeast Asia and South Asia. These languages are scattered throughout parts of Thailand, Laos, India, Myanmar, Malaysia, Bangladesh, Nepal, and southern China and are t ...
and other languages in contact with. During this period, there were two main forms of written Bengali: * ''Chôlitôbhasha''; colloquial form of Bengali using simplified inflections * '' Sadhubhasha''; Sanskritised form of Bengali. In 1948, the Government of Pakistan tried to impose
Urdu Urdu (;"Urdu"
''
The Bengali Language Movement was a popular ethno-linguistic movement in the former East Bengal (today Bangladesh), which was a result of the strong linguistic consciousness of the Bengalis to gain and protect spoken and written Bengali's recognition as a state language of the then Dominion of Pakistan. On 21 February 1952, five students and political activists were killed during protests near the campus of the University of Dhaka, whom were the first ever
Martyrs A martyr (, ''mártys'', "witness", or , ''marturia'', stem , ''martyr-'') is someone who suffers persecution and death for advocating, renouncing, or refusing to renounce or advocate, a religious belief or other cause as demanded by an external ...
to die for their rights on speaking their mother language. In 1956, Bengali was made a state language of Pakistan. The day has since been observed as Language Movement Day in Bangladesh and is also commemorated as
International Mother Language Day International Mother Language Day is a worldwide annual observance held on 21 February to promote awareness of linguistic and cultural diversity and to promote multilingualism. First announced by UNESCO on 17 November 1999, it was formal ...
by
UNESCO The United Nations Educational, Scientific and Cultural Organization is a specialized agency of the United Nations (UN) aimed at promoting world peace and security through international cooperation in education, arts, sciences and culture. It ...
every year since 2000. In 2010, the parliament of Bangladesh and the legislative assembly of West Bengal proposed that Bengali be made an official UN language, though no further action was taken on this matter.


Geographical distribution

The Bengali language is native to the region of Bengal, which comprises the present-day nation of Bangladesh and the Indian state of West Bengal. Besides the native region it is also spoken by the Bengalis living in
Tripura Tripura (, Bengali: ) is a state in Northeast India. The third-smallest state in the country, it covers ; and the seventh-least populous state with a population of 36.71 lakh ( 3.67 million). It is bordered by Assam and Mizoram to the ea ...
, southern Assam and the Bengali population in the Indian union territory of Andaman and Nicobar Islands. Bengali is also spoken in the neighbouring states of
Odisha Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of ...
, Bihar, and Jharkhand, and sizeable minorities of Bengali speakers reside in Indian cities outside Bengal, including
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, primarily its western or right bank, Delhi shares borders ...
, Mumbai, Thane, Varanasi, and Vrindavan. There are also significant Bengali-speaking communities in the Middle East, the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country primarily located in North America. It consists of 50 states, a federal district, five major unincorporated territori ...
,
Singapore Singapore (), officially the Republic of Singapore, is a sovereign island country and city-state in maritime Southeast Asia. It lies about one degree of latitude () north of the equator, off the southern tip of the Malay Peninsula, bor ...
,
Malaysia Malaysia ( ; ) is a country in Southeast Asia. The federation, federal constitutional monarchy consists of States and federal territories of Malaysia, thirteen states and three federal territories, separated by the South China Sea into two r ...
, Australia,
Canada Canada is a country in North America. Its ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world's second-largest country by tot ...
, the
United Kingdom The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom (UK) or Britain, is a country in Europe, off the north-western coast of the European mainland, continental mainland. It comprises England, Scotlan ...
, and
Italy Italy ( it, Italia ), officially the Italian Republic, ) or the Republic of Italy, is a country in Southern Europe. It is located in the middle of the Mediterranean Sea, and its territory largely coincides with the homonymous geographical ...
.


Official status

The 3rd article of the
Constitution of Bangladesh The Constitution of Bangladesh ( bn, বাংলাদেশের সংবিধান — ), officially the Constitution of the People's Republic of Bangladesh ( bn, গণপ্রজাতন্ত্রী বাংলাদেশের ...
states Bengali to be the sole
official language An official language is a language given supreme status in a particular country, state, or other jurisdiction. Typically the term "official language" does not refer to the language used by a people or country, but by its government (e.g. judiciary, ...
of Bangladesh. The Bengali Language Implementation Act, 1987 made it mandatory to use Bengali in all records and correspondences, laws, proceedings of court and other legal actions in all courts, government or semi-government offices, and autonomous institutions in Bangladesh. It is also the ''de facto'' national language of the country. In India, Bengali is one of the 23 official languages. It is the official language of the Indian states of West Bengal,
Tripura Tripura (, Bengali: ) is a state in Northeast India. The third-smallest state in the country, it covers ; and the seventh-least populous state with a population of 36.71 lakh ( 3.67 million). It is bordered by Assam and Mizoram to the ea ...
and in
Barak Valley The Barak Valley is located in the southern region of the Indian state of Assam. The region is named after the Barak river. The Barak valley consists of three administrative districts of Assam - namely Cachar, Karimganj, and Hailakandi. The ...
of Assam. Bengali has been a second official language of the Indian state of Jharkhand since September 2011. In Pakistan, Bengali is a recognised secondary language in the city of Karachi. The Department of Bengali in the University of Karachi also offers regular programs of studies at the Bachelors and at the Masters levels for Bengali Literature. The national anthems of both Bangladesh (''
Amar Sonar Bangla "" ( bn, আমার সোনার বাংলা, lit=My Golden Bengal, ) is the national anthem of Bangladesh. An ode to Mother Bengal, the lyrics were written by Bengali polymath Rabindranath Tagore in 1905, while the melody of the hym ...
'') and India ('' Jana Gana Mana'') were written in Bengali by the Bengali Nobel laureate Rabindranath Tagore. Additionally, the first two verses of '' Vande Mataram'', a patriotic song written in Bengali by Bankim Chandra Chatterjee, was adopted as the "national song" of India in both the colonial period and later in 1950 in independent India. Furthermore, it is believed by many that the national anthem of Sri Lanka ( Sri Lanka Matha) was inspired by a Bengali poem written by Rabindranath Tagore, while some even believe the anthem was originally written in Bengali and then translated into Sinhala. After the contribution made by the
Bangladesh UN Peacekeeping Force The Bangladesh Armed Forces and the Bangladesh Police have been actively involved in a number of United Nations Peace Support Operations (UNPSO) since 1988. Currently Bangladesh is the largest contributor in the UN peacekeeping missions. Banglad ...
in the
Sierra Leone Civil War The Sierra Leone Civil War (1991–2002), or the Sierra Leonean Civil War, was a civil war in Sierra Leone that began on 23 March 1991 when the Revolutionary United Front (RUF), with support from the special forces of Liberian dictator Char ...
under the
United Nations Mission in Sierra Leone The United Nations Mission in Sierra Leone (UNAMSIL) was a United Nations peacekeeping operation in Sierra Leone from 1999 to 2006. It was created by the United Nations Security Council in October 1999 to help with the implementation of the Lom� ...
, the government of
Ahmad Tejan Kabbah Alhaji Ahmad Tejan Kabbah (February 16, 1932 – March 13, 2014) was a Sierra Leonean politician who served twice as the 3rd President of Sierra Leone, from 1996 to 1997 and again from 1998 to 2007. An economist and attorney by profession, K ...
declared Bengali as an honorary official language in December 2002. In 2009, elected representatives in both Bangladesh and West Bengal called for Bengali to be made an official language of the United Nations.


Dialects

Regional variation in spoken Bengali constitutes a
dialect continuum A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated varie ...
. Linguist Suniti Kumar Chatterji grouped the dialects of Bengali language into four large clusters- Rarhi, Vangiya, Kamrupi and Varendri; but many alternative grouping schemes have also been proposed. The south-western dialects ( Rarhi or
Nadia Nadia is a female name. Variations include Nadja, Nadya, Nadine, Nadiya, and Nadiia. Most variations of the name are derived from Arabic, Slavic languages, or both. In Slavic, names similar to ''Nadia'' mean "hope" in many Slavic languages: ...
dialect) form the basis of modern standard colloquial Bengali. In the dialects prevalent in much of eastern and south-eastern Bangladesh ( Barisal, Chittagong,
Dhaka Dhaka ( or ; bn, ঢাকা, Ḍhākā, ), formerly known as Dacca, is the capital and largest city of Bangladesh, as well as the world's largest Bengali-speaking city. It is the eighth largest and sixth most densely populated city i ...
and
Sylhet Division Sylhet Division ( bn, সিলেট বিভাগ) is the northeastern division of Bangladesh. It is bordered by the Indian states of Meghalaya, Assam and Tripura to the north, east and south respectively, and by the Bangladeshi divisions ...
s of Bangladesh), many of the stops and affricates heard in West Bengal are pronounced as
fricatives A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in t ...
. Western alveolo-palatal affricates , , correspond to eastern , , . The influence of Tibeto-Burman languages on the phonology of Eastern Bengali is seen through the lack of nasalised vowels and an alveolar articulation of what are categorised as the "cerebral" consonants (as opposed to the postalveolar articulation of West Bengal). Some variants of Bengali, particularly Chittagonian and Chakma, have contrastive tone; differences in the pitch of the speaker's voice can distinguish words. Kharia Thar and Mal Paharia are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects. During the standardisation of Bengali in the 19th century and early 20th century, the cultural centre of Bengal was in Kolkata, a city founded by the British. What is accepted as the standard form today in both West Bengal and Bangladesh is based on the West-Central dialect of Nadia District, located next to the border of Bangladesh and 76 miles north of Kolkata. There are cases where speakers of Standard Bengali in West Bengal will use a different word from a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. For example, the word salt is ''nun'' in the west which corresponds to ''lôbôṇ'' in the east. Bengali exhibits diglossia, though some scholars have proposed triglossia or even n-glossia or heteroglossia between the written and spoken forms of the language. Two styles of writing have emerged, involving somewhat different vocabularies and syntax: # '' Shadhu-bhasha'' ( "uptight language") was the written language, with longer verb inflections and more of a Pali and
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
-derived '' Tatsama'' vocabulary. Songs such as India's national anthem '' Jana Gana Mana'' (by Rabindranath Tagore) were composed in this style. Its use in modern writing however is uncommon, restricted to some official signs and documents in Bangladesh as well as for achieving particular literary effects. # ''Cholito-bhasha'' ( "running language"), known by linguists as Standard Colloquial Bengali, is a written Bengali style exhibiting a preponderance of colloquial idiom and shortened verb forms, and is the standard for written Bengali now. This form came into vogue towards the turn of the 19th century, promoted by the writings of Peary Chand Mitra ('' Alaler Gharer Dulal'', 1857), Pramatha Chaudhuri (''Sabujpatra'', 1914) and in the later writings of Rabindranath Tagore. It is modelled on the dialect spoken in the
Shantipur Santipur is a city and a municipality in the Ranaghat subdivision of Nadia district in the Indian States and territories of India, state of West Bengal. The fort area of this city, also known as Daak-Garh (calling or gathering room) is thought ...
region in Nadia district, West Bengal. This form of Bengali is often referred to as the "Nadia standard", "Nadia dialect", "Southwestern/West-Central dialect" or "Shantipuri Bangla". Linguist Prabhat Ranjan Sarkar categorises the language as: * Madhya Rarhi dialect * Kanthi (Contai) dialect * Kolkata dialect * Shantipuriya (Nadia) dialect *
Shershahabadia Shershabadia is a community found in the state of West Bengal, Bihar and Jharkhand in India. They belong to Shaikh community and also form a significant part of the Shaikhs of West Bengal and Bihar. Common surnames used by the community include Sh ...
(Maldahiya/ Jangipuri) dialect * Barendri dialect * Rangapuriya dialect * Shrihatti dialect * Dhakiya (Bikrampuri) dialect * Jashore/Jessoriya dialect * Barisal (Chandradwip) dialect *
Chattal Chattal ( ur, چتال) is a village located in Chak Malook union council of Chakwal District in the Punjab Province of Pakistan, it is part of Chakwal Tehsil. It is a village located approximately Chakwal, Punjab, Pakistan on the Chakwal-Jehlu ...
(Chittagong) dialect While most writing is in Standard Colloquial Bengali (SCB), spoken dialects exhibit a greater variety. People in southeastern West Bengal, including Kolkata, speak in SCB. Other dialects, with minor variations from Standard Colloquial, are used in other parts of West Bengal and western Bangladesh, such as the Midnapore dialect, characterised by some unique words and constructions. However, a majority in Bangladesh speak in dialects notably different from SCB. Some dialects, particularly those of the Chittagong region, bear only a superficial resemblance to SCB. The dialect in the Chittagong region is least widely understood by the general body of Bengalis. The majority of Bengalis are able to communicate in more than one
variety Variety may refer to: Arts and entertainment Entertainment formats * Variety (radio) * Variety show, in theater and television Films * ''Variety'' (1925 film), a German silent film directed by Ewald Andre Dupont * ''Variety'' (1935 film), ...
– often, speakers are fluent in ''Cholitobhasha'' (SCB) and one or more regional dialects. Even in SCB, the vocabulary may differ according to the speaker's religion: Muslims are more likely to use words of Persian and Arabic origin, along with more words naturally derived from Sanskrit (
tadbhava (Sanskrit: तद्भव, , lit. "arising from that") is the Sanskrit word for one of three etymological classes defined by native grammarians of Middle Indo-Aryan languages, alongside tatsama and deśi words. at pp. 67-69. A "tadbhava" is a w ...
), whereas Hindus are more likely to use tatsama (words directly borrowed from Sanskrit). For example:


Phonology

The
phonemic In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west ...
inventory of standard Bengali consists of 29 consonants and 7 vowels, as well as 7 nasalised vowels. The inventory is set out below in the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation ...
(upper grapheme in each box) and romanisation (lower grapheme). Bengali is known for its wide variety of
diphthongs A diphthong ( ; , ), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech ...
, combinations of
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s occurring within the same syllable. Two of these, and , are the only ones with representation in script, as and respectively. may all form the glide part of a diphthong. The total number of diphthongs is not established, with bounds at 17 and 31. An incomplete chart is given by Sarkar (1985) of the following:


Stress

In standard Bengali,
stress Stress may refer to: Science and medicine * Stress (biology), an organism's response to a stressor such as an environmental condition * Stress (linguistics), relative emphasis or prominence given to a syllable in a word, or to a word in a phrase ...
is predominantly initial. Bengali words are virtually all
trochaic In English poetic metre and modern linguistics, a trochee () is a metrical foot consisting of a stressed syllable followed by an unstressed one. But in Latin and Ancient Greek poetic metre, a trochee is a heavy syllable followed by a light one (al ...
; the primary stress falls on the initial syllable of the word, while secondary stress often falls on all odd-numbered syllables thereafter, giving strings such as in ''shô-hô-jo-gi-ta'' "cooperation", where the boldface represents primary and secondary stress.


Consonant clusters

Native Bengali words do not allow initial consonant clusters; the maximum syllabic structure is CVC (i.e. one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this pattern, even when using Sanskrit or English borrowings, such as ''geram'' (CV.CVC) for ''gram'' (CCVC) "village" or ''iskul'' (VC.CVC) for ''skul'' (CCVC) "school".


Writing system

Bengali-Assamese script is an
abugida An abugida (, from Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental writing system in which consonant-vowel sequences are written as units; each unit is based on a consonant letter, and vowel n ...
, a script with letters for consonants, diacritics for vowels, and in which an
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. For example, if the Latin alphabet used 'i' as an inherent vowel, "Wikipedia" could be rendered as "W ...
(অ ''ô'') is assumed for consonants if no vowel is marked. The Bengali alphabet is used throughout Bangladesh and eastern India (Assam, West Bengal, Tripura). The Bengali alphabet is believed to have evolved from a modified
Brahmic script The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
around 1000 CE (or 10th–11th century). in It is a cursive script with eleven graphemes or signs denoting nine vowels and two diphthongs, and thirty-nine graphemes representing
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wi ...
s and other modifiers. There are no distinct upper and lower case letter forms. The letters run from left to right and spaces are used to separate orthographic words. Bengali script has a distinctive horizontal line running along the tops of the graphemes that links them together called ''matra''. Since the Bengali script is an abugida, its consonant graphemes usually do not represent phonetic segments, but carry an "inherent" vowel and thus are
syllabic Syllabic may refer to: *Syllable, a unit of speech sound, considered the building block of words **Syllabic consonant, a consonant that forms the nucleus of a syllable *Syllabary, writing system using symbols for syllables *Abugida, writing system ...
in nature. The inherent vowel is usually a back vowel, either as in "opinion" or , as in "mind", with variants like the more open . To emphatically represent a consonant sound without any inherent vowel attached to it, a special diacritic, called the '' hôsôntô'' , may be added below the basic consonant grapheme (as in ). This diacritic, however, is not common, and is chiefly employed as a guide to pronunciation. The abugida nature of Bengali consonant graphemes is not consistent, however. Often, syllable-final consonant graphemes, though not marked by a ''hôsôntô'', may carry no inherent vowel sound (as in the final in or the medial in ). A consonant sound followed by some vowel sound other than the inherent is orthographically realised by using a variety of vowel allographs above, below, before, after, or around the consonant sign, thus forming the ubiquitous consonant-vowel typographic ligatures. These allographs, called ''kar'', are diacritical vowel forms and cannot stand on their own. For example, the graph represents the consonant followed by the vowel , where is represented as the diacritical allograph (called ''i-kar'') and is placed ''before'' the default consonant sign. Similarly, the graphs , , , , , , , and represent the same consonant combined with seven other vowels and two diphthongs. In these consonant-vowel ligatures, the so-called "inherent" vowel is first expunged from the consonant before adding the vowel, but this intermediate expulsion of the inherent vowel is not indicated in any visual manner on the basic consonant sign . The vowel graphemes in Bengali can take two forms: the independent form found in the basic inventory of the script and the dependent, abridged, allograph form (as discussed above). To represent a vowel in isolation from any preceding or following consonant, the independent form of the vowel is used. For example, in "ladder" and in "Hilsa fish", the independent form of the vowel is used (cf. the dependent form. A vowel at the beginning of a word is always realised using its independent form. In addition to the inherent-vowel-suppressing ''hôsôntô'', three more diacritics are commonly used in Bengali. These are the superposed ''chôndrôbindu'' , denoting a suprasegmental for
nasalisation In phonetics, nasalization (or nasalisation) is the production of a sound while the velum is lowered, so that some air escapes through the nose during the production of the sound by the mouth. An archetypal nasal sound is . In the Internatio ...
of vowels (as in "moon"), the postposed ''ônusbar'' indicating the
velar nasal The voiced velar nasal, also known as agma, from the Greek word for 'fragment', is a type of consonantal sound used in some spoken languages. It is the sound of ''ng'' in English ''sing'' as well as ''n'' before velar consonants as in ''Englis ...
(as in "Bengali") and the postposed ''bisôrgô'' indicating the voiceless glottal fricative (as in "ouch!") or the gemination of the following consonant (as in "sorrow"). The Bengali consonant clusters ( ''juktôbênjôn'') are usually realised as ligatures, where the consonant which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond recognition. In the Bengali writing system, there are nearly 285 such ligatures denoting consonant clusters. Although there exist a few visual formulas to construct some of these ligatures, many of them have to be learned by rote. Recently, in a bid to lessen this burden on young learners, efforts have been made by educational institutions in the two main Bengali-speaking regions (West Bengal and Bangladesh) to address the opaque nature of many consonant clusters, and as a result, modern Bengali textbooks are beginning to contain more and more "transparent" graphical forms of consonant clusters, in which the constituent consonants of a cluster are readily apparent from the graphical form. However, since this change is not as widespread and is not being followed as uniformly in the rest of the Bengali printed literature, today's Bengali-learning children will possibly have to learn to recognise both the new "transparent" and the old "opaque" forms, which ultimately amounts to an increase in learning burden. Bengali punctuation marks, apart from the downstroke ''daṛi'' – the Bengali equivalent of a full stop – have been adopted from western scripts and their usage is similar. Unlike in western scripts (Latin, Cyrillic, etc.) where the letter-forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called ''matra''. The presence and absence of this matra can be important. For example, the letter ''tô'' and the numeral "3" are distinguishable only by the presence or absence of the ''matra'', as is the case between the consonant cluster ''trô'' and the independent vowel ''e'', also the letter ''hô'' and Bengali Ôbogroho ''(~ô)'' and letter ''o'' and consonant cluster ''ttô''. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline). There is yet to be a uniform standard collating sequence (sorting order of graphemes to be used in dictionaries, indices, computer sorting programs, etc.) of Bengali graphemes. Experts in both Bangladesh and India are currently working towards a common solution for this problem.


Alternative and historic scripts

Throughout history there have been instances of the Bengali language being written in different scripts, though these employments were never popular on a large scale and were communally limited. Owing to Bengal's geographic location, Bengali areas bordering non-Bengali regions have been influenced by each other. Small numbers of people in Midnapore, which borders
Odisha Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of ...
, have used the Odia script to write in Bengali. In the border areas between West Bengal and Bihar, some Bengali communities historically wrote Bengali in Devanagari,
Kaithi Kaithi (), also called Kayathi () or Kayasthi (), is a historical Brahmic script that was used widely in parts of Northern and Eastern India, primarily in the present-day states of Uttar Pradesh, Jharkhand and Bihar. In particular, it was us ...
and Tirhuta. In Sylhet and Bankura, modified versions of the Kaithi script had some historical prominence, mainly among Muslim communities. The variant in Sylhet was identical to the Baitali Kaithi script of Hindustani with the exception of Sylhet Nagri possessing ''matra''. Sylhet Nagri was standardised for printing in . Up until the 19th century, numerous variations of the Arabic script had been used across Bengal from Chittagong in the east to Meherpur in the west. The 14th-century court scholar of Bengal, Nur Qutb Alam, composed Bengali poetry using the Persian alphabet. After the Partition of India in the 20th century, the Pakistani government attempted to institute the Perso-Arabic script as the standard for Bengali in East Pakistan; this was met with resistance and contributed to the Bengali language movement. In the 16th century, Portuguese missionaries began a tradition of using the Roman alphabet to transcribe the Bengali language. Though the Portuguese-standard did not receive much growth, a few Roman Bengali works relating to Christianity and Bengali grammar were printed as far as Lisbon in 1743. The Portuguese were followed by the English and French respectively, whose works were mostly relating to Bengali grammar and transliteration. The first version of the
Aesop's Fables Aesop's Fables, or the Aesopica, is a collection of fables credited to Aesop, a slave and storyteller believed to have lived in ancient Greece between 620 and 564 BCE. Of diverse origins, the stories associated with his name have descended to ...
in Bengali was printed using Roman letters based on
English phonology Like many other languages, English has wide variation in pronunciation, both historically and from dialect to dialect. In general, however, the regional dialects of English share a largely similar (but not identical) phonological system. Amon ...
by the Scottish linguist John Gilchrist. Consecutive attempts to establish a Roman Bengali has continued across every century since these times, and have been supported by the likes of Suniti Kumar Chatterji, Muhammad Qudrat-i-Khuda and Muhammad Enamul Haq. The Digital Revolution has also played a part in the adoption of the
English alphabet The alphabet for Modern English is a Latin-script alphabet consisting of 26 letters, each having an upper- and lower-case form. The word ''alphabet'' is a compound of the first two letters of the Greek alphabet, ''alpha'' and '' beta''. ...
to write Bengali, with certain social media influencers publishing entire novels in Roman Bengali.


Orthographic depth

The Bengali script in general has a comparatively shallow orthography, i.e., in most cases there is a one-to-one correspondence between the sounds (phonemes) and the letters (graphemes) of Bengali. But grapheme-phoneme inconsistencies do occur in certain cases. One kind of inconsistency is due to the presence of several letters in the script for the same sound. In spite of some modifications in the 19th century, the Bengali spelling system continues to be based on the one used for Sanskrit, and thus does not take into account some sound mergers that have occurred in the spoken language. For example, there are three letters (, , and ) for the voiceless postalveolar fricative , although the letter retains the voiceless alveolar sibilant sound when used in certain consonant conjuncts as in "fall", "beat", etc. The letter also retains the voiceless retroflex sibilant sound when used in certain consonant conjuncts as in "suffering", "clan", etc. Similarly, there are two letters ( and ) for the
voiced postalveolar affricate The voiced palato-alveolar sibilant affricate, voiced post-alveolar affricate or voiced domed postalveolar sibilant affricate, is a type of consonantal sound, used in some spoken languages. The sound is transcribed in the International Phonetic ...
. Moreover, what was once pronounced and written as a retroflex nasal is now pronounced as an alveolar when in conversation (the difference is heard when reading) (unless conjoined with another retroflex consonant such as , , and ), although the spelling does not reflect this change. The near-open front unrounded vowel is orthographically realised by multiple means, as seen in the following examples: "so much", "academy", "amoeba", "to see", "busy", "grammar". Another kind of inconsistency is concerned with the incomplete coverage of phonological information in the script. The inherent vowel attached to every consonant can be either or depending on vowel harmony () with the preceding or following vowel or on the context, but this phonological information is not captured by the script, creating ambiguity for the reader. Furthermore, the inherent vowel is often not pronounced at the end of a syllable, as in "less", but this omission is not generally reflected in the script, making it difficult for the new reader. Many consonant clusters have different sounds than their constituent consonants. For example, the combination of the consonants and is graphically realised as and is pronounced (as in "coarse"), (as in "capability") or even (as in "harm"), depending on the position of the cluster in a word. The Bengali writing system is, therefore, not always a true guide to pronunciation.


Uses

The script used for Bengali, Assamese and other languages is known as Bengali script. The script is known as the Bengali alphabet for Bengali and its dialects and the Assamese alphabet for
Assamese language Assamese (), also Asamiya ( ), is an Indo-Aryan language spoken mainly in the north-east Indian state of Assam, where it is an official language, and it serves as a '' lingua franca'' of the wider region. The easternmost Indo-Iranian langua ...
with some minor variations. Other related languages in the nearby region also make use of the Bengali script like the Meitei language in the Indian state of Manipur, where the Meitei language has been written in the Bengali script for centuries, though the
Meitei script ) , altname = , type = Abugida , languages = Meitei language (officially known as Manipuri language) , region = * Manipur , sample = "Meitei Mayek" (literally meaning "Meitei script" in Meitei language) written ...
has been promoted in recent times.


Number system

Bengali digits are as follows. There are additional digits for fractions and prices, though they are little used any longer.


Romanisation

There are various Romanisation systems used for Bengali created in recent years which have failed to represent the true Bengali phonetic sound. The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation where the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration, or IAST system (based on diacritics); "Indian languages Transliteration", or
ITRANS The "Indian languages TRANSliteration" (ITRANS) is an ASCII transliteration scheme for Indic scripts, particularly for the Devanagari script. The need for a simple encoding scheme that used only keys available on an ordinary keyboard was felt i ...
(uses upper case letters suited for
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
keyboards); and the
National Library at Kolkata romanisation The National Library of India, National Library at Kolkata romanisationSee p 24-26 for table comparing Indic languages, and p 33-34 for Devanagari alphabet listing. is a widely used transliteration scheme in dictionaries and grammars of Language ...
. In the context of Bengali
romanisation Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and ...
, it is important to distinguish transliteration from transcription. Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription is phonetically accurate (the pronunciation can be reproduced). Although it might be desirable to use a transliteration scheme where the original Bengali orthography is recoverable from the Latin text, Bengali words are currently Romanized on Wikipedia using a phonemic transcription, where the true phonetic pronunciation of Bengali is represented with no reference to how it is written. The most recent attempt has been by publishers Mitra and Ghosh with the launch of three popular children's books, ''Abol Tabol'', ''Hasi Khusi'' and ''Sahoj Path'' in Roman script at the Kolkata Book Fair 2018. Published under the imprint of Benglish Books, these are based on phonetic transliteration and closely follow spellings used in social media but for using an underline to describe soft consonants.


Grammar

Bengali nouns are not assigned gender, which leads to minimal changing of adjectives ( inflection). However, nouns and pronouns are moderately declined (altered depending on their function in a sentence) into four cases while verbs are heavily conjugated, and the verbs do not change form depending on the gender of the nouns.


Word order

As a
head-final In linguistics, head directionality is a proposed parameter that classifies languages according to whether they are head-initial (the head of a phrase precedes its complements) or head-final (the head follows its complements). The head is the ...
language, Bengali follows a subject–object–verb word order, although variations on this theme are common. Bengali makes use of
postpositions Prepositions and postpositions, together called adpositions (or broadly, in traditional grammar, simply prepositions), are a class of words used to express spatial or temporal relations (''in'', ''under'', ''towards'', ''before'') or mark various ...
, as opposed to the prepositions used in English and other European languages. Determiners follow the noun, while numerals, adjectives, and possessors precede the noun. Yes-no questions do not require any change to the basic word order; instead, the low (L) tone of the final syllable in the utterance is replaced with a falling (HL) tone. Additionally, optional
particles In the physical sciences, a particle (or corpuscule in older texts) is a small localized object which can be described by several physical or chemical properties, such as volume, density, or mass. They vary greatly in size or quantity, from s ...
(e.g. ''-ki'', ''-na'', etc.) are often encliticised onto the first or last word of a yes-no question. Wh-questions are formed by fronting the wh-word to
focus Focus, or its plural form foci may refer to: Arts * Focus or Focus Festival, former name of the Adelaide Fringe arts festival in South Australia Film *''Focus'', a 1962 TV film starring James Whitmore * ''Focus'' (2001 film), a 2001 film based ...
position, which is typically the first or second word in the utterance.


Nouns

Nouns and pronouns are inflected for case, including nominative,
objective Objective may refer to: * Objective (optics), an element in a camera or microscope * ''The Objective'', a 2008 science fiction horror film * Objective pronoun, a personal pronoun that is used as a grammatical object * Objective Productions, a Brit ...
, genitive (possessive), and locative. The case marking pattern for each noun being inflected depends on the noun's degree of animacy. When a
definite article An article is any member of a class of dedicated words that are used with noun phrases to mark the identifiability of the referents of the noun phrases. The category of articles constitutes a part of speech. In English, both "the" and "a(n)" a ...
such as ''-ṭa'' (singular) or ''-gulo'' (plural) is added, as in the tables below, nouns are also inflected for
number A number is a mathematical object used to count, measure, and label. The original examples are the natural numbers 1, 2, 3, 4, and so forth. Numbers can be represented in language with number words. More universally, individual numbers c ...
. In most of the Bengali grammar books, cases are divided into 6 categories and an additional possessive case (possessive form is not recognised as a type of case by Bengali grammarians). But in terms of usages, cases are generally grouped into only 4 categories. When counted, nouns take one of a small set of
measure word In linguistics, measure words are words (or morphemes) that are used in combination with a numeral to indicate an amount of something represented by some noun. Description Measure words denote a unit or measurement and are used with mass nouns ( ...
s. Nouns in Bengali cannot be counted by adding the numeral directly adjacent to the noun. An appropriate measure word (MW), a classifier, must be used between the numeral and the noun (most languages of the
Mainland Southeast Asia linguistic area The Mainland Southeast Asia linguistic area is a sprachbund including languages of the Sino-Tibetan, Hmong–Mien (or Miao–Yao), Kra–Dai, Austronesian and Austroasiatic families spoken in an area stretching from Thailand to China. Neighbou ...
are similar in this respect). Most nouns take the generic measure word ''-ṭa'', though other measure words indicate semantic classes (e.g. ''-jôn'' for humans). There is also the classifier ''-khana,'' and its diminutive form ''-khani'', which attach only to nouns denoting something flat, long, square, or thin. These are the least common of the classifiers. Measuring nouns in Bengali without their corresponding measure words (e.g. ''aṭ biṛal'' instead of ''aṭ-ṭa biṛal'' "eight cats") would typically be considered ungrammatical. However, when the semantic class of the noun is understood from the measure word, the noun is often omitted and only the measure word is used, e.g. ''Shudhu êk-jôn thakbe.'' (lit. "Only one-MW will remain.") would be understood to mean "Only one person will remain.", given the semantic class implicit in ''-jôn''. In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to mass nouns.


Verbs

There are two classes of verbs:
finite Finite is the opposite of infinite. It may refer to: * Finite number (disambiguation) * Finite set, a set whose cardinality (number of elements) is some natural number * Finite verb, a verb form that has a subject, usually being inflected or marke ...
and non-finite. Non-finite verbs have no inflection for tense or person, while finite verbs are fully inflected for person (first, second, third), tense (present, past, future),
aspect Aspect or Aspects may refer to: Entertainment * ''Aspect magazine'', a biannual DVD magazine showcasing new media art * Aspect Co., a Japanese video game company * Aspects (band), a hip hop group from Bristol, England * ''Aspects'' (Benny Carter ...
(simple, perfect, progressive), and
honour Honour (British English) or honor (American English; see spelling differences) is the idea of a bond between an individual and a society as a quality of a person that is both of social teaching and of personal ethos, that manifests itself as a ...
(intimate, familiar, and formal), but ''not'' for number. Conditional, imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of inflections on many verb roots can total more than 200. Inflectional suffixes in the
morphology Morphology, from the Greek and meaning "study of shape", may refer to: Disciplines * Morphology (archaeology), study of the shapes or forms of artifacts * Morphology (astronomy), study of the shape of astronomical objects such as nebulae, galaxies ...
of Bengali vary from region to region, along with minor differences in syntax. Bengali differs from most Indo-Aryan Languages in the zero copula, where the copula or connective ''be'' is often missing in the present tense. in Thus, "he is a teacher" is ''se shikkhôk'', (literally "he teacher").Among Bengali speakers brought up in neighbouring linguistic regions (e.g. Hindi), the lost copula may surface in utterances such as ''she shikkhôk hocche''. This is viewed as ungrammatical by other speakers, and speakers of this variety are sometimes (humorously) referred as "hocche-Bangali". In this respect, Bengali is similar to Russian and Hungarian. Romani grammar is also the closest to Bengali grammar.


Vocabulary

Bengali has as many as 100,000 separate words, of which 50,000 are considered
Tadbhava (Sanskrit: तद्भव, , lit. "arising from that") is the Sanskrit word for one of three etymological classes defined by native grammarians of Middle Indo-Aryan languages, alongside tatsama and deśi words. at pp. 67-69. A "tadbhava" is a w ...
s, 21,100 are Tatsamas and the remainder
loanword A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because t ...
s from Austroasiatic and other foreign languages. However, these figures do not take into account the large proportion of archaic or highly technical words that are very rarely used. Furthermore, different dialects use more
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
and Arabic vocabulary especially in different areas of Bangladesh and Muslim majority areas of West Bengal. Hindus, on the other hand, use more Sanskrit vocabulary than Muslims. Standard Bengali is based on the Nadia dialect spoken in the Hindu majority states of West Bengal and parts of Muslim majority division of Khulna in Bangladesh. About 90% of Bengalis in Bangladesh (ca. 148 million) and 27% of Bengalis in West Bengal and 10% in Assam (ca. 36 million) are Muslim and the Bangladeshi Muslims and some of the Indian Bengali Muslims speak a more "persio-arabised" version of Bengali instead of the more Sanskrit influenced Standard Nadia dialect although majority of the Indian Bengalis of West Bengal speaks in Rarhi dialect irrespective of religion. The productive vocabulary used in modern literary works, in fact, is made up mostly (67%) of tadbhavas, while tatsamas make up only 25% of the total. in in Loanwords from non-Indic languages account for the remaining 8% of the vocabulary used in modern Bengali literature. According to Suniti Kumar Chatterji, dictionaries from the early 20th century attributed to a little more 50% of the Bengali vocabulary to native words (i.e., naturally modified
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
words, corrupted forms of Sanskrit words, and loanwords non-Indo-European languages). About 45% percent of Bengali words are unmodified Sanskrit, and the remaining words are from foreign languages. Dominant in the last group was
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
, which was also the source of some grammatical forms. More recent studies suggest that the use of native and foreign words has been increasing, mainly because of the preference of Bengali speakers for the colloquial style. Because of centuries of contact with Europeans,
Turkic peoples The Turkic peoples are a collection of diverse ethnic groups of West, Central, East, and North Asia as well as parts of Europe, who speak Turkic languages.. "Turkic peoples, any of various peoples whose members speak languages belonging t ...
, and Persians, Bengali has absorbed numerous words from foreign languages, often totally integrating these borrowings into the core vocabulary. The most common borrowings from foreign languages come from three different kinds of contact. After close contact with several indigenous Austroasiatic languages, and later the Mughal invasion whose court language was Persian, numerous Chagatai, Arabic, and Persian words were absorbed into the lexicon. Later, East Asian travellers and lately European colonialism brought words from
Portuguese Portuguese may refer to: * anything of, from, or related to the country and nation of Portugal ** Portuguese cuisine, traditional foods ** Portuguese language, a Romance language *** Portuguese dialects, variants of the Portuguese language ** Portu ...
, French,
Dutch Dutch commonly refers to: * Something of, from, or related to the Netherlands * Dutch people () * Dutch language () Dutch may also refer to: Places * Dutch, West Virginia, a community in the United States * Pennsylvania Dutch Country People E ...
, and most significantly English during the colonial period.


Sample text

The following is a sample text in Bengali of Article 1 of the Universal Declaration of Human Rights:


See also

*
Bangla Academy The Bangla Academy ( bn, বাংলা একাডেমি, ) is an autonomous institution funded by the Bangladesh government to foster the Bengali language, literature and culture, to develop and implement national language policy and to d ...
* Bengali dialects *
Bengali numerals Bengali numerals ( bn, সংখ্যা ''sôṅkhya'', as, সংখ্যা ''xoiŋkha''), are the units of the numeral system, originating from the Indian subcontinent, used in Bengali, Sylheti, Chittagonian, Assamese, Bishnupriya Manipur ...
* Bengali-language newspapers *
Chittagonian language Chittagonian ( ''saṭgãia'' or ''siʈaiŋga'') is an Indo-Aryan language spoken in parts of the Chittagong Division in Bangladesh. Its speakers identify with Bengali culture and the Bengali language, but Chittagonian and Bengali are not mut ...
* Languages of Bangladesh * Rangpuri language * Romani people *
Sylheti language Sylheti ( Sylheti Nāgarī: ; bn, সিলেটি ) is an Indo-Aryan language spoken by an estimated 11 million people, primarily in the Sylhet Division of Bangladesh and in parts of Northeast India."Sylheti is an Indo-Aryan language spok ...


Notes


References

* * * * * * * Chakraborty, Byomkes, A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, . * * * * * * * * * * * * * * * * * * * Shaw, Rameswar ''Sadharan Bhasabigna O Bangal Bhasa'', Pustak Bipani, Kolkata, 1997. * Haldar, Narayan ''Bengali Bhasa Prsanga: Banan Kathan Likhanriti'', Pustak Bipani, Kolkata, 2007. *


Further reading

* Thompson, Hanne-Ruth (2012).
Bengali
'. Volume 18 of London Oriental and African Language Library. John Benjamins Publishing. .


External links

*

{{Authority control Eastern Indo-Aryan languages Languages of Bangladesh Official languages of India Subject–object–verb languages Languages of West Bengal Languages of Tripura Languages of Assam Languages officially written in Indic scripts Sahitya Akademi recognised languages Languages of Jharkhand Languages of Bihar