HOME

TheInfoList



OR:

Bengali, also known by its
endonym An endonym (also known as autonym ) is a common, name for a group of people, individual person, geographical place, language, or dialect, meaning that it is used inside a particular group or linguistic community to identify or designate them ...
Bangla (, , ), is an Indo-Aryan language belonging to the Indo-Iranian branch of the
Indo-European The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
language family. It is native to the Bengal region (
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
,
India India, officially the Republic of India, is a country in South Asia. It is the List of countries and dependencies by area, seventh-largest country by area; the List of countries by population (United Nations), most populous country since ...
's
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
and Tripura) of
South Asia South Asia is the southern Subregion#Asia, subregion of Asia that is defined in both geographical and Ethnicity, ethnic-Culture, cultural terms. South Asia, with a population of 2.04 billion, contains a quarter (25%) of the world's populatio ...
. With over 242 million native speakers and another 43 million as
second language A second language (L2) is a language spoken in addition to one's first language (L1). A second language may be a neighbouring language, another language of the speaker's home country, or a foreign language. A speaker's dominant language, which ...
speakers as of 2025, Bengali is the sixth most spoken native language and the seventh most spoken language by the total number of speakers in the world. Bengali is the
official An official is someone who holds an office (function or Mandate (politics), mandate, regardless of whether it carries an actual Office, working space with it) in an organization or government and participates in the exercise of authority (eithe ...
, national, and most widely spoken language of
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
, with 98% of Bangladeshis using Bengali as their first language. It is the second-most widely spoken language in India. It is the official language of the Indian states of
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
, Tripura and the Barak Valley region of the state of
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
. It is also the second official language of the Indian state of
Jharkhand Jharkhand (; ) is a States and union territories of India, state in East India, eastern India. The state shares its border with the states of West Bengal to the east, Chhattisgarh to the west, Uttar Pradesh to the northwest, Bihar to the north ...
since September 2011. It is the most widely spoken language in the
Andaman and Nicobar Islands The Andaman and Nicobar Islands is a union territory of India comprising 572 islands, of which only 38 are inhabited. The islands are grouped into two main clusters: the northern Andaman Islands and the southern Nicobar Islands, separated by a ...
in the
Bay of Bengal The Bay of Bengal is the northeastern part of the Indian Ocean. Geographically it is positioned between the Indian subcontinent and the Mainland Southeast Asia, Indochinese peninsula, located below the Bengal region. Many South Asian and Southe ...
, and is spoken by significant populations in other states including
Bihar Bihar ( ) is a states and union territories of India, state in Eastern India. It is the list of states and union territories of India by population, second largest state by population, the List of states and union territories of India by are ...
,
Arunachal Pradesh Arunachal Pradesh (; ) is a States and union territories of India, state in northeast India. It was formed from the North-East Frontier Agency (NEFA) region, and India declared it as a state on 20 February 1987. Itanagar is its capital and la ...
,
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, but spread chiefly to the west, or beyond its Bank (geography ...
, Chhattisgarh,
Meghalaya Meghalaya (; "the abode of clouds") is a states and union territories of India, state in northeast India. Its capital is Shillong. Meghalaya was formed on 21 January 1972 by carving out two districts from the Assam: the United Khasi Hills an ...
, Mizoram,
Nagaland Nagaland () is a States and union territories of India, state in the northeast India, north-eastern region of India. It is bordered by the Indian states of Arunachal Pradesh to the north, Assam to the west, Manipur to the south, and the Naga Sel ...
,
Odisha Odisha (), formerly Orissa (List of renamed places in India, the official name until 2011), is a States and union territories of India, state located in East India, Eastern India. It is the List of states and union territories of India by ar ...
and
Uttarakhand Uttarakhand (, ), also known as Uttaranchal ( ; List of renamed places in India, the official name until 2007), is a States and union territories of India, state in North India, northern India. The state is bordered by Himachal Pradesh to the n ...
. Bengali is also spoken by the Bengali diasporas (
Bangladeshi diaspora The Bangladeshi diaspora () are people of Bangladeshi birth, descent or origin who live outside of Bangladesh. First-generation migrants may have moved abroad from Bangladesh for various reasons including better living conditions, to escape pove ...
and Indian Bengalis) across Europe, North America, the Middle East and other regions. Bengali was accorded the status of a classical language by the
government of India The Government of India (ISO 15919, ISO: Bhārata Sarakāra, legally the Union Government or Union of India or the Central Government) is the national authority of the Republic of India, located in South Asia, consisting of States and union t ...
on 3 October 2024. It is the second most spoken and fifth fastest growing language in India, following
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
, Kashmiri, Gujarati, and Meitei ( Manipuri), according to the 2011 census of India. Bengali has developed over more than 1,400 years.
Bengali literature Bengali literature () denotes the body of writings in the Bengali language and which covers Old Bengali, Middle Bengali and Modern Bengali with the changes through the passage of time and dynastic patronization or non-patronization. Bengali h ...
, with its millennium-old literary history, was extensively developed during the Bengali Renaissance and is one of the most prolific and diverse literary traditions in Asia. The Bengali language movement from 1948 to 1956 demanding that Bengali be an official language of Pakistan fostered Bengali nationalism in
East Bengal East Bengal (; ''Purbô Bangla/Purbôbongo'') was the eastern province of the Dominion of Pakistan, which covered the territory of modern-day Bangladesh. It consisted of the eastern portion of the Bengal region, and existed from 1947 until 195 ...
leading to the emergence of Bangladesh in 1971. In 1999,
UNESCO The United Nations Educational, Scientific and Cultural Organization (UNESCO ) is a List of specialized agencies of the United Nations, specialized agency of the United Nations (UN) with the aim of promoting world peace and International secur ...
recognised 21 February as International Mother Language Day in recognition of the language movement.


History


Ancient

Although
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
has been spoken by Hindu
Brahmins Brahmin (; ) is a ''Varna (Hinduism), varna'' (theoretical social classes) within Hindu society. The other three varnas are the ''Kshatriya'' (rulers and warriors), ''Vaishya'' (traders, merchants, and farmers), and ''Shudra'' (labourers). Th ...
in
Bengal Bengal ( ) is a Historical geography, historical geographical, ethnolinguistic and cultural term referring to a region in the Eastern South Asia, eastern part of the Indian subcontinent at the apex of the Bay of Bengal. The region of Benga ...
since the 3rd century BC, the local
Buddhist Buddhism, also known as Buddhadharma and Dharmavinaya, is an Indian religion and List of philosophies, philosophical tradition based on Pre-sectarian Buddhism, teachings attributed to the Buddha, a wandering teacher who lived in the 6th or ...
population spoke varieties of the Prakrit. These varieties are generally referred to as "eastern Magadhi Prakrit", as coined by linguist Suniti Kumar Chatterji, as the Middle Indo-Aryan dialects were influential in the first millennium when Bengal was a part of the Greater Magadhan realm. The local varieties had no official status during the
Gupta Empire The Gupta Empire was an Indian empire during the classical period of the Indian subcontinent which existed from the mid 3rd century to mid 6th century CE. At its zenith, the dynasty ruled over an empire that spanned much of the northern Indian ...
, and with Bengal increasingly becoming a hub of Sanskrit literature for Hindu priests, the vernacular of Bengal gained much influence from Sanskrit. Magadhi Prakrit was also spoken in modern-day
Bihar Bihar ( ) is a states and union territories of India, state in Eastern India. It is the list of states and union territories of India by population, second largest state by population, the List of states and union territories of India by are ...
and
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
, and this vernacular eventually evolved into Ardha Magadhi. Ardha Magadhi began to give way to what is known as Apabhraṃśa, by the end of the first millennium. The Bengali language evolved as a distinct language over the course of time.


Early

A Sanskrit-Chinese dictionary compiled by the Chinese poet Li-Yen in 782 AD shows the presence of Bengali. A research document ''Classical Bangla'' published in 2024 by the Kolkata-based institute " Institute of Language Studies and Research" (ILSR), mentions the presence of 51 Bengali words in the dictionary. The lexicon strongly supports the existence of Old Bengali in the 8th century or earlier. Though some archaeologists claim that some 10th-century texts were in Bengali, it is not certain whether they represent a differentiated language or whether they represent a stage when Eastern Indo-Aryan languages were differentiating. The local Apabhraṃśa of the eastern subcontinent, Purbi Apabhraṃśa or Abahatta (), eventually evolved into regional dialects, which in turn formed three groups, the Bengali–Assamese languages, the Bihari languages, and the
Odia language Odia (;"Odia"
''Lexico''.
, ISO 15919, ISO: , ; formerly rendere ...
. The language was not static: different varieties coexisted and authors often wrote in multiple dialects in this period. For example, Ardhamagadhi is believed to have evolved into Abahatta around the 6th century, which competed with the ancestor of Bengali for some time. The ancestor of Bengali was the language of the Pala Empire and the Sena dynasty.


Medieval

During the medieval period, Middle Bengali was characterised by the
elision In linguistics, an elision or deletion is the omission of one or more sounds (such as a vowel, a consonant, or a whole syllable) in a word or phrase. However, these terms are also used to refer more narrowly to cases where two words are run to ...
of the word-final '' ô'' and the spread of compound verbs, which originated from the
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
Schwa. Slowly, the word-final ''ô'' disappeared from many words influenced by the
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
, Persian, and
Turkic languages The Turkic languages are a language family of more than 35 documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia (Siberia), and West Asia. The Turkic langua ...
. The arrival of merchants and traders from the Middle East and Turkestan into the
Buddhist Buddhism, also known as Buddhadharma and Dharmavinaya, is an Indian religion and List of philosophies, philosophical tradition based on Pre-sectarian Buddhism, teachings attributed to the Buddha, a wandering teacher who lived in the 6th or ...
-ruling Pala Empire, from as early as the 7th century, gave birth to Islamic influence in the region. In the 13th century, subsequent Arab Muslim and Turco-Persian expeditions to Bengal heavily influenced the local vernacular by settling among the native population. Bengali absorbed Arabic and Persian influences in its vocabulary and dialect, including the development of Dobhashi. Bengali acquired prominence, over Persian, in the court of the
Sultans of Bengal The Bengal Sultanate ( Middle Bengali: , Classical Persian: ) was a late medieval sultanate based in the Bengal region in the eastern South Asia between the 14th and 16th century. It was the dominant power of the Ganges-Brahmaputra Delta, ...
with the ascent of Jalaluddin Muhammad Shah. Subsequent Muslim rulers actively promoted the literary development of Bengali, allowing it to become the most spoken
vernacular Vernacular is the ordinary, informal, spoken language, spoken form of language, particularly when perceptual dialectology, perceived as having lower social status or less Prestige (sociolinguistics), prestige than standard language, which is mor ...
language in the Sultanate. Bengali adopted many words from
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
and Persian, which was a manifestation of
Islamic culture Islamic cultures or Muslim cultures refers to the historic cultural practices that developed among the various peoples living in the Muslim world. These practices, while not always religious in nature, are generally influenced by aspects of Islam ...
on the language. Major texts of Middle Bengali (1400–1800) include Yusuf-Zulekha by Shah Muhammad Sagir and Srikrishna Kirtana by the Chandidas poets. Court support for Bengali culture and language waned when the
Mughal Empire The Mughal Empire was an Early modern period, early modern empire in South Asia. At its peak, the empire stretched from the outer fringes of the Indus River Basin in the west, northern Afghanistan in the northwest, and Kashmir in the north, to ...
conquered Bengal in the late 16th and early 17th century.


Modern

The standard literary form of Modern Bengali was developed during the 19th and early 20th centuries based on the west-central dialect spoken in Shantipur region of the Nadia district. Modern Bengali shows a high degree of
diglossia In linguistics, diglossia ( , ) is where two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled "L" or "low" v ...
, with the literary and standard form differing greatly from the colloquial speech of the regions that identify with the language. Modern Bengali vocabulary is based on words inherited from Magadhi Prakrit and Pali, along with tatsamas and reborrowings from Sanskrit and borrowings from Persian,
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
,
Austroasiatic languages The Austroasiatic languages ( ) are a large language family spoken throughout Mainland Southeast Asia, South Asia and East Asia. These languages are natively spoken by the majority of the population in Vietnam and Cambodia, and by minority popu ...
and other languages with which it has historically been in contact. In the 19th and 20th centuries, there were two standard forms of written Bengali: * ''Chôlitôbhasha'', a colloquial form of Bengali using simplified inflections. * '' Sadhubhasha'', a formal and genteel form of Bengali. In 1948, the government of Pakistan tried to impose
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
as the sole state language in Pakistan, giving rise to the Bengali language movement. This was a popular ethnolinguistic movement in the former
East Bengal East Bengal (; ''Purbô Bangla/Purbôbongo'') was the eastern province of the Dominion of Pakistan, which covered the territory of modern-day Bangladesh. It consisted of the eastern portion of the Bengal region, and existed from 1947 until 195 ...
(today
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
), which arose as a result of the strong linguistic consciousness of the
Bengalis Bengalis ( ), also rendered as endonym and exonym, endonym Bangalee, are an Indo-Aryan peoples, Indo-Aryan ethnolinguistic group originating from and culturally affiliated with the Bengal region of South Asia. The current population is divi ...
and their desire to promote and protect spoken and written Bengali's recognition as a state language of the then
Dominion of Pakistan The Dominion of Pakistan, officially Pakistan, was an independent federal dominion in the British Commonwealth of Nations, which existed from 14 August 1947 to Pakistan Day, 23 March 1956. It was created by the passing of the Indian Independence ...
. On 21 February 1952, five students and political activists were killed during protests near the campus of the University of Dhaka; they were the first ever martyrs to die for their right to speak their mother tongue. In 1956, Bengali was made a state language of Pakistan. 21 February has since been observed as Language Movement Day in Bangladesh and has also been commemorated as International Mother Language Day by
UNESCO The United Nations Educational, Scientific and Cultural Organization (UNESCO ) is a List of specialized agencies of the United Nations, specialized agency of the United Nations (UN) with the aim of promoting world peace and International secur ...
every year since 2000. In 2010, the parliament of Bangladesh and the legislative assembly of West Bengal proposed that Bengali be made an official UN language. As of January 2023, no further action has been yet taken on this matter. However, in 2022, the UN did adopt Bangla as an unofficial language, after a resolution tabled by India. In 2024, the
government of India The Government of India (ISO 15919, ISO: Bhārata Sarakāra, legally the Union Government or Union of India or the Central Government) is the national authority of the Republic of India, located in South Asia, consisting of States and union t ...
conferred Bengali with the status of classical language.


Geographical distribution

The Bengali language is native to the region of
Bengal Bengal ( ) is a Historical geography, historical geographical, ethnolinguistic and cultural term referring to a region in the Eastern South Asia, eastern part of the Indian subcontinent at the apex of the Bay of Bengal. The region of Benga ...
, which comprises the present-day nation of
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
and the Indian state of
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
. Besides the native region it is also spoken by the Bengalis living in Tripura, southern
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
and the Bengali population in the Indian union territory of
Andaman and Nicobar Islands The Andaman and Nicobar Islands is a union territory of India comprising 572 islands, of which only 38 are inhabited. The islands are grouped into two main clusters: the northern Andaman Islands and the southern Nicobar Islands, separated by a ...
. Bengali is also spoken in the neighbouring states of
Odisha Odisha (), formerly Orissa (List of renamed places in India, the official name until 2011), is a States and union territories of India, state located in East India, Eastern India. It is the List of states and union territories of India by ar ...
,
Bihar Bihar ( ) is a states and union territories of India, state in Eastern India. It is the list of states and union territories of India by population, second largest state by population, the List of states and union territories of India by are ...
, and
Jharkhand Jharkhand (; ) is a States and union territories of India, state in East India, eastern India. The state shares its border with the states of West Bengal to the east, Chhattisgarh to the west, Uttar Pradesh to the northwest, Bihar to the north ...
, and sizeable minorities of Bengali speakers reside in Indian cities outside Bengal, including
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, but spread chiefly to the west, or beyond its Bank (geography ...
,
Mumbai Mumbai ( ; ), also known as Bombay ( ; its official name until 1995), is the capital city of the Indian state of Maharashtra. Mumbai is the financial capital and the most populous city proper of India with an estimated population of 12 ...
,
Thane Thane (; previously known as Thana, List of renamed Indian cities and states#Maharashtra, the official name until 1996) is a metropolitan city located on the northwestern side of the list of Indian states, state of Maharashtra in India and on ...
,
Varanasi Varanasi (, also Benares, Banaras ) or Kashi, is a city on the Ganges river in northern India that has a central place in the traditions of pilgrimage, death, and mourning in the Hindu world.* * * * The city has a syncretic tradition of I ...
, and Vrindavan. There are also significant Bengali-speaking communities in the
Middle East The Middle East (term originally coined in English language) is a geopolitical region encompassing the Arabian Peninsula, the Levant, Turkey, Egypt, Iran, and Iraq. The term came into widespread usage by the United Kingdom and western Eur ...
, the
United States The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic of 50 U.S. state, states and a federal capital district, Washington, D.C. The 48 ...
,
Singapore Singapore, officially the Republic of Singapore, is an island country and city-state in Southeast Asia. The country's territory comprises one main island, 63 satellite islands and islets, and one outlying islet. It is about one degree ...
,
Malaysia Malaysia is a country in Southeast Asia. Featuring the Tanjung Piai, southernmost point of continental Eurasia, it is a federation, federal constitutional monarchy consisting of States and federal territories of Malaysia, 13 states and thre ...
,
Australia Australia, officially the Commonwealth of Australia, is a country comprising mainland Australia, the mainland of the Australia (continent), Australian continent, the island of Tasmania and list of islands of Australia, numerous smaller isl ...
,
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
, the
United Kingdom The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom (UK) or Britain, is a country in Northwestern Europe, off the coast of European mainland, the continental mainland. It comprises England, Scotlan ...
, and
Italy Italy, officially the Italian Republic, is a country in Southern Europe, Southern and Western Europe, Western Europe. It consists of Italian Peninsula, a peninsula that extends into the Mediterranean Sea, with the Alps on its northern land b ...
.


Official status

The 3rd article of the Constitution of Bangladesh states Bengali to be the sole
official language An official language is defined by the Cambridge English Dictionary as, "the language or one of the languages that is accepted by a country's government, is taught in schools, used in the courts of law, etc." Depending on the decree, establishmen ...
of Bangladesh. The Bengali Language Implementation Act, 1987, made it mandatory to use Bengali in all records and correspondences, laws, proceedings of court and other legal actions in all courts, government or semi-government offices, and autonomous institutions in Bangladesh. It is also the ''de facto'' national language of the country. In India, Bengali is one of the 23 official languages. It is the official language of the Indian states of
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
, Tripura and in Barak Valley of
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
. Bengali has been a second official language of the
Indian state India is a federal union comprising 28 states and 8 union territories, for a total of 36 subnational entities. The states and union territories are further subdivided into 800 districts and smaller administrative divisions by the respe ...
of
Jharkhand Jharkhand (; ) is a States and union territories of India, state in East India, eastern India. The state shares its border with the states of West Bengal to the east, Chhattisgarh to the west, Uttar Pradesh to the northwest, Bihar to the north ...
since September 2011. In
Pakistan Pakistan, officially the Islamic Republic of Pakistan, is a country in South Asia. It is the List of countries and dependencies by population, fifth-most populous country, with a population of over 241.5 million, having the Islam by country# ...
, Bengali is a recognised secondary language in the city of
Karachi Karachi is the capital city of the Administrative units of Pakistan, province of Sindh, Pakistan. It is the List of cities in Pakistan by population, largest city in Pakistan and 12th List of largest cities, largest in the world, with a popul ...
mainly spoken by stranded Bengalis of Pakistan. The Department of Bengali in the University of Karachi (established by East Pakistani politicians before Independence of Bangladesh) also offers regular programs of studies at the Bachelors and at the Masters levels for Bengali Literature. The national anthems of both Bangladesh ('' Amar Sonar Bangla'') and India ('' Jana Gana Mana'') were written in Bengali by the Bengali Nobel laureate
Rabindranath Tagore Rabindranath Thakur (; anglicised as Rabindranath Tagore ; 7 May 1861 – 7 August 1941) was a Bengalis, Bengali polymath who worked as a poet, writer, playwright, composer, philosopher, social reformer, and painter of the Bengal Renai ...
. Notuner Gaan known as "''Chol Chol Chol"'' is Bangladesh's national march, written by ''The National Poet'' Kazi Nazrul Islam in Bengali in 1928. It was adopted as the national marching song by the Bangladeshi government in 1972. Additionally, the first two verses of '' Vande Mataram'', a patriotic song written in Bengali by Bankim Chandra Chatterjee, was adopted as the "national song" of India in both the colonial period and later in 1950 in independent India. Furthermore, it is believed by many that the national anthem of Sri Lanka ( Sri Lanka Matha) was inspired by a Bengali poem written by
Rabindranath Tagore Rabindranath Thakur (; anglicised as Rabindranath Tagore ; 7 May 1861 – 7 August 1941) was a Bengalis, Bengali polymath who worked as a poet, writer, playwright, composer, philosopher, social reformer, and painter of the Bengal Renai ...
, while some even believe the anthem was originally written in Bengali and then translated into Sinhala. After the contribution made by the Bangladesh UN Peacekeeping Force in the Sierra Leone Civil War under the United Nations Mission in Sierra Leone, the government of Ahmad Tejan Kabbah declared Bengali as an honorary official language in December 2002. In 2009, elected representatives in both Bangladesh and West Bengal called for Bengali to be made an official language of the United Nations.


Dialects

Regional varieties in spoken Bengali constitute a
dialect continuum A dialect continuum or dialect chain is a series of Variety (linguistics), language varieties spoken across some geographical area such that neighboring varieties are Mutual intelligibility, mutually intelligible, but the differences accumulat ...
. Linguist Suniti Kumar Chatterji grouped the dialects of Bengali language into four large clusters: Rarhi, Vangiya, Kamrupi and Varendri; but many alternative grouping schemes have also been proposed. The West-Central dialects ( Rarhi or Nadia dialect) form the basis of modern standard colloquial Bengali. In the dialects prevalent in much of eastern and south-eastern Bangladesh ( Barisal, Chittagong,
Dhaka Dhaka ( or ; , ), List of renamed places in Bangladesh, formerly known as Dacca, is the capital city, capital and list of cities and towns in Bangladesh, largest city of Bangladesh. It is one of the list of largest cities, largest and list o ...
and
Sylhet Division Sylhet Division () is a northeastern Divisions of Bangladesh, division of Bangladesh, renowned for its lush tea gardens, rolling hills and vibrant cultural heritage. Covering an area of approximately 12,298 square kilometres, it is bordered by t ...
s of Bangladesh), many of the stops and affricates heard in
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
and western Bangladesh are pronounced as fricatives. Western alveolo-palatal affricates , , correspond to eastern , , . The influence of
Tibeto-Burman languages The Tibeto-Burman languages are the non- Sinitic members of the Sino-Tibetan language family, over 400 of which are spoken throughout the Southeast Asian Massif ("Zomia") as well as parts of East Asia and South Asia. Around 60 million people spe ...
on the
phonology Phonology (formerly also phonemics or phonematics: "phonemics ''n.'' 'obsolescent''1. Any procedure for identifying the phonemes of a language from a corpus of data. 2. (formerly also phonematics) A former synonym for phonology, often pre ...
of Eastern Bengali is seen through the lack of nasalised vowels and an alveolar articulation of what are categorised as the "cerebral" consonants (as opposed to the postalveolar articulation of western Bengal). Some varieties of Bengali, particularly Sylheti, Chittagonian and Chakma, have contrastive tone; differences in the pitch of the speaker's voice can distinguish words. Kharia Thar and Mal Paharia are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects. During the standardisation of Bengali in the 19th century and early 20th century, the cultural centre of Bengal was in
Kolkata Kolkata, also known as Calcutta ( its official name until 2001), is the capital and largest city of the Indian state of West Bengal. It lies on the eastern bank of the Hooghly River, west of the border with Bangladesh. It is the primary ...
, a city founded by the British. What is accepted as the standard form today in both West Bengal and Bangladesh is based on the West-Central dialect of Nadia and Kushtia District. There are cases where speakers of Standard Bengali in
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
will use a different word from a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. For example, the word salt is ''lôbôṇ'' in the east which corresponds to ''nun'' in the west. Bengali exhibits
diglossia In linguistics, diglossia ( , ) is where two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled "L" or "low" v ...
, though some scholars have proposed triglossia or even n-glossia or heteroglossia between the written and spoken forms of the language. Two styles of writing have emerged, involving somewhat different vocabularies and
syntax In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
: # '' Sadhu bhasha'' ( "upright language") was the written language, with longer verb inflections and more of a
Pali Pāli (, IAST: pāl̤i) is a Classical languages of India, classical Middle Indo-Aryan languages, Middle Indo-Aryan language of the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pali Canon, Pāli Can ...
and
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
-derived '' Tatsama'' vocabulary. Songs such as India's national anthem '' Jana Gana Mana'' (by
Rabindranath Tagore Rabindranath Thakur (; anglicised as Rabindranath Tagore ; 7 May 1861 – 7 August 1941) was a Bengalis, Bengali polymath who worked as a poet, writer, playwright, composer, philosopher, social reformer, and painter of the Bengal Renai ...
) were composed in this style. Its use in modern writing however is uncommon, restricted to some official signs and documents as well as for achieving particular literary effects. # ''Chôlito bhasha'' ( "running language"), known by linguists as Standard Colloquial Bengali, is a written Bengali style exhibiting a preponderance of colloquial idiom and shortened verb forms and is the standard for written Bengali now. This form came into vogue towards the turn of the 19th century, promoted by the writings of Peary Chand Mitra ('' Alaler Gharer Dulal'', 1857), Pramatha Chaudhuri (''Sabujpatra'', 1914) and in the later writings of
Rabindranath Tagore Rabindranath Thakur (; anglicised as Rabindranath Tagore ; 7 May 1861 – 7 August 1941) was a Bengalis, Bengali polymath who worked as a poet, writer, playwright, composer, philosopher, social reformer, and painter of the Bengal Renai ...
. It is modelled on the dialect spoken in the Shantipur and Shilaidaha region in Nadia and Kushtia Districts respectively. This form of Bengali is often referred to as the "Kushtia standard"(Bangladesh), "Nadia standard" (West Bengal), "West-Central dialect", "Shantipuri Bangla" or "Shilaidahi Bangla". Linguist
Prabhat Ranjan Sarkar Prabhat Ranjan Sarkar (21 May 1921 – 21 October 1990), also known by his spiritual name Shrii Shrii Ánandamúrti (Ánanda Múrti meaning "Bliss Embodiment"), and known as Bábá ("Father") to his disciples, was a spiritual guru, philos ...
categorises the language as: * Madhya Rarhi dialect * Kanthi (Contai) dialect *
Kolkata Kolkata, also known as Calcutta ( its official name until 2001), is the capital and largest city of the Indian state of West Bengal. It lies on the eastern bank of the Hooghly River, west of the border with Bangladesh. It is the primary ...
dialect * Shantipuriya (Nadia) dialect * Shershahabadia (Maldahiya/ Jangipuri) dialect * Barendri dialect * Rangapuriya dialect * Sylheti dialect * Dhakiya (Bikrampuri) dialect * Jashore/Jessoriya dialect * Barisal (Chandradwip) dialect * Chattal (Chittagong) dialect While most writing is in Standard Colloquial Bengali (SCB), spoken dialects exhibit a greater variety. People in southeastern West Bengal, including Kolkata, speak in SCB. Other dialects, with minor variations from Standard Colloquial, are used in other parts of West Bengal and western Bangladesh, such as the Midnapore dialect, characterised by some unique words and constructions. However, a majority in Bangladesh speaks dialects notably different from SCB. Some dialects, particularly those of the Chittagong region, bear only a superficial resemblance to SCB. The dialect in the Chittagong region is least widely understood by the general body of Bengalis. The majority of Bengalis are able to communicate in more than one variety – often, speakers are fluent in ''Cholitobhasha'' (SCB) and one or more regional dialects. Even in SCB, the vocabulary may differ according to the speaker's religion: Muslims are more likely to use words of Persian and Arabic origin, along with more words naturally derived from Sanskrit ( tadbhava), whereas Hindus are more likely to use tatsama (words directly borrowed from Sanskrit). For example:


Phonology

The phonemic inventory of standard Bengali consists of 29 consonants and 7 vowels, as well as 7 nasalised vowels. The inventory is set out below in the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
(upper grapheme in each box) and romanisation (lower grapheme). Bengali is known for its wide variety of
diphthongs A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
, combinations of
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s occurring within the same
syllable A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
. Two of these, and , are the only ones with representation in script, as and respectively. may all form the glide part of a diphthong. The total number of diphthongs is not established, with bounds at 17 and 31. An incomplete chart is given by Sarkar (1985) of the following:


Stress

In standard Bengali, stress is predominantly initial. Bengali words are virtually all trochaic; the primary stress falls on the initial
syllable A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
of the word, while secondary stress often falls on all odd-numbered syllables thereafter, giving strings such as in ''shô-hô-jo-gi-ta'' "cooperation", where the boldface represents primary and secondary stress.


Consonant clusters

Native Bengali words do not allow initial consonant clusters; the maximum syllabic structure is CVC (i.e., one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this pattern, even when using Sanskrit or English borrowings, such as ''geram'' (CV.CVC) for ''gram'' (CCVC) "village" or ''iskul'' (VC.CVC) for ''skul'' (CCVC) "school".


Writing system

The Bengali-Assamese script is an
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
, a script with letters for consonants, with diacritics for vowels, and in which an inherent vowel (অ ''ô'') is assumed for consonants if no vowel is marked. The Bengali alphabet is used throughout Bangladesh and eastern India (Assam, West Bengal, Tripura). The Bengali alphabet is believed to have evolved from a modified Brahmic script around 1000 CE (or 10th–11th century). in It is a
cursive Cursive (also known as joined-up writing) is any style of penmanship in which characters are written joined in a flowing manner, generally for the purpose of making writing faster, in contrast to block letters. It varies in functionality and m ...
script with eleven
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived from Ancient Greek ('write'), and the suffix ''-eme'' by analogy with ''phoneme'' and other emic units. The study of graphemes ...
s or signs denoting nine vowels and two
diphthong A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
s, and thirty-nine graphemes representing
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s and other modifiers. There are no distinct upper and lower case letter forms. The letters run from left to right and spaces are used to separate orthographic words. Bengali script has a distinctive horizontal line running along the tops of the graphemes that links them together called ''matra''. Since the Bengali script is an abugida, its consonant graphemes usually do not represent phonetic segments, but carry an "inherent" vowel and thus are syllabic in nature. The inherent vowel is usually a
back vowel A back vowel is any in a class of vowel sound used in spoken languages. The defining characteristic of a back vowel is that the highest point of the tongue is positioned relatively back in the mouth without creating a constriction that would be c ...
, either as in "opinion" or , as in "mind", with variants like the more open . To emphatically represent a consonant sound without any inherent vowel attached to it, a special diacritic, called the '' hôsôntô'' , may be added below the basic consonant grapheme (as in ). This diacritic, however, is not common and is chiefly employed as a guide to pronunciation. The abugida nature of Bengali consonant graphemes is not consistent, however. Often, syllable-final consonant graphemes, though not marked by a ''hôsôntô'', may carry no inherent vowel sound (as in the final in or the medial in ). A consonant sound followed by some vowel sound other than the inherent is orthographically realised by using a variety of vowel
allograph In graphemics and typography, the term allograph is used of a glyph that is a design variant of a letter or other grapheme, such as a letter, a number, an ideograph, a punctuation mark or other typographic symbol. In graphemics, an obvious exa ...
s above, below, before, after, or around the consonant sign, thus forming the ubiquitous consonant-vowel
typographic ligature In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters and used in English and French, in which the letters and are joined for the first ligature ...
s. These allographs, called ''kar'', are
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
al vowel forms and cannot stand on their own. For example, the graph represents the consonant followed by the vowel , where is represented as the diacritical allograph (called ''i-kar'') and is placed ''before'' the default consonant sign. Similarly, the graphs , , , , , , , and represent the same consonant combined with seven other vowels and two diphthongs. In these consonant-vowel ligatures, the so-called "inherent" vowel is first expunged from the consonant before adding the vowel, but this intermediate expulsion of the inherent vowel is not indicated in any visual manner on the basic consonant sign . The vowel graphemes in Bengali can take two forms: the independent form found in the basic inventory of the script and the dependent, abridged, allograph form (as discussed above). To represent a vowel in isolation from any preceding or following consonant, the independent form of the vowel is used. For example, in "ladder" and in "Hilsa fish", the independent form of the vowel is used (cf. the dependent form. A vowel at the beginning of a word is always realised using its independent form. In addition to the inherent-vowel-suppressing ''hôsôntô'', three more diacritics are commonly used in Bengali. These are the superposed ''chôndrôbindu'' , denoting a suprasegmental for nasalisation of vowels (as in "moon"), the postposed ''ônusbar'' indicating the
velar nasal The voiced velar nasal, also known as eng, engma, or agma (from Greek 'fragment'), is a type of consonantal sound used in some spoken languages. It is the sound of ''ng'' in English ''sing'' as well as ''n'' before velar consonants as in ''E ...
(as in "Bengali") and the postposed ''bisôrgô'' indicating the
voiceless glottal fricative The voiceless glottal fricative, sometimes called voiceless glottal transition or the aspirate, is a type of sound used in some spoken languages that patterns like a fricative or approximant consonant '' phonologically'', but often lacks the ...
(as in "ouch!") or the gemination of the following consonant (as in "sorrow"). The Bengali consonant clusters ( ''juktôbênjôn'') are usually realised as ligatures, where the consonant which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond recognition. As in, ক্ষ (ক+ষ) or হ্ম (হ+ম) In the Bengali writing system, there are nearly 285 such ligatures denoting consonant clusters. Although there exist a few visual formulas to construct some of these ligatures, many of them have to be learned by rote. Recently, in a bid to lessen this burden on young learners, efforts have been made by educational institutions in the two main Bengali-speaking regions (West Bengal and Bangladesh) to address the opaque nature of many consonant clusters, and as a result, modern Bengali textbooks are beginning to contain more and more "transparent" graphical forms of consonant clusters, in which the constituent consonants of a cluster are readily apparent from the graphical form. However, since this change is not as widespread and is not being followed as uniformly in the rest of the Bengali printed literature, today's Bengali-learning children will possibly have to learn to recognise both the new "transparent" and the old "opaque" forms, which ultimately amounts to an increase in learning burden. Bengali punctuation marks, apart from the downstroke ''daṛi'' – the Bengali equivalent of a
full stop The full stop ( Commonwealth English), period (North American English), or full point is a punctuation mark used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamation). A ...
– have been adopted from Western scripts and their usage is similar. Unlike in Western scripts (Latin, Cyrillic, etc.) where the letter forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called ''matra''. The presence and absence of this matra can be important. For example, the letter ''tô'' and the numeral "3" are distinguishable only by the presence or absence of the ''matra'', as is the case between the consonant cluster ''trô'' and the independent vowel ''e'', also the letter ''hô'' and Bengali Ôbogroho ''(~ô)'' and letter ''o'' and consonant cluster ''ttô''. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline). There is yet to be a uniform standard collating sequence (sorting order of graphemes to be used in dictionaries, indices, computer sorting programs, etc.) of Bengali graphemes. Experts in both Bangladesh and India are currently working towards a common solution for this problem.


Alternative and historic scripts

Throughout history, there have been instances of the Bengali language being written in different scripts, though these employments were never popular on a large scale and were communally limited. Owing to Bengal's geographic location, Bengali areas bordering non-Bengali regions have been influenced by each other. Small numbers of people in Midnapore, which borders
Odisha Odisha (), formerly Orissa (List of renamed places in India, the official name until 2011), is a States and union territories of India, state located in East India, Eastern India. It is the List of states and union territories of India by ar ...
, have used the Odia script to write in Bengali. In the border areas between
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
and
Bihar Bihar ( ) is a states and union territories of India, state in Eastern India. It is the list of states and union territories of India by population, second largest state by population, the List of states and union territories of India by are ...
, some Bengali communities historically wrote Bengali in
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
,
Kaithi Kaithi (), also called Kayathi (), Kayasthi (), or Kayastani, is a Brahmic script historically used across parts of Northern and Eastern India. It was prevalent in regions corresponding to modern-day Uttar Pradesh, Bihar, and Jharkhand. The s ...
and Tirhuta. In
Sylhet Sylhet (; ) is a Metropolis, metropolitan city in the north eastern region of Bangladesh. It serves as the administrative center for both the Sylhet District and the Sylhet Division. The city is situated on the banks of the Surma River and, as o ...
and
Bankura Bankura () is a city and a municipality in the state of West Bengal, India. It is the headquarters of the Bankura district. Etymology It comes from the old Austric word ráŕhá or ráŕho which means “land of red soil”.P.R. Sarkar Rarh - ...
, modified versions of the Kaithi script had some historical prominence, mainly among Muslim communities. The variant in Sylhet was identical to the Baitali Kaithi script of Hindustani with the exception of Sylhet Nagri possessing ''matra''. Sylhet Nagri was standardised for printing in . Up until the 19th century, numerous variations of the
Arabic script The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
had been used across Bengal from Chittagong in the east to Meherpur in the west. The 14th-century court scholar of Bengal, Nur Qutb Alam, composed Bengali poetry using the Persian alphabet. After the
Partition of India The partition of India in 1947 was the division of British India into two independent dominion states, the Dominion of India, Union of India and Dominion of Pakistan. The Union of India is today the Republic of India, and the Dominion of Paki ...
in the 20th century, the Pakistani government attempted to institute the Perso-Arabic script as the standard for Bengali in
East Pakistan East Pakistan was the eastern province of Pakistan between 1955 and 1971, restructured and renamed from the province of East Bengal and covering the territory of the modern country of Bangladesh. Its land borders were with India and Burma, wit ...
; this was met with resistance and contributed to the Bengali language movement. In the 16th century, Portuguese missionaries began a tradition of using the Roman alphabet to transcribe the Bengali language. Though the Portuguese standard did not receive much growth, a few Roman Bengali works relating to Christianity and Bengali grammar were printed as far as
Lisbon Lisbon ( ; ) is the capital and largest city of Portugal, with an estimated population of 567,131, as of 2023, within its administrative limits and 3,028,000 within the Lisbon Metropolitan Area, metropolis, as of 2025. Lisbon is mainlan ...
in 1743. The Portuguese were followed by the English and French respectively, whose works were mostly related to Bengali grammar and transliteration. The first version of the
Aesop's Fables Aesop's Fables, or the Aesopica, is a collection of fables credited to Aesop, a Slavery in ancient Greece, slave and storyteller who lived in ancient Greece between 620 and 564 Before the Common Era, BCE. Of varied and unclear origins, the stor ...
in Bengali was printed using Roman letters based on English phonology by the Scottish linguist John Gilchrist. Consecutive attempts to establish a Roman Bengali have continued across every century since these times, and have been supported by the likes of Suniti Kumar Chatterji, Muhammad Qudrat-i-Khuda, and Muhammad Enamul Haq. The
Digital Revolution The Information Age is a History by period, historical period that began in the mid-20th century. It is characterized by a rapid shift from traditional industries, as established during the Industrial Revolution, to an economy centered on info ...
has also played a part in the adoption of the
English alphabet Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
to write Bengali, with certain social media influencers publishing entire novels in Roman Bengali. Bengali script like others does have Schwa deletion. It does not mark when the inherent vowel is not used (mainly at the end of words)


Orthographic depth

The Bengali script in general has a comparatively shallow orthography when compared to the Latin script used for English and French, i.e., in many cases there is a one-to-one correspondence between the sounds (phonemes) and the letters (graphemes) of Bengali. But grapheme-phoneme inconsistencies do occur in many other cases. In fact, Bengali-Assamese script has the deepest orthography ( deep orthography) among the Indian scripts. In general, the Bengali-Assamese script is fairly transparent for grapheme-to-phoneme conversion, i.e., it is easier to predict the pronunciation from spelling of the words. But the script is fairly opaque for phoneme-to-grapheme conversion, i.e., it is more difficult to predict the spelling from the pronunciation of the words. One kind of inconsistency is due to the presence of several letters in the script for the same sound. In spite of some modifications in the 19th century, the Bengali spelling system continues to be based on the one used for Sanskrit, and thus does not take into account some sound mergers that have occurred in the spoken language. For example, there are three letters (, , and ) for the
voiceless postalveolar fricative A voiceless postalveolar fricative is a type of consonantal sound used in some Speech, spoken languages. The International Phonetic Association uses the term ''voiceless postalveolar fricative'' only for the sound #Voiceless palato-alveolar frica ...
, although the letter retains the voiceless alveolar sibilant sound when used in certain consonant conjuncts as in "fall", "beat", etc. The letter also, sometimes, retains the voiceless retroflex sibilant sound when used in certain consonant conjuncts as in "suffering", "clan", etc. Similarly, there are two letters ( and ) for the voiced postalveolar affricate . Moreover, what was once pronounced and written as a retroflex nasal is now pronounced as an alveolar when in conversation (the difference is heard when reading) (unless conjoined with another
retroflex consonant A retroflex () or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the hard palate. They are sometimes referred to as cerebral consona ...
such as , , and ), although the spelling does not reflect this change. The
near-open front unrounded vowel The near-open front unrounded vowel, or near-low front unrounded vowel, is a type of vowel sound. The symbol in the International Phonetic Alphabet that represents this sound is , a lowercase of the ligature. Both the symbol and the sound ar ...
is orthographically realised by multiple means, as seen in the following examples: "so much", "academy", "amoeba", "to see", "busy", "grammar". Another kind of inconsistency is concerned with the incomplete coverage of phonological information in the script. The inherent vowel attached to every consonant can be either or depending on
vowel harmony In phonology, vowel harmony is a phonological rule in which the vowels of a given domain – typically a phonological word – must share certain distinctive features (thus "in harmony"). Vowel harmony is typically long distance, meaning tha ...
() with the preceding or following vowel or on the context, but this phonological information is not captured by the script, creating ambiguity for the reader. Furthermore, the inherent vowel is often not pronounced at the end of a syllable, as in "less", but this omission is not generally reflected in the script, making it difficult for the new reader. Many consonant clusters have different sounds than their constituent consonants. For example, the combination of the consonants and is graphically realised as and is pronounced (as in "coarse"), (as in "capability") or even (as in "harm"), depending on the position of the cluster in a word. Another example is that there are around 7 or more graphemes to represent the sound . These are '' as in (''śabda'', pronounced as ''śôbdo'' "word"), '' as in (''ṣaṛayantra'', pronounced as ''śoṛōjontrō'' "conspiracy"), '' as in (''sarakāra'', pronounced as ''śorkar'' "government"), '' as in (written as ''śbaśura'' but pronounced with the ব ''b'' silent, i.e., as ''śōśur'' "father-in-law"), '' as in (written as ''śmaśāna'' but pronounced with the 'm' silent, i.e., as ''śośan'' "crematorium"), '' as in (written as "sbapna" but pronounced with the 'b' silent, i.e., as ''śopnō'' "dream"), '' as in (written as ''smaraṇa'' but pronounced with the 'm' silent, i.e., as ''śorōn'' "remembrance"), '' as in (written as ''grīṣma'' but pronounced with the 'm' silent, i.e., as ''griśśō'' "summer") and so on. In most of the consonant clusters, only the first consonant is pronounced and rest of the consonants are silent. Examples are (written as ''lakṣmaṇa'' but pronounced as ''lokkhōn'' " Lakshman"), (written as ''biśbāsa'' but pronounced as ''biśśaś'' "belief"), (written as ''bādhya'' but pronounced as ''baddhō'' "obliged") and (written as ''sbāsthya'' but pronounced as ''śasthō'' "health"). Some consonant clusters have completely different pronunciation as compared to the constituent consonants. For example, '' as in (meaning "heritage") where ''hy'' is pronounced as ''jjh'' (written as ''aitihya'' but pronounced as ''ōitijjhō''). The same হ্য is pronounced as 'hæ' as in (meaning "yes") (written as ''hyām̐'' but pronounced as nasalised "hæ"). Another example of inconsistency in the script is that of words like, (written as ''anya'' but pronounced as ''ōnnō'' "other, different") and (written as ''anna'' but pronounced as ''onnō'' "cooked rice, food"); in these words, the letter is combining with two different consonant clusters (''nya'') and (''nna''), and while the same letter has two different pronunciations, ''ō'' and ''o'', the two different consonant clusters have the same pronunciation. Thus, same letters and graphemes can often have different pronunciations depending on their position in a word and different graphemes and letters often have the same pronunciation. The main reason for these numerous inconsistencies is that there have been lots of sound mergers in Bengali, but the script has failed to account for the sound shifts and consonant mergers in the language. Bengali has lots of tatsam words (words directly derived from Sanskrit) and in all these words, the original spelling has been preserved but the pronunciations have changed due to consonant mergers and sound shifts. In fact, most of the tatsam words have many grapheme-to-phoneme inconsistencies while most of the tadbhav words (native Bengali words) have fairly consistent grapheme-to-phoneme correspondence. The Bengali writing system is, therefore, not often a true guide to pronunciation.


Uses

The script used for Bengali, Assamese, and other languages is known as
Bengali script The Bengali script or Bangla alphabet (, Romanization of Bengali, romanized: ''Bāṅlā bôrṇômālā'') is the standard writing system used to write the Bengali language, and has historically been used to write Sanskrit within Bengal. ...
. The script is known as the Bengali alphabet for Bengali and its dialects and the Assamese alphabet for
Assamese language Assamese () or Asamiya ( ) is an Indo-Aryan language spoken mainly in the north-eastern Indian state of Assam, where it is an official language. It has long served as a ''lingua franca'' in parts of Northeast India."Axomiya is the major langu ...
with some minor variations. Other related languages in the nearby region also make use of the Bengali script like the
Meitei language Meitei (; ) also known as Manipuri ), is a Tibeto-Burman language of northeast India. It is the official language and the lingua franca of Manipur and an additional official language in four districts of Assam. It is one of the scheduled ...
in the Indian state of
Manipur Manipur () is a state in northeastern India with Imphal as its capital. It borders the Indian states of Assam to the west, Mizoram to the south, and Nagaland to the north and shares the international border with Myanmar, specifically t ...
, where the Meitei language has been written in the Bengali script for centuries, though the
Meitei script The Meitei script (), also known as the Kanglei script () or the Kok Sam Lai script (), after its first three letters is an abugida in the Brahmic scripts family used to write the Meitei language, the official language of Manipur, Assam an ...
has been promoted in recent times.


Number system

Bengali digits are as follows: Some 19th-century grammars note additional signs for fractions, quarters and sixteenths in particular.


Romanisation

There are various romanisation systems used for Bengali created in recent years which have failed to represent the true Bengali phonetic sound. The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation where the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration, or IAST system (based on diacritics); "Indian languages Transliteration", or ITRANS (uses upper case letters suited for
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
keyboards); and the National Library at Kolkata romanisation. In the context of Bengali romanisation, it is important to distinguish
transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or L ...
from transcription. Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription is phonetically accurate (the pronunciation can be reproduced). As the spelling often doesn't reflect the actual pronunciation,
transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or L ...
and transcription are often different. Although it might be desirable to use a transliteration scheme where the original Bengali orthography is recoverable from the Latin text, Bengali words are currently romanised on Wikipedia using a phonemic transcription, where the true phonetic pronunciation of Bengali is represented with no reference to how it is written. The most recent attempt has been by publishers Mitra and Ghosh with the launch of three popular children's books, ''Abol Tabol'', ''Hasi Khusi'' and ''Sahoj Path'', in Roman script at the Kolkata Book Fair 2018. Published under the imprint of Benglish Books, these are based on phonetic transliteration and closely follow spellings used in social media but for using an underline to describe soft consonants.


Grammar

Bengali nouns are not assigned gender, which leads to minimal changing of adjectives (
inflection In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
). However, nouns and pronouns are moderately declined (altered depending on their function in a sentence) into four cases while verbs are heavily conjugated, and the verbs do not change form depending on the gender of the nouns.


Word order

As a head-final language, Bengali follows a subject–object–verb
word order In linguistics, word order (also known as linear order) is the order of the syntactic constituents of a language. Word order typology studies it from a cross-linguistic perspective, and examines how languages employ different orders. Correlatio ...
, although variations on this theme are common. Bengali makes use of postpositions, as opposed to the prepositions used in English and other European languages.
Determiner Determiner, also called determinative ( abbreviated ), is a term used in some models of grammatical description to describe a word or affix belonging to a class of noun modifiers. A determiner combines with a noun to express its reference. Examp ...
s follow the
noun In grammar, a noun is a word that represents a concrete or abstract thing, like living creatures, places, actions, qualities, states of existence, and ideas. A noun may serve as an Object (grammar), object or Subject (grammar), subject within a p ...
, while numerals,
adjective An adjective (abbreviations, abbreviated ) is a word that describes or defines a noun or noun phrase. Its semantic role is to change information given by the noun. Traditionally, adjectives are considered one of the main part of speech, parts of ...
s, and possessors precede the noun. Yes–no questions do not require any change to the basic word order; instead, the low (L) tone of the final syllable in the utterance is replaced with a falling (HL) tone. Additionally, optional particles (e.g. ''-ki'', ''-na'', etc.) are often encliticised onto the first or last word of a yes–no question. Wh-questions are formed by fronting the wh-word to focus position, which is typically the first or second word in the utterance.


Nouns

Nouns and pronouns are inflected for case, including
nominative In grammar, the nominative case ( abbreviated ), subjective case, straight case, or upright case is one of the grammatical cases of a noun or other part of speech, which generally marks the subject of a verb, or (in Latin and formal variants of E ...
, objective, genitive (possessive), and locative. The case marking pattern for each noun being inflected depends on the noun's degree of animacy. When a
definite article In grammar, an article is any member of a class of dedicated words that are used with noun phrases to mark the identifiability of the referents of the noun phrases. The category of articles constitutes a part of speech. In English, both "the" ...
such as ''-ṭa'' (singular) or ''-gulo'' (plural) is added, as in the tables below, nouns are also inflected for
number A number is a mathematical object used to count, measure, and label. The most basic examples are the natural numbers 1, 2, 3, 4, and so forth. Numbers can be represented in language with number words. More universally, individual numbers can ...
. In most of Bengali grammar books, cases are divided into 6 categories and an additional possessive case (the possessive form is not recognised as a type of case by Bengali grammarians). But in terms of usage, cases are generally grouped into only 4 categories. When counted, nouns take one of a small set of measure words. Nouns in Bengali cannot be counted by adding the numeral directly adjacent to the noun. An appropriate measure word (MW), a classifier, must be used between the numeral and the noun (most languages of the Mainland Southeast Asia linguistic area are similar in this respect). Most nouns take the generic measure word ''-ṭa'', though other measure words indicate semantic classes (e.g. ''-jôn'' for humans). There is also the classifier ''-khana,'' and its diminutive form ''-khani'', which attaches only to nouns denoting something flat, long, square, or thin. These are the least common of the classifiers. Measuring nouns in Bengali without their corresponding measure words (e.g. ''aṭ biṛal'' instead of ''aṭ-ṭa biṛal'' "eight cats") would typically be considered ungrammatical. However, when the semantic class of the noun is understood from the measure word, the noun is often omitted and only the measure word is used, e.g. ''Shudhu êk-jôn thakbe.'' (MW will remain.") would be understood to mean "Only one person will remain.", given the semantic class implicit in ''-jôn''. In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to
mass noun In linguistics, a mass noun, uncountable noun, non-count noun, uncount noun, or just uncountable, is a noun with the syntactic property that any quantity of it is treated as an undifferentiated unit, rather than as something with discrete eleme ...
s.


Verbs

There are two classes of verbs: finite and non-finite. Non-finite verbs have no inflection for tense or person, while finite verbs are fully inflected for
person A person (: people or persons, depending on context) is a being who has certain capacities or attributes such as reason, morality, consciousness or self-consciousness, and being a part of a culturally established form of social relations suc ...
(first, second, third), tense (present, past, future), aspect (simple, perfect, progressive), and
honour Honour (Commonwealth English) or honor (American English; American and British English spelling differences#-our, -or, see spelling differences) is a quality of a person that is of both social teaching and personal ethos, that manifests itself ...
(intimate, familiar, and formal), but ''not'' for number. Conditional, imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of inflections on many verb roots can total more than 200.
Inflection In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
al suffixes in the morphology of Bengali vary from region to region, along with minor differences in
syntax In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
. Bengali differs from most Indo-Aryan Languages in the
zero copula Zero copula, also known as null copula, is a linguistic phenomenon whereby the subject is joined to the predicate without overt marking of this relationship (like the copula (linguistics), copula ''to be'' in English). One can distinguish languag ...
, where the copula or connective ''be'' is often missing in the present tense. in Thus, "he is a teacher" is ''se shikkhôk'', (literally "he teacher").Among Bengali speakers brought up in neighbouring linguistic regions (e.g. Hindi), the lost copula may surface in utterances such as ''she shikkhôk hocche''. This is viewed as ungrammatical by other speakers, and speakers of this variety are sometimes (humorously) referred as "hocche-Bangali". In this respect, Bengali is similar to Russian and Hungarian. Romani grammar is also the closest to Bengali grammar.


Vocabulary

Bengali is typically thought to have around 100,000 separate words, of which 16,000 (16%) are considered to be তদ্ভব ''tôdbhôbô, or Tadbhava'' (inherited Indo-Aryan vocabulary), 40,000 (40%) are তৎসম ''tôtśômô'' or '' Tatsama'' (words directly borrowed from
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
), and borrowings from দেশী ''deśi,'' or "indigenous" words, which are at around 16,000 (16%) of the Bengali vocabulary. The rest are বিদেশী ''bideśi'' or "foreign" sources, including Persian, Turkish,
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
, and English among others, accounting for around 28,000 (28%) of all Bengali words, highlighting the significant influence that foreign languages and cultures have had on the Bengali language throughout Bengal's long history of contact with different peoples and the cultural exchanges that came with such interactions. Bengali is reportedly similar to Assamese and has a lexical similarity of 40 per cent with Nepali. According to Suniti Kumar Chatterji, dictionaries from the early 20th century attributed a little more than 50% of the Bengali vocabulary to native words (i.e., naturally modified
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
words, corrupted forms of Sanskrit words, and loanwords non-Indo-European languages). About 45% per cent of Bengali words are unmodified Sanskrit, and the remaining words are from foreign languages. However, more modern sources cite that this is not the case with Bengali vocabulary, as there are far more dominant foreign influences that accurately reflect the way modern Bengalis speak and utilise Bengali. Persian is also thought to have influenced many grammatical forms. More recent studies suggest that the use of foreign words has been increasing, mainly because of the preference of Bengali speakers for the colloquial style. Because of centuries of contact with Europeans,
Turkic peoples Turkic peoples are a collection of diverse ethnic groups of West Asia, West, Central Asia, Central, East Asia, East, and North Asia as well as parts of Europe, who speak Turkic languages.. "Turkic peoples, any of various peoples whose members ...
, and
Persians Persians ( ), or the Persian people (), are an Iranian ethnic group from West Asia that came from an earlier group called the Proto-Iranians, which likely split from the Indo-Iranians in 1800 BCE from either Afghanistan or Central Asia. They ...
, Bengali has absorbed numerous words from foreign languages, often totally integrating these borrowings into the core vocabulary. Persian influence was significant for the development of Bengali up to the modern day, and was the primary official language in the region for 600 years, until British rule, when it was changed to English in 1836. In fact, there was so much Persian influence that a register of highly Persianized Bengali, known as Dobhashi appeared in medieval Bengal. The most common borrowings from foreign languages come from three types of contact. After close contact with several indigenous Austroasiatic languages, and later the
Delhi Sultanate The Delhi Sultanate or the Sultanate of Delhi was a Medieval India, late medieval empire primarily based in Delhi that stretched over large parts of the Indian subcontinent for more than three centuries.
, the
Bengal Sultanate The Bengal Sultanate (Middle Bengali: , Classical Persian: ) was a Post-classical history, late medieval sultanate based in the Bengal region in the eastern South Asia between the 14th and 16th century. It was the dominant power of the Ganges- ...
, and the
Mughal Empire The Mughal Empire was an Early modern period, early modern empire in South Asia. At its peak, the empire stretched from the outer fringes of the Indus River Basin in the west, northern Afghanistan in the northwest, and Kashmir in the north, to ...
, whose court language was Persian, numerous
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
, Persian, and Chaghatai words were absorbed into the lexicon. Later, East Asian travellers and lately European
colonialism Colonialism is the control of another territory, natural resources and people by a foreign group. Colonizers control the political and tribal power of the colonised territory. While frequently an Imperialism, imperialist project, colonialism c ...
brought words from Portuguese, French, Dutch, and most significantly English during the colonial period.


Sample text

The following is a sample text in Bengali of Article 1 of the
Universal Declaration of Human Rights The Universal Declaration of Human Rights (UDHR) is an international document adopted by the United Nations General Assembly that enshrines the Human rights, rights and freedoms of all human beings. Drafted by a UN Drafting of the Universal D ...
:


See also

* Bangla Academy (Bangladesh) * Bangla Academy (West Bengal) * Bengali numerals * Bengali-language newspapers


Notes


References

* * * * * * * Chakraborty, Byomkes, A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, . * * * * * * * * * * * * * * * * * * * Shaw, Rameswar ''Sadharan Bhasabigna O Bangal Bhasa'', Pustak Bipani, Kolkata, 1997. * Haldar, Narayan ''Bengali Bhasa Prsanga: Banan Kathan Likhanriti'', Pustak Bipani, Kolkata, 2007. *


Further reading

* Thompson, Hanne-Ruth (2012).
Bengali
'. Volume 18 of London Oriental and African Language Library. John Benjamins Publishing. .


External links



at the
Library of Congress The Library of Congress (LOC) is a research library in Washington, D.C., serving as the library and research service for the United States Congress and the ''de facto'' national library of the United States. It also administers Copyright law o ...
{{Authority control Eastern Indo-Aryan languages Languages of Bangladesh Official languages of India Subject–object–verb languages Languages of West Bengal Languages of Tripura Official languages of Assam Languages written in Brahmic scripts Languages of Jharkhand Languages of Bihar Classical Language in India