Bangla alphabet
   HOME

TheInfoList



OR:

The Bengali script or Bangla alphabet ( bn, বাংলা বর্ণমালা, ''Bangla bôrṇômala'') is the alphabet used to write the
Bengali language Bengali ( ), generally known by its endonym Bangla (, ), is an Indo-Aryan language native to the Bengal region of South Asia. It is the official, national, and most widely spoken language of Bangladesh and the second most widely spoken o ...
based on the Bengali-Assamese script, and has historically been used to write
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
within
Bengal Bengal ( ; bn, বাংলা/বঙ্গ, translit=Bānglā/Bôngô, ) is a geopolitical, cultural and historical region in South Asia, specifically in the eastern part of the Indian subcontinent at the apex of the Bay of Bengal, predom ...
. It is one of the most widely adopted writing systems in the world (used by over 265 million people). From a classificatory point of view, the Bengali writing system is an
abugida An abugida (, from Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental writing system in which consonant-vowel sequences are written as units; each unit is based on a consonant letter, and vowel n ...
, i.e. its vowel
graphemes In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemic ...
are mainly realised not as independent letters, but as
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s modifying the vowel inherent in the base letter they are added to. The Bengali writing system is written from left to right and uses a single
letter case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
, which makes it a unicameral script, as opposed to a bicameral one like the Latin script. It is recognisable, as are some other
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient Ind ...
, by a distinctive horizontal line known as a '' mātrā'' () running along the tops of the letters that links them together. The Bengali writing system is less blocky, however, and presents a more sinuous shape than the
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
script.George Cardona and Danesh Jain (2003), The Indo-Aryan Languages, Routledge,


Characters

The Bengali script can be divided into vowels and vowel diacritics, consonants and consonant conjuncts, diacritical and other symbols, digits and punctuation marks. Vowels and consonants are used as letters and also as diacritical marks.


Vowels

The Bengali script has a total of 9 vowel
graphemes In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemic ...
, each of which is called a ''swôrôbôrnô'' "vowel letter". The ''swôrôbôrnô''s represent six of the seven main vowel sounds of Bengali, along with two vowel
diphthongs A diphthong ( ; , ), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech ...
. All of them are used in both Bengali and Assamese languages. * "" ''ô'' ('' shôrô ô'', "vocalic ô") sounds as the default
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. For example, if the Latin alphabet used 'i' as an inherent vowel, "Wikipedia" could be rendered as "W ...
for the entire Bengali script. Bengali, Assamese and Odia which are Eastern languages have this value for the inherent vowel, while other languages using
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient Ind ...
have ''a'' for their inherent vowel. * Even though the near-open front unrounded vowel is one of the seven main vowel sounds in the standard Bengali language, no distinct vowel symbol has been allotted for it in the script since there is no sound in Sanskrit, the primary written language when the script was conceived. As a result, the sound is orthographically realised by multiple means in modern Bengali orthography, usually using some combination of "" ''e'' ('' shôrô e'', "vocalic e") , "", "" ''a'' ('' shôrô a'') and the ''jôphôla'' (diacritic form of the consonant grapheme ''jô''). * There are two graphemes for the vowel sound and two graphemes for the vowel sound . The redundancy stems from the time when this script was used to write
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
, a language that had short and long vowels: "" ''i'' ('' rôshshô i'', "short i") and "" ''ī'' (''দীর্ঘ ঈ dirghô ī'', "long ī") , and "" ''u'' (''হ্রস্ব উ rôshshô u'') and "" ''ū'' (''দীর্ঘ ঊ dirghô ū'') . The letters are preserved in the Bengali script with their traditional names despite the fact that they are no longer pronounced differently in ordinary speech. These graphemes serve an etymological function, however, in preserving the original Sanskrit spelling in '' tôtsômô'' Bengali words (words borrowed from Sanskrit). * The grapheme called "" ''ṛ'' (or ''হ্রস্ব ঋ rôshshô ri'', "short ri", as it used to be) does not really represent a vowel phoneme in Bengali but the consonant-vowel combination . Nevertheless, it is included in the vowel section of the inventory of the Bengali script. This inconsistency is also a remnant from
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
, where the grapheme represents the vocalic equivalent of a
retroflex approximant The voiced retroflex approximant is a type of consonant used in some languages. The symbol in the International Phonetic Alphabet that represents this sound is , and the equivalent X-SAMPA symbol is r\`. The IPA symbol is a turned lowercase lett ...
(possibly an r-colored vowel). Another grapheme called "" ''ḷ'' (or ''হ্রস্ব ঌ rôshshô li'' as it used to be) representing the vocalic equivalent of a
dental approximant The voiced dental fricative is a consonant sound used in some spoken languages. It is familiar to English-speakers as the ''th'' sound in ''father''. Its symbol in the International Phonetic Alphabet is eth, or and was taken from the Old Engl ...
in Sanskrit but actually representing the consonant-vowel combination in Bengali instead of a vowel phoneme, was also included in the vowel section but unlike "", it was recently discarded from the inventory since its usage was extremely limited even in Sanskrit. * When a vowel sound occurs syllable-initially or when it follows another vowel, it is written using a distinct letter. When a vowel sound follows a consonant (or a consonant cluster), it is written with a diacritic which, depending on the vowel, can appear above, below, before or after the consonant. These vowel marks cannot appear without a consonant and are called ''kar''. * An exception to the above system is the vowel , which has no vowel mark but is considered inherent in every consonant letter. To denote the absence of the inherent vowel following a consonant, a diacritic called the ''hôsôntô'' (্) may be written underneath the consonant. * Although there are only two diphthongs in the inventory of the script: "" ''oi'' (''স্বর ঐ shôrô oi'', "vocalic oi") and "" ''ou'' (''স্বর ঔ shôrô ou'') , the Bengali phonetic system has, in fact, many diphthongs.Different Bengali linguists give different numbers of Bengali diphthongs in their works depending on methodology, e.g. 25 (Chatterji 1939: 40), 31 (Hai 1964), 45 (Ashraf and Ashraf 1966: 49), 28 (Kostic and Das 1972:6–7) and 17 (Sarkar 1987). Most diphthongs are represented by juxtaposing the graphemes of their forming vowels, as in ''keu'' . * There also used to be two long vowels: "" ''ṝ'' (''দীর্ঘ ৠ dirghô rri'', "long rri") and "" ''ḹ'' (''দীর্ঘ ৡ dirghô lli''), which were removed from the inventory during the Vidyasagarian reform of the script due to peculiarity to Sanskrit. The table below shows the vowels present in the modern (since the late nineteenth century) inventory of the Bengali alphabet:


Notes


Consonants

Consonant letters are called ''bænjônbôrnô'' "consonant letter" in Bengali. The names of the letters are typically just the consonant sound plus the inherent vowel '' ô''. Since the inherent vowel is assumed and not written, most letters' names look identical to the letter itself (the name of the letter is itself ''ghô'', not ''gh''). * Some letters that have lost their distinctive pronunciation in modern Bengali are called by more elaborate names. For example, since the consonant phoneme is written as both and , the letters are not called simply ''nô''; instead, they are called ''dôntyô nô'' (" dental nô") and ''murdhônyô nô'' ("
retroflex A retroflex (Help:IPA/English, /ˈɹɛtʃɹoːflɛks/), apico-domal (Help:IPA/English, /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated betw ...
nô"). What was once pronounced and written as a retroflex nasal ণ is now pronounced as an alveolar (unless conjoined with another
retroflex consonant A retroflex ( /ˈɹɛtʃɹoːflɛks/), apico-domal ( /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the h ...
such as ট, ঠ, ড and ঢ) although the spelling does not reflect the change. * Although still named ''murdhônyô'' when they are being taught,
retroflex consonants A retroflex ( /ˈɹɛtʃɹoːflɛks/), apico-domal ( /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the har ...
do not exist in Bengali and are instead fronted to their postalveolar and alveolar equivalents. * The
voiceless palato-alveolar sibilant A voiceless postalveolar fricative is a type of consonantal sound used in some spoken languages. The International Phonetic Association uses the term ''voiceless postalveolar fricative'' only for the sound , but it also describes the voiceless ...
phoneme can be written as , ('' talôbyô shô'', "palatal shô"), ('' murdhônyô shô'', "retroflex shô"), or ('' dôntyô sô'', "dental sô"
voiceless alveolar fricative The voiceless alveolar fricatives are a type of fricative consonant pronounced with the tip or blade of the tongue against the alveolar ridge (gum line) just behind the teeth. This refers to a class of sounds, not a single sound. There are at lea ...
), depending on the word. * The
voiced palato-alveolar affricate The voiced palato-alveolar sibilant affricate, voiced post-alveolar affricate or voiced domed postalveolar sibilant affricate, is a type of consonantal sound, used in some spoken languages. The sound is transcribed in the International Phonetic ...
phoneme can be written in two ways, as ('' ôntôsthô jô'') or ( ''bôrgiyô jô''). In many varieties of Bengali, are not distinct from this phoneme, but speakers who distinguish them may use the letters and with contrast. * Post-
reform Reform ( lat, reformo) means the improvement or amendment of what is wrong, corrupt, unsatisfactory, etc. The use of the word in this way emerges in the late 18th century and is believed to originate from Christopher Wyvill's Association movement ...
, the letter য় was introduced to distinguish it from য
note Note, notes, or NOTE may refer to: Music and entertainment * Musical note, a pitched sound (or a symbol for a sound) in music * ''Notes'' (album), a 1987 album by Paul Bley and Paul Motian * ''Notes'', a common (yet unofficial) shortened version ...
]: ** The semivowel yô cannot occur at the beginning of a word . The name of য় is ''ôntôsthô ô ('semi-vowel y')'' [the y is silent in the pronunciation of its name]. The pronunciation of yô varies between ⟨w⟩ and ⟨j⟩ w' and 'y' ** The name of is '' ôntôsthô jô ('semi-vowel j').'' It is found almost entirely at the beginning of words. ** When present in the middle of words, in conjuncts, য is represented as a distinct letter: (যফলা ''jôphôla'') which is mostly silent or semi-silent (see below). ''Jôphôla'' may alter the pronunciation of the surrounding vowel or double the preceding consonant or be completely silent. * Since the
nasals In phonetics, a nasal, also called a nasal occlusive or nasal stop in contrast with an oral stop or nasalized consonant, is an occlusive consonant produced with a lowered velum, allowing air to escape freely through the nose. The vast majorit ...
''ñô'' and ''ngô'' cannot occur at the beginning of a word in Bengali, their names are not ''ñô'' and ''ngô'' respectively but ''ungô'' (pronounced by some as ''umô'' or ''ũô'') and ''iñô'' (pronounced by some as ''niyô'' or ''ingô'') respectively. * There is a difference in the pronunciation of ''ṛô'' ('' ḍô-e shunyô ṛô'', "ṛô (as) ḍô with a zero (the figure is used analogous to the
ring below A ring diacritic may appear above or below letters. It may be combined with some Letter (alphabet), letters of the extended Latin alphabets in various contexts. Rings Distinct letter The character Å (å) is derived from an A with a ring. It ...
diacritic as the Bengali equivalent of the
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
nuqta ় The nuqta (Hindi–Urdu: //, fa, , noqte; from ar, نقطة, nuqṭa, dot; sometimes also spelled nukta) is a diacritic mark that was introduced in Devanagari and some other Indic scripts to represent sounds not present in the original scri ...
, which is again analogous to the
underdot When used as a diacritic mark, the term dot is usually reserved for the ''interpunct'' ( · ), or to the glyphs "combining dot above" ( ◌̇ ) and "combining dot below" ( ◌̣ ) which may be combined with some letters of th ...
)") and ''ṛhô'' ('' ḍhô-e shunyô ṛhô'') with that of ''rô'' (sometimes called '' bô-e shunyô rô'' for distinguishing purpose) - similar to other Indic languages. This is especially true in the parlance of western and southern part of Bengal but lesser on the dialects of the eastern side of the
Padma River The Padma ( bn, পদ্মা ''Pôdma'') is a major river in Bangladesh. It is the main distributary of the Ganges, flowing generally southeast for to its confluence with the Meghna River near the Bay of Bengal. The city of Rajshahi is sit ...
. and were introduced to the inventory during the Vidyasagarian reform to indicate the
retroflex flap The voiced retroflex flap is a type of consonantal sound, used in some spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , and the equivalent X-SAMPA symbol is r`. Features Features of the voice ...
in the pronunciation of ''ḍô'' and ''ḍhô'' in the middle or end of a word. It is an
allophonic In phonology, an allophone (; from the Greek , , 'other' and , , 'voice, sound') is a set of multiple possible spoken soundsor '' phones''or signs used to pronounce a single phoneme in a particular language. For example, in English, (as in '' ...
development in some Indic languages not present in Sanskrit. Yet in ordinary speech these letters are pronounced the same as in modern Bengali.


Notes


Consonant conjuncts

Clusters of up to four consonants can be orthographically represented as a
typographic ligature In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters æ and œ used in English and French, in which the letters 'a' and 'e' are joined for the firs ...
called a consonant conjunct ( bn, যুক্তাক্ষর/যুক্তবর্ণ ''juktakkhôr''/''juktôbôrnô'' or more specifically ). Typically, the first consonant in the conjunct is shown above and/or to the left of the following consonants. Many consonants appear in an abbreviated or compressed form when serving as part of a conjunct. Others simply take exceptional forms in conjuncts, bearing little or no resemblance to the base character. Often, consonant conjuncts are not actually pronounced as would be implied by the pronunciation of the individual components. For example, adding ''lô'' underneath ''shô'' in Bengali creates the conjunct , which is not pronounced ''shlô'' but ''slô'' in Bengali. Many conjuncts represent Sanskrit sounds that were lost centuries before modern Bengali was ever spoken as in . It is a combination of ǰô and ñô but it is not pronounced "ǰñô" or "jnô". Instead, it is pronounced ''ggô'' in modern Bengali. Thus, as conjuncts often represent (combinations of) sounds that cannot be easily understood from the components, the following descriptions are concerned only with the construction of the conjunct, and not the resulting pronunciation. (Some graphemes may appear in a form other than the mentioned form due to the font used)


Fused forms

Some consonants fuse in such a way that one stroke of the first consonant also serves as a stroke of the next. * The consonants can be placed on top of one another, sharing their vertical line: kkô gnô glô nnô pnô ppô llô etc. * As the last member of a conjunct, ব bô can hang on the vertical line under the preceding consonants, taking the shape of ব bô (includes বফলা ''bôphôla''): gbô "ṇbô" "dbô" lbô "shbô". * The consonants can also be placed side-by-side, sharing their vertical line: ddô ndô bdô bǰô pṭô sṭô shchô shchhô, etc.


Approximated forms

Some consonants are written closer to one another simply to indicate that they are in a conjunct together. * The consonants can be placed side-by-side, appearing unaltered: dgô dghô ḍḍô. * As the last member of a conjunct, bô can appear immediately to the right of the preceding consonant, taking the shape of bô (includes বফলা ''bôphôla''): "dhbô" bbô "hbô".


Compressed forms

Some consonants are compressed (and often simplified) when appearing as the first member of a conjunct. * As the first member of a conjunct, the consonants ngô chô ḍô and bô are often compressed and placed at the top-left of the following consonant, with little or no change to the basic shape: "ngkṣô" ngkhô ngghô ngmô chchô chchhô "chnô" ḍḍhô bbô. * As the first member of a conjunct, tô is compressed and placed above the following consonant, with little or no change to the basic shape: tnô "tmô" "tbô". * As the first member of a conjunct, mô is compressed and simplified to a curved shape. It is placed above or to the top-left of the following consonant: mnô mpô mfô mbô mbhô mmô mlô. * As the first member of a conjunct, ṣô is compressed and simplified to an oval shape with a diagonal stroke through it. It is placed to the top-left of the following consonants: ṣkô ṣṭô ṣṭhô ṣpô ṣfô ṣmô. * As the first member of a conjunct, sô is compressed and simplified to a ribbon shape. It is placed above or to the top-left of the following consonant: skô skhô stô sthô snô spô sfô "sbô" "smô" slô.


Abbreviated forms

Some consonants are abbreviated when appearing in conjuncts and lose part of their basic shape. * As the first member of a conjunct, ǰô can lose its final down-stroke: ǰǰô "ǰñô" "jbô". * As the first member of a conjunct, ñô can lose its bottom half: ñchô ñchhô ñǰô ñǰhô. * As the last member of a conjunct, ñô can lose its left half (the part): "ǰñô". * As the first member of a conjunct, ṇô and pô can lose their down-stroke: ṇṭhô ṇḍô ptô psô. * As the first member of a conjunct, tô and bhô can lose their final upward tail: ttô tthô trô bhrô. * As the last member of a conjunct, thô can lose its final upstroke, taking the form of hô instead: nthô sthô mthô * As the last member of a conjunct, mô can lose its initial down-stroke: "kmô" "gmô" ngmô "ṭmô" "ṇmô" "tmô" "dmô" nmô mmô "shmô" ṣmô "smô". * As the last member of a conjunct, sô can lose its top half: ksô. * As the last member of a conjunct ṭô, ḍô and ḍhô can lose their matra: pṭô ṇḍô ṇṭô ṇḍhô. * As the last member of a conjunct ḍô can change its shape: ṇḍô


Variant forms

Some consonants have forms that are used regularly but only within conjuncts. * As the first member of a conjunct, ঙ ngô can appear as a loop and curl: ঙ্ক ngkô ঙ্গ nggô. * As the last member of a conjunct, the curled top of ধ dhô is replaced by a straight downstroke to the right, taking the form of ঝ ǰhô instead: গ্ধ gdhô দ্ধ ddhô ন্ধ ndhô ব্ধ bdhô. * As the first member of a conjunct, র rô appears as a diagonal stroke (called রেফ ''ref'') above the following member: র্ক rkô র্খ rkhô র্গ rgô র্ঘ rghô, etc. * As the last member of a conjunct, র rô appears as a wavy horizontal line (called রফলা ''rôphôla'') under the previous member: খ্র khrô গ্র grô ঘ্র ghrô ব্র brô, etc. ** In some fonts, certain conjuncts with রফলা ''rôphôla'' appear using the compressed (and often simplified) form of the previous consonant: জ্র ǰrô ট্র ṭrô ঠ্র ṭhrô ড্র ḍrô ম্র mrô স্র srô. ** In some fonts, certain conjuncts with রফলা ''rôphôla'' appear using the abbreviated form of the previous consonant: ক্র krô ত্র trô ভ্র bhrô. * As the last member of a conjunct, য jô appears as a wavy vertical line (called যফলা ''jôphôla'') to the right of the previous member: ক্য "kyô" খ্য "khyô" গ্য "gyô" ঘ্য "ghyô" etc. ** In some fonts, certain conjuncts with যফলা ''jôphôla'' appear using special fused forms: দ্য "dyô" ন্য "nyô" শ্য "shyô" ষ্য "ṣyô" স্য "syô" হ্য "hyô".


Exceptions

* When followed by র rô or ত tô, ক kô takes on the same form as ত tô would with the addition of a curl to the right: ক্র krô, ক্ত ktô. * When preceded by the abbreviated form of ঞ ñô, চ chô takes the shape of ব bô: ঞ্চ ñchô * When preceded by another ট ṭô, ট is reduced to a leftward curl: ট্ট ṭṭô. * When preceded by ষ ṣô, ণ ṇô appears as two loops to the right: ষ্ণ ṣṇô. * As the first member of a conjunct, or when at the end of a word and followed by no vowel, ত tô can appear as : ৎস "tsô" ৎপ tpô ৎক tkô etc. * When preceded by হ hô, ন nô appears as a curl to the right: হ্ন "hnô". * Certain combinations must be memorised: ক্ষ "kṣô" হ্ম "hmô".


Certain compounds

When serving as a vowel mark, উ u, ঊ u, and ঋ ri take on many exceptional forms. * উ u ** When following গ gô or শ shô, it takes on a variant form resembling the final tail of ও o: গু gu শু shu. ** When following a ত tô that is already part of a conjunct with প pô, ন nô or স sô, it is fused with the ত to resemble ও o: ন্তু ntu স্তু stu প্তু ptu. ** When following র rô, and in many fonts also following the variant রফলা ''rôphôla'', it appears as an upward curl to the right of the preceding consonant as opposed to a downward loop below: রু ru গ্রু gru ত্রু tru থ্রু thru দ্রু dru ধ্রু dhru ব্রু bru ভ্রু bhru শ্রু shru. ** When following হ hô, it appears as an extra curl: হু hu. * ঊ u ** When following র rô, and in many fonts also following the variant রফলা ''rôphôla'', it appears as a downstroke to the right of the preceding consonant as opposed to a downward hook below: রূ rū গ্রূ grū থ্রূ thrū দ্রূ drū ধ্রূ dhrū ভ্রূ bhrū শ্রূ shrū. * ঋ ri ** When following হ hô, it takes the variant shape of ঊ u: হৃ hri. * Conjuncts of three consonants also exist, and follow the same rules as above: স sô + ত tô +র rô = স্ত্র strô, ম mô + প pô + র rô = ম্প্র mprô, জ ǰô + জ ǰô + ব bô = জ্জ্ব "ǰǰbô", ক্ষ "kṣô" + ম mô = ক্ষ্ম "kṣmô". * Theoretically, four-consonant conjuncts can also be created, as in র rô + স sô + ট ṭô + র rô = র্স্ট্র rsṭrô, but they are not found in native words. * Also theoretically, 5-letter conjuncts can be created, as র rô + স sô + ট tô + র rô + ঁ = র্স্ট্রঁ (pronounced rsṭrô but nasalised: rsṭrôñ). Here ঁ is a
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
which nasalises the previous vowel. A theoretical 6-letter conjunct would be র্স্ট্রাঁ (rsṭrañ/rsṭra), with the addition of a (আ) to র্স্ট্রঁ, and a theoretical 7-letter conjunct would be like র্স্ট্র‍্যাঁ (rsṭrya/rsṭryañ) with the addition of য to র্স্ট্রাঁ.


Diacritics and other symbols

These are mainly the Brahmi-Sanskrit diacritics, phones and punctuation marks present in languages with Sanskrit influence or Brahmi-derived scripts.


Notes


Digits and numerals

The Bengali script has ten
numerical digit A numerical digit (often shortened to just digit) is a single symbol used alone (such as "2") or in combinations (such as "25"), to represent numbers in a positional numeral system. The name "digit" comes from the fact that the ten digits (Latin ...
s (graphemes or symbols indicating the numbers from 0 to 9). Bengali numerals have no horizontal headstroke or মাত্রা "matra". Numbers larger than 9 are written in Bengali using a positional base 10 numeral system (the decimal system). A period or dot is used to denote the
decimal separator A decimal separator is a symbol used to separate the integer part from the fractional part of a number written in decimal form (e.g., "." in 12.45). Different countries officially designate different symbols for use as the separator. The choi ...
, which separates the integral and the fractional parts of a decimal number. When writing large numbers with many digits, commas are used as delimiters to
group A group is a number of persons or things that are located, gathered, or classed together. Groups of people * Cultural group, a group whose members share the same cultural identity * Ethnic group, a group whose members share the same ethnic ide ...
digits, indicating the
thousand 1000 or one thousand is the natural number following 999 and preceding 1001. In most English-speaking countries, it can be written with or without a comma or sometimes a period separating the thousands digit: 1,000. A group of one thousand th ...
(হাজার ''hazar''), the hundred thousand or lakh (লাখ ''lakh'' or লক্ষ ''lôkkhô''), and the ten million or hundred lakh or crore (কোটি ''koti'') units. In other words, leftwards from the decimal separator, the first grouping consists of three digits, and the subsequent groupings always consist of two digits. For example, the English number 17,557,345 will be written in traditional Bengali as ১,৭৫,৫৭,৩৪৫.


Punctuation marks

Bengali punctuation marks, apart from the downstroke দাড়ি ''dari'' (।), the Bengali equivalent of a full stop, have been adopted from western scripts and their usage is similar: Commas, semicolons, colons, quotation marks, etc. are the same as in English. Capital letters are absent in the Bengali script so proper names are unmarked.
An apostrophe, known in Bengali as ঊর্ধ্বকমা ''urdhbôkôma'' "upper comma", is sometimes used to distinguish between
homograph A homograph (from the el, ὁμός, ''homós'', "same" and γράφω, ''gráphō'', "write") is a word that shares the same written form as another word but has a different meaning. However, some dictionaries insist that the words must also ...
s, as in পাটা ''pata'' "plank" and পাʼটা ''pa'ta'' "the leg". Sometimes, a hyphen is used for the same purpose (as in পা-টা, an alternative of পাʼটা).


Characteristics of the Bengali text

Bengali text is written and read horizontally, from left to right. The consonant graphemes and the full form of vowel graphemes fit into an imaginary rectangle of uniform size (uniform width and height). The size of a consonant conjunct, regardless of its complexity, is deliberately maintained the same as that of a single consonant grapheme, so that diacritic vowel forms can be attached to it without any distortion. In a typical Bengali text, orthographic words, words as they are written, can be seen as being separated from each other by an even spacing. Graphemes within a word are also evenly spaced, but that spacing is much narrower than the spacing between words. Unlike in western scripts (
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, Cyrillic, etc.) for which the letter-forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called মাত্রা ''matra''. The presence and absence of this matra can be important. For example, the letter ত ''tô'' and the numeral ৩ "3" are distinguishable only by the presence or absence of the ''matra'', as is the case between the consonant cluster ত্র ''trô'' and the independent vowel এ ''e''. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline). According to Bengali linguist
Munier Chowdhury Munier Choudhury (27 November 192514 December 1971) was a Bangladeshi educationist, playwright, literary critic and political dissident. He was a victim of the mass killing of Bangladeshi intellectuals in 1971. He was awarded Independence Day A ...
, there are about nine graphemes that are the most frequent in Bengali texts, shown with its percentage of appearance in the adjacent table.See Chowdhury 1963


Comparison of Bengali script with ancestral and related scripts


Vowels


Consonants


Vowel diacritics


Standardization

In the script, clusters of consonants are represented by different and sometimes quite irregular forms; thus, learning to read is complicated by the sheer size of the full set of letters and letter combinations, numbering about 350. While efforts at standardising the alphabet for the Bengali language continue in such notable centres as the
Bangla Academy The Bangla Academy ( bn, বাংলা একাডেমি, ) is an autonomous institution funded by the Bangladesh government to foster the Bengali language, literature and culture, to develop and implement national language policy and to d ...
at
Dhaka Dhaka ( or ; bn, ঢাকা, Ḍhākā, ), formerly known as Dacca, is the capital and largest city of Bangladesh, as well as the world's largest Bengali-speaking city. It is the eighth largest and sixth most densely populated city i ...
(
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the eighth-most populous country in the world, with a population exceeding 165 million people in an area of . Bangladesh is among the mos ...
) and the Pôshchimbônggô Bangla Akademi at
Kolkata Kolkata (, or , ; also known as Calcutta , the official name until 2001) is the capital of the Indian state of West Bengal, on the eastern bank of the Hooghly River west of the border with Bangladesh. It is the primary business, comme ...
(
West Bengal West Bengal (, Bengali: ''Poshchim Bongo'', , abbr. WB) is a state in the eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabitants within an area of . West Bengal is the fou ...
, India), it is still not quite uniform yet, as many people continue to use various archaic forms of letters, resulting in concurrent forms for the same sounds.


Romanization

Romanization of Bengali is the representation of the
Bengali language Bengali ( ), generally known by its endonym Bangla (, ), is an Indo-Aryan language native to the Bengal region of South Asia. It is the official, national, and most widely spoken language of Bangladesh and the second most widely spoken o ...
in the
Latin script The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern I ...
. There are various ways of Romanization systems of Bengali, created in recent years but failed to represent the true Bengali phonetic sound. While different standards for
romanisation Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and ...
have been proposed for Bengali, they have not been adopted with the degree of uniformity seen in languages such as Japanese or Sanskrit.In Japanese, there is some debate as to whether to accent certain distinctions, such as Tōhoku vs Tohoku. Sanskrit is well-standardized because the speaking community is relatively small, and sound change is not a large concern. The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation in which the true phonetic value of Bengali is never represented. Some of them are the
International Alphabet of Sanskrit Transliteration The International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme that allows the lossless romanisation of Indic scripts as employed by Sanskrit and related Indic languages. It is based on a scheme that emerged during ...
or "IAST system" "Indian languages Transliteration" or
ITRANS The "Indian languages TRANSliteration" (ITRANS) is an ASCII transliteration scheme for Indic scripts, particularly for the Devanagari script. The need for a simple encoding scheme that used only keys available on an ordinary keyboard was felt i ...
(uses upper case alphabets suited for
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
keyboards), and the extension of IAST intended for non-Sanskrit languages of the Indian region called the
National Library at Kolkata romanisation The National Library of India, National Library at Kolkata romanisationSee p 24-26 for table comparing Indic languages, and p 33-34 for Devanagari alphabet listing. is a widely used transliteration scheme in dictionaries and grammars of Language ...
.


Sample texts

Article 1 of the Universal Declaration of Human Rights The first line is the Bengali alphabet; the second a phonetic Romanization, the third
IPA IPA commonly refers to: * India pale ale, a style of beer * International Phonetic Alphabet, a system of phonetic notation * Isopropyl alcohol, a chemical compound IPA may also refer to: Organizations International * Insolvency Practitioners ...
.


Unicode

Bengali script was added to the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, wh ...
Standard in October 1991 with the release of version 1.0. The Unicode block for Bengali is U+0980–U+09FF:


See also

*
Bengali Braille Bengali Braille is used for the Bengali language. According to UNESCO (2013),World Bra ...
* Robert B. Wray movable type for Bengali (1778) *
Bengali phonology The phonology of Bengali, like that of its neighbouring Eastern Indo-Aryan languages, is characterised by a wide variety of diphthongs and inherent back vowels (both and ). Phonemic inventory Phonemically, Bengali features 29 consonants and ...


Notes


References


Bibliography

* * * * * * * * {{list of writing systems Bengali script
Alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
Languages of Bangladesh Officially used writing systems of India