Sinhala ( ; , ''siṁhala'', ), also known as Sinhalese, is an Indo-Aryan language primarily spoken by the Buddhist peoples of Sri Lanka, who make up the largest ethnic group on the island, numbering about 16 million. Sinhala is also spoken as the first language by other ethnic groups in Sri Lanka, totalling about 4 million people as of 2001. Sinhala is written using the Sinhala script, which is one of the Brahmic scripts, a descendant of the ancient Indian Brahmi script closely related to the Kadamba script. Sinhala is one of the official and national languages of Sri Lanka. Sinhala, along with Pali, played a major role in the development of Theravada Buddhist literature. The oldest Sinhalese Prakrit inscriptions found are from the third to second century BCE following the arrival of Buddhism in Sri Lanka, while the oldest extant literary works date from the ninth century. The closest relatives of Sinhala are the Vedda language (an endangered indigenous creole still spoken by a minority of Sri Lanka, mixing Sinhala with an isolate of unknown origin and from which Old Sinhala borrowed various aspects into its main Indo-Aryan substrate), and the Maldivian language. Sinhala has two main varieties, written and spoken, and is a conspicuous example of the linguistic phenomenon known as diglossia.


''Sinhala'' () is a Sanskrit term; the corresponding Middle Indo-Aryan (Eḷu) word is ''Sīhala''. The name is a derivation from ', the Sanskrit word for "lion". ' is attested as a Sanskrit name of the island in the ''Bhagavata Purana''. The name is sometimes glossed as "abode of lions", and attributed to a supposed former abundance of lions on the island.


According to the chronicle ''Mahavansa'', written in Pali, Prince Vijaya and his entourage merged with two tribes of the ancient India present in Lanka, the Yakkha and Naga peoples. In the following centuries, there was substantial immigration from Eastern India (Vanga Kingdom (Bengal), Kalinga, Magadha) which led to an admixture of features of Eastern Prakrits.

Stages of historical development

The development of Sinhala is divided into four periods: * Sinhalese Prakrit (until 3rd century CE) * Proto-Sinhala (3rd–7th century CE) * Medieval Sinhala (7th–12th century CE) * Modern Sinhala (12th century – present)

Phonetic development

The most important phonetic developments of Sinhala include * the loss of the aspiration distinction (e.g. ''kanavā'' "to eat" corresponds to Sanskrit ''khādati'', Hindi ''khānā'') * the loss of a vowel length distinction; long vowels in the modern language are due to loanwords (e.g. ''vibāgaya'' "exam" < Sanskrit ''vibhāga'') and sandhi, either after elision of Intervocalic consonants (e.g. ''dānavā'' "to put" < ''damanavā'') or in originally compound words. * the simplification of consonant clusters and geminate consonants into geminates and single consonants respectively (e.g. Sanskrit ''viṣṭā'' "time" > Sinhalese Prakrit ''viṭṭa'' > Modern Sinhala ''viṭa'') * development of to (e.g. ''däla'' "web" corresponds to Sanskrit ''jāla'')

Western vs. Eastern Prakrit features

An example for a Western feature in Sinhala is the retention of initial which developed into in the Eastern languages (e.g. Sanskrit ''viṃśati'' "twenty", Sinhala ''visi-'', Hindi ''bīs''). An example of an Eastern feature is the ending -e for masculine nominative singular (instead of Western -o) in Sinhalese Prakrit. There are several cases of vocabulary doublets, e.g. the words ''mässā'' ("fly") and ''mäkkā'' ("flea"), which both correspond to Sanskrit ''makṣikā'' but stem from two regionally different Prakrit words ''macchiā'' and ''makkhikā'' (as in Pali).

Pre-1815 Sinhalese literature

In 1815 the island of Ceylon came under British rule. During the career of Christopher Reynolds (1922–2015) as a Sinhalese lecturer at the SOAS, University of London, he extensively researched the Sinhalese language and its pre-1815 literature: the Sri Lankan government awarded him the Sri Lanka Ranjana medal for this. He wrote the 377-page ''An anthology of Sinhalese literature up to 1815'', selected by the UNESCO National Commission of Ceylon

Substratum influence in Sinhala

According to Wilhelm Geiger, Sinhala has features that set it apart from other Indo-Aryan languages. Some of the differences can be explained by the substrate influence of the parent stock of the Vedda language. Sinhala has many words that are only found in Sinhala, or shared between Sinhala and Vedda and not etymologically derivable from Middle or Old Indo-Aryan. Common examples are ''kola'' for leaf in Sinhala and Vedda, ''dola'' for pig in Vedda and offering in Sinhala. Other common words are ''rera'' for wild duck, and ''gala'' for stones (in toponyms used throughout the island). There are also high frequency words denoting body parts in Sinhala, such as ''olluva'' for head, ''kakula'' for leg, ''bella'' for neck and ''kalava'' for thighs, that are derived from pre-Sinhalese languages of Sri Lanka. The author of the oldest Sinhala grammar, ''Sidatsangarava'', written in the 13th century CE, recognised a category of words that exclusively belonged to early Sinhala. The grammar lists ''naramba'' (to see) and ''kolamba'' (fort or harbour) as belonging to an indigenous source. ''Kolamba'' is the source of the name of the commercial capital Colombo.

Influences from neighbouring languages

In addition to many Tamil loanwords, several phonetic and grammatical features present in neighbouring Dravidian languages, setting today's spoken Sinhala apart from its Northern Indo-Aryan siblings, bear witness to the close interactions with Dravidian speakers. However, formal Sinhala is more similar to Pali and medieval Sinhala. Some of the features that may be traced to Dravidian influence are – * the distinction between short e, o and long ē, ō * the loss of aspiration * left-branching syntax * the use of the attributive verb of ''kiyana'' "to say" as a subordinating conjunction with the meanings "that" and "if", e.g.: "I know that it is new." "I do not know whether it is new."

European influence

As a result of centuries of colonial rule, interaction, settlement, intermarriage and assimilation, modern Sinhala contains many Portuguese, Dutch and English loanwords.

Influences on other languages

Macanese Patois or Macau Creole (known as ''Patuá'' to its speakers) is a creole language derived mainly from Malay, Sinhala, Cantonese, and Portuguese, which was originally spoken by the Macanese people of the Portuguese colony of Macau. It is now spoken by a few families in Macau and in the Macanese diaspora. The language developed first mainly among the descendants of Portuguese settlers who often married women from Malacca and Sri Lanka rather than from neighbouring China, so the language had strong Malay and Sinhala influence from the beginning.

Accents and dialects

The Sinhala language has different types of variations which are commonly identified as 'dialects and accents'. Among those variations, 'regional variations' are prominent. Some of the well-known regional variations of Sinhala language are: 1. The Uva Province variation (Monaragala, Badulla). 2. The southern variation (Matara, Galle). 3. The up-country variation (Kandy, Matale). 4. The Sabaragamu variation (Kegalle, Balangoda). 1. Uva regional variation in relation to grammar People from Uva province also have a unique linguistic variation in relation to the pronunciation of words. In general, Sinhala singular words are pluralized by adding suffixes like O, hu, wal or waru. But when it comes to Monaragala, the situation is somewhat different as when nouns are pluralized a nasal sound is added.            2. Southern variation with regard to the vocabulary used in ‘Kamath language’ The Kamath language (an indigenous language of paddy culture) used by the Southerners is somewhat different from the ‘Kamath language’ used in other parts (Uva, Kandy) of Sri Lanka as it is marked with a systematic variation; ‘boya’ at the end of the majority of nouns as the examples below show. Crops: ‘Kurakkan boya’ (bran)            ‘Rambakan boya’ (banana) Tools: ‘Thattu boya’ (bucket) Other words: ‘Nivahan boya’ (home) Here the particular word ‘boya’ means ‘a little’ in the Southern region and at the end of most of nouns, 'boya' is added regularly. This particular word 'boya' is added to most words by the Southern villages as a token of respect towards the things (those things can be crops, tools etc.) they are referring to. 3. The contrast among the regional variations used by Kandy, Kegalle and Galle people in relation to pronunciation Even though the Kandy, Kegalle and Galle people pronounce words with slight differences, the Sinhalese can understand the majority of the sentences. Diglossia In Sinhala there is distinctive diglossia, as in many languages of South Asia. The literary language and the spoken language differ from each other in many aspects. The written language is used for all forms of literary texts but also orally at formal occasions (public speeches, TV and radio news broadcasts, etc.), whereas the spoken language is used as the language of communication in everyday life (see also Sinhala slang and colloquialism). As a rule the literary language uses more Sanskrit-based words. Sinhala diglossia can also be described in terms of informal and formal varieties. The variety used for formal purposes is closer to the written/literary variety, whereas the variety used for informal purposes is closer to the spoken variety. It is also used in some modern literature (e.g. Liyanage Amarakeerthi's ''Kurulu Hadawatha''). The most important difference between the two varieties is the lack of inflected verb forms in the spoken language. The children are taught the written language at school almost like a foreign language. Sinhala also has diverse slang. Most slang words and terms were regarded as taboo and most were frowned upon as non-scholarly. However, nowadays Sinhala slang words and terms, even the ones with sexual references, are commonly used among younger Sri Lankans.

Writing system

Sinhala script, ''Sinhala hodiya'', is based on the ancient Brahmi script, as are most Indian scripts. Sinhala script is closely related to South Indian Grantha script and Khmer script taken the elements from the related Kadamba script. The writing system for Sinhala is an abugida, where the consonants are written with letters while the vowels are indicated with diacritics (''pilla'') on those consonants, unlike English where both consonants and vowels are full letters, or Urdu where vowels need not be written at all. Also, when a diacritic is not used, an "inherent vowel", either or , is understood, depending on the position of the consonant within the word. For example, the letter ක ''k'' on its own indicates ''ka'', either or . The various vowels are written කා , කැ , කෑ (after the consonant), කි , කී (above the consonant), කු , කූ (below the consonant), කෙ , කේ (before the consonant), කො , කෝ (surrounding the consonant). There are also a few diacritics for consonants, such as in special circumstances, although the tendency nowadays is to spell words with the full letter ර , plus either a preceding or following ''hal kirima''. One word that is still spelt with an "r" diacritic is ශ්‍රී, as in ශ්‍රී ලංකාව (Sri Lankāwa). The "r" diacritic is the curved line under the first letter ("ශ": "ශ්‍ර"). A second diacritic, this time for the vowel sound completes the word ("ශ්‍ර": "ශ්‍රීී"). For simple without a vowel, a vowel-cancelling diacritic (virama) called හල් කිරීම is used: ක් . Several of these diacritics occur in two forms, which depend on the shape of the consonant letter. Vowels also have independent letters but these are only used at the beginning of words where there is no preceding consonant to add a diacritic to. The complete script consists of about 60 letters, 18 for vowels and 42 for consonants. However, only 57 (16 vowels and 41 consonants) are required for writing colloquial spoken Sinhala (''suddha Sinhala''). The rest indicate sounds that have been merged in the course of linguistic change, such as the aspirates, and are restricted to Sanskrit and Pali loan words. One letter (ඦ), representing the sound /ⁿd͡ʒa/, is attested although no words using this letter are attested. Sinhala is written from left to right and Sinhala script is mainly used for Sinhala, as well as the liturgical languages Pali and Sanskrit. The alphabetic sequence is similar to those of other Brahmic scripts:


Sinhala has so-called prenasalized consonants, or 'half nasal' consonants. A short homorganic nasal occurs before a voiced stop, it is shorter than a sequence of nasal plus stop. The nasal is syllabified with the onset of the following syllable, which means that the moraic weight of the preceding syllable is left unchanged. For example, ''tam̆ba'' 'copper' contrasts with ''tamba'' 'boil'. /f~ɸ/ and /ʃ/ are restricted to loans, typically English or Sanskrit. They are commonly replaced by /p/ and /s/ respectively in colloquial speech. Some speakers use the voiceless labiodental fricative as in English, and some use the voiceless bilabial fricative due to its similarity to the native voiceless bilabial stop /p/. Long /əː/ is restricted to English loans. /a/ and /ə/ are allophones in Sinhala and contrast with each other in stressed and unstressed syllables respectively. In writing, /a/ and /ə/ are both spelt without a vowel sign attached to the consonant letter, so the patterns of stress in the language must be used to determine the correct pronunciation. Most Sinhala syllables are of the form CV. The first syllable of each word is stressed, with the exception of the verb කරනවා /kərənəˈwaː/ ("to do") and all of its infected forms where the first syllable is unstressed. Syllables using long vowels are always stressed. The remainder of the syllables are unstressed if they use a short vowel, unless they are immediately followed by one of: a CCV syllable, final /j(i)/ (-යි), final /wu/ (-වු), or a final consonant without a following vowel. The sound /ha/ is always stressed in nouns, adjectives, and adverbs, and so is not pronounced /hə/ except in the word හතලිහ /ˈhat̪əlihə/ ("forty"), where the initial /ha/ is stressed and the final /hə/ is unstressed.


Nominal morphology

The main features marked on Sinhala nouns are case, number, definiteness and animacy.


Sinhala distinguishes several cases. Next to the cross-linguistically rather common nominative, accusative, genitive, dative and ablative, there are also less common cases like the instrumental. The exact number of these cases depends on the exact definition of cases one wishes to employ. For instance, the endings for the animate instrumental and locative cases, ''atiŋ'' and ''laᵑgə'', are also independent words meaning "with the hand" and "near" respectively, which is why they are not regarded to be actual ''case endings'' by some scholars. Depending on how far an independent word has progressed on a grammaticalisation path, scholars will see it as a case marker or not. The brackets with most of the vowel length symbols indicate the optional shortening of long vowels in certain unstressed syllables.

Number marking

In Sinhala animate nouns, the plural is marked with ''-o(ː)'', a long consonant plus ''-u'', or with ''-la(ː)''. Most inanimates mark the plural through disfixation. Loanwords from English mark the singular with ''ekə'', and do not mark the plural. This can be interpreted as a singulative number. On the left hand side of the table, plurals are longer than singulars. On the right hand side, it is the other way round, with the exception of paːrə "street". Note that animatelexemes are mostly in the classes on the left-hand side, while animatelexemes are most often in the classes on the right hand.

Indefinite article

The indefinite article is ''-ek'' for animates and ''-ak'' for inanimates. The indefinite article exists only in the singular, where its absence marks definiteness. In the plural, (in)definiteness does not receive special marking.

Verbal morphology

Sinhala distinguishes three conjugation classes. Spoken Sinhala does not mark person, number or gender on the verb (literary Sinhala does). In other words, there is no subject–verb agreement.


* Left-branching language (see branching), which means that determining elements are usually put in front of what they determine (see example below). * An exception to this is formed by statements of quantity which usually stand behind what they define. Example: "the four flowers" translates to , literally "flowers four". On the other hand, it can be argued that the numeral is the head in this construction, and the flowers the modifier, so that a more literal English rendering would be "a floral foursome" * SOV (subject–object–verb) word order, common to most left-branching languages. * As is common in left-branching languages, it has no prepositions, only postpositions (see Adposition). Example: "under the book" translates to , literally "book under". * Sinhala has no copula: "I am rich" translates to , literally "I rich". There are two existential verbs, which are used for locative predications, but these verbs are not used for predications of class-membership or property-assignment, unlike English ''is''. * There are almost no conjunctions as English ''that'' or ''whether'', but only non-finite clauses that are formed by the means of participles and verbal adjectives. Example: "The man who writes books" translates to , literally "books writing man".


There is a four-way deictic system (which is rare): There are four demonstrative stems (see demonstrative pronouns) "here, close to the speaker", "there, close to the person addressed", "there, close to a third person, visible" and "there, close to a third person, not visible".

Use of (''thuma)''

Sinhalese has an all-purpose odd suffix (''thuma)'' which when suffixed to a pronoun creates a formal and respectful tone in reference to a person. This is usually used in referring to politicians, nobles, and priests.
e.g. oba thuma (ඔබ තුමා) - you (vocative, when addressing a minister, high-ranking official, or generally showing respect in public etc.) janadhipathi thuma (ජනාධිපති තුමා) - the president (third person)
Discourse Sinhala is a pro-drop language: Arguments of a sentence can be omitted when they can be inferred from context. This is true for subject—as in Italian, for instance—but also objects and other parts of the sentence can be "dropped" in Sinhala if they can be inferred. In that sense, Sinhala can be called a "super pro-drop language", like Japanese. Example: The sentence , literally "where went", can mean "where did I/you/he/she/we... go".

See also

* Sinhala honorifics * Sinhala idioms and proverbs * Sinhala keyboard * Sinhala numerals * Sinhala slang * Sinhalese people



* Gair, James: ''Sinhala and Other South Asian Languages'', New York 1998. * * *

Further reading

* * * * * everal new editions * (''Article on the use of slang amongst Sinhalese Raggers.'')

External links

* Charles Henry Carter
A Sinhalese-English dictionary.
Colombo: The "Ceylon Observer" Printing Works; London: Probsthain & Co., 1924. * Simhala Sabdakosa Karyamsaya
Sanksipta Simhala Sabdakosaya.
Kolamba : Samskrtika Katayutu Pilibanda Departamentuva, 2007–2009.
Madura Online English-Sinhala Dictionary and Language Translator

Kapruka Sinhala dictionary

Sinhala dictionary resources online

Sinhala dictionary (Beta)

Sinhala for iOS

Sinhala Dictionary for Android
{{DEFAULTSORT:Sinhala Language Category:Southern Indo-Aryan languages Category:Fusional languages Category:Languages of Sri Lanka Category:Subject–object–verb languages