Khmer ( ; ,
UNGEGN: ) is an
Austroasiatic language spoken natively by the Khmer people. This language is an
official language
An official language is defined by the Cambridge English Dictionary as, "the language or one of the languages that is accepted by a country's government, is taught in schools, used in the courts of law, etc." Depending on the decree, establishmen ...
and
national language
'' ''
A national language is a language (or language variant, e.g. dialect) that has some connection— de facto or de jure—with a nation. The term is applied quite differently in various contexts. One or more languages spoken as first languag ...
of
Cambodia
Cambodia, officially the Kingdom of Cambodia, is a country in Southeast Asia on the Mainland Southeast Asia, Indochinese Peninsula. It is bordered by Thailand to the northwest, Laos to the north, and Vietnam to the east, and has a coastline ...
. The language is also widely spoken by Khmer people in
Eastern Thailand and
Isan,
Thailand
Thailand, officially the Kingdom of Thailand and historically known as Siam (the official name until 1939), is a country in Southeast Asia on the Mainland Southeast Asia, Indochinese Peninsula. With a population of almost 66 million, it spa ...
, as well as in the
Southeastern and
Mekong Delta
The Mekong Delta ( or simply ), also known as the Western Region () or South-western region (), is the list of regions of Vietnam, region in southwestern Vietnam where the Mekong, Mekong River River delta, approaches and empties into the sea th ...
regions of
Vietnam
Vietnam, officially the Socialist Republic of Vietnam (SRV), is a country at the eastern edge of mainland Southeast Asia, with an area of about and a population of over 100 million, making it the world's List of countries and depende ...
.
Khmer has been influenced considerably by
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
and
Pali
Pāli (, IAST: pāl̤i) is a Classical languages of India, classical Middle Indo-Aryan languages, Middle Indo-Aryan language of the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pali Canon, Pāli Can ...
especially in the royal and religious
registers, through
Hinduism and Buddhism,
due to
Old Khmer being the language of the historical empires of
Chenla
Chenla or Zhenla ( zh, t=真臘, s=, 真腊, p=Zhēnlà, w=Chen-la; , ; ) is the Chinese designation for the vassal of the kingdom of Funan preceding the Khmer Empire that existed from around the late 6th to the early 9th century in Indochina. ...
and
Angkor
Angkor ( , 'capital city'), also known as Yasodharapura (; ),Headly, Robert K.; Chhor, Kylin; Lim, Lam Kheng; Kheang, Lim Hak; Chun, Chen. 1977. ''Cambodian-English Dictionary''. Bureau of Special Research in Modern Languages. The Catholic Uni ...
.
The vast majority of Khmer speakers speak ''Central Khmer'', the dialect of the central plain where the Khmer are most heavily concentrated. Within Cambodia, regional accents exist in remote areas but these are regarded as varieties of Central Khmer. Two exceptions are the speech of the capital,
Phnom Penh
Phnom Penh is the Capital city, capital and List of cities and towns in Cambodia, most populous city of Cambodia. It has been the national capital since 1865 and has grown to become the nation's primate city and its political, economic, industr ...
, and that of the
Khmer Khe in
Stung Treng province, both of which differ sufficiently enough from Central Khmer to be considered separate dialects of Khmer.
Outside of Cambodia, three distinct dialects are spoken by ethnic Khmers native to areas that were historically part of the
Khmer Empire
The Khmer Empire was an empire in Southeast Asia, centered on Hydraulic empire, hydraulic cities in what is now northern Cambodia. Known as Kambuja (; ) by its inhabitants, it grew out of the former civilization of Chenla and lasted from 802 t ...
. The
Northern Khmer dialect is spoken by over a million Khmers in the southern regions of
Northeast Thailand and is treated by some linguists as a separate language. Khmer Krom, or Southern Khmer, is the first language of the
Khmer of Vietnam, while the Khmer living in the remote
Cardamom Mountains speak a very conservative dialect that still displays features of the
Middle Khmer language.
Khmer is primarily an
analytic,
isolating language
Social isolation, Isolation is the near or complete lack of social contact by an individual.
Isolation or isolated may also refer to:
Sociology and psychology
*Social isolation
*Isolation (psychology), a defense mechanism in psychoanalytic theo ...
. There are no
inflection
In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
s,
conjugations or
case endings. Instead,
particles and auxiliary words are used to indicate grammatical relationships. General word order is
subject–verb–object, and
modifiers follow the word they modify.
Classifiers appear after numbers when used to count nouns, though not always so consistently as in languages like
Chinese. In spoken Khmer,
topic-comment structure is common, and the perceived social relation between participants determines which sets of vocabulary, such as pronouns and honorifics, are proper.
Khmer differs from neighboring languages such as
Burmese,
Thai,
Lao, and
Vietnamese in that it is not a
tonal language. Words are
stressed on the final syllable, hence many words conform to the typical Mon–Khmer pattern of a stressed syllable preceded by a
minor syllable. The language has been written in the
Khmer script, an
abugida
An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
descended from the
Brahmi script
Brahmi ( ; ; ISO 15919, ISO: ''Brāhmī'') is a writing system from ancient India. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as ...
via the southern Indian
Pallava script, since at least the 7th century. The script's form and use has evolved over the centuries; its modern features include subscripted versions of consonants used to write
clusters and a division of consonants into two series with different
inherent vowel
An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol.
There are many known abugida scripts, including most of the Brahmic scripts and Kharosthi, the c ...
s.
Classification
Khmer is a member of the
Austroasiatic language family, the autochthonous family in an area that stretches from the Malay Peninsula through Southeast Asia to East India.
[Diffloth, Gerard & Zide, Norman]
''Austroasiatic Languages''
. Austroasiatic, which also includes
Mon,
Vietnamese and
Munda, has been studied since 1856 and was first proposed as a language family in 1907. Despite the amount of research, there is still doubt about the internal relationship of the languages of Austroasiatic.
[Sidwell, Paul (2009a)]
The Austroasiatic Central Riverine Hypothesis
Keynote address, SEALS, XIX.
Diffloth places Khmer in an eastern branch of the
Mon-Khmer languages.
[Diffloth, Gérard (2005). "The contribution of linguistic palaeontology and Austroasiatic". in Laurent Sagart, Roger Blench and Alicia Sanchez-Mazas, eds. ''The Peopling of East Asia: Putting Together Archaeology, Linguistics and Genetics.'' 77–80. London: Routledge Curzon.] In these classification schemes Khmer's closest genetic relatives are the
Bahnaric and
Pearic languages.
[Shorto, Harry L. edited by Sidwell, Paul, Cooper, Doug and Bauer, Christian (2006). ''A Mon–Khmer comparative dictionary''. Canberra: Australian National University. Pacific Linguistics. ] More recent classifications doubt the validity of the Mon-Khmer sub-grouping and place the Khmer language as its own branch of Austroasiatic equidistant from the other 12 branches of the family.
Geographic distribution and dialects

Khmer is spoken by some 13 million people in
Cambodia
Cambodia, officially the Kingdom of Cambodia, is a country in Southeast Asia on the Mainland Southeast Asia, Indochinese Peninsula. It is bordered by Thailand to the northwest, Laos to the north, and Vietnam to the east, and has a coastline ...
, where it is the official language. It is also a second language for most of the minority groups and indigenous hill tribes there. Additionally there are a million speakers of Khmer native to southern
Vietnam
Vietnam, officially the Socialist Republic of Vietnam (SRV), is a country at the eastern edge of mainland Southeast Asia, with an area of about and a population of over 100 million, making it the world's List of countries and depende ...
(1999 census) and 1.4 million in northeast
Thailand
Thailand, officially the Kingdom of Thailand and historically known as Siam (the official name until 1939), is a country in Southeast Asia on the Mainland Southeast Asia, Indochinese Peninsula. With a population of almost 66 million, it spa ...
(2006).
Khmer
dialect
A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
s, although mutually intelligible, are sometimes quite marked. Notable variations are found in speakers from
Phnom Penh
Phnom Penh is the Capital city, capital and List of cities and towns in Cambodia, most populous city of Cambodia. It has been the national capital since 1865 and has grown to become the nation's primate city and its political, economic, industr ...
(Cambodia's capital city), the rural
Battambang area, the areas of Northeast Thailand adjacent to Cambodia such as
Surin province
Surin (, ; , ; ) is one of Thailand's seventy-seven Provinces of Thailand, provinces (''changwat''). It lies in Isan#Administrative divisions, lower northeastern Thailand, also called Isan. Neighboring provinces are (from west clockwise) Burira ...
, the
Cardamom Mountains, and southern Vietnam.
The dialects form a
continuum running roughly north to south. Standard Cambodian Khmer is mutually intelligible with the others but a
Khmer Krom speaker from Vietnam, for instance, may have great difficulty communicating with a Khmer native of
Sisaket Province in Thailand.
The following is a classification scheme showing the development of the modern Khmer dialects.
[Sidwell, Paul (2009). ''Classifying the Austroasiatic languages: history and state of the art''. LINCOM studies in Asian linguistics, 76. Munich: Lincom Europa.]
*
Middle Khmer
** Cardamom (Western) Khmer
** Central Khmer
*** Surin (Northern) Khmer
*** Standard Khmer and related dialects (including Khmer Krom)
Standard Khmer, or Central Khmer, the language as taught in Cambodian schools and used by the media, is based on the dialect spoken throughout the
Central Plain,
a region encompassed by the northwest and central provinces.
Northern Khmer (called in Khmer) refers to the dialects spoken by many in several border provinces of present-day northeast Thailand. After the fall of the Khmer Empire in the early 15th century, the Dongrek Mountains served as a natural border leaving the Khmer north of the mountains under the sphere of influence of the Kingdom of
Lan Xang. The conquests of Cambodia by
Naresuan the Great for
Ayutthaya furthered their political and economic isolation from Cambodia proper, leading to a dialect that developed relatively independently from the midpoint of the Middle Khmer period.
This has resulted in a distinct accent influenced by the surrounding tonal languages
Lao and
Thai, lexical differences, and phonemic differences in both vowels and distribution of consonants. Syllable-final , which has become silent in other dialects of Khmer, is still pronounced in Northern Khmer. Some linguists classify Northern Khmer as a separate but closely related language rather than a dialect.
[Phonetic variation of final trill and final palatals in Khmer dialects of Thailand]
Suwilai, Premsrirat; Mahidol University; '' Mon-Khmer Studies'' 24:1–26; pg 1
Western Khmer, also called Cardamom Khmer or Chanthaburi Khmer, is spoken by a very small, isolated population in the
Cardamom mountain range extending from western Cambodia into eastern
Central Thailand
Central Thailand (Central Plain) (historically also known as Siam or Dvaravati) is one of the regions of Thailand, covering the broad alluvial plain of the Chao Phraya River. It is separated from northeast Thailand (Isan) by the Phetchabun Mount ...
. Although little studied, this variety is unique in that it maintains a definite system of
vocal register that has all but disappeared in other dialects of modern Khmer.
Phnom Penh Khmer is spoken in the capital and surrounding areas. This dialect is characterized by merging or complete
elision
In linguistics, an elision or deletion is the omission of one or more sounds (such as a vowel, a consonant, or a whole syllable) in a word or phrase. However, these terms are also used to refer more narrowly to cases where two words are run to ...
of syllables, which speakers from other regions consider a "relaxed" pronunciation. For instance, "Phnom Penh" is sometimes shortened to "m'Penh". Another characteristic of Phnom Penh speech is observed in words with an "r" either as an initial consonant or as the second member of a
consonant cluster (as in the English word "bread"). The "r",
trilled or
flapped in other dialects, is either pronounced as a
uvular trill or not pronounced at all.
This alters the quality of any preceding consonant, causing a harder, more emphasized pronunciation. Another unique result is that the syllable is spoken with a low-rising or "dipping"
tone much like the "hỏi" tone in
Vietnamese. For example, some people pronounce ('fish') as : the is dropped and the vowel begins by dipping much lower in tone than standard speech and then rises, effectively doubling its length. Another example is the word ('study'), which is pronounced , with the uvular "r" and the same intonation described above.
Khmer Krom or Southern Khmer is spoken by the indigenous Khmer population of the
Mekong Delta
The Mekong Delta ( or simply ), also known as the Western Region () or South-western region (), is the list of regions of Vietnam, region in southwestern Vietnam where the Mekong, Mekong River River delta, approaches and empties into the sea th ...
, formerly controlled by the Khmer Empire but part of Vietnam since 1698. Khmers are persecuted by the Vietnamese government for using their native language and, since the 1950s, have been forced to take Vietnamese names.
[Unrepresented Peoples and Nations Organizatio]
Khmer Krom Profile
Retrieved 19 June 2012 Consequently, very little research has been published regarding this dialect. It has been generally influenced by Vietnamese for three centuries and accordingly displays a pronounced accent, tendency toward monosyllabic words and lexical differences from Standard Khmer.
Khmer Khe is spoken in the
Se San,
Srepok and
Sekong river valleys of
Sesan and
Siem Pang districts in
Stung Treng Province. Following the decline of Angkor, the Khmer abandoned their northern territories, which the Lao then settled. In the 17th century, Chey Chetha XI led a Khmer force into Stung Treng to retake the area. The Khmer Khe living in this area of Stung Treng in modern times are presumed to be the descendants of this group. Their dialect is thought to resemble that of pre-modern Siem Reap.
Historical periods
Linguistic
Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
study of the Khmer language divides its history into four periods, one of which, the Old Khmer period, is subdivided into pre-Angkorian and Angkorian.
[Sak-Humphry, Channy]
''The Syntax of Nouns and Noun Phrases in Dated Pre-Angkorian Inscriptions''
'' Mon-Khmer Studies'' 22: 1–26. Pre-Angkorian Khmer is the Old Khmer language from 600 through 800 CE. Angkorian Khmer is the language as it was spoken in the
Khmer Empire
The Khmer Empire was an empire in Southeast Asia, centered on Hydraulic empire, hydraulic cities in what is now northern Cambodia. Known as Kambuja (; ) by its inhabitants, it grew out of the former civilization of Chenla and lasted from 802 t ...
from the 9th century until the 13th century.
The following centuries saw changes in
morphology,
phonology
Phonology (formerly also phonemics or phonematics: "phonemics ''n.'' 'obsolescent''1. Any procedure for identifying the phonemes of a language from a corpus of data. 2. (formerly also phonematics) A former synonym for phonology, often pre ...
and
lexicon
A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word () ...
. The language of this transition period, from about the 14th to 18th centuries, is referred to as Middle Khmer and saw borrowings from Thai in the literary register.
Modern Khmer is dated from the 19th century to today.
The following table shows the conventionally accepted historical stages of Khmer.
Just as modern Khmer was emerging from the transitional period represented by Middle Khmer, Cambodia fell under the
influence of
French colonialism
Colonialism is the control of another territory, natural resources and people by a foreign group. Colonizers control the political and tribal power of the colonised territory. While frequently an Imperialism, imperialist project, colonialism c ...
.
Thailand, which had for centuries claimed suzerainty over Cambodia and controlled succession to the Cambodian throne, began losing its influence on the language.
In 1887 Cambodia was fully integrated into
French Indochina
French Indochina (previously spelled as French Indo-China), officially known as the Indochinese Union and after 1941 as the Indochinese Federation, was a group of French dependent territories in Southeast Asia from 1887 to 1954. It was initial ...
, which brought in a
French-speaking aristocracy. This led to French becoming the language of higher education and the intellectual class. By 1907, the French had wrested over half of modern-day Cambodia, including the north and northwest where Thai had been the prestige language, back from Thai control and reintegrated it into the country.
Many native scholars in the early 20th century, led by a monk named
Chuon Nath, resisted the French and Thai influences on their language. Forming the government sponsored Cultural Committee to define and standardize the modern language, they championed Khmerization, purging of foreign elements, reviving affixation, and the use of Old Khmer roots and historical Pali and Sanskrit to coin new words for modern ideas.
Opponents, led by
Keng Vannsak, who embraced "total Khmerization" by denouncing the reversion to classical languages and favoring the use of contemporary colloquial Khmer for neologisms, and
Ieu Koeus, who favored borrowing from Thai, were also influential.
Koeus later joined the Cultural Committee and supported Nath. Nath's views and prolific work won out and he is credited with cultivating modern Khmer-language identity and culture, overseeing the translation of the entire Pali Buddhist canon into Khmer. He also created the modern Khmer language dictionary that is still in use today, helping preserve Khmer during the French colonial period.
Phonology
The phonological system described here is the inventory of sounds of the standard spoken language,
[Huffman, Franklin. 1970. ]
Cambodian System of Writing and Beginning Reader
''. Yale University Press. represented using the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
(IPA).
Consonants
The voiceless plosives may occur with or without
aspiration (as vs. , etc.); this difference is contrastive before a vowel. However, the aspirated sounds in that position may be analyzed as sequences of two
phoneme
A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s: . This analysis is supported by the fact that
infix
An infix is an affix inserted inside a word stem (an existing word or the core of a family of words). It contrasts with '' adfix,'' a rare term for an affix attached to the outside of a stem, such as a prefix or suffix.
When marking text for ...
es can be inserted between the stop and the aspiration; for example ('big') becomes ('size') with a nominalizing infix. When one of these plosives occurs initially before another consonant, aspiration is no longer contrastive and can be regarded as mere phonetic detail:
slight aspiration is expected when the following consonant is not one of (or if the initial plosive is ).
The voiced plosives are pronounced as
implosives by most speakers, but this feature is weak in educated speech, where they become .
In syllable-final position, and approach and respectively. The stops are unaspirated and have
no audible release when occurring as syllable finals.
In addition, the consonants , , and occur occasionally in recent
loan word
A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing (linguistics), borrowing. Borrowing ...
s in the speech of Cambodians familiar with French and other languages.
Vowels
Various authors have proposed slightly different analyses of the Khmer vowel system. This may be in part because of the wide degree of variation in pronunciation between individual speakers, even within a dialectal region. The description below follows Huffman (1970).
The number of vowel nuclei and their values vary between dialects; differences exist even between the Standard Khmer system and that of the Battambang dialect on which the standard is based.
[Wayland, Ratree]
"An Acoustic Study of Battambang Khmer Vowels."
'' Mon-Khmer Studies'' 28. (1998): 43–62.

In addition, some diphthongs and triphthongs are analyzed as a vowel nucleus plus a
semivowel
In phonetics and phonology, a semivowel, glide or semiconsonant is a sound that is phonetically similar to a vowel sound but functions as the syllable boundary, rather than as the nucleus of a syllable. Examples of semivowels in English are ''y ...
( or ) coda because they cannot be followed by a final consonant. These include: (with short monophthongs) , , , , ; (with long monophthongs) , ; (with long diphthongs) , , , , and .
The independent vowels are the vowels that can exist without a preceding or trailing consonant. The independent vowels may be used as monosyllabic words, or as the initial syllables in longer words. Khmer words never begin with regular vowels; they can, however, begin with independent vowels. Example: ឰដ៏, ឧទាហរណ៍, ឧត្តម, ឱកាស...។
Syllable structure
A Khmer
syllable
A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
begins with a single consonant, or else with a
cluster of two, or rarely three, consonants. The only possible clusters of three consonants at the start of a syllable are ,
and (with aspirated consonants analyzed as two-consonant sequences) . There are 85 possible two-consonant clusters (including
ʰetc. analyzed as etc.). All the clusters are shown in the following table, phonetically, i.e. superscript can mark either contrastive or non-contrastive aspiration (see
above).
Slight vowel
epenthesis
In phonology, epenthesis (; Greek ) means the addition of one or more sounds to a word, especially in the first syllable ('' prothesis''), the last syllable ('' paragoge''), or between two syllabic sounds in a word. The opposite process in whi ...
occurs in the clusters consisting of a plosive followed by , in those beginning , and in the cluster .
After the initial consonant or consonant cluster comes the syllabic
nucleus, which is one of the
vowels listed above. This vowel may end the syllable or may be followed by a
coda, which is a single consonant. If the syllable is stressed and the vowel is short, there must be a final consonant. All consonant sounds except and the aspirates can appear as the coda (although final is heard in some dialects, most notably in
Northern Khmer).
A
minor syllable (unstressed syllable preceding the main syllable of a word) has a structure of CV-, CrV-, CVN- or CrVN- (where C is a consonant, V a vowel, and N a nasal consonant). The vowels in such syllables are usually short; in conversation they may be
reduced to , although in careful or formal speech, including on television and radio, they are clearly articulated. An example of such a word is ''mɔnuh, mɔnɨh, mĕəʾnuh'' ('person'), pronounced , or more casually .
Stress
Stress in Khmer falls on the final syllable of a word.
Because of this predictable pattern, stress is non-
phonemic in Khmer (it does not distinguish different meanings).
Primary stress falls on the final syllable, with
secondary stress on every second syllable from the end. Thus in a three-syllable word, the first syllable has secondary stress; in a four-syllable word, the second syllable has secondary stress; in a five-syllable word, the first and third syllables have secondary stress, and so on.
Long polysyllables are not often used in conversation.
Most Khmer words consist of either one or two syllables. In most native disyllabic words, the first syllable is a
minor (fully unstressed) syllable. Such words have been described as ''sesquisyllabic'' (i.e. as having one-and-a-half syllables). There are also some disyllabic words in which the first syllable does not behave as a minor syllable, but takes
secondary stress. Most such words are
compounds, but some are single
morpheme
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
s (generally loanwords). An example is ('language'), pronounced .
Words with three or more syllables, if they are not compounds, are mostly loanwords, usually derived from Pali, Sanskrit, or more recently, French. They are nonetheless adapted to Khmer stress patterns.
[Headley, Robert K.; Chhor, Kylin; Lim, Lam Kheng; Kheang, Lim Hak; Chun, Chen. 1977. ''Cambodian-English Dictionary''. Bureau of Special Research in Modern Languages. The Catholic University of America Press. Washington, D.C. ]
Compounds, however, preserve the stress patterns of the constituent words. Thus , the name of a kind of cookie (literally 'bird's nest'), is pronounced , with secondary stress on the second rather than the first syllable, because it is composed of the words ('nest') and ('bird').
Phonation and tone
Khmer once had a
phonation
The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defi ...
distinction in its vowels, but this now survives only in the most archaic dialect (
Western Khmer).
The distinction arose historically when vowels after Old Khmer voiced consonants became
breathy voice
Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like s ...
d and diphthongized; for example became . When consonant voicing was lost, the distinction was maintained by the vowel (); later the phonation disappeared as well ().
These processes explain the origin of what are now called a-series and o-series consonants in the
Khmer script.
Although most Cambodian dialects are not
tonal, the colloquial Phnom Penh dialect has developed a tonal contrast (level versus peaking tone) as a by-product of the
elision
In linguistics, an elision or deletion is the omission of one or more sounds (such as a vowel, a consonant, or a whole syllable) in a word or phrase. However, these terms are also used to refer more narrowly to cases where two words are run to ...
of .
Intonation
Intonation often conveys
semantic
Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...
context in Khmer, as in distinguishing
declarative statements, questions and exclamations. The available grammatical means of making such distinctions are not always used, or may be ambiguous; for example, the final
interrogative particle can also serve as an emphasizing (or in some cases negating) particle.
The intonation pattern of a typical Khmer declarative phrase is a steady rise throughout followed by an abrupt drop on the last syllable.
: ('I don't want it')
Other intonation contours signify a different type of phrase such as the "full doubt" interrogative, similar to
yes–no questions in English. Full doubt interrogatives remain fairly even in tone throughout, but rise sharply towards the end.
: ('do you want to go to Siem Reap?')
Exclamatory phrases follow the typical steadily rising pattern, but rise sharply on the last syllable instead of falling.
: ('this book is expensive!')
Grammar
Khmer is primarily an
analytic language
An analytic language is a type of natural language in which a series of root/stem words is accompanied by prepositions, postpositions, particles and modifiers, using affixes very rarely. This is opposed to synthetic languages, which synthesi ...
with no
inflection
In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
. Syntactic relations are mainly determined by word order. Old and
Middle Khmer used particles to mark
grammatical categories and many of these have survived in Modern Khmer but are used sparingly, mostly in literary or formal language.
Khmer makes extensive use of
auxiliary verbs, "directionals" and
serial verb construction. Colloquial Khmer is a
zero copula
Zero copula, also known as null copula, is a linguistic phenomenon whereby the subject is joined to the predicate without overt marking of this relationship (like the copula (linguistics), copula ''to be'' in English). One can distinguish languag ...
language, instead preferring predicative adjectives (and even predicative nouns) unless using a copula for emphasis or to avoid ambiguity in more complex sentences. Basic word order is
subject–verb–object (SVO), although subjects are often
dropped;
prepositions are used rather than postpositions.
Topic-Comment constructions are common and the language is generally
head-initial
In linguistics, head directionality is a proposed Principles and parameters, parameter that classifies languages according to whether they are head-initial (the head (linguistics), head of a phrase precedes its Complement (linguistics), complement ...
(modifiers follow the words they modify). Some grammatical processes are still not fully understood by western scholars. For example, it is not clear if certain features of Khmer grammar, such as
actor nominalization, should be treated as a morphological process or a purely syntactic device,
and some derivational morphology seems "purely decorative" and performs no known syntactic work.
Lexical categories have been hard to define in Khmer.
Henri Maspero, an early scholar of Khmer, claimed the language had no parts of speech,
while a later scholar, Judith Jacob, posited four parts of speech and innumerable particles.
John Haiman, on the other hand, identifies "a couple dozen" parts of speech in Khmer with the caveat that Khmer words have the freedom to perform a variety of syntactic functions depending on such factors as word order, relevant particles, location within a clause, intonation and context.
Some of the more important lexical categories and their function are demonstrated in the following example sentence taken from a hospital brochure:
Morphology
Modern Khmer is an
isolating language
Social isolation, Isolation is the near or complete lack of social contact by an individual.
Isolation or isolated may also refer to:
Sociology and psychology
*Social isolation
*Isolation (psychology), a defense mechanism in psychoanalytic theo ...
, which means that it uses little
productive morphology. There is some
derivation by means of
prefixes and
infixes, but this is a remnant of Old Khmer and not always productive in the modern language.
Khmer morphology is evidence of a historical process through which the language was, at some point in the past, changed from being an
agglutinative language to adopting an isolating typology. Affixed forms are
lexicalized and cannot be used productively to form new words.
Below are some of the most common affixes with examples as given by Huffman.
Compounding in Khmer is a common derivational process that takes two forms, coordinate compounds and repetitive compounds. Coordinate compounds join two
unbound morphemes (independent words) of similar meaning to form a compound signifying a concept more general than either word alone.
Coordinate compounds join either two nouns or two verbs. Repetitive compounds, one of the most productive derivational features of Khmer, use
reduplication
In linguistics, reduplication is a Morphology (linguistics), morphological process in which the Root (linguistics), root or Stem (linguistics), stem of a word, part of that, or the whole word is repeated exactly or with a slight change.
The cla ...
of an entire word to derive words whose meaning depends on the class of the reduplicated word.
A repetitive compound of a noun indicates plurality or generality while that of an adjectival verb could mean either an intensification or plurality.
Coordinate compounds:
Repetitive compounds:
Nouns and pronouns
Khmer
noun
In grammar, a noun is a word that represents a concrete or abstract thing, like living creatures, places, actions, qualities, states of existence, and ideas. A noun may serve as an Object (grammar), object or Subject (grammar), subject within a p ...
s do not inflect for
grammatical gender
In linguistics, a grammatical gender system is a specific form of a noun class system, where nouns are assigned to gender categories that are often not related to the real-world qualities of the entities denoted by those nouns. In languages wit ...
or
singular/plural. There are no
articles, but indefiniteness is often expressed by the word for "one" ( ) following the noun as in ( "a dog"). Plurality can be marked by
postnominal particles, numerals, or reduplication of a following adjective, which, although similar to intensification, is usually not ambiguous due to context.
Classifying particles are used after numerals, but are not always obligatory as they are in
Thai or
Chinese, for example, and are often dropped in colloquial speech. Khmer nouns are divided into two groups: mass nouns, which take classifiers; and specific, nouns, which do not. The overwhelming majority are mass nouns.
Possession is colloquially expressed by word order. The possessor is placed after the thing that is possessed.
Alternatively, in more complex sentences or when emphasis is required, a possessive construction using the word (, "property, object") may be employed. In formal and literary contexts, the possessive particle () is used:
Pronoun
In linguistics and grammar, a pronoun (Interlinear gloss, glossed ) is a word or a group of words that one may substitute for a noun or noun phrase.
Pronouns have traditionally been regarded as one of the part of speech, parts of speech, but so ...
s are subject to a complicated system of social register, the choice of pronoun depending on the perceived relationships between speaker, audience and referent (see
Social registers below). Khmer exhibits
pronoun avoidance, so kinship terms, nicknames and proper names are often used instead of pronouns (including for the first person) among intimates. Subject pronouns are frequently
dropped in colloquial conversation.
Adjectives, verbs and verb phrases may be made into nouns by the use of
nominalization
In linguistics, nominalization or nominalisation, also known as nouning, is the use of a word that is not a noun (e.g., a verb, an adjective or an adverb) as a noun, or as the head (linguistics), head of a noun phrase. This change in functional c ...
particles. Three of the more common particles used to create nouns are , , and .
These particles are prefixed most often to verbs to form abstract nouns. The latter, derived from Sanskrit, also occurs as a suffix in fixed forms borrowed from Sanskrit and Pali such as ("health") from ("to be healthy").
Adjectives and adverbs
Adjective
An adjective (abbreviations, abbreviated ) is a word that describes or defines a noun or noun phrase. Its semantic role is to change information given by the noun.
Traditionally, adjectives are considered one of the main part of speech, parts of ...
s,
demonstrative
Demonstratives (list of glossing abbreviations, abbreviated ) are words, such as ''this'' and ''that'', used to indicate which entities are being referred to and to distinguish those entities from others. They are typically deictic, their meaning ...
s and
numerals follow the noun they modify. Adverbs likewise follow the verb. Morphologically, adjectives and adverbs are not distinguished, with many words often serving either function. Adjectives are also employed as verbs as Khmer sentences rarely use a
copula.
Degrees of comparison are constructed syntactically.
Comparative
The degrees of comparison of adjectives and adverbs are the various forms taken by adjectives and adverbs when used to compare two entities (comparative degree), three or more entities (superlative degree), or when not comparing entities (positi ...
s are expressed using the word : "A X
(A is more X
han B. The most common way to express
superlatives is with : "A X " (A is the most X).
[Huffman, F. E., Promchan, C., & Lambert, C.-R. T. (1970). ''Modern spoken Cambodian''. New Haven: Yale University Press. ] Intensity is also expressed syntactically, similar to other languages of the region, by
reduplication
In linguistics, reduplication is a Morphology (linguistics), morphological process in which the Root (linguistics), root or Stem (linguistics), stem of a word, part of that, or the whole word is repeated exactly or with a slight change.
The cla ...
or with the use of
intensifiers.
Verbs
As is typical of most East Asian languages, Khmer verbs do not inflect at all;
tense,
aspect and
mood can be expressed using auxiliary verbs, particles (such as , placed before a verb to express
continuous aspect
The continuous and progressive aspects (list of glossing abbreviations, abbreviated and ) are grammatical aspects that express incomplete action ("to do") or state ("to be") in progress at a specific time: they are non-habitual, imperfective asp ...
) and adverbs (such as "yesterday", "earlier", "tomorrow"), or may be understood from context.
Serial verb construction is quite common.
Khmer verbs are a relatively
open class and can be divided into two types, main verbs and auxiliary verbs.
Huffman defined a Khmer verb as "any word that can be (negated)",
and further divided main verbs into three classes.
Transitive verb
A transitive verb is a verb that entails one or more transitive objects, for example, 'enjoys' in ''Amadeus enjoys music''. This contrasts with intransitive verbs, which do not entail transitive objects, for example, 'arose' in ''Beatrice arose ...
s are verbs that may be followed by a
direct object:
Intransitive verbs are verbs that can not be followed by an object:
Adjectival verbs are a word class that has no equivalent in English. When modifying a noun or verb, they function as adjectives or adverbs, respectively, but they may also be used as main verbs equivalent to English "be + ''adjective''".
;
;
;
Syntax
Syntax
In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
is the rules and processes that describe how sentences are formed in a particular language, how words relate to each other within clauses or phrases and how those phrases relate to each other within a sentence to convey meaning. Khmer syntax is very
analytic. Relationships between words and phrases are signified primarily by word order supplemented with auxiliary verbs and, particularly in formal and literary registers, grammatical marking particles.
Grammatical phenomena such as
negation
In logic, negation, also called the logical not or logical complement, is an operation (mathematics), operation that takes a Proposition (mathematics), proposition P to another proposition "not P", written \neg P, \mathord P, P^\prime or \over ...
and
aspect are marked by particles while
interrogative
An interrogative clause is a clause whose form is typically associated with question-like meanings. For instance, the English sentence (linguistics), sentence "Is Hannah sick?" has interrogative syntax which distinguishes it from its Declarative ...
sentences are marked either by particles or
interrogative word
An interrogative word or question word is a function word used to ask a question, such as ''what, which'', ''when'', ''where'', '' who, whom, whose'', ''why'', ''whether'' and ''how''. They are sometimes called wh-words, because in English most ...
s equivalent to English "wh-words".
A complete Khmer sentence consists of four basic elements—an optional topic, an optional subject, an obligatory predicate, and various adverbials and particles.
The topic and subject are
noun phrase
A noun phrase – or NP or nominal (phrase) – is a phrase that usually has a noun or pronoun as its head, and has the same grammatical functions as a noun. Noun phrases are very common cross-linguistically, and they may be the most frequently ...
s, predicates are
verb phrase
In linguistics, a verb phrase (VP) is a syntax, syntactic unit composed of a verb and its argument (linguistics), arguments except the subject (grammar), subject of an independent clause or coordinate clause. Thus, in the sentence ''A fat man quic ...
s and another noun phrase acting as an
object or verbal attribute often follows the predicate.
Basic constituent order
When combining these noun and verb phrases into a sentence the order is typically SVO:
When both a
direct object and
indirect object
In linguistics, an object is any of several types of arguments. In subject-prominent, nominative-accusative languages such as English, a transitive verb typically distinguishes between its subject and any of its objects, which can include but ...
are present without any grammatical markers, the preferred order is SV(DO)(IO). In such a case, if the direct object phrase contains multiple components, the indirect object immediately follows the noun of the direct object phrase and the direct object's modifiers follow the indirect object:
This ordering of objects can be changed and the meaning clarified with the inclusion of particles. The word , which normally means "to arrive" or "towards", can be used as a preposition meaning "to":
Alternatively, the indirect object could precede the direct object if the object-marking preposition were used:
However, in spoken discourse OSV is possible when emphasizing the object in a
topic–comment-like structure.
Noun phrase
The noun phrase in Khmer typically has the following structure:
: = () () () () ()
The elements in parentheses are optional.
Honorific
An honorific is a title that conveys esteem, courtesy, or respect for position or rank when used in addressing or referring to a person. Sometimes, the term "honorific" is used in a more specific sense to refer to an Honorary title (academic), h ...
s are a class of words that serve to index the social status of the referent. Honorifics can be kinship terms or personal names, both of which are often used as first and second person pronouns, or specialized words such as ('god') before royal and religious objects.
The most common demonstratives are ('this, these') and ('that, those'). The word ('those over there') has a more distal or vague connotation.
If the noun phrase contains a possessive adjective, it follows the noun and precedes the numeral. If a descriptive attribute co-occurs with a possessive, the possessive construction () is expected.
Some examples of typical Khmer noun phrases are:
The Khmer particle marked attributes in Old Khmer noun phrases and is used in formal and literary language to signify that what precedes is the noun and what follows is the attribute. Modern usage may carry the connotation of mild intensity.
Verb phrase
Khmer verbs are completely uninflected, and once a subject or topic has been introduced or is clear from context the noun phrase may be dropped. Thus, the simplest possible sentence in Khmer consists of a single verb. For example, 'to go' on its own can mean "I'm going.", "He went.", "They've gone.", "Let's go.", etc.
This also results in long strings of verbs such as:
Khmer uses three verbs for what translates into English as the copula. The general copula is ; it is used to convey identity with nominal predicates.
For locative predicates, the copula is .
The verb is the "existential" copula meaning "there is" or "there exists".
Negation
In logic, negation, also called the logical not or logical complement, is an operation (mathematics), operation that takes a Proposition (mathematics), proposition P to another proposition "not P", written \neg P, \mathord P, P^\prime or \over ...
is achieved by putting before the verb and the particle at the end of the sentence or clause. In colloquial speech, verbs can also be negated without the need for a final particle, by placing before them.
Past tense can be conveyed by adverbs, such as "yesterday" or by the use of perfective particles such as
Different senses of future action can also be expressed by the use of adverbs like "tomorrow" or by the future tense marker , which is placed immediately before the verb, or both:
Imperatives are often unmarked.
For example, in addition to the meanings given above, the "sentence" can also mean "Go!". Various words and particles may be added to the verb to soften the command to varying degrees, including to the point of politeness (
jussives):
Prohibitives take the form " + " and also are often softened by the addition of the particle to the end of the phrase.
Questions
There are three basic types of questions in Khmer.
Questions requesting specific information use
question words.
Polar questions are indicated with interrogative particles, most commonly , a homonym of the negation particle.
Tag questions are indicated with various particles and rising inflection.
The SVO word order is generally not
inverted for questions.
In more formal contexts and in polite speech, questions are also marked at their beginning by the particle .
Passive voice
Khmer does not have a passive voice,
but there is a construction utilizing the main verb ("to hit", "to be correct", "to affect") as an auxiliary verb meaning "to be subject to" or "to undergo"—which results in sentences that are translated to English using the passive voice.
Clause syntax
Complex sentences are formed in Khmer by the addition of one or more
clause
In language, a clause is a Constituent (linguistics), constituent or Phrase (grammar), phrase that comprises a semantic predicand (expressed or not) and a semantic Predicate (grammar), predicate. A typical clause consists of a subject (grammar), ...
s to the main clause. The various types of clauses in Khmer include the
coordinate clause, the
relative clause
A relative clause is a clause that modifies a noun or noun phrase and uses some grammatical device to indicate that one of the arguments in the relative clause refers to the noun or noun phrase. For example, in the sentence ''I met a man who wasn ...
and the
subordinate clause. Word order in clauses is the same for that of the basic sentences described above.
Coordinate clauses do not necessarily have to be marked; they can simply follow one another. When explicitly marked, they are joined by words similar to English conjunctions such as ("and") and ("and then") or by clause-final conjunction-like adverbs and , both of which can mean "also" or "and also"; disjunction is indicated by ("or").
Relative clauses can be introduced by ("that") but, similar to coordinate clauses, often simply follow the main clause. For example, both phrases below can mean "the hospital bed that has wheels".
Relative clauses are more likely to be introduced with if they do not immediately follow the head noun.
Khmer subordinate conjunctions always precede a subordinate clause.
Subordinate conjunctions include words such as ("because"), ("seems as if") and ("in order to").
Numerals
Counting in Khmer is based on a
biquinary
Bi-quinary coded decimal is a numeral encoding scheme used in many abacuses and in some early computers, notably the Colossus. The term ''bi-quinary'' indicates that the code comprises both a two-state (''bi'') and a five-state (''quin'' ...
system: the numbers from 6 to 9 have the form "five one", "five two", etc. The words for multiples of ten from 30 to 90 are not related to the basic Khmer numbers, but are Chinese in origin, and probably came to Khmer via Thai.
Khmer numerals
Khmer numerals ០ ១ ២ ៣ ៤ ៥ ៦ ៧ ៨ ៩ are the Numerical digit, numerals used in the Khmer language. They have been in use since at least the early 7th century.
Numerals
Having been derived from the Hindu–Arabic numeral sy ...
, which were inherited directly from Indian numerals, are used more widely than
Western numerals, which like Khmer numerals were inherited from Indian, but first passed through the Arabic numerals before reaching the west.
The principal number words are listed in the following table, which gives Western and Khmer digits, Khmer spelling and IPA transcription.
Intermediate numbers are formed by compounding the above elements. Powers of ten are denoted by loan words: (100), (1,000), (10,000), (100,000) and (1,000,000) from Thai and (10,000,000) from Sanskrit.
Ordinal number
In set theory, an ordinal number, or ordinal, is a generalization of ordinal numerals (first, second, th, etc.) aimed to extend enumeration to infinite sets.
A finite set can be enumerated by successively labeling each element with the leas ...
s are formed by placing the particle before the corresponding cardinal number.
Social registers
Khmer employs a system of
registers in which the speaker must always be conscious of the social status of the person spoken to. The different registers, which include those used for common speech, polite speech, speaking to or about royals and speaking to or about monks, employ alternate verbs, names of body parts and pronouns. As an example, the word for "to eat" used between intimates or in reference to animals is . Used in polite reference to commoners, it is . When used of those of higher social status, it is or . For monks the word is and for royals, .
Another result is that the pronominal system is complex and full of honorific variations, just a few of which are shown in the table below.
Writing system

Khmer is written with the Khmer script, an
abugida
An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
developed from the
Pallava script of India before the 7th century when the first known inscription appeared.
Written left-to-right with vowel signs that can be placed after, before, above or below the consonant they follow, the Khmer script is similar in appearance and usage to Thai script, Thai and Lao script, Lao, both of which were based on the Khmer system. The Khmer script is also distantly related to the Mon–Burmese script.
Within Cambodia, literacy in the Khmer alphabet is estimated at 77.6%.
Consonant symbols in Khmer are divided into two groups, or series. The first series carries the inherent vowel while the second series carries the inherent vowel . The Khmer names of the series, ('voiceless') and ('voiced'), respectively, indicate that the second series consonants were used to represent the voiced phonemes of Old Khmer. As the voicing of stops was lost, however, the contrast shifted to the phonation of the attached vowels, which, in turn, evolved into a simple difference of vowel quality, often by Vowel breaking, diphthongization.
This process has resulted in the Khmer alphabet having two symbols for most consonant phonemes and each vowel symbol having two possible readings, depending on the series of the initial consonant:
Examples
The following text is from Article 1 of the Universal Declaration of Human Rights.
See also
* Hem Chieu
* Khmer literature
* Romanization of Khmer
References and notes
Further reading
* Ferlus, Michel. (1992). ''Essai de phonétique historique du khmer'' (Du milieu du premier millénaire de notre ère à l'époque actuelle)", ''
Mon-Khmer Studies'' XXI: 57–89)
* Headley, Robert and others. (1977). ''Cambodian-English Dictionary''. Washington, Catholic University Press.
* Herington, Jennifer and Amy Ryan. (2013)
Sociolinguistic Survey of the Khmer Khe in Cambodia. Chiang Mai: Linguistics Institute, Payap University.
* Huffman, F. E., Promchan, C., & Lambert, C.-R. T. (1970). ''Modern spoken Cambodian''. New Haven: Yale University Press.
* Huffman, F. E., Lambert, C.-R. T., & Im Proum. (1970). ''Cambodian system of writing and beginning reader with drills and glossary''. Yale linguistic series. New Haven: Yale University Press.
* Jacob, Judith. (1966). 'Some features of Khmer versification', in C. E. Bazell, J. C. Catford, M. A. K. Halliday, and R. H. Robins, eds., In Memory of J. R Firth, 227–41. London: Longman. [Includes discussion of the two series of syllables and their places in Khmer shymes]
* Jacob, Judith. (1974). ''A Concise Cambodian-English Dictionary''. London, Oxford University Press.
* Jacob, J. M. (1996). ''The traditional literature of Cambodia: a preliminary guide''. London oriental series, v. 40. New York: Oxford University Press.
* Jacob, J. M., & David A. Smyth, Smyth, D. (1993). ''Cambodian linguistics, literature and history: collected articles''. London: School of Oriental and African Studies, University of London.
* Keesee, A. P. K. (1996). ''An English-spoken Khmer dictionary: with romanized writing system, usage, and idioms, and notes on Khmer speech and grammar''. London: Kegan Paul International.
* Meechan, M. (1992). ''Register in Khmer the laryngeal specification of pharyngeal expansion''. Ottawa: National Library of Canada = Bibliothèque nationale du Canada.
* Sak-Humphry, C. (2002). ''Communicating in Khmer: an interactive intermediate level Khmer course''. Manoa, Hawai'i: Center for Southeast Asian Studies, School of Hawaiian, Asian and Pacific Studies, University of Hawai'i at Manoa. OCLC: 56840636
* David A. Smyth, Smyth, D. (1995). ''Colloquial Cambodian: a complete language course''. London: Routledge.
* Stewart, F., & May, S. (2004). ''In the shadow of Angkor: contemporary writing from Cambodia''. Honolulu: University of Hawai'i Press.
* Tonkin, D. (1991). ''The Cambodian alphabet: how to write the Khmer language''. Bangkok: Trasvin Publications.
External links
Kheng.info��An online audio dictionary for learning Khmer, with thousands of native speaker recordings an
text segmentation software
* wiktionary:Appendix:Khmer Swadesh list, Khmer Swadesh vocabulary list (from Wiktionary'
Swadesh-list appendix
Dictionary and SpellCheckeropen sourced and collaborative project based on
Chuon Nath Khmer Dictionary
How to install Khmer script on a Windows 7 computerHow to install Khmer script on a Windows XP computer
*
Khmer' a
Online Khmer & English dictionaryKhmer Online Dictionaries
NOUN:noun
VERB:verb
OBJ:object
OM:object marker
MARKER:marker
REL:relative
COHORT:cohortative
DIR:directional
COMPL:complement
RESP:respectful
{{Authority control
Khmer language,
Languages attested from the 9th century
Analytic languages
Isolating languages
Languages of Cambodia
Languages of Thailand
Languages of Vietnam
Subject–verb–object languages