Bai (Bai: ; ) is a
Sino-Tibetan language spoken in
China
China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...
, primarily in
Yunnan Province
Yunnan; is an inland Provinces of China, province in Southwestern China. The province spans approximately and has a population of 47.2 million (as of 2020). The capital of the province is Kunming. The province borders the Chinese provinces ...
, by the
Bai people
The Bai or Pai (Bai language, Bai: , ; zh, c=白族, p=Báizú), are an East Asian people, East Asian ethnic group native to the Dali Bai Autonomous Prefecture of Yunnan, Yunnan Province, Bijie area of Guizhou, Guizhou Province, and Sangzhi C ...
. The language has over a million speakers and is divided into three or four main
dialect
A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
s. Bai syllables are always open, with a rich set of
vowel
A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s and eight
tones. The tones are divided into two groups with
modal and non-modal (
tense,
harsh or
breathy) phonation. There is a small amount of traditional literature written with Chinese characters, Bowen (), as well as a number of recent publications printed with a recently standardized system of romanisation using the
Latin alphabet
The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
.
The origins of Bai have been obscured by intensive Chinese influence of an extended period. Different scholars have proposed that it is an early offshoot or sister language of
Chinese, part of the
Loloish branch, or a separate group within the
Sino-Tibetan family.
Varieties
Xu and Zhao (1984) divided Bai into three dialects, which may actually be distinct languages:
Jianchuan (Central),
Dali (Southern) and Bijiang (Northern). Bijiang County has since been renamed as
Lushui County. Jianchuan and Dali are closely related and speakers are reported to be able to understand one another after living together for a month.
The more divergent Northern dialects are spoken by about 15,000 Laemae (', Lemei, Lama), a clan numbering about 50,000 people who are partly submerged within the
Lisu. They are now designated as two languages by
ISO 639-3
ISO 639-3:2007, ''Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages'', is an international standard for language codes in the ISO 639 series. It defines three-letter codes for ...
:
* Panyi, spoken by people called ''Lemo'' () on the Nu River (upper
Salween) in Lushui County.
*
Lama
Lama () is a title bestowed to a realized practitioner of the Dharma in Tibetan Buddhism. Not all monks are lamas, while nuns and female practitioners can be recognized and entitled as lamas. The Tibetan word ''la-ma'' means "high mother", ...
, spoken by people called ''Lama'' () on the Lancang River (upper
Mekong
The Mekong or Mekong River ( , ) is a transboundary river in East Asia and Southeast Asia. It is the world's twelfth-longest river and the third-longest in Asia with an estimated length of and a drainage area of , discharging of wat ...
) in
Lanping County and
Weixi County.
Wang Feng (2012)
provides the following classification for nine Bai dialects:
;Bai
*Western
**Gongxing (),
Lanping County
**(''core branch'')
***Enqi (),
Lanping County; Jinman (金满),
Lushui County
***Tuoluo (),
Weixi County
***Ega (),
Lushui County
*Eastern
**Mazhelong (),
Qiubei County
**(''core branch'')
***Jinxing (),
Jianchuan County
***Dashi (),
Heqing County
Heqing County (; Bai language, Bai: ) is a county in the Dali Bai Autonomous Prefecture located in the northwest of Yunnan Province, China. It borders Yongsheng County to the east, Binchuan County and Dali City to the south, Jianchuan County and E ...
***Zhoucheng (),
Dali City
Wang (2012) also documents a Bai dialect in Xicun, Dacun Village, Shalang Township,
Kunming
Kunming is the capital and largest city of the province of Yunnan in China. The political, economic, communications and cultural centre of the province, Kunming is also the seat of the provincial government. During World War II, Kunming was a Ch ...
City (昆明市沙朗乡大村西村).
Classification
The affiliation of Bai is obscured by over two millennia of influence from
varieties of Chinese
There are hundreds of local Chinese language varieties forming a branch of the Sino-Tibetan languages, Sino-Tibetan language family, many of which are not Mutual intelligibility, mutually intelligible. Variation is particularly strong in the m ...
, leaving most of its lexicon related to Chinese
etyma of various periods. To determine its origin, researchers must first identify and remove from consideration the various layers of
loanword
A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing. Borrowing is a metaphorical term t ...
s and then examine the residue. In his survey of the field, Wang (2006) notes that early work was hampered by a lack of data on Bai and uncertainties in the reconstruction of early forms of Chinese. Recent authors have suggested that Bai is an early offshoot from Chinese, a sister language to Chinese, or more distantly related (though usually still
Sino-Tibetan).
There are different tonal correspondences in the various layers. Many words can be identified as later Chinese loans because they display Chinese
sound change
In historical linguistics, a sound change is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic chan ...
s from the last two millennia:
*
labiodental
In phonetics, labiodentals are consonants articulated with the lower lip and the upper teeth, such as and . In English, labiodentalized /s/, /z/ and /r/ are characteristic of some individuals; these may be written .
Labiodental consonants in ...
fricative
A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in ...
s, which developed from earlier labial stops in certain environments.
*
palatal affricate
An affricate is a consonant that begins as a stop and releases as a fricative, generally with the same place of articulation (most often coronal). It is often difficult to decide if a stop and fricative form a single phoneme or a consonant pai ...
s from earlier
velar stops in palatal environments.
*
aspirated stops from earlier voiced stops in words having the
Middle Chinese
Middle Chinese (formerly known as Ancient Chinese) or the Qieyun system (QYS) is the historical variety of Chinese language, Chinese recorded in the ''Qieyun'', a rime dictionary first published in 601 and followed by several revised and expande ...
level tone.
* the initial , which developed from
Old Chinese
Old Chinese, also called Archaic Chinese in older works, is the oldest attested stage of Chinese language, Chinese, and the ancestor of all modern varieties of Chinese. The earliest examples of Chinese are divinatory inscriptions on oracle bones ...
*r-.
Some of these changes date back to the first centuries AD.
The oldest layer of Bai vocabulary with Chinese cognates, of which Wang lists some 250 words, includes common Bai words that were also common in
Classical Chinese
Classical Chinese is the language in which the classics of Chinese literature were written, from . For millennia thereafter, the written Chinese used in these works was imitated and iterated upon by scholars in a form now called Literary ...
, but are not used in modern
varieties of Chinese
There are hundreds of local Chinese language varieties forming a branch of the Sino-Tibetan languages, Sino-Tibetan language family, many of which are not Mutual intelligibility, mutually intelligible. Variation is particularly strong in the m ...
. Its features have been compared with current ideas on
Old Chinese phonology
Scholars have attempted to reconstruct the phonology of Old Chinese from documentary evidence. Although the writing system does not describe sounds directly, shared phono-semantic, phonetic components of the most ancient Chinese characters are b ...
:
* The voiceless nasals and lateral postulated for Old Chinese are absent, though in some cases the reflexes match those in western dialects of Han Chinese, rather than those of eastern dialects from which
Middle Chinese
Middle Chinese (formerly known as Ancient Chinese) or the Qieyun system (QYS) is the historical variety of Chinese language, Chinese recorded in the ''Qieyun'', a rime dictionary first published in 601 and followed by several revised and expande ...
and most modern varieties are descended.
* Where Middle Chinese has ''l-'', believed to be a reflex of Old Chinese *r, Bai varieties have before , before a nasal final, and elsewhere. However, in words where Middle Chinese ''l-'' corresponds to in inland
Min dialects, Bai often has a stop initial, providing support for Baxter and Sagart's suggestion that such initials derive from clusters.
* Old Chinese *l- generally has similar palatal and dental reflexes in Bai and Middle Chinese, but seems to be preserved in a few Bai words.
* The Old Chinese finals *-aw and *-u merged in Middle Chinese syllables without a palatal medial by the 4th century AD, but are still distinguished in Bai.
* Several words with Old Chinese *-ts, which developed to ''-j'' with the departing tone in Middle Chinese, produce tonal reflexes in Bai corresponding to an original stop coda.
Sergei Starostin
Sergei Anatolyevich Starostin (; March 24, 1953 – September 30, 2005) was a Russian historical linguistics, historical linguist and philology, philologist, perhaps best known for his reconstructions of hypothetical proto-languages, including hi ...
suggests that these facts indicate a split from mainstream Chinese around the 2nd century BC, corresponding to the
Western Han period. Wang argues that a few of the correspondences between his reconstructed Proto-Bai and Old Chinese cannot be explained by the Old Chinese forms, and that Chinese and Bai therefore form a Sino-Bai group. However, Gong suggests that at least some of these cases can be accounted for by refining the Proto-Bai reconstruction to take account of
complementary distribution within Bai.
Starostin and
Zhengzhang Shangfang have separately argued that the oldest Chinese layer accounts for all but an insignificant residue of Bai vocabulary, and that Bai is therefore an early branching from Chinese.
On the other hand, Lee and Sagart (1998) argued that the various layers of Chinese vocabulary are loans, and that when they are removed, a significant non-Chinese residue remains, including 15 entries from the 100-word
Swadesh list
A Swadesh list () is a compilation of cultural universal, tentatively universal concepts for the purposes of lexicostatistics. That is, a Swadesh list is a list of forms and concepts which all languages, without exception, have terms for, such as ...
of basic vocabulary. They suggest that this residue shows similarities with
Proto-Loloish.
James Matisoff
James Alan Matisoff ( zh, , t=馬蒂索夫, s=马蒂索夫, p=Mǎdìsuǒfū or zh, , t=馬提索夫, s=马提索夫, p=Mǎtísuǒfū; born July 14, 1937) is an American linguist. He is a professor emeritus of linguistics at the University of Cal ...
(2001) argued that the comparison with Loloish is less persuasive when considering other Bai varieties than the Jianchuan dialect used by Lee and Sagart, and that it is safer to consider Bai as an independent branch of Sino-Tibetan, though perhaps close to the neighbouring Loloish. Lee and Sagart (2008) refined their analysis, presenting the residue as a non-Chinese form of Sino-Tibetan, though not necessarily Loloish. They also note that this residue includes the Bai vocabulary relating to pig rearing and rice agriculture.
Lee and Sagart's analysis has been further discussed by List (2009). Gong (2015) suggests that the residual layer may be
Qiangic, pointing out that the Bai, like the Qiang, call themselves "white", whereas the
Lolo use "black".
Phonology
The Jianchuan dialect has the following consonants, all of which are restricted to syllable-initial position:
The Gongxing and Tuolou dialects retain an older 3-way distinction for stop and affricate initials between voiceless unaspirated, voiceless aspirated and voiced. In the core eastern group, including the standard form of Dali, the voiced initials have become voiceless unaspirated, while other dialects show partial loss of voicing, conditioned by tone in different ways. Some varieties also have an additional
uvular nasal that contrasts phonemically with .
Jianchuan finals comprise:
*diphthongs:
* triphthong:
All but , and have contrasting
nasalized variants.
Dali Bai lacks nasal vowels. Some other varieties retain nasal codas instead of nasalization, though only the Gongxing and Tuolou dialects have a contrast between and .
Jianchuan has eight tones, divided between those with
modal and non-modal
phonation
The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defi ...
. Some of the western varieties have fewer tones.
Syntax
Bai has a basic subject–verb–object (
SVO) order. However,
SOV can be found in interrogative and negative sentences.
Writing system
Latin script
The old Bai script used modified
Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
, but was not widely used. A new script based on the
Latin alphabet
The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
was designed in 1958, based on the speech of the urban centre of
Xiaguan, even though it was not a typical Southern dialect. The idea of romanization was controversial among Bai elites and the system saw little use. In a renewed attempt in 1982,
language planners used the Jianchuan dialect as a base, because it represented an area with a significant population, almost all of whom spoke Bai. The new script was popular in the Jianchuan area, but was rejected in the more economically advanced area of Dali, which also had the largest number of speakers, albeit living alongside a large number of speakers of Chinese. The script was revised extensively in 1993 to define two variants, representing Jianchuan and Dali respectively and has since been more widely used.
The retroflex initials ''zh'', ''ch'', ''sh'' and ''r'' are used only in recent loanwords from
Standard Chinese
Standard Chinese ( zh, s=现代标准汉语, t=現代標準漢語, p=Xiàndài biāozhǔn hànyǔ, l=modern standard Han speech) is a modern standard form of Mandarin Chinese that was first codified during the republican era (1912–1949). ...
or for other Bai varieties.
The 1993 revision introduced variants ''ai''/''er'' etc, with the former to be used for Jianchuan Bai and the latter for Dali Bai. In Jianchuan, all vowels but ''ao'', ''iao'', ''uo'', ''ou'' and ''iou'' have nasalized counterparts, denoted by a suffixed ''n''. Dali Bai lacks nasalized vowels.
Suffixed letters indicate tone contours and
modal or non-modal phonation. This was the most radical aspect of the 1993 revision:
Bowen script

Bowen script (), also known as Square Bai Script (),
Hanzi
Chinese characters are logographs used to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represent the only one ...
Bai Script (), Hanzi-style Bai Script (), or Ancient Bai Script (), was a
logographic script
In a written language, a logogram (from Ancient Greek 'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a semantic component of a language, such as a word or morpheme. Chinese c ...
formerly used by the
Bai people
The Bai or Pai (Bai language, Bai: , ; zh, c=白族, p=Báizú), are an East Asian people, East Asian ethnic group native to the Dali Bai Autonomous Prefecture of Yunnan, Yunnan Province, Bijie area of Guizhou, Guizhou Province, and Sangzhi C ...
, adapted from
Hanzi
Chinese characters are logographs used to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represent the only one ...
to fit the Bai language. The script was used from the
Nanzhao
Nanzhao ( zh, t=南詔, s=南诏, p=Nánzhào), also spelled Nanchao, , Yi language: ꂷꏂꌅ, ''Mashynzy'') was a dynastic kingdom that flourished in what is now southwestern China and northern Southeast Asia during the 8th and 9th centuri ...
period to the beginning of the
Ming dynasty
The Ming dynasty, officially the Great Ming, was an Dynasties of China, imperial dynasty of China that ruled from 1368 to 1644, following the collapse of the Mongol Empire, Mongol-led Yuan dynasty. The Ming was the last imperial dynasty of ...
.
The Shanhua tablet (), from
Dali Town in
Yunnan
Yunnan; is an inland Provinces of China, province in Southwestern China. The province spans approximately and has a population of 47.2 million (as of 2020). The capital of the province is Kunming. The province borders the Chinese provinces ...
, contains a poem written using Bowen text from the
Ming dynasty
The Ming dynasty, officially the Great Ming, was an Dynasties of China, imperial dynasty of China that ruled from 1368 to 1644, following the collapse of the Mongol Empire, Mongol-led Yuan dynasty. The Ming was the last imperial dynasty of ...
by the Bai poet Yang Fu (),
《詞記山花·詠蒼洱境》.
Examples
Nge, no – I
Ne, no – you
Cai ho – red flower
Gei bo – rooster
A de gei bo – a rooster
Ne mian e ain hain? – What's your name?
Ngo mian e A Lu Gai. – My name is A Lu Gai.
Ngo ze ne san se yin a biu. – I don't recognize you.
Ngo ye can. – I'm eating.
Ne can ye la ma? – Have you eaten?
Ne ze a ma yin? – Who are you?
Ne ze nge mo a bio. – You are not my mother.
Ngo zei pi ne gan. – I'm taller than you.
Ne nge no hha si bei. – You won't let me go.
Notes
References
Works cited
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
Further reading
* Allen, Bryan and Zhang Xia. 2004. ''Bai Dialect Survey''. Yunnan Nationalities Publishing House.
CLDFDataset a
Zenodo .
*
*Wāng, Fēng. 2013. Báiyǔ yǔ báizú de liúbiàn: Duōjiǎodù jiéhé de shìyě 白語與白族的流變:多角度結合的視野. In Fēng Shí and Gāng Péng, editors, Dàjiāng Dōngqù: Wāng Shìyuán Jiàoshòu Bāshísuì Hèshòu Wénjí. 大江東去:王士元教授八十歲賀壽文集. City University of Hong Kong Press.
*Xú, Lín and Yǎnsūn Zhào. 1984. Báiyǔ Jiǎnzhì 白语简志. Mínzú Chūbǎnshè.
*Yuán, Míngjūn. 2006. Hànbáiyǔ diàochá yánjiū 汉白语调查研究. Zhōngguó Wénshǐ Chūbǎnshè.
*Zhào, Yǎnsūn and Lín Xú. 1996. Bái-Hàn Cídiǎn 白汉词典. Sìchuān Mínzú Chūbǎnshè.
*Dali Prefecture Bai Cultural Studies Editorical Committee
��理白族自治洲白族文化研究所编 2008.'' Dali series: Bai language, vol. 3: Vocabulary of the dialects of the Bai people''
��理丛书·白语篇 卷3 白族方言词汇 Kunming: Yunnan People's Press
��南民族出版社 ontains word lists of 33 Bai language datapoints.
External links
Bai basic lexicon at the Global Lexicostatistical Database
{{authority control
Sino-Tibetan languages
Tonal languages
Subject–verb–object languages
Languages of Yunnan
Bai people