Written Cantonese
   HOME

TheInfoList



OR:

Written
Cantonese Cantonese ( zh, t=廣東話, s=广东话, first=t, cy=Gwóngdūng wá) is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou (historically known as Canton) and its surrounding a ...
is the most complete written form of Chinese after that for
Mandarin Chinese Mandarin (; ) is a group of Chinese (Sinitic) dialects that are natively spoken across most of northern and southwestern China. The group includes the Beijing dialect, the basis of the phonology of Standard Chinese, the official language ...
and
Classical Chinese Classical Chinese, also known as Literary Chinese (古文 ''gǔwén'' "ancient text", or 文言 ''wényán'' "text speak", meaning "literary language/speech"; modern vernacular: 文言文 ''wényánwén'' "text speak text", meaning "literar ...
.
Written Chinese Written Chinese () comprises Chinese characters used to represent the Chinese language. Chinese characters do not constitute an alphabet or a compact syllabary. Rather, the writing system is roughly logosyllabic; that is, a character generally r ...
was originally developed for Classical Chinese, and was the main
literary language A literary language is the form (register) of a language used in written literature, which can be either a nonstandard dialect or a standardized variety of the language. Literary language sometimes is noticeably different from the spoken langua ...
of China until the 19th century. Written vernacular Chinese first appeared in the 17th century, and a written form of Mandarin became standard throughout China in the early 20th century. While the Mandarin form can in principle be read and spoken word for word in other
Chinese varieties Chinese, also known as Sinitic, is a branch of the Sino-Tibetan language family consisting of hundreds of local varieties, many of which are not mutually intelligible. Variation is particularly strong in the more mountainous southeast of main ...
, its intelligibility to non-Mandarin speakers is poor to incomprehensible because of differences in
idiom An idiom is a phrase or expression that typically presents a figurative, non-literal meaning attached to the phrase; but some phrases become figurative idioms while retaining the literal meaning of the phrase. Categorized as formulaic language ...
s,
grammar In linguistics, the grammar of a natural language is its set of structural constraints on speakers' or writers' composition of clauses, phrases, and words. The term can also refer to the study of such constraints, a field that includes doma ...
and
usage The usage of a language is the ways in which its written and spoken variations are routinely employed by its speakers; that is, it refers to "the collective habits of a language's native speakers", as opposed to idealized models of how a languag ...
. Modern Cantonese speakers have therefore developed new characters for words that do not exist and have retained others that have been lost in standard Chinese. With the advent of the computer and standardization of character sets specifically for Cantonese, many printed materials in predominantly Cantonese-speaking areas of the world are written to cater to their population with these written Cantonese characters.


History


Early history

Before the 20th century, the standard written language of
China China, officially the People's Republic of China (PRC), is a country in East Asia. It is the world's List of countries and dependencies by population, most populous country, with a Population of China, population exceeding 1.4 billion, slig ...
was
Classical Chinese Classical Chinese, also known as Literary Chinese (古文 ''gǔwén'' "ancient text", or 文言 ''wényán'' "text speak", meaning "literary language/speech"; modern vernacular: 文言文 ''wényánwén'' "text speak text", meaning "literar ...
, which has grammar and vocabulary based on the Chinese used in ancient China,
Old Chinese Old Chinese, also called Archaic Chinese in older works, is the oldest attested stage of Chinese, and the ancestor of all modern varieties of Chinese. The earliest examples of Chinese are divinatory inscriptions on oracle bones from around 12 ...
. However, while this written standard remained essentially static for over two thousand years, the actual spoken language diverged further and further away. The first Cantonese writings belong to
Canton Canton may refer to: Administrative division terminology * Canton (administrative division), territorial/administrative division in some countries, notably Switzerland * Township (Canada), known as ''canton'' in Canadian French Arts and ente ...
's specific ''mukjyusyu'' () literary form, that supposedly has its roots in
Buddhist Buddhism ( , ), also known as Buddha Dharma and Dharmavinaya (), is an Indian religion or philosophical tradition based on teachings attributed to the Buddha. It originated in northern India as a -movement in the 5th century BCE, and ...
chants accompanied by wooden fish. ''Mukjyu'' texts were popular light reading, their primary audience were women, as female (and overall) literacy was unusually high in that region. The ''mukjyus'' were intended to be sung, similarly to other performing genres such as '' naamyam'', although without musical instruments. The earliest known work with elements of written Cantonese, '' Faazin Gei'' (, "The Flowery Paper"), was composed by an unknown author during the late
Ming dynasty The Ming dynasty (), officially the Great Ming, was an Dynasties in Chinese history, imperial dynasty of China, ruling from 1368 to 1644 following the collapse of the Mongol Empire, Mongol-led Yuan dynasty. The Ming dynasty was the last ort ...
(16-17th century), its oldest extant edition is dated 1713. The ''Faazin Gei'' is an example of the "scholar and beauty" genre popular at the time, with its story set in
Suzhou Suzhou (; ; Suzhounese: ''sou¹ tseu¹'' , Mandarin: ), alternately romanized as Soochow, is a major city in southern Jiangsu province, East China. Suzhou is the largest city in Jiangsu, and a major economic center and focal point of trad ...
. Its text, while still being close to the literary Chinese, contains a lot of specific Cantonese wording and even Cantonese vernacular characters, especially in the dialogue sentences, but also in the narrative text. Other praised early works include ''Ji-Hofaa Si'' (, "The Two Lotus Flowers") and ''Gamso-Jyunjoeng Saanwusin Gei'' (, "Coral Fan and Golden-lock Mandarin-ducks Pendant"). Since the late Ming era, the '' naamyam'' song genre flourished. These ''southern songs'' were frequently sung in Canton's brothels, accompanied by string instruments. Their language was in general very literary, only occasionally throwing in some colloquial Cantonese words. The purpose of such inclusions is debated, they were likely added purely for rhythmic purposes. An example of such practice is ''Haaktou Cauhan'' (, "The Traveler's Autumn Regrets") written in the first decade of 1800s, which is considered one of the most outstanding examples of the ''naamyam'' genre. The Cantonese vocabulary was used much more in the ''lungzau'' (, "Dragon boat") songs performed mainly by beggars on the streets. These songs were considered the least prestigious genre and rarely faced publishing, only after careful editing making them less vernacular in style. An important landmark in the history of written Cantonese was the publication of ''Jyut-au'' (, "Cantonese Love Songs") by Zhao Ziyong () in 1828. It marked the beginning of an extremely popular genre. Being an educated '' juren'', Zhao Ziyong earned some prestige and respect for the "heavy" vernacular literature, which used to be rejected.


Modern times

In the early 20th century, Chinese reformers like
Hu Shih Hu Shih (; 17 December 1891 – 24 February 1962), also known as Hu Suh in early references, was a Chinese diplomat, essayist, literary scholar, philosopher, and politician. Hu is widely recognized today as a key contributor to Chinese libera ...
saw the need for language reform and championed the development of a vernacular that allowed modern Chinese to write the language the same way they speak. The vernacular language movement took hold, and the written language was standardized as
vernacular Chinese Written vernacular Chinese, also known as Baihua () or Huawen (), is the forms of written Chinese based on the varieties of Chinese spoken throughout China, in contrast to Classical Chinese, the written standard used during imperial China up ...
. Mandarin was chosen as the basis for the new standard. The standardization and adoption of written Mandarin preempted the development and standardization of vernaculars based on other varieties of Chinese. No matter which dialect one spoke, he still wrote in standardized Mandarin for everyday writing. However, Cantonese is unique amongst the non-Mandarin varieties in having a widely used written form. Cantonese-speaking
Hong Kong Hong Kong ( (US) or (UK); , ), officially the Hong Kong Special Administrative Region of the People's Republic of China (abbr. Hong Kong SAR or HKSAR), is a List of cities in China, city and Special administrative regions of China, special ...
used to be a British colony isolated from mainland
China China, officially the People's Republic of China (PRC), is a country in East Asia. It is the world's List of countries and dependencies by population, most populous country, with a Population of China, population exceeding 1.4 billion, slig ...
before 1997, so most HK citizens do not speak Mandarin. Written Cantonese has developed as a means of informal communication. Still, Cantonese speakers must use standard written Chinese, or even literary Chinese, in most formal written communications, since written Cantonese may be unintelligible to speakers of other varieties of Chinese. Historically, written Cantonese has been used in Hong Kong for legal proceedings in order to write down the exact spoken testimony of a witness, instead of paraphrasing spoken Cantonese into standard written Chinese. However, its popularity and usage has been rising in the last two decades, the late
Wong Jim James Wong Jim (; 18 March 1941 – 24 November 2004, also known as "霑叔" or "Uncle Jim") was a Cantopop lyricist and songwriter based primarily in Hong Kong. Beginning from the 1960s, he was the lyricist for over 2,000 songs, collaboratin ...
being one of the pioneers of its use as an effective written language. Written Cantonese has become quite popular in certain tabloids, online chat rooms,
instant messaging Instant messaging (IM) technology is a type of online chat allowing real-time text transmission over the Internet or another computer network. Messages are typically transmitted between two or more parties, when each user inputs text and tri ...
, and even
social networking A social network is a social structure made up of a set of social actors (such as individuals or organizations), sets of dyadic ties, and other social interactions between actors. The social network perspective provides a set of methods for ...
websites; this would be even more evident since the rise of
localism in Hong Kong In Hong Kong, localism is a political movement centered on the preservation of the city's autonomy and local culture. The Hong Kong localist movement encompasses a variety of groups with different goals, but all of them oppose the perceived gr ...
from the 2010s, where the articles written by those localist media are written in Cantonese. Although most foreign movies and TV shows are subtitled in Standard Chinese, some, such as ''
The Simpsons ''The Simpsons'' is an American animated sitcom created by Matt Groening for the Fox Broadcasting Company. The series is a satirical depiction of American life, epitomized by the Simpson family, which consists of Homer, Marge, Bart, ...
'', are subtitled using written Cantonese. Newspapers have the news section written in Standard Chinese, but they may have editorials or columns that contain Cantonese discourses, and Cantonese characters are increasing in popularity on advertisements and billboards. It has been stated that written Cantonese remains limited outside Hong Kong, including other Cantonese-speaking areas in
Guangdong Province Guangdong (, ), alternatively romanized as Canton or Kwangtung, is a coastal province in South China on the north shore of the South China Sea. The capital of the province is Guangzhou. With a population of 126.01 million (as of 2020) ...
. However, colloquial Cantonese advertisements are sometimes seen in Guangdong, suggesting that written Cantonese is widely understood and is regarded favourably, at least in some contexts. Some sources will use only colloquial Cantonese forms, resulting in text similar to natural speech. However, it is more common to use a mixture of colloquial forms and standard Chinese forms, some of which are alien to natural speech. Thus the resulting "hybrid" text lies on a continuum between two norms: standard Chinese and colloquial Cantonese as spoken.


Cantonese characters


Early sources

A good source for well documented written Cantonese words can be found in the scripts for
Cantonese opera Cantonese opera is one of the major categories in Chinese opera, originating in southern China's Guangdong Province. It is popular in Guangdong, Guangxi, Hong Kong, Macau and among Chinese communities in Southeast Asia. Like all versions of Ch ...
. ''Readings in Cantonese colloquial: being selections from books in the Cantonese vernacular with free and literal translations of the Chinese character and romanized spelling'' (1894) by James Dyer Ball has a bibliography of printed works available in Cantonese characters in the last decade of the nineteenth century. A few libraries have collections of so-called "wooden fish books" written in Cantonese characters. Facsimiles and plot precis of a few of these have been published in Wolfram Eberhard's ''Cantonese Ballads.'' See also ''Cantonese love-songs, translated with introduction and notes by Cecil Clementi'' (1904) or a newer translation of these by Peter T. Morris in ''Cantonese love songs : an English translation of Jiu Ji-yung's Cantonese songs of the early 19th century'' (1992). Cantonese character versions of the Bible, Pilgrims Progress, and Peep of Day, as well as simple catechisms, were published by mission presses. The special Cantonese characters used in all of these were not standardized and show wide variation.


Characters today

File:Cantonesebillboard.jpg, A Hong Kong billboard in Written Cantonese with a mixture of English words in the typical
code switch ''Code Switch'' is a race and culture outlet and a weekly podcast from American public radio network NPR. It began in 2013 with a blog as well as contributing stories to NPR radio programs. The Code Switch podcast launched in 2016. In the wake ...
style of Hong Kong speech. File:Written_cantonese.jpg, A Hong Kong political advertisement in Written Cantonese File:Cantopoliticalbanner.jpg, Political banner in Written Cantonese
Written Cantonese contains many characters not used in standard written Chinese in order to transcribe words not present in the standard lexicon, and for some words from Old Chinese when their original forms have been forgotten. Despite attempts by the government of Hong Kong in the 1990s to standardize this character set, culminating in the release of the
Hong Kong Supplementary Character Set The Hong Kong Supplementary Character Set (; commonly abbreviated to HKSCS) is a set of Chinese characters – 4,702 in total in the initial release—used in Cantonese, as well as when writing the names of some places in Hong Kong (whether in w ...
(HKSCS) for use in electronic communication, there is still significant disagreement about which characters are ''correct'' in written Cantonese, as many of the Cantonese words existed as descendants of Old Chinese words, but are being replaced by some new invented Cantonese words.


Vocabulary

General estimates of vocabulary differences between Cantonese and Mandarin range from 30 to 50 percent. Donald B. Snow, the author of ''Cantonese as Written Language: The Growth of a Written Chinese Vernacular'', wrote that "It is difficult to quantify precisely how different" the two vocabularies are. Snow wrote that the different vocabulary systems are the main difference between written Mandarin and written Cantonese. Ouyang Shan made a corpus-based estimate concluding that one third of the lexical items used in regular Cantonese speech do not exist in Mandarin, but that between the formal registers the differences were smaller. He analyzed a radio news broadcast and concluded that of its lexical items, 10.6% were distinctly Cantonese. Here are examples of differing lexical items in a sentence: The two Chinese sentences are grammatically identical, using an
A-not-A question In linguistics, an A-not-A question, also known as an A-neg-A question, is a polar question that offers two opposite possibilities for the answer. Predominantly researched in Sinitic languages, the A-not-A question offers a choice between an aff ...
to ask "Is it theirs?" (referring to an aforementioned object). Though the characters correspond 1:1, the actual glyphs used are all different.


Cognates

There are certain words that share a common root with standard written Chinese words. However, because they have diverged in pronunciation, tone, and/or meaning, they are often written using a different character. One example is the doublet loi4 (standard) and lei4 (Cantonese), meaning "to come." Both share the same meaning and usage, but because the colloquial pronunciation differs from the literary pronunciation, they are represented using two different characters. Some people argue that representing the colloquial pronunciation with a different (and often extremely complex) character is superfluous, and would encourage using the same character for both forms since they are
cognate In historical linguistics, cognates or lexical cognates are sets of words in different languages that have been inherited in direct descent from an etymological ancestor in a common parent language. Because language change can have radical ef ...
s (see Derived characters below).


Native words

Some Cantonese words have no equivalents in Mandarin, though equivalents may exist in classical or other varieties of Chinese. Cantonese writers have from time to time reinvented or borrowed a new character if they are not aware of the original one. For example, some suggest that the common word leng3, meaning ''pretty'' in Cantonese but also ''looking into the mirror'' in Mandarin, is in fact the character ling3. Today those characters can mainly be found in ancient rime dictionaries such as ''
Guangyun The ''Guangyun'' (''Kuang-yun''; ) is a Chinese rime dictionary that was compiled from 1007 to 1008 under the patronage of Emperor Zhenzong of Song. Its full name was ''Dà Sòng chóngxiū guǎngyùn'' (, literally "Great Song revised and ex ...
''. Some scholars have made some "archaeological" efforts to find out what the "original characters" are. Often, however, these efforts are of little use to the modern Cantonese writer, since the characters so discovered are not available in the standard character sets provided to computer users, and many have fallen out of usage. In Southeast Asia, Cantonese people may adopt local Malay words into their daily speech, such as using the term 鐳 leoi1 to mean money rather than 錢 cin2, which would be used in Hong Kong.


Particles

Cantonese particles may be added to the end of a sentence or suffixed to verbs to indicate aspect. There are many such particles; here are a few. * – "me1" is placed at the end of a sentence to indicate disbelief, e.g. ? Is your nickname really Raymond Lam? * – "ne1" is placed at the end of a sentence to indicate a question, e.g. What is your name? * – "mei6" is placed at the end of a sentence to ask if an action is done yet, e.g. Are you done yet? * – "haa5" is placed after a verb to indicate a little bit, e.g. Eat a little bit; "haa2" is used singly to show uncertainty or unbelief, e.g. ? What? Is that so? * – "gan2" is placed after a verb to indicate a progressive action, e.g. I'm eating an apple. * – "zo2" placed after a verb to indicate a completed action, e.g. I ate an apple. * – "saai3" placed after a verb to indicate an action to all of the targets, e.g. I ate all the apples. * – "maai4" is placed after a verb to indicate an expansion of the target of action, or that the action is an addition to the one(s) previously mentioned, e.g. I'll go after I finish eating the rest. ("eating the rest" is an expansion of the target of action from the food eaten to the food not yet eaten); You can go first. I'll eat before going. (The action "eating" is an addition to the action "going" which is previously mentioned or mutually known.) * – "waa1 / waa3" interjection of amazement, e.g. Wow! That's amazing! * – "gaa3 laa1" is used when the context seems to be commonplace, e.g., Everyone is like that. * – "ze1 maa3" translates as "just", e.g. I just have two pages of homework left to do.


Loanwords

Some Cantonese loanwords are written in existing Chinese characters.


Cantonese character formation

Cantonese characters, as with regular Chinese characters, are formed in one of several ways:


Borrowings

Some characters already exist in standard Chinese, but are simply reborrowed into Cantonese with new meanings. Most of these tend to be archaic or rarely used characters. An example is the character 子, which means "child". The Cantonese word for child is represented by 仔(jai), which has the original meaning of "young animal".


Compound formation

The majority of characters used in Standard Chinese are phono-semantic compounds - characters formed by placing two radicals, one hinting as its meaning and one hinting its pronunciation. Written Cantonese continues this practice via putting the 'mouth' radical () next to a character pronounced similarly that indicates its pronunciation. As an example, the character uses the mouth radical with a , which means 'down', but the meaning has no relation to the meaning of . (An exception is ''mē'', which is not pronounced like (''yèuhng'', sheep) but was chosen to represent the sound sheep make.) The characters which are commonly used in Cantonese writing include: There is evidence that the mouth radical in such characters can, over time, be replaced by a different one. For instance, (lām, "bud"), written with the determinative ("cover"), is instead written in older dictionaries as , with the mouth radical.


Derived characters

Other common characters are unique to Cantonese or are different from their Mandarin usage, including: etc. The characters which are commonly used in Cantonese writing include: * mou5 ('' v.'' not have). Originally . Standard written Mandarin: * hai6 ('' v.'' be). Standard written Mandarin: * keoi5 ('' pron.'' he/she/it). Originally . Standard written Mandarin: , 她, 它, 牠, 祂 * mat1 ('' pron.'' what) often followed by 嘢 to form 乜嘢. Originally . Standard written Mandarin: * zai2 ('' n.'' son, child, small thing). Originally . * lou2 ('' n.'' guy, dude). Originally . * bei2 ('' v.'' give). Standard written Mandarin: * leng3 (''
adj. In linguistics, an adjective (abbreviated ) is a word that generally modifies a noun or noun phrase or describes its referent. Its semantic role is to change information given by the noun. Traditionally, adjectives were considered one of the mai ...
'' pretty, handsome). Standard written Mandarin: * saai3 ('' adv.'' completely; '' v.'' bask in sun) * fan3 ('' v.'' sleep). Originally . Standard written Mandarin: , * lo2 ('' v.'' take, get). Standard written Mandarin: * ling1 ('' v.'' take, get). Standard written Mandarin: * lei6 ('' n.'' tongue). Standard written Mandarin: * gui6 (''
adj. In linguistics, an adjective (abbreviated ) is a word that generally modifies a noun or noun phrase or describes its referent. Its semantic role is to change information given by the noun. Traditionally, adjectives were considered one of the mai ...
'' tired). Standard written Mandarin: * deng6 ('' n.'' place) often followed by 方 to form 埞方. Standard written Mandarin: The words represented by these characters are sometimes
cognate In historical linguistics, cognates or lexical cognates are sets of words in different languages that have been inherited in direct descent from an etymological ancestor in a common parent language. Because language change can have radical ef ...
s with pre-existing Chinese words. However, their colloquial Cantonese pronunciations have diverged from formal Cantonese pronunciations. For example, ("without") is normally pronounced mou4 in literature. In spoken Cantonese, mou5 has the same usage, meaning, and pronunciation as , except for tone. represents the spoken Cantonese form of the word "without", while represents the word used in
Classical Chinese Classical Chinese, also known as Literary Chinese (古文 ''gǔwén'' "ancient text", or 文言 ''wényán'' "text speak", meaning "literary language/speech"; modern vernacular: 文言文 ''wényánwén'' "text speak text", meaning "literar ...
and Mandarin. However, is still used in some instances in spoken Cantonese, such as ("no matter what happens"). Another example is the doublet , which means "come". loi4 is used in literature; lei4 is the spoken Cantonese form.


Workarounds

Though most Cantonese words can be found in the current encoding system, input workarounds are commonly used both by those unfamiliar with them, and by those whose input methods do not allow for easy input (similar to how some Russian speakers might write in the Latin script if their computing device lacks the ability to input
Cyrillic The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
). Some Cantonese writers use simple romanization (e.g., use D as 啲), symbols (add a Latin letter "o" in front of another Chinese character; e.g., 㗎 is defined in Unicode but will not display if not installed on the device in use, hence the proxy o架 is often used), homophones (e.g., use 果 as 嗰), and Chinese characters which have different meanings in Mandarin (e.g., 乜, 係, 俾; etc.) For example,


See also

*
Cantonese braille Cantonese Braille (Chinese: 粵語點字) is a braille script used to write Cantonese in Hong Kong and Macau. It is locally referred to as ''tim chi'' (點字, ''dim2zi6'') 'dot characters' or more commonly but ambiguously ''tuk chi'' (凸字, ...
* Hong Kong Sign Language * Written Hokkien * Saam kap dai *
Chữ Nôm Chữ Nôm (, ; ) is a logographic writing system formerly used to write the Vietnamese language. It uses Chinese characters ('' Chữ Hán'') to represent Sino-Vietnamese vocabulary and some native Vietnamese words, with other words represent ...


References

* Snow, Donald B. ''Cantonese as Written Language: The Growth of a Written Chinese Vernacular''.
Hong Kong University Press Hong Kong University Press is the university press of the University of Hong Kong The University of Hong Kong (HKU) (Chinese: 香港大學) is a public research university in Hong Kong Hong Kong ( (US) or (UK); , ), officially t ...
, 2004. , 9789622097094.


Notes


Further reading

* * * * *


External links


Chinese and English phrase book : with the Chinese pronunciation indicated in English
by Benoni Lanctot(published in 1867)
Vocabulary of the Canton Dialect: Chinese words and phrases
by
Robert Morrison (missionary) Robert Morrison, FRS (5 January 1782 – 1 August 1834), was an Anglo-Scottish Protestant missionary to Portuguese Macao, Qing-era Guangdong, and Dutch Malacca, who was also a pioneering sinologist, lexicographer, and translator consider ...
, published in 1828
S. L. Wong's A Chinese Syllabary Pronounced according to the Dialect of Canton, by the CUHKA Chinese Talking Syllabary of the Cantonese Dialect: An Electronic Repository, by the CUHKModern Standard Mandarin and Cantonese conversion, by the CUHKYueyu.netLearn Cantonese!
Cantonese learning, including Cantonese characters {{Authority control Cantonese language Chinese language Chinese scripts