Modern Chinese characters () are the
Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
used in modern languages, including Chinese, Japanese, Korean and Vietnamese. Chinese characters are composed of components, which are in turn composed of strokes.
The 100 most frequently used characters cover (i.e., having an accumulated frequency of) over 40% of modern Chinese texts. The 1000 most frequently used characters cover approximately 90% of the texts.
There are a variety of novel aspects of modern Chinese characters, including that of orthography, phonology, and semantics, as well as matters of collation and organization and statistical analysis, computer processing, and pedagogy.
Background
Historical development
Since maturing as a complete writing system, Chinese characters have had an uninterrupted history of development over more than 3,000 years, with stages including
*
Oracle bone script
Oracle bone script is the oldest attested form of written Chinese, dating to the late 2nd millennium BC. Inscriptions were made by carving characters into oracle bones, usually either the shoulder bones of oxen or the plastrons of turtl ...
,
*
Bronze script
Chinese bronze inscriptions, also referred to as bronze script or bronzeware script, comprise Chinese writing made in several styles on ritual bronzes mainly during the Late Shang dynasty () and Western Zhou dynasty (771 BC). Types of bron ...
,
*
Seal script
Seal script or sigillary script () is a Chinese script styles, style of writing Chinese characters that was common throughout the latter half of the 1st millennium BC. It evolved organically out of bronze script during the Zhou dynasty (1 ...
,
*
Clerical script
The clerical script (), sometimes also chancery script, is a style of Chinese writing that evolved from the late Warring States period to the Qin dynasty. It matured and became dominant in the Han dynasty, and remained in active use through t ...
, and
*
Regular script
The regular script is the newest of the major Chinese script styles, emerging during the Three Kingdoms period , and stylistically mature by the 7th century. It is the most common style used in modern text. In its traditional form it is the t ...
,
leading to the modern written forms, as illustrated by the development of character :
In 1980,
Zhou Youguang
Zhou Youguang (; 13 January 190614 January 2017), also known as Chou Yu-kuang or Chou Yao-ping, was a Chinese economist, linguist, sinologist, and supercentenarian. He has been credited as the father of pinyin, the most popular Romanization of ...
, often considered to be the "father of
pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
", published a paper entitled "Introduction to the Studies of Modern Chinese Characters"—within, he detailed aspects of the numbers, orders, forms, sounds, meanings, and pedagogy regarding the modern characters. His paper was followed by Gao Jiaying's "A Brief Discussion on the Establishment of Modern Chinese Character Studies", and other related writings on the subject. At least five textbooks have been published in this area.
Regional varieties
Chinese characters were originally invented for writing the Chinese language, and were later employed for other East Asian languages, developing as part of a shared orthographic tradition. Among other places, for ordinary and historical purposes,
simplified characters
Simplified Chinese characters are one of two standardized character sets widely used to write the Chinese language, with the other being traditional characters. Their mass standardization during the 20th century was part of an initiative by t ...
are primarily used in mainland China, Singapore, and Malaysia,
traditional characters
Traditional Chinese characters are a standard set of Chinese character forms used to write Chinese languages. In Taiwan, the set of traditional characters is regulated by the Ministry of Education and standardized in the ''Standard Form of ...
are used in Taiwan, Hong Kong, and Macau, along with
kanji
are logographic Chinese characters, adapted from Chinese family of scripts, Chinese script, used in the writing of Japanese language, Japanese. They were made a major part of the Japanese writing system during the time of Old Japanese and are ...
in Japan,
hanja
Hanja (; ), alternatively spelled Hancha, are Chinese characters used to write the Korean language. After characters were introduced to Korea to write Literary Chinese, they were adapted to write Korean as early as the Gojoseon period.
() ...
in Korea, and
chữ Hán
( , ) are the Chinese characters that were used to write Literary Chinese in Vietnam, Literary Chinese (; ) and Sino-Vietnamese vocabulary in Vietnamese language, Vietnamese. They were officially used in Vietnam after the Red River Delta region ...
in Vietnam. For example, the traditional character has the simplified form and the kanji form .
Characteristics
In contrast with the Latin alphabet used to write many languages, including English, Chinese characters have many divergent properties, including:
* being in a two-dimensional block structure
* potentially having dozens of strokes
* denoting a
morpheme
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
in most cases
* Monosyllabic-meaning one character is read as one syllable.
* Texts written in Chinese characters are intelligible to readers of different dialects and different dynasties.
Sources
Modern Chinese characters include:
* Received characters as standardized for both simplified and traditional Chinese, accounting for about 75% of modern characters, e.g. , , , , , , and ;
* Newly coined characters, about 2.7% of the total number, e.g. , , , , and ;
* Repurposed ancient character with pronunciations and meanings differing from ancient ones, e.g. , and (used in );
* Simplified forms, often derived from variants already in common use, about 20%, e.g. , );
* Modern dialect characters, such as the Cantonese characters included in
Hong Kong Supplementary Character Set
The Hong Kong Supplementary Character Set (; commonly abbreviated to HKSCS) is a set of Chinese characters – 4,702 in total in the initial release—used in Standard Cantonese, Cantonese, as well as when writing the List of places in Hong Kong, ...
.
Number and sets
Due to the dynamic development of languages, there is no definite number of modern Chinese characters. However a reasonable estimation can be made by a survey of the character sets of relevant standard lists and influential dictionaries in the countries and regions where Chinese characters are used.
Mainland China
The standards in the
People's Republic of China
China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...
include the ''
List of Frequently Used Characters in Modern Chinese'' (), totalling 3,500 characters, and the ''List of Commonly Used Characters in Modern Chinese'' ( with 7,000 characters, including the 3,500 characters in the previous list).
The current standard is the ''
List of Commonly Used Standard Chinese Characters
The ''List of Commonly Used Standard Chinese Characters'' is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.
The project began in 2001, origina ...
'', which was released by the State Council in June 2013 to replace the previous two lists and other standards. It includes 8,105 characters of the Simplified Chinese writing system, 3,500 as primary, 3,000 as secondary, and 1,605 as tertiary. In addition, there are 2,574 traditional characters and 1,023 variants.
The character sets of ''
Xinhua Zidian
The ''Xinhua Zidian'' (), also as ''Xinhua Dictionary'', is a Chinese language, Chinese-language dictionary published by the Commercial Press. The first edition of ''Xinhua Zidian'' was published in 1957. The latest version is the 12th edition, ...
'' and ''
Xiandai Hanyu Cidian
''Xiandai Hanyu Cidian'' ( zh , s = 现代汉语词典 , t = 現代漢語詞典 , p = Xiàndài Hànyǔ Cídiǎn , l = Modern Han Language Word Dictionary ), also known as ''A Dictionary of Current Chinese'' or ''Contemporary Chinese Dictionary'' ...
'', the most popular modern Chinese character dictionary and word dictionary, each include over 13,000 characters of Simplified characters, Traditional characters and variants.
Taiwan
In Taiwan, the standard is the ''
Chart of Standard Forms of Common National Characters
The Chart of Standard Forms of Common National Characters or the Table of Standard Typefaces for Frequently-Used Chinese Characters () is a list of 4,808 commonly used Chinese characters. The standard typefaces were prescribed by Taiwan's Minist ...
'' with 4,808 characters, and the ''Chart of Standard Forms of Less-Than-Common National Characters'' (), with 6,341 common national characters. Both lists were released by the Ministry of Education, with a total of 11,149 characters of the Traditional Chinese writing system.
Hong Kong
In Hong Kong, the standard is the ''
List of Graphemes of Commonly-Used Chinese Characters
The ''List of Graphemes of Commonly-Used Chinese Characters'' () is a list of 4762 commonly used Chinese characters and their standardized forms prescribed by the Hong Kong Education Bureau. The list is meant to be taught in primary and middl ...
'' for elementary and junior secondary education, totally 4,762 characters. The list was released by the Education Bureau, and is very influential in educational circles.
Japan
In Japan, the standard is the —a list of 2,136 frequently used characters designated by the
Japanese Ministry of Education
The , also known as MEXT, is one of the eleven ministries of Japan that compose part of the executive branch of the government of Japan.
History
The Meiji government created the first Ministry of Education in 1871. In January 2001, the former ...
, as well as 983 for use in personal names.
Korea
In Korea, the standard is the ''
Basic Hanja for educational use'' (, a subset of 1,800
Hanja
Hanja (; ), alternatively spelled Hancha, are Chinese characters used to write the Korean language. After characters were introduced to Korea to write Literary Chinese, they were adapted to write Korean as early as the Gojoseon period.
() ...
defined in 1972 by a
South Korea
South Korea, officially the Republic of Korea (ROK), is a country in East Asia. It constitutes the southern half of the Korea, Korean Peninsula and borders North Korea along the Korean Demilitarized Zone, with the Yellow Sea to the west and t ...
educational standard), and the ''Table of Hanja for Personal Name Use'' (), published by the
Supreme Court of Korea
The Supreme Court of Korea () is the highest ordinary court in the judicial branch of South Korea, seated in Seocho, Seoul. Established under Chapter 5 of the Constitution of South Korea, the court has ultimate and comprehensive jurisdictio ...
in March 1991. The list expanded gradually, and to year 2015 there were 8,142 ''hanja'' permitted to be used in Korean names.
Overall estimates
With consideration of all the character sets mentioned above, the total number of modern Chinese characters in the world is over 10,000, probably around 15,000. Such an estimation should not be counted as too rough, considering that there are totally over 90,000 Chinese characters (
CJK Unified Ideographs
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Uni ...
) in Unicode, and more if every Chinese character ever appeared in the world is to be included.
A college graduate who is literate in written Chinese knows between three and four thousand characters. Specialists in classical literature or history, who would often encounter characters no longer in use, are estimated to have a working vocabulary of between 5,000 and 6,000 characters.
Frequency
Chinese character frequencies are calculated on data of
corpora
Corpus (plural ''corpora'') is Latin for "body". It may refer to:
Linguistics
* Text corpus, in linguistics, a large and structured set of texts
* Speech corpus, in linguistics, a large set of speech audio files
* Corpus linguistics, a branch of ...
. A corpus is a collection of texts representative of one or more languages. The frequency of a character is the ratio of the number of its occurrences in the corpus to the total number of characters of the corpus. The formula for calculating frequency is
, where is the number of times a certain () Chinese character appears in the corpus, and is the total number of (occurrences of) characters in the corpus.
Origins
The first person to make a statistic study on the frequency of Chinese characters was Chen Heqin (). In the 1920s, he and his assistants spent two years manually counting the characters in a corpus of 554,478 characters, and obtained 4,261 different characters with frequency information. They then compiled a book, ''Applied Lexis of Vernacular Chinese'' ().
The 10 most frequently used characters in their corpus are, by descending frequency,
('of'), ('no', 'not'), ('one', 'an'), (), (the
copula), (I/me), (on, up), , (to have), ('person').
CUHK survey
In 2001, the Chinese University of Hong Kong (CUHK) published a number of frequency lists on their website,
entitled "Hong Kong, Mainland China and Taiwan Chinese Frequency: a Trans-regional Diachronic Survey". The frequency data came from a grand corpus with a number of sub-corpora representing the Chinese languages in the three regions of Hong Kong, mainland China and Taiwan and in the two time periods of the 1960s and 1980s–90s. Each sub-corpus includes about 5,000 different characters, as shown by their frequency lists.
From the data of these frequency lists, some important and interesting features of Chinese can be discovered:
# , and are the three most frequently used characters across the regions and time periods of the corpora. is number one in all the frequency lists.
# The 10 most frequently used characters across the three regions and two time periods are very consistent. That means a frequently used character in one region or period is very likely to be frequently used in another region or period.
# The 100 most frequently used characters in the 80s and 90s cover (i.e., have an accumulated frequency of) 41.00% of the Hong Kong texts of that period, 41.34% of the mainland texts, and 41.88% of the Taiwan texts. That is more than 4 out of every 10 characters for the three regions.
# The 1000 most frequently used characters in the 80s and 90s cover 89.25% of the Hong Kong texts of that period, 90.26% of the mainland texts, and 88.74% of the Taiwan texts.
Chinese government survey
Large-scale surveys by the Ministry of Education and the State Language Commission of PRC over the years have shown that the use of Chinese characters and words has a strong distribution pattern. The number of characters used in modern Chinese has stayed stable at about 10,000 for a few years. The number of most frequently used characters with a coverage rate of 80%, 90%, and 99% is about 590, 960, and 2,400 respectively.
Chinese character frequency is essential to quantitative research of Chinese characters and has been applied to language teaching, dictionary composition, character lists compilation, Chinese character information processing, etc.
Orders
The orders or sorting methods of Chinese dictionaries are traditionally divided into three categories: form-based orders, sound-based orders and meaning-based orders. In modern Chinese, people also use
frequency orders.
Form-based
In form-based ordering, characters and words are sorted according to various features of the forms or shapes of Chinese characters. Compared to sound-based orders, form-based orders have the advantages of allowing lookup of characters and words without knowing their pronunciations, as well as effective collation of large character sets without support from other sorting methods. There are two subcategories of form-based orders:
stroke-based orders and component-based orders, which further includes
radical-based orders, etc.
Sound-based
There are two major sound representation systems for Standard Chinese:
pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
and
bopomofo
Bopomofo, also called Zhuyin Fuhao ( ; ), or simply Zhuyin, is a Chinese transliteration, transliteration system for Standard Chinese and other Sinitic languages. It is the principal method of teaching Chinese Mandarin pronunciation in Taiwa ...
. Accordingly, there is a
pinyin alphabetical order and a bopomofo-based order.
Meaning-based
Meaning-based orders, also called semantics-based orders, arrange characters and words in a hierarchical structure of semantic categories.
Frequency-based
Frequency-based ordering arranges Chinese characters by their frequency of uses, normally in descending order. That means the most frequently used character is at the top of the list.
Orders of words
Chinese words consist of one or more characters. Single-character words can be sorted by a character order, and multi-character words can be sorted character by character in a similar way.
Forms
Modern Chinese characters appear in the form of square blocks. There are three layers or levels of structural units of Chinese characters: strokes, components, and whole characters. For example, (character) has two components, each of which is composed of three strokes:
Strokes
''Strokes'' () are the smallest writing units of Chinese characters. When writing a Chinese character, the trace of a dot or a line left on the writing material (such as paper) from pen-down to pen-up is called a stroke. ''
Stroke number
Stroke number, or stroke count (), is the number of strokes of a Chinese character. It may also refer to the number of different strokes in a Chinese character set. Stroke number plays an important role in Chinese character sorting, teaching and co ...
'' is the number of strokes of a Chinese character. It varies, for example, characters and have only one stroke, while character has 36 strokes, and (composed of three ) consists of 48 strokes.
''Stroke forms'' refer to the shapes of strokes. The stroke forms of a standard Chinese character set can be classified into a table, for instance, the Unicode CJK strokes list has 36 types of strokes:
''
Stroke order
Stroke order is the order in which the strokes of a Chinese character are written. A stroke is a movement of a writing instrument on a writing surface.
Basic principles
Chinese characters are logograms constructed with strokes. Over the ...
'' is the order in which strokes are written to form a Chinese character. For example, the stroke order of is .
Components
Chinese characters are composed of components (), which are in turn composed of strokes. In most cases, a component is larger than a stroke (i.e., consists of more than one stroke) and smaller than the whole character (combines with some other components to form a character). For example, in character , there are two components, and , each with more than one stroke (:, : ). In the special cases of one-stroke characters, such as and , a stroke is a component and is a character.
Chinese character component analysis is to divide or separate a character into components.
There are two ways for Chinese character dividing, ''hierarchical dividing'' and ''plane dividing''. Hierarchical dividing separates layer by layer from larger to smaller components to get the primitive components. Plane dividing separate out the primitive components all at once.
A component that can independently form a character is a ''character component'', or a ''component of independent character formation'' (). For example, component formed character independently, and is a component in characters , and . A component that can not independently form a character is a ''non-character component'', or a ''component of dependent character formation'' (). For example, component in character , and .
A component that cannot be (further) divided into smaller components by the rules is called a ''primitive component'' or ''basic component'' (). Primitive components are the final-level components of hierarchical dividing. For example, components and in character . A component composed of two or more primitive components is a ''compound component'' (). For example, component in character , and .
Whole characters
'Whole characters' () lie at the final level of the stroke–component–character composition.
A ''non-decomposable character'' () consists of one primitive component, which is directly formed by strokes and can not be decomposed into smaller components. A ''decomposable character'' () can be broken down into multiple components.
The structure of a Chinese character is the pattern or rule in which the character is formed by its (first level) components. Chinese character structures include:
* Single-component structure (i.e., a non-decomposable character): The character is formed by a single primitive component, such as , and .
* Left–right structure: The character is formed by a component on the left and another one on the right, such as , and .
* Up–down structure: The character is formed by a component above another component, such as , and .
* Surrounding structure: One component is completely or partially surrounded by another component, such as , , , , , and .
Popular typefaces of modern Chinese characters include
* Song () or Ming (),
* Fangsong (),
*
Regular script
The regular script is the newest of the major Chinese script styles, emerging during the Three Kingdoms period , and stylistically mature by the 7th century. It is the most common style used in modern text. In its traditional form it is the t ...
,
*
Clerical script
The clerical script (), sometimes also chancery script, is a style of Chinese writing that evolved from the late Warring States period to the Qin dynasty. It matured and became dominant in the Han dynasty, and remained in active use through t ...
(),
* Hei or "black" () and
* Wei ().
In Chinese, in addition to the international point system, a unique 'number' () system is used for character sizes. For example, the Simplified Chinese version of Microsoft Word allows setting font sizes by either points or numbers.
Phonology
The standard pronunciation of Chinese characters is based on the
Beijing dialect
The Beijing dialect ( zh, s=北京话, t=北京話, p=Běijīnghuà), also known as Pekingese and Beijingese, is the prestige dialect of Mandarin spoken in the urban area of Beijing, China. It is the phonological basis of Standard Chinese, the ...
of Mandarin. Normally, a character is read with one syllable. Some Chinese characters have more than one pronunciation (polyphonic characters). Some syllables correspond to more than one character (homophonic characters).
Polyphonic characters
Polyphonic characters () are characters with two or more pronunciations, as opposed to monophonic characters with only one. A polyphonic monosemous character () has two or more pronunciations of the same meaning. For example, the English word 'ton' is transliterated as , with two pronunciations of and coexisting in some old dictionaries, both sharing the meaning of 'ton'. Since is both a character and a word, it is a polyphonic monosemous character, as well as a polyphonic monosemous word.
In December 1985, the Chinese government published the ''Table of Mandarin Words with Variant Pronunciation'' () to define the standard pronunciations for polyphonic monosemous characters. In Taiwan, there is a similar official standard for Mandarin words with variant sounds, where pronunciations are expressed in bopomofo instead of pinyin.
A polyphonic polysemous character () has two or more pronunciations, and different pronunciations represent different meanings. For example, character is pronounced with the meaning of 'long', or with the meaning of 'grow'. The simplified character is pronounced as from traditional character or as from traditional character . The pronunciation of such characters is determined by the meaning intended.
Polyphonic polysemous characters may hinder the learning and application of Chinese characters and should be reduced. There are two main methods:
* Change pronunciation. A common approach is to change rare and sub-frequent sounds to more frequent readings and change ancient pronunciations to modern pronunciations.
* Change form. It means changing some sounds and meanings to be expressed by other characters.
Homophones
Homophonic characters () are those sharing the same pronunciation, as opposed to heterophonic characters (). Homophonic characters are either narrowly understood as having identical initials, finals, and tones, or more broadly as merely having identical initials and finals, with tones possibly differing. For example, are all pronounced , while , , , , are homophones only in the broader sense. Usually, people understand homophony in characters as referring to the narrow sense.
Homophonic characters are widespread in Mandarin: there are around 1,300 possible syllables, including tonal distinctions—excluding tones, the number of different syllables drops to 400. Meanwhile, the written language has more than 10,000 characters, for an average of 7.5 characters mapped to each syllable.
Zhou Youguang introduced two ways homophones have been historically reduced:
* Differentiate character pronunciations without changing the word. For example: was originally pronounced , later changed to due to confusion with );
* Differentiate words and pronunciation. For example: ) was confused with ), later the synonym ) began to be used instead.
Others
There are two systems for phonetic notation of Chinese characters.
*
Bopomofo
Bopomofo, also called Zhuyin Fuhao ( ; ), or simply Zhuyin, is a Chinese transliteration, transliteration system for Standard Chinese and other Sinitic languages. It is the principal method of teaching Chinese Mandarin pronunciation in Taiwa ...
: for example, ,
*
Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
: for example,
In pinyin, either diacritics (e.g., ) or numbers () may be used to mark tones. The Jyutping system for Cantonese uses numbers, e.g.
are readings of
kanji
are logographic Chinese characters, adapted from Chinese family of scripts, Chinese script, used in the writing of Japanese language, Japanese. They were made a major part of the Japanese writing system during the time of Old Japanese and are ...
using native Japanese words mapped to the meanings of borrowed Chinese characters. Characters have also been borrowed with readings with borrowed
Sino-Japanese pronunciations. For example, when Chinese character was borrowed to Japan, people read it with either a native pronunciation of , or with a Sino-Japanese ''on'yomi'' pronunciation of . These phenomena also appear in Mandarin and English, such as ''i.e.'' being read aloud as 'that is'.
Qiu Xigui
Qiu Xigui (; (13 July 1935 – 8 May 2025) was a Chinese historian, palaeographer, and professor of Fudan University. His book ''Chinese Writing'' is considered the "single most influential study of Chinese palaeography".
Early life and educa ...
called it ).
Semantics
In modern Chinese, a character may represent a word, a morpheme in a compound word, or alternatively a meaningless syllable combined with some other syllables or characters to form a morpheme. In a language, ''
morphemes
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
'' are the minimal units of meaning. Some characters have only one meaning, some have multiple meanings, and some characters largely share the same meaning.
Monosemous and polysemous characters
A character with only one meaning is a monosemous character, and a character with two or more meanings is a polysemous character. According to statistics from the ''Chinese Character Information Dictionary'', among the 7,785 mainland standard Chinese characters in the dictionary, there are 4,139 monosemous characters, 3,053 polysemous characters and 593 meaningless characters.
The meaning people assigned to a character when it was created is the original meaning () of the character. For example, the original meaning of is 'weapon' () being held with both hands ).
The meaning developed from the original meaning of a character through association is the extended meaning (). For example, is an extended meaning of .
The meaning added through the loan of homonymous sounds is the phonetic-loan meaning (). For example, the original meaning of is 'dustpan': its use as a third-person pronoun is due to a phonetic loan.
Synonyms
Synonym characters are a group of characters that have the same or similar meaning. The characters in a synonym group often differ in frequency of use and word-formation ability, and there are some (subtle) differences in meaning and emotional color. The knowledge of synonym characters will help students write Chinese more correctly and express meanings more accurately.
Both and have the meaning of 'face', but there are some differences. Generally, is not used as an independent word in Mandarin, but only in multi-character compounds. For example, , , , . The in these words cannot be equivocated with . In contrast, can usually be used alone in Mandarin as its own word, as well as in compounds such as , , , and , . The in these words cannot be replaced by .
Meanings of characters and words
The meaning of a single-character word is its character meaning. The meaning of a multi-character word is generally derived from the meanings of the characters. The relationships between the meaning of a compound word and of its characters are categorized as follows:
# Synonyms (A + B = A = B), such as = = .
# Synthetic meaning (A + B = AB), such as = and
# Expanded meaning (A + B = AB + ε), such as = + + ('for sightseeing')
# Partial meaning (A + B = A or B, but not the other), for example ) = but not , = but not .
# Complementary meaning (A + B = ε), for example ) is
not + .
According to sampling statistics, categories 2 and 3 account for 89.7% of the compound words.
Internal structures
In the analysis of internal structures, Chinese characters are decomposed into internal structural components or ''
pianpang
Pianpangs ( zh, c=偏旁, p=piānpáng, l=side side) are components in Chinese character internal structures. A compound character is normally divided into two ''pianpangs'' according to their relationship in sounds and meanings. Originally, the ...
s'' in relations with the sound and meaning of the characters.
Traditional classification
In
Shuowen Jiezi
The ''Shuowen Jiezi'' is a Chinese dictionary compiled by Xu Shen , during the Eastern Han dynasty (25–220 CE). While prefigured by earlier reference works for Chinese characters like the ''Erya'' (), the ''Shuowen Jiezi'' contains the ...
,
Xu Shen
Xu Shen () was a Chinese calligrapher, philologist, politician, and writer of the Eastern Han dynasty (25–189 CE). During his own lifetime, Xu was recognized as a preeminent scholar of the Five Classics. He was the author of ''Shuowen Jiezi'' ...
proposed six categories () of Chinese characters, including
#Pictograms (), single-semantic-component characters which are drawings of the objects they represent.
#Simple ideograms (), express an abstract idea with an iconic form.
#Compound ideographs (), combine two or more semantic components to indicate the meaning of the character.
#Phono-semantic characters (), consist of phonetic components and semantic components.
#Derivative cognates (), two characters had similar Old Chinese pronunciations and may have had the same etymological root.
#Rebus (phonetic loan) characters (), are characters borrowed to write other morphemes with similar pronunciations.
Modern classification
The traditional ''liushu'' presupposed that every internal component can either represent the sound or meaning of the character. But, after the long evolution of Chinese writing systems, quite a few components can no longer effectively play the roles and have become pure form components. From the internal structure point of view, modern Chinese characters are composed of semantic components, phonetic components and pure form components. And they have formed seven categories of modern Chinese characters:
''Semantic component characters'' are composed of semantic components and include:
*Pictograms, such as , , .
*Simple ideograms, such as , , .
*Compound ideographs. For example, : together to take; : something with two ; : one follows another person; : from .
*Special methods, such as : turn to the opposite (right) side; : taken away .
''Phonetic component characters'' are composed of phonetic components. For example,
*Phonetic-loan, for example, character is borrowed to mean 'spending'.
*Used in a transliterated foreign word, e.g. the characters in words and .
*Multi-phonetic component characters, for example, was originally a semantic-phonetic character, but its modern meaning of "new" has nothing to do with the original semantic component of , but the sounds are similar. In this way, then has two phonetic components: and .
''Pure form characters'' are composed of form components, which neither represent the sound nor the meaning of the characters.
For example:
*: The character in modern regular script is no longer round like the Sun.
*: The traditional character is .
*: Oracle resembled a deer.
''Semantic-phonetic characters'', also called "phono-semantic characters", consist of semantic components and phonetic components. There are six combinations:
#Left meaning (semantic) and right sound (phonetic), such as , , ;
#Right meaning and left sound, such as , , ;
#Upper meaning and lower sound: , and ;
#Lower meaning and upper sound: , , ;
#Outer meaning and inner sounds: , , , , ;
#Inner meaning and outer sound: , , .
''Semantic-form characters'' are composed of semantic components and pure form components.
Many of these characters were originally semantic-phonetic characters. Due to subsequent changes in the pronunciation of the phonetic components or the characters, the phonetic components could not effectively represent the pronunciation of the character and became pure form. For example:
*: used to have signific and phonetic , the phonetic component is no longer .
*: used to have semantic and phonetic . Now the upper component no longer looks like .
*, is a , but not read as .
''Phonetic-form characters'' are composed of phonetic components and pure form components. They mostly came from ancient semantic-phonetic characters, where the semantic components lost their functions and became pure form. For example,
*: Originally refers to a kind of beautiful jade, with semantic component }. Later, it was borrowed to represent a ball, and then extended to any spherical object, and became a pure form component, while remains a phonetic component.
*: Originally refers to the inner white layer of bamboo, with semantic component and phonetic . Later, the character was borrowed by sound to mean 'stupid'.
*: This is a simplified character with phonetic , and pure form component .
''Semantic-phonetic-form characters'' consist of the three kinds of components.
For example,
*: originally had the signific ⿱ and phonetic . In modern Chinese, ⿱ is not a character or radical with a sound or meaning, but can still express meaning, while remains a pure form component.
*: semantic and phonetic . In modern Chinese characters, the right part has become a pure form component.
Semantic–phonetic–form characters are very rare and the examples above are not quite persuasive. Whether they can be justified as an internal structural category remains to be further studied. If not a category, then the classification above can also be called "New six writings".
According to Yang, among the 3,500 frequently used Chinese characters of their experiment, semantic component characters are the least, accounting for about 5%; pure form component characters account for about 18%; Semantic–form and phonetic–form characters account for about 19%. The largest group is semantic-phonetic characters, accounting for about 58%.
Simplification
Sources
There are four main sources of simplified characters:
#Ancient characters, such as: (), (), ()
#Simplified Chinese characters popular in the society, such as: (), (), ().
#Cursive regularized characters, for example: (), (), ().
#Newly coined characters, for example: (, (), ().
Methods
The methods to simplify Chinese characters include
Omitting
Omitting is to omit some components of the character, for example:
*Omit one side, such as → , → , → , → ;
*Omit both sides, such as → , → ;
*Omit a corner, such as → → ;
*Keep a corner, such as → , → ,
*Omit inside, such as → , → ;
*Omit outside, such as → ;
*Omit strokes, such as → , → , → ;
*Others, such as → , → , → , → .
Reshaping
Reshaping is to change forms based on the original characters. For example,
*Change one or both components of a semantic-phonetic character, such as → , → , → , ( → .
*Change to semantic-phonetic characters, such as → , → , → .
*Change components of multi-semantic characters, such as: → , → .
*Change to multi-semantic characters, such as: → , → , → .
*Keep outline (cursive script regularized), such as: → , → , → , → , → , → , → , → , → .
*Symbolize components, such as: → , → , → , → , → .
*Simplify radicals, such as: → (... → ...), → (... → ), → (... → ...).
*Others: → , → , → .
Replacing
Replacing the whole character with a character of similar sound. For example,
* → , → , → , → ;
* → ; → → .
Rationalization
The goal of ''Chinese character rationalization'' or ''Chinese character optimization'' () is to, in addition to
Chinese character simplification, optimize the Chinese characters and set up one standard form for each of them.
Processing variant characters
''
Variant Chinese characters
Chinese characters may have several variant forms—visually distinct glyphs that represent the same underlying meaning and pronunciation. Variants of a given character are ''allographs'' of one another, and many are directly analogous to allog ...
'' are characters with the same pronunciation and meaning but different forms, such as and . The existence of variant characters results in multiple forms for one character, which increases the burden of language learning and application. In the process of Chinese characters application, people need to constantly process variant characters and eliminate inappropriate ones.
There are two different principles for processing variant characters: One is conforming to the customs and simplicity. The other is to follow the original form and meaning, based on the character creation method and etymology, especially the ''
Shuowen Jiezi
The ''Shuowen Jiezi'' is a Chinese dictionary compiled by Xu Shen , during the Eastern Han dynasty (25–220 CE). While prefigured by earlier reference works for Chinese characters like the ''Erya'' (), the ''Shuowen Jiezi'' contains the ...
''.
There are two methods for processing variant characters. The ''selecting'' method is to select one of the variant characters as the standard character and eliminate the rest. The ''splitting'' method is to differentiate a group of variant characters in terms of usage to eliminate the variant relationship.
In December 1955, the Ministry of Culture and the Cultural Reform Commission of PRC jointly announced the "First List of Processed Variant Characters" (). After some later adjustments, the list now has 796 groups of variant characters, and 1,027 characters have been eliminated.
Processing printing fonts
In January 1965, the Ministry of Culture and the Cultural Reform Commission of PRC jointly issued the ''General Chinese Character Forms for Printing'' (; Font Table in short). The ''Font Table'' contains 6,196 commonly used Song-style characters for printing. In accordance with the principles of simplicity and convenience for learning and use, a standard form was specified for each common character, including the number of strokes, structure and stroke order. After the
Cultural Revolution
The Cultural Revolution, formally known as the Great Proletarian Cultural Revolution, was a Social movement, sociopolitical movement in the China, People's Republic of China (PRC). It was launched by Mao Zedong in 1966 and lasted until his de ...
, the Font Table was formally published. The character forms specified by it are now customarily called "new character forms", while the fonts used before were called "old character forms". The "New and Old Character Form Comparison Table" () in many language reference books including
Xinhua Dictionary
The ''Xinhua Zidian'' (), also as ''Xinhua Dictionary'', is a Chinese-language dictionary published by the Commercial Press. The first edition of ''Xinhua Zidian'' was published in 1957. The latest version is the 12th edition, which was publis ...
and
Xiandai Hanyu Dictionary are compiled and printed based on the Font Table.
Current font standards include:
*In Mainland China, the standard is the ''
List of Commonly Used Standard Chinese Characters
The ''List of Commonly Used Standard Chinese Characters'' is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.
The project began in 2001, origina ...
'' (), issued by the State Council on June 5, 2013.
[ The characters are in font Song.
*In Taiwan, the standard is the '']Standard Form of National Characters
The ''Standard Form of National Characters'' or the ''Standard Typefaces for Chinese Characters'' () is the standardized form of Chinese characters set by the Ministry of Education of the Republic of China (Taiwan).
Lists
There are three lists ...
'' (). The characters are in regular script.
*The standard adopted by the Hong Kong education sector is the ''List of Graphemes of Commonly-Used Chinese Characters
The ''List of Graphemes of Commonly-Used Chinese Characters'' () is a list of 4762 commonly used Chinese characters and their standardized forms prescribed by the Hong Kong Education Bureau. The list is meant to be taught in primary and middl ...
'' ( ). The characters were originally handwritten, then changed to font Kai.
*The list of ''jōyō kanji
The are those kanji listed on the , officially announced by the Japanese Ministry of Education. The current List of jōyō kanji, list of 2,136 characters was issued in 2010. It is a slightly modified version of the tōyō kanji, kanji, which ...
'' for Japan.
*The ''Kangxi Dictionary
The ''Kangxi Dictionary'' () is a Chinese dictionary published in 1716 during the High Qing, considered from the time of its publishing until the early 20th century to be the most authoritative reference for written Chinese characters. Wanting ...
'' (de facto) for Korea.
Names of places
In order to make place names easier to use, the Chinese government started to process the uncommon characters used in place names in 1950s.
The principles for choosing replacement characters are:
#Same pronunciation and clear,
#More commonly used,
#Simple and easy to write,
#A standard character that is popular in the local area,
#Not to be confused with other place names.
From March 1955 to August 1964, 35 place names of county level or above were changed with the approval of the State Council. For example:
*"" (Tieli County) was changed to "" (Tieli County),
*"" (Poyang County) was changed to "" (Boyang County),
*"" (Hedian Prefecture) was changed to "" (Hetian Prefecture).
Later, in order to maintain the stability of place names, this work was suspended.
Measurement words
When the English units of measurement were translated into Chinese, there were inconsistencies in the use of characters. For example:
mile: or .
foot: , .
kilowatt: , .
Therefore, the burden of language application was increased. etc. are specially created characters, and they also have poly-syllable sounds, which does not follow the monosyllable pattern of Chinese characters. In order to solve these problems, in July 1977, the Chinese Character Reform Commission and the National Bureau of Standards and Measures of PRC jointly issued the "Notice on the Uniform Use of Characters in the Names of Some Measurement Units" (), establishing the metric system as the basic measurement system.
Education
''Chinese character education'' is the teaching and learning of Chinese characters. When written Chinese appeared in social communication, Chinese character teaching came into being. From ancient times to the present, the teaching of Chinese characters has always been the focus of Chinese language education.
Ancient education
In ancient times, research on Chinese character teaching focused on the preparation of various centralized literacy textbooks and dictionaries. Among them, the ones with greater impact include:
*Southern and Northern Dynasties: Thousand Character Classic
The ''Thousand Character Classic'' (), also known as the ''Thousand Character Text'', is a Chinese poem that has been used as a primer for teaching Chinese characters to children from the sixth century onward. It contains exactly one thousand c ...
, written in regular script, 502–549 AC)
*Song Dynasty: Hundred Family Surnames
The ''Hundred Family Surnames'' (), commonly known as ''Bai Jia Xing'', also translated as ''Hundreds of Chinese Surnames'', is a classic Chinese language , Chinese text composed of common Chinese surnames. An unknown author compiled the book ...
, regular script, 960–1279)
*Song Dynasty: Three Character Classic
The ''Three Character Classic'' (), commonly known as ''San Zi Jing'', also translated as ''Trimetric Classic'', is one of the Chinese classic texts. It was probably written in the 13th century and is mainly attributed to Wang Yinglin (王應麟 ...
, regular script, 13th century).
The previous three books then developed into a set of teaching materials, collectively called "Three Hundred Thousand" (, about 2,000 different characters), which were used for over 1000 years until the end of the Qing Dynasty, and still have a certain influence today. "Three Hundred Thousand" is arranged in rhyme form to make it catchy and easy to remember. Another influential literacy textbook is "Wenzi Mengqiu" () compiled for children by the Qing Dynasty writer Wang Jun (1784-1854), which contains 2,049 characters.
Modern native language education
Modern Chinese character education is an important component of primary education in China, and an important part of literacy teaching and teaching Chinese as a foreign language.
The method is to use high-frequency characters according to frequency statistics. The important character lists include:
*"List of Frequently Used Characters in Modern Chinese" (, State Language Commission, Beijing, 1988), 3,500 characters.
*List of Commonly Used Standard Chinese Characters
The ''List of Commonly Used Standard Chinese Characters'' is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.
The project began in 2001, origina ...
(, the 3,500 primary characters in this list of 8,105 characters of the Simplified Chinese writing system, released by the State Council of PRC in June 2013,)
* Chart of Standard Forms of Common National Characters
The Chart of Standard Forms of Common National Characters or the Table of Standard Typefaces for Frequently-Used Chinese Characters () is a list of 4,808 commonly used Chinese characters. The standard typefaces were prescribed by Taiwan's Minis ...
(, 1979), including 4,808 commonly used Chinese characters.
The Chinese character literacy movement began in the early 20th century, when the literacy level of ordinary Chinese people was quite low. Intellectuals who cared about the country and its people advocated for education to save the country and started a Chinese character literacy campaign. In June 1952, the Ministry of Education of China published a list of commonly used literacy characters, including 2,000 characters for use in literacy textbooks. In 1993, the State Language Commission published the "Character List for Literacy", which includes Table A and B. Table A contains 1,800 characters that are required for literacy in the country, and Table B contains 200 reference characters for literacy. According to UNESCO
The United Nations Educational, Scientific and Cultural Organization (UNESCO ) is a List of specialized agencies of the United Nations, specialized agency of the United Nations (UN) with the aim of promoting world peace and International secur ...
, China's illiteracy rate had dropped to only 3.6 percent by 2015.
Foreign language education
In the 3rd century AD, Chinese characters were introduced to Korea, thereafter to Japan, Vietnam and other countries.
By 1989, there were more than 100 colleges and universities teaching Chinese as a foreign language in China.
From 1990 to 1991, the National Leading Group for Teaching Chinese as a Foreign Language and the Chinese Proficiency Test Center of Beijing Language Institute jointly developed the "" (Outline of the Graded Vocabulary and Characters for HSK). The Chinese character outline contains 2,905 characters, divided into four grades: 800 Grade A characters, 804 Grade B characters, 601 Grade C characters, and 700 Grade D characters. Among them, 2,485 are first-level frequently used characters in the "" (List of Frequently Used Characters in Modern Chinese). Teaching Chinese characters as a foreign language has received more and more attention, and many textbooks and elective courses in this area have appeared. There are now more than 200 Confucius Institutes teaching Chinese as a foreign language in the world.
Information technology
''Chinese character Information Technology (IT)'' is the technology of computer processing of Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
.
While the English writing system makes use of a few dozen different characters, Chinese language needs a much larger character set. There are over ten thousand characters in the ''Xinhua Dictionary
The ''Xinhua Zidian'' (), also as ''Xinhua Dictionary'', is a Chinese-language dictionary published by the Commercial Press. The first edition of ''Xinhua Zidian'' was published in 1957. The latest version is the 12th edition, which was publis ...
''. In the Unicode multilingual character set of 149,813 characters, 98,682 (about two-thirds) are Chinese.
Chinese character input
''Sound-based encoding'' is normally based on an existing Latin character scheme for Chinese phonetics, such as the pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
Scheme for Mandarin Chinese
Mandarin ( ; zh, s=, t=, p=Guānhuà, l=Mandarin (bureaucrat), officials' speech) is the largest branch of the Sinitic languages. Mandarin varieties are spoken by 70 percent of all Chinese speakers over a large geographical area that stretch ...
or Putonghua
Standard Chinese ( zh, s=现代标准汉语, t=現代標準漢語, p=Xiàndài biāozhǔn hànyǔ, l=modern standard Han speech) is a modern standard form of Mandarin Chinese that was first codified during the republican era (1912–1949). ...
, and the Jyutping
The Linguistic Society of Hong Kong Cantonese Romanization Scheme, also known as Jyutping, is a romanisation system for Cantonese developed in 1993 by the Linguistic Society of Hong Kong (LSHK).
The name ''Jyutping'' (itself the Jyutping ro ...
Scheme for the Cantonese
Cantonese is the traditional prestige variety of Yue Chinese, a Sinitic language belonging to the Sino-Tibetan language family. It originated in the city of Guangzhou (formerly known as Canton) and its surrounding Pearl River Delta. While th ...
dialect. The input code of a Chinese character is its pinyin letter string followed by an optional number representing the tone. For example, the Putonghua Pinyin input code of (Hong Kong) is "xianggang" or "xiang1gang3", and the Cantonese Jyutping code is "hoenggong" or "hoeng1gong2", all of which can be easily input via an English keyboard.
A Chinese character can alternatively be input by ''form-based encoding''. Most Chinese characters can be divided into a sequence of components in writing order.
There are a few hundred basic components, much less than the number of characters. By representing each component with an English letter and putting them in writing order of the character, the input method creator can get a letter string ready to be used as an input code on the English keyboard. Of course the creator can also design a rule to select representative letters from the string if it is too long. For example, in the Cangjie input method
The Cangjie input method (Tsang-chieh input method, sometimes called Changjie, Cang Jie, Changjei or Chongkit) is a system for entering Chinese characters into a computer using a standard computer keyboard. In filenames and elsewhere, the name C ...
, character (border) is encoded as "NGMWM" corresponding to components "弓土一田一", with some components omitted.
Popular form-based encoding methods include Wubi (五笔) in the Mainland and Cangjie
Cangjie is a legendary figure in Chinese mythology, said to have been an official historian of the Yellow Emperor and the inventor of Chinese characters.
Legend has it that he had four eyes, and that when he invented the characters, the deities ...
(倉頡) in Taiwan and Hong Kong.
The most important feature of ''intelligent input'' is the application of contextual constraints for candidate character selection. For example, on Microsoft Pinyin, when the user types input code "daxuejiaoshou", he/she will get "" (University Professor), when types "daxuepiaopiao" the computer will suggest "" (heavy snow flying). Though the non-toned Pinyin letters of 大学 and 大雪 are both "daxue", the computer can make a reasonable selection based on the subsequent words.
Chinese character encoding for information interchange
Inside the computer or mobile phone each character is represented by an internal code. When a character is sent between two computers or other digital devices, it is in information interchange code. Nowadays, information interchange codes, such as ASCII and Unicode, are often directly employed as internal codes.
The first ''GB Chinese character encoding standard'' is GB2312, which was released by the PRC
China, officially the People's Republic of China (PRC), is a country in East Asia. With a population exceeding 1.4 billion, it is the second-most populous country after India, representing 17.4% of the world population. China spans the e ...
in 1980. It includes 6,763 Chinese characters, with 3,755 frequently used ones sorted by Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
, and the rest by radicals
Radical (from Latin: ', root) may refer to:
Politics and ideology Politics
*Classical radicalism, the Radical Movement that began in late 18th century Britain and spread to continental Europe and Latin America in the 19th century
*Radical politics ...
(indexing components). GB2312 was designed for simplified Chinese characters
Simplified Chinese characters are one of two standardized Chinese characters, character sets widely used to write the Chinese language, with the other being traditional characters. Their mass standardization during the 20th century was part of ...
. Traditional characters
Traditional Chinese characters are a standard set of Chinese character forms used to write Chinese languages. In Taiwan, the set of traditional characters is regulated by the Ministry of Education and standardized in the ''Standard Form of ...
which have been simplified are not covered. The code of a character is represented by a two-byte hexadecimal number, for instance, the GB codes of (Hong Kong) are CFE3 and B8DB respectively. GB2312 is still in use on some computers and the WWW, though newer versions with extended character sets, such as GB13000.1 and GB18030, have been released.
The latest version of GB encoding is GB18030, which supports both simplified and traditional Chinese characters, and is consistent with the Unicode character set.
The standard of ''Big5
Big-5 or Big5 ( zh, t=大五碼) is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters.
The People's Republic of China (PRC), which uses simplified Chinese characters, uses the GB 18030 ...
encoding'' was designed by five big IT companies in Taiwan in the early 1980s, and has been the de facto standard for representing traditional Chinese in computers ever since. Big5 is popularly used in Taiwan, Hong Kong and Macau.
The original Big5 standard included 13,053 Chinese characters, with no simplified characters of the Mainland. Each character is encoded with a two byte hexadecimal code, for example, 香 (ADBB) 港 (B4E4) 龍 (C073). Chinese characters in the Big5 character set are arranged in radical order.
Extended versions of Big5 include Big-5E and Big5-2003, which include some simplified characters and Hong Kong Cantonese characters.
The full version of the ''Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
standard'' represents a character with a 4-byte digital code, providing a huge encoding space to cover all characters of all languages in the world. The Basic Multilingual Plane (BMP) is a 2-byte kernel version of Unicode with 65,536 code points for important characters of many languages. There are 27,522 characters in the CJKV (China, Japan, Korea and Vietnam) Ideographs Area, including all the simplified and traditional Chinese characters in GB2312 and Big5 traditional. In Unicode 15.0, there is a multilingual character set of 149,813 characters, among which overs 98,682 (about 2/3) are Chinese sorted by Kangxi Radicals
The ''Kangxi'' radicals (), also known as ''Zihui'' radicals, are a set of 214 radicals that were collated in the 18th-century '' Kangxi Dictionary'' to aid categorization of Chinese characters. They are primarily sorted by stroke count. They ...
.
o
Chinese character output
Like English and other languages, Chinese characters are output on printers and screens in different fonts
In movable type, metal typesetting, a font is a particular #Characteristics, size, weight and style of a ''typeface'', defined as the set of fonts that share an overall design.
For instance, the typeface Bauer Bodoni (shown in the figure) inclu ...
and styles. The most popular Chinese fonts are the Song (), Kai (), Hei () and Fangsong () families.
Fonts appear in different sizes. In addition to the international measurement system of points
A point is a small dot or the sharp tip of something. Point or points may refer to:
Mathematics
* Point (geometry), an entity that has a location in space or on a plane, but has no extent; more generally, an element of some abstract topologica ...
, Chinese characters are also measured by size numbers ( ) invented by an American for Chinese printing in 1859.
See also
* Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
* Chinese character strokes
Strokes ( zh, t=筆畫, s=笔画, p=bǐhuà) are the smallest structural units making up written Chinese characters. In the act of writing, a stroke is defined as a movement of a writing instrument on a writing material surface, or
the trace l ...
* Chinese character components
* Chinese character structures Chinese character structures () are the patterns or rules in which the characters are formed by their writing units. There are two aspects of Chinese character structures:
The ''external structures'' are on the writing strokes, components and whole ...
* Chinese character IT
* CJK characters
In internationalization, CJK characters is a collective term for graphemes used in the Chinese, Japanese, and Korean writing systems, which each include Chinese characters. It can also go by CJKV to include Chữ Nôm, the Chinese-origin lo ...
* Kanji
are logographic Chinese characters, adapted from Chinese family of scripts, Chinese script, used in the writing of Japanese language, Japanese. They were made a major part of the Japanese writing system during the time of Old Japanese and are ...
* List of Commonly Used Standard Chinese Characters
The ''List of Commonly Used Standard Chinese Characters'' is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.
The project began in 2001, origina ...
*Variant Chinese characters
Chinese characters may have several variant forms—visually distinct glyphs that represent the same underlying meaning and pronunciation. Variants of a given character are ''allographs'' of one another, and many are directly analogous to allog ...
* Written Chinese
Written Chinese is a writing system that uses Chinese characters and other symbols to represent the Chinese languages. Chinese characters do not directly represent pronunciation, unlike letters in an alphabet or syllabograms in a syllabary. Rath ...
Notes
References
Citations
Works cited
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
* (English translation of ''Wénzìxué Gàiyào'' , Shangwu, 1988.)
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
{{Refend
External links
Chinese Character Strokes
* https://qxk.bnu.edu.cn/#/
* https://www.chineseconverter.com/zh-cn/convert/zhuyin
* https://www.chineseconverter.com/zh-cn/convert/chinese-to-pinyin
Chinese characters
Chinese language reform