Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
may have several variant forms—visually distinct
glyphs
A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
that represent the same underlying meaning and pronunciation. Variants of a given character are ''
allograph
In graphemics and typography, the term allograph is used of a glyph that is a design variant of a letter or other grapheme, such as a letter, a number, an ideograph, a punctuation mark or other typographic symbol. In graphemics, an obvious exa ...
s'' of one another, and many are directly analogous to allographs present in the
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
, such as the double-storey and single-storey variants of the letter A, with the latter more commonly appearing in
handwriting
Handwriting in Italian schools (XXth - XXIst century)
Handwriting is the personal and unique style of writing with a writing instrument, such as a pen or pencil in the hand. Handwriting includes both block and cursive styles and is separa ...
. Some contexts require usage of specific variants.
Nature of variants
Before the 20th century, variation in the shape of characters was ubiquitous, a dynamic which continued after the invention of
woodblock printing
Woodblock printing or block printing is a technique for printing text, images or patterns used widely throughout East Asia and originating in China in antiquity as a method of textile printing, printing on textiles and later on paper. Each page ...
. For example, prior to the
Qin dynasty
The Qin dynasty ( ) was the first Dynasties of China, imperial dynasty of China. It is named for its progenitor state of Qin, a fief of the confederal Zhou dynasty (256 BC). Beginning in 230 BC, the Qin under King Ying Zheng enga ...
(221–206 BC) the character meaning 'bright' was written as either or —with either or on the left, with the
component
Component may refer to:
In engineering, science, and technology Generic systems
*System components, an entity with discrete structure, such as an assembly or software module, within a system considered at a particular level of analysis
* Lumped e ...
on the right.
Li Si
Li Si (; 208 BC) was a Chinese calligrapher, philosopher, and politician of the Qin dynasty. He served as Chancellor from 246 to 208 BC, first under King Zheng of the state of Qin—who later became Qin Shi Huang, the "First Emperor" o ...
(), the
Chancellor
Chancellor () is a title of various official positions in the governments of many countries. The original chancellors were the of Roman courts of justice—ushers, who sat at the (lattice work screens) of a basilica (court hall), which separa ...
of Qin, attempted to universalize the Qin
small seal script
The small seal script is an archaic script style of written Chinese. It developed within the state of Qin during the Eastern Zhou dynasty (771–256 BC), and was then promulgated across China in order to replace script varieties used i ...
across China following
the wars
''The Wars'' is a 1977 novel by Timothy Findley that follows Robert Ross, a nineteen-year-old Canadian who enlists in World War I after the death of his beloved older sister in an attempt to escape both his grief and the social norms of oppressiv ...
that had politically unified the country for the first time. Li prescribed the form of the word for 'bright', but some scribes ignored this and continued to write the character as . However, the increased usage of was followed by proliferation of a third variant: , with on the left—likely derived as a contraction of . Ultimately, became the character's standard form.
New variants also result from larger shifts in the writing system as a whole, such as the process of ''
libian'' and ''
liding'' that resulted in the
clerical script
The clerical script (), sometimes also chancery script, is a style of Chinese writing that evolved from the late Warring States period to the Qin dynasty. It matured and became dominant in the Han dynasty, and remained in active use through t ...
. According to the palaeographer Qiu Xigui, the broadest trend in the evolution of Chinese characters over their history has been simplification, both in graphical ''shape'' (), the "external appearances of individual graphs", and in graphical ''form'' (), "overall changes in the distinguishing features of graphic
lshape and calligraphic style,
..in most cases refer
ing
Ing, ING or ing may refer to:
Art and media
* '' ...ing'', a 2003 Korean film
* i.n.g, a Taiwanese girl group
* The Ing, a race of dark creatures in the 2004 video game '' Metroid Prime 2: Echoes''
* "Ing", the first song on The Roches' 199 ...
to rather obvious and rather substantial changes". ''Libian'' often involved significant omissions, additions, or transmutations of the forms used by Qin small seal script, while ''liding'' is the direct regularization and linearization of shapes to convert them into clerical forms while preserving their original structure. For example, the character for 'year' underwent ''liding'' to the clerical script form , while the same character after undergoing ''libian'' resulted in the orthodox form . Similarly, ''libian'' and ''liding'' created the two distinct characters and for 'tiger'.
There are variants that arise through the use of different radicals to refer to specific definitions of a polysemous character. For instance, the character could mean either 'a type of hawk' or 'carve'. Variants using different radicals to specify thus developed: respectively , with a radical, and , with a radical.
In rare cases, two characters in ancient Chinese with similar meanings were confused and conflated when their modern Chinese readings merged, for example, and , are both read as and mean 'famine', used interchangeably in the modern language, even though initially meant 'insufficient food to satiate' and meant 'famine' in
Old Chinese
Old Chinese, also called Archaic Chinese in older works, is the oldest attested stage of Chinese language, Chinese, and the ancestor of all modern varieties of Chinese. The earliest examples of Chinese are divinatory inscriptions on oracle bones ...
. The two characters formerly belonged to two different Old Chinese
rime groups ( and groups, respectively) which indicates they had different pronunciations back then. A similar situation is responsible for the existence of variants of the particle 'in' which had the ancient form , now used as its simplified form. In each case above, variants were merged into single simplified forms.
Orthodoxy
Character forms that are most orthodox are known as ''orthodox variants'' (), which is sometimes taken as mean the forms present in the ''
Kangxi Dictionary
The ''Kangxi Dictionary'' () is a Chinese dictionary published in 1716 during the High Qing, considered from the time of its publishing until the early 20th century to be the most authoritative reference for written Chinese characters. Wanting ...
'' (), which usually represent the orthodox forms used in late imperial China. Non-orthodox forms are known as ''folk variants'' (;
Revised Romanization
Revised Romanization of Korean () is the official Korean language romanization system in South Korea. It was developed by the National Academy of the Korean Language from 1995 and was released to the public on 7 July 2000 by South Korea's Mi ...
: ;
Hepburn: ). Some folk variants are longstanding abbreviations or calligraphic forms, and later became the basis for the
simplified forms
Simplified Chinese characters are one of two standardized character sets widely used to write the Chinese language, with the other being traditional characters. Their mass standardization during the 20th century was part of an initiative by th ...
adopted on the mainland. For example, is a folk variant corresponding to the orthodox form 'foolish'. These forms differ by their phonetic component, with the folk variant using a character with a "close enough" pronunciation but having much less strokes and thus quicker to write. In mainland China, simplified forms are called ''
xin zixing
The ''xin zixing'' () are a set of Standard language, standardized Chinese character forms. It is based on the 1964 "List of character forms of Common Chinese characters for Publishing" () as compared to ''jiu zixing''. The standard is based on ...
'', typically contrasting with ''
jiu zixing
(), also known as inherited glyph form, or traditional glyph form, not to be confused with Traditional Chinese, is a traditional orthography of Chinese characters which uses the orthodox character forms, especially the character forms used i ...
'', which are usually the ''Kangxi'' form.
Orthodox and vulgar forms may only differ by the length or location of individual strokes, whether certain strokes intersect, or the presence or absence of minor strokes (dots). These are often not considered to amount to being discrete variants. For instance, is the new form of the character with traditional orthography 'recount', 'describe'. As another example, the
surname
In many societies, a surname, family name, or last name is the mostly hereditary portion of one's personal name that indicates one's family. It is typically combined with a given name to form the full name of a person, although several give ...
, also the name of an
ancient state, is the 'new character shape' form of the character traditionally written .
Regional standards

Character variant exist throughout every writing system that uses Chinese characters, including
written Chinese
Written Chinese is a writing system that uses Chinese characters and other symbols to represent the Chinese languages. Chinese characters do not directly represent pronunciation, unlike letters in an alphabet or syllabograms in a syllabary. Rath ...
,
Japanese, and
Korean. Several governments of countries that speak these languages have standardized their writing systems by specifying certain variants as the standard form. The choice of which variants to use has resulted in some bifurcation of written Chinese between
simplified and
traditional forms. The standardization of simplified forms in Japan was distinct from the process in mainland China.
The standard character forms prescribed by the government of each region are described in:
* The ''
List of Commonly Used Standard Chinese Characters
The ''List of Commonly Used Standard Chinese Characters'' is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.
The project began in 2001, origina ...
'' for mainland China
* The ''
List of Graphemes of Commonly-Used Chinese Characters
The ''List of Graphemes of Commonly-Used Chinese Characters'' () is a list of 4762 commonly used Chinese characters and their standardized forms prescribed by the Hong Kong Education Bureau. The list is meant to be taught in primary and middl ...
'' for Hong Kong (educational usage only)
* The ''
Standard Form of National Characters
The ''Standard Form of National Characters'' or the ''Standard Typefaces for Chinese Characters'' () is the standardized form of Chinese characters set by the Ministry of Education of the Republic of China (Taiwan).
Lists
There are three lists ...
'' for Taiwan (educational usage only)
* The list of for Japan
* The ''
Kangxi Dictionary
The ''Kangxi Dictionary'' () is a Chinese dictionary published in 1716 during the High Qing, considered from the time of its publishing until the early 20th century to be the most authoritative reference for written Chinese characters. Wanting ...
'' in Korea
Use in computing
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
deals with variant characters in a complex manner, as a result of the process of
Han unification
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a featur ...
. In Han unification, some variants that are nearly identical between Chinese-, Japanese-, Korean-speaking regions are encoded in the same
code point
A code point, codepoint or code position is a particular position in a Table (database), table, where the position has been assigned a meaning. The table may be one dimensional (a column), two dimensional (like cells in a spreadsheet), three dime ...
, and can only be distinguished using different
typeface
A typeface (or font family) is a design of Letter (alphabet), letters, Numerical digit, numbers and other symbols, to be used in printing or for electronic display. Most typefaces include variations in size (e.g., 24 point), weight (e.g., light, ...
s. Other variants that are more divergent are encoded in different code points. On
webpage
A web page (or webpage) is a Web document that is accessed in a web browser. A website typically consists of many web pages linked together under a common domain name. The term "web page" is therefore a metaphor of paper pages bound together in ...
s, displaying the correct variants for the intended language is dependent on the typefaces installed on the computer, the configuration of the web browser and the
language tags of web pages. Systems that are ready to display the correct variants are rare because many computer users do not have standard typefaces installed and the most popular web browsers are not configured to display the correct variants by default. The following are some examples of variant forms of Chinese characters with different code points and language tags.
The following examples have the same code points, but different language tags. However language tags rarely work correctly to get the expected forms from text renderers (e.g. in the table below where all rendered glyphs may look the same).
Instead, the Unicode standard allows encoding these variants as
variation sequences, by appending a
variation selector (a glyph-less non-spacing mark) to the standard CJK ''unified'' ideograph (it also works directly inside plain text, without needing to use any
rich text format to select the appropriate language or script, and allows easier and more selective control when the same language/script combination needs several variants). The list of valid variation sequences is standardized by Unicode, defined in the Ideographic Variation Database (IVD), part of the Unicode Characters Database (UCD), and it is expansible without reencoding new code points in the UCS (and since the Unicode versions where variation selectors were encoded and the IVD established, it's no longer needed to encode any new ''compatibility'' ideograph to render them; the two blocks
CJK Compatibility Ideographs in the
BMP and
CJK Compatibility Ideographs Supplement in the
SIP are now frozen since Unicode 4.1, except to fix a few past mistakes that were forgotten during the Han unification process for the review of normative sources).
See also
*
*
*
*
Chinese character rationalization
Notes
References
Citations
Works cited
*
*
*
{{DEFAULTSORT:Variant Chinese character
Chinese characters