Pinyin input method
   HOME

TheInfoList



OR:

The pinyin method () refers to a family of
input methods An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
based on the
pinyin Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese fo ...
method of romanization. In the most basic form, the pinyin method allows a user to input
Chinese characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji ...
by entering the pinyin of a Chinese character and then presenting the user with a list of possible characters with that
pronunciation Pronunciation is the way in which a word or a language is spoken. This may refer to generally agreed-upon sequences of sounds used in speaking a given word or language in a specific dialect ("correct pronunciation") or simply the way a particular ...
. However, there are a number of slightly different such systems in use, and modern pinyin methods provide a number of convenient features.


Advantages and disadvantages

The obvious advantage of pinyin-based input methods the ease of learning for
Standard Chinese Standard Chinese ()—in linguistics Standard Northern Mandarin or Standard Beijing Mandarin, in common speech simply Mandarin, better qualified as Standard Mandarin, Modern Standard Mandarin or Standard Mandarin Chinese—is a modern standa ...
speakers. Those who are familiar with pinyin and are able to recognize the resulting characters would be able to input them with almost no training, compared to other input methods. It does not require the user to be able to construct the character from scratch, as one would do in
written Chinese Written Chinese () comprises Chinese characters used to represent the Chinese language. Chinese characters do not constitute an alphabet or a compact syllabary. Rather, the writing system is roughly logosyllabic; that is, a character generally r ...
. Since all children in
Mainland China "Mainland China" is a geopolitical term defined as the territory governed by the China, People's Republic of China (including islands like Hainan or Chongming Island, Chongming), excluding dependent territories of the PRC, and other territorie ...
are required to learn pinyin in
school A school is an educational institution designed to provide learning spaces and learning environments for the teaching of students under the direction of teachers. Most countries have systems of formal education, which is sometimes co ...
, pinyin is among the most popular input methods there. For people who cannot speak Standard Chinese, the main advantage of pinyin becomes a disadvantage, as they will need to learn the Standard Chinese pronunciation of characters before they are able to use this input method. Also, because pinyin and other
pronunciation Pronunciation is the way in which a word or a language is spoken. This may refer to generally agreed-upon sequences of sounds used in speaking a given word or language in a specific dialect ("correct pronunciation") or simply the way a particular ...
-based input methods do not rely on the written formation of the character for input (as do stroke-based input methods like
Cangjie Cangjie () is a legendary ancient Chinese figure said to have been an official historian of the Yellow Emperor and the inventor of Chinese characters. Legend has it that he had four eyes, and that when he invented the characters, the deities an ...
), they may cause language attrition and skill loss in adults, and it may be a learning barrier for written Chinese in children.


Elements and features

Pinyin input methods differ in a number of possible aspects. Most pinyin input methods provide convenience features to speed up input. Some of these features can speed up typing immensely.


Conversion length

Conversion length input method is the buffer that holds the user input until it is converted into characters that would otherwise be unavailable from the keyboard. In the most basic systems, one character is converted at a time. This makes a very time-consuming input process. Not only does the user have to select characters one at a time, it also means that the input system does not have the ability to prioritize character choices using word phrases, grammatical structure, or context. In addition, since the input method only supports one character at a time, it likely requires the user to type out the full pinyin spelling to narrow down the selection. This system still exists in embedded applications such as cell phones. Common pinyin implementations on the computer today can hold up to a clause in pinyin before requiring a conversion. The method attempts to guess the appropriate characters by using word phrases from a dictionary, grammatical structure, and context.


Treatment of tones

Chinese is a
tonal language Tone is the use of pitch in language to distinguish lexical or grammatical meaning – that is, to distinguish or to inflect words. All verbal languages use pitch to express emotional and other paralinguistic information and to convey emph ...
. Tones can be used to further distinguish characters of the same sound. Many of the early single-character pinyin method implementations required input of tones in order to narrow down the character selection. For the sake of convenience, tone selection is disabled by default in most modern pinyin systems on the computer. The user may have the option to enable it depending on the pinyin implementation.


Treatment of extended Latin characters (ü and ê)

With the exception of intonation, there are two extended Latin vowels in pinyin. They are ü (U-umlaut) and ê (E-circumflex). Given that the US keyboard layout is the most common keyboard layout in China, any pinyin method implementation would need to be able to facilitate the input of those vowels on US keyboard. Since the letter "v" is unused in Mandarin pinyin, it is universally used as an alias for ü. For example, typing "nv" into the input method would bring up the candidate list for . The handling of ê is not as universal, since the character is the only commonly used character with this pronunciation. It is an
interjection An interjection is a word or expression that occurs as an utterance on its own and expresses a spontaneous feeling or reaction. It is a diverse category, encompassing many different parts of speech, such as exclamations ''(ouch!'', ''wow!''), curse ...
roughly equivalent to " Eh" in English. Some IMEs, such as Google Pinyin, merge it into "e", while others create an additional letter combination for it, such as "ea" or "eh", or "ei" in iOS. Others would simply drop this sound.


Treatment of hm, hng, ng, n

The character 嗯 (ng) can (or should) be written using the IBUS GNU/Linux and the Microsoft input method by typing "en".


Usage statistics and user dictionaries

Most modern input method implementations would adjust the positions of word candidates in the candidate list based on prior usage statistics. In addition, the input method would also support user-defined phrases via a user dictionary.


Abbreviation

Abbreviation is a feature that allows the user to omit all but the first or first couple of letters in the pinyin spelling. This feature can speed up the input of long word phrases significantly. Under this feature, the user can enter the word for "concert" () by typing "yyh" as opposed to "yinyuehui". In systems that support user-defined phrases, users can even define their own abbreviations that might not follow standard pinyin rules.


Fuzzy pinyin

Pinyin was created based on the pronunciation of
Standard Chinese Standard Chinese ()—in linguistics Standard Northern Mandarin or Standard Beijing Mandarin, in common speech simply Mandarin, better qualified as Standard Mandarin, Modern Standard Mandarin or Standard Mandarin Chinese—is a modern standa ...
, a variety of
Mandarin Chinese Mandarin (; ) is a group of Chinese (Sinitic) dialects that are natively spoken across most of northern and southwestern China. The group includes the Beijing dialect, the basis of the phonology of Standard Chinese, the official language ...
.
Regional accents In sociolinguistics, an accent is a way of pronouncing a language that is distinctive to a country, area, social class, or individual. An accent may be identified with the locality in which its speakers reside (a regional or geographical acce ...
are prevalent in Mandarin among both native and nonnative speakers. This means that a significant number of Mandarin speakers would have trouble distinguishing a number of similar-sounding syllables of pinyin, such as ''c'' and ''ch'', ''s'' and ''sh'', ''z'' and ''zh'', ''n'' and ''ng'', ''h'' or ''hu'' and ''f'', or ''n'' and ''l''. ''Fuzzy pinyin'' or ''fuzzy input'' (模糊音) is a feature that allows a user to input those similar-sounding vowels or consonants as if they were the same thing. It also has disadvantages as the user must choose the correct characters or words from a longer list of "homophones".


Word prediction

Word prediction Autocomplete, or word completion, is a feature in which an application software, application predicts the rest of a word a user is typing. In Android (operating system), Android and iOS smartphones, this is called predictive text. In graphical use ...
() is a feature of an input method that attempts to guess the next series of characters that the user is attempting to enter. This feature is often used to refer to two different mechanisms that have similar functions. One of these mechanisms is akin to an
auto-complete Autocomplete, or word completion, is a feature in which an application predicts the rest of a word a user is typing. In Android and iOS smartphones, this is called predictive text. In graphical user interfaces, users can typically press the tab ...
function for user input. While the user is typing the appropriate pinyin, the input method would take the input and look up all possible word phrases that might match the user input even though the input is incomplete. For example, when the user enters "shang", the input method would show "上海" (Shanghai) as a word candidate under this feature. The second possible mechanism is the prediction of the user's next input after the user completes entering a set of words. For example, in the above example, after user selects "上海" (Shanghai) from the word candidate list, the input method's pinyin buffer would be empty. Under this mechanism, the input method would display a list of words that often follows the word Shanghai, such as "人" (people), "市" (city), "的" (an auxiliary word).


Double pinyin

Vowel groups in pinyin can be up to four letters long. ''Double pinyin'' (双拼) is a method whereby longer vowel groups are assigned to consonant keys as shortcuts, and zh, ch, sh are assigned to vowel keys as shortcuts. Thus, when the input method expects a vowel, the user can use the shortcuts to speed up typing. In the
Microsoft Pinyin IME Microsoft Pinyin IME () is the pinyin input method implementation developed by Microsoft and Harbin Institute of Technology. It is bundled with Microsoft Windows and Chinese editions of Microsoft Office. Various versions can be downloaded from Mi ...
, for example, if a user wants to input “中华人民共和国 (zhōnghuárénmíngònghéguó)”, "People's Republic of China" into the computer, they need to type "zhonghuarenmingongheguo" in Full Pinyin. In Double Pinyin, however, one only needs to type "vshwrfmngshego" (v=zh, s=ong, h=h, w=ua, r=r, f=en, m=m, n=in, g=g, s=ong, h=h, e=e, g=g, o=uo). However, with a system that uses abbreviation, the same result can be achieved by just typing in "zhhrmghg".


Typo correction

Similar to automatic typo correction for English in
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Early word processors were stand-alone devices dedicated to the function, but current ...
s, pinyin method implementations can recognize possible typos and show appropriate word candidates. Using Google Pinyin as an example, when encountering a suspected typo, Google Pinyin would show both the word candidates assuming it is correct and the word candidates assuming it is a typo.


Language mixing

Most advanced pinyin method implementations allow the mixing of English into an input stream without requiring the user to change the language mode. However, it often comes with some limitations such as requiring the input to be uppercase. The following examples show the difference if user wishes to enter "这个SQL漏洞可以瘫痪整个系统。" (This SQL vulnerability could paralyze the entire system.): * "zhe ge witch to EnglishSQL witch to Chineseloudong keyi tanhuan zhengge xitong." (Unsupported) * "zhe ge SQL loudong keyi tanhuan zhengge xitong." (Supported)


Implementations

The following are the most popular pinyin method editors used in
Mainland China "Mainland China" is a geopolitical term defined as the territory governed by the China, People's Republic of China (including islands like Hainan or Chongming Island, Chongming), excluding dependent territories of the PRC, and other territorie ...
. They are free to download at their official websites.


Cross platform

* Rime input method engine,librime homepagelibrime Debian package
/ref> an open source input method engine for pinyin and others, which supports Windows, macOS, and Linux (中州韻).


Windows

*
Microsoft Pinyin IME Microsoft Pinyin IME () is the pinyin input method implementation developed by Microsoft and Harbin Institute of Technology. It is bundled with Microsoft Windows and Chinese editions of Microsoft Office. Various versions can be downloaded from Mi ...
, bundled with Windows 2000 or higher, and bundled with all Simplified Chinese editions of Windows, developed by
Harbin Institute of Technology Harbin Institute of Technology (; abbreviation: HIT or ) is a public research university and a member of China's elite C9 League and a member of the University Alliance of the Silk Road. HIT is a Chinese Ministry of Education Class A Dou ...
(微软拼音输入法). * ZNABC, bundled with Simplified Chinese edition of Windows XP, developed by
Peking University Peking University (PKU; ) is a public research university in Beijing, China. The university is funded by the Ministry of Education. Peking University was established as the Imperial University of Peking in 1898 when it received its royal charte ...
(智能ABC输入法). *
Sogou Pinyin Sogou Pinyin Method () is a popular Chinese Pinyin input method editor developed by Sohu.com, Inc. under its search engine brand name, Sogou. Sogou Pinyin is a dominant input software in China. By July 2011, Sogou Pinyin had an 83.6% penetrati ...
(搜狗拼音输入法). * Google Pinyin, Google's implementation for Windows and Android.(谷歌拼音输入法) *Ziguang Pinyin ()
QQ Pinyin
()

() *Pinyin Jiajia ()


Linux/Unix

* Fcitx, general input method that supports Pinyin with fcitx-pinyin and fcitx-rime, among many others schemes. *Smart Pinyin (scim-pinyin), pinyin implementation for the SCIM input platform on
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, whi ...
, BSD, and other
Unices A Unix-like (sometimes referred to as UN*X or *nix) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Unix-li ...
. *Bimspinyin, pinyin implementation for the xcin input platform on
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, whi ...
, BSD, and other
Unices A Unix-like (sometimes referred to as UN*X or *nix) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Unix-li ...
. * OpenVanilla, a cross-platform framework for Chinese and more.
Ibus-Pinyin
(ibus-pinyin), pinyin implementation for the IBus input platform on
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, whi ...
, BSD, and other
Unices A Unix-like (sometimes referred to as UN*X or *nix) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Unix-li ...
.
Ibus-sunpinyin
a statistical language model based pinyin input method for IBus.


macOS

* Pinyin input is part of the standard installation of macOS. With version 10.5.8 and before, the international standard term ITABC was used, but was changed to "Pinyin - Simplified" in Mac OS X 10.6.
Fit smart Pinyin
is an alternative to the standard OS X Chinese input method.


Web


Type in Chinese Online (IME)
web-based IME with cross-browser support.
Google web-based IME

Online Pinyin Input Method
web-based IME through browsers.
Pinyin Editor
Editor for creating Pinyin with tones


See also

*
Chinese input methods for computers Chinese input methods are methods that allow a computer user to input Chinese characters. Most, if not all, Chinese input methods fall into one of two categories: phonetic readings or root shapes. Methods under the phonetic category usually are e ...
*
Keyboard layout A keyboard layout is any specific physical, visual or functional arrangement of the keys, legends, or key-meaning associations (respectively) of a computer keyboard, mobile phone, or other computer-controlled typographic keyboard. is the actua ...


References


External links

{{DEFAULTSORT:Pinyin Input Method Han character input Han pinyin input