HOME





Chinese Word-segmented Writing
Chinese word-segmented writing, or Chinese word-separated writing (), is a style of written Chinese where texts are written with spaces between words like written English. Chinese sentences are traditionally written as strings of characters, with no marks between words. Hence, word segmentation according to the context (done either consciously or unconsciously) is a task for the reader. There are many advantages or reasons of word-segmented writing. An important reason lies in the existence of ambiguous texts where only the author knows the intended meaning and the correct segmentation. For example, "美國會不同意。 美国会不同意。" may mean "美國 會 不同意。 美国 会 不同意。" (The US will not agree.) or "美 國會 不同意。 美 国会 不同意。" (The US Congress does not agree). History In ancient China, texts were written without punctuation marks, which led to the reader needing to spend a considerable amount of time finding the boundary of a s ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Written Chinese
Written Chinese is a writing system that uses Chinese characters and other symbols to represent the Chinese languages. Chinese characters do not directly represent pronunciation, unlike letters in an alphabet or syllabograms in a syllabary. Rather, the writing system is '' morphosyllabic'': characters are one spoken syllable in length, but generally correspond to morphemes in the language, which may either be independent words, or part of a polysyllabic word. Most characters are constructed from smaller components that may reflect the character's meaning or pronunciation. Literacy requires the memorization of thousands of characters; college-educated Chinese speakers know approximately 4,000. This has led in part to the adoption of complementary transliteration systems (generally Pinyin) as a means of representing the pronunciation of Chinese. Chinese writing is first attested during the late Shang dynasty (), but the process of creating characters is thought to have begun centur ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Written English
English orthography comprises the set of rules used when writing the English language, allowing readers and writers to associate written graphemes with the sounds of spoken English, as well as other features of the language. English's orthography includes norms for spelling, hyphenation, capitalisation, word breaks, emphasis, and punctuation. As with the orthographies of most other world languages, written English is broadly standardised. This standardisation began to develop when movable type spread to England in the late 15th century. However, unlike with most languages, there are multiple ways to spell every phoneme, and most letters also represent multiple pronunciations depending on their position in a word and the context. This is partly due to the large number of words that have been loaned from a large number of other languages throughout the history of English, without successful attempts at complete spelling reforms, and partly due to accidents of history, such ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Journal Of Chinese Information Processing
''Journal of Chinese Information Processing'' () is the journal of Chinese Information Processing Society of China. It was founded in 1986 and has been focused on publishing academic papers on the basic theory and applied technology of Chinese information processing, as well as related overviews, research results, technical reports, book reviews, special discussions, domestic and foreign academic trends, etc. It aims to reflect the development and academic trends in the field of Chinese information processing in a timely manner. ''Journal of Chinese Information Processing'' has long been included in many important domestic and foreign databases such as the Chinese Science Citation Database (CSCD), Chinese Core Journals, and Chinese Science and Technology Core Journals. Its contents represent the advanced level of Chinese information processing in China. History * In 1986, ''Journal of Chinese Information Processing'' was founded. * In 1987, the publication period was changed f ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Word Segmentation
A word is a basic element of language that carries meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonological, grammatical or orthographic basis. Others suggest that the concept is simply a convention used in everyday situations. The concept of "word" is distinguished from that of a morpheme, which is the smallest unit of language that has a meaning, even if it cannot stand on its own. Words are made out of at least one morpheme. Morphemes can a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Ancient China
The history of China spans several millennia across a wide geographical area. Each region now considered part of the Chinese world has experienced periods of unity, fracture, prosperity, and strife. Chinese civilization first emerged in the Yellow River valley, which along with the Yangtze River, Yangtze basin constitutes the geographic core of the Sinosphere, Chinese cultural sphere. China maintains a rich diversity of ethnic and linguistic people groups. The Chinese historiography, traditional lens for viewing Chinese history is the Dynasties of China, dynastic cycle: imperial dynasties rise and fall, and are ascribed certain achievements. This lens also tends to assume Chinese civilization can be traced as an unbroken thread Five thousand years of Chinese civilization, many thousands of years into the past, making it one of the Cradle of civilization, cradles of civilization. At various times, states representative of a dominant Chinese culture have directly controlled areas ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Xiandai Hanyu Cidian
''Xiandai Hanyu Cidian'' ( zh , s = 现代汉语词典 , t = 現代漢語詞典 , p = Xiàndài Hànyǔ Cídiǎn , l = Modern Han Language Word Dictionary ), also known as ''A Dictionary of Current Chinese'' or ''Contemporary Chinese Dictionary'', is an important one-volume dictionary of Standard Mandarin Chinese published by the Commercial Press, now into its 7th (2016) edition. It was originally edited by Lü Shuxiang and Ding Shengshu as a reference work on modern Standard Mandarin Chinese. Compilation started in 1958 and trial editions were issued in 1960 and 1965, with a number of copies printed in 1973 for internal circulation and comments, but due to the Cultural Revolution the final draft was not completed until the end of 1977, and the first formal edition was not published until December 1978. It was the first People's Republic of China dictionary to be arranged according to Hanyu Pinyin, the phonetic standard for Standard Mandarin Chinese, with explanatory notes in s ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CEDICT
The CEDICT project was started by Paul Denisowski in 1997 and is maintained by a team on mdbg.net under the name CC-CEDICT, with the aim to provide a complete Chinese to English dictionary with pronunciation in pinyin for the Chinese characters. Content CEDICT is a text file; other programs (or simply Notepad or egrep or equivalent) are needed to search and display it. This project is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database. Features: * Traditional Chinese and Simplified Chinese * Pinyin (several pronunciations) * American English (several) * , it had 122,444 entries in UTF-8. The basic format of a CEDICT entry is: Traditional Simplified in1 yin1/American English equivalent 1/equivalent 2/ 漢字 汉字 an4 zi4/Chinese character/CL:個, 个/ Example of a simple egrep search: $ egrep -i 有� ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Word
A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguistics, linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonology, phonological, grammar, grammatical or orthography, orthographic basis. Others suggest that the concept is simply a convention used in everyday situations. The concept of "word" is distinguished from that of a morpheme, which is the smallest unit of language that has a meaning, even if it cannot stand on its own. Words a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin'' literally means 'spelled sounds'. Pinyin is the official romanization system used in China, Singapore, Taiwan, and by the United Nations. Its use has become common when transliterating Standard Chinese mostly regardless of region, though it is less ubiquitous in Taiwan. It is used to teach Standard Chinese, normally written with Chinese characters, to students in mainland China and Singapore. Pinyin is also used by various Chinese input method, input methods on computers and to lexicographic ordering, categorize entries in some Chinese dictionaries. In pinyin, each Chinese syllable is spelled in terms of an optional initial (linguistics), initial and a final (linguistics), final, each of which is represented by one or more letters. Initi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Simplified Chinese Characters
Simplified Chinese characters are one of two standardized Chinese characters, character sets widely used to write the Chinese language, with the other being traditional characters. Their mass standardization during the 20th century was part of an initiative by the People's Republic of China (PRC) to promote literacy, and their use in ordinary circumstances on the mainland has been encouraged by the Chinese government since the 1950s. They are the official forms used in mainland China, Malaysia, and Singapore, while traditional characters are officially used in Hong Kong, Macau, and Taiwan. Simplification of a component—either a character or a sub-component called a Radical (Chinese characters), radical—usually involves either a reduction in its total number of Chinese character strokes, strokes, or an apparent streamlining of which strokes are chosen in what places—for example, the radical used in the traditional character is simplified to to form the simplified charac ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Chinese Orthography
Written Chinese is a writing system that uses Chinese characters and other symbols to represent the Chinese languages. Chinese characters do not directly represent pronunciation, unlike letters in an alphabet or syllabograms in a syllabary. Rather, the writing system is '' morphosyllabic'': characters are one spoken syllable in length, but generally correspond to morphemes in the language, which may either be independent words, or part of a polysyllabic word. Most characters are constructed from smaller components that may reflect the character's meaning or pronunciation. Literacy requires the memorization of thousands of characters; college-educated Chinese speakers know approximately 4,000. This has led in part to the adoption of complementary transliteration systems (generally Pinyin) as a means of representing the pronunciation of Chinese. Chinese writing is first attested during the late Shang dynasty (), but the process of creating characters is thought to have begun centuri ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]