KS C 5657
KS X 1002 (formerly KS C 5657) is a South Korean character set standard that is established in order to supplement KS X 1001. It consists of a total of 7,649 characters. Unlike KS X 1001, KS X 1002 is not encoded in any legacy encoding. Even in 1994, it was known as "a standard that no one implemented." Characters Characters in KS X 1002 are arranged in a 94×94 grid (as in ISO/IEC 2022), and the two-byte code point of each character is expressed in the ''haeng''-''yeol'' form, which specifies a row (''haeng'' 행) and the position of the character within the row (cell, ''yeol'' 열). The rows (numbered from 1 to 94) contain characters as follows: * 01–07: Latin letters with diacritics (613 characters) * 08–10: Greek letters with diacritics (273 characters) * 11–13: miscellaneous symbols (275 characters) * 14: compound ''jamo'' and hangul syllables without an initial consonant (27 characters) * 16–36: modern hangul syllables (1,930 characters) * 37–54: archaic hangul s ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KS X 1001
KS X 1001, "''Code for Information Interchange (Hangul and Hanja)''", formerly called KS C 5601, is a South Korean coded character set standard to represent hangul and hanja characters on a computer. KS X 1001 is encoded by the most common legacy (pre-Unicode) character encodings for Korean, including EUC-KR and Microsoft's Unified Hangul Code (UHC). It contains Korean Hangul syllables, CJK ideographs (Hanja), Greek, Cyrillic, Japanese (Hiragana and Katakana) and some other characters. KS X 1001 is arranged as a 94×94 table, following the structure of 2-byte code words in ISO 2022 and EUC. Therefore, its code points are pairs of integers 1–94. However, some encodings (UHC and Johab), in addition to providing codes for every code point, provide additional codes for characters otherwise representable only as code point sequences. History This standard was previously known as KS C 5601. There have been several revisions of this standard. For example, there were revisi ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Greek Alphabet
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BCE. It is derived from the earlier Phoenician alphabet, and was the earliest known alphabetic script to have distinct letters for vowels as well as consonants. In Archaic and early Classical times, the Greek alphabet existed in many local variants, but, by the end of the 4th century BCE, the Euclidean alphabet, with 24 letters, ordered from alpha to omega, had become standard and it is this version that is still used for Greek writing today. The uppercase and lowercase forms of the 24 letters are: : , , , , , , , , , , , , , , , , , /ς, , , , , , . The Greek alphabet is the ancestor of the Latin and Cyrillic scripts. Like Latin and Cyrillic, Greek originally had only a single form of each letter; it developed the letter case distinction between uppercase and lowercase in parallel with Latin during the modern era. Sound values and conventional transcriptions for some ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Encodings Of Asian Languages
In communications and information processing, code is a system of rules to convert information—such as a letter, word, sound, image, or gesture—into another form, sometimes shortened or secret, for communication through a communication channel or storage in a storage medium. An early example is an invention of language, which enabled a person, through speech, to communicate what they thought, saw, heard, or felt to others. But speech limits the range of communication to the distance a voice can carry and limits the audience to those present when the speech is uttered. The invention of writing, which converted spoken language into visual symbols, extended the range of communication across space and time. The process of encoding converts information from a source into symbols for communication or storage. Decoding is the reverse process, converting code symbols back into a form that the recipient understands, such as English or/and Spanish. One reason for coding is to ena ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
List Of Modern Hangul Characters In ISO/IEC 2022–compliant National Character Set Standards
This is the list of modern Hangul characters in ISO/IEC 2022–compliant national character set standards. * South Korea (KR): KS X 1001 (formerly KS C 5601) and KS X 1002 (formerly KS C 5657) * North Korea (KP): KPS 9566 * China (CN): GB 12052 Note: In the tables below, the "KPS 9566" column excludes Hangul characters that are not in the EUC range ( xA1-\xFE\xA1-\xFE]) and the duplicate syllables for the names of the former and current leaders of North Korea. ''Jamo'' (consonants and vowels) 51 characters in total. Note: KS X 1002 is a supplementary character set for KS X 1001. As all modern Hangul ''jamo'' characters are already in KS X 1001, none are in KS X 1002. Syllables * KS X 1001: 2,350 characters * KS X 1002: 1,930 characters * KPS 9566: 2,679 characters * GB 12052: 3,373 characters (2,017 in rows 16–37 and 1,356 in rows 38–52) * Union of the above: 4,300 characters Notes: * For the "Count " and "Count (KP & CN)" columns, see Hangul#Alphabetic order. * ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Hangul Syllables
Hangul Syllables is a Unicode block containing precomposed Hangul syllable blocks for modern Korean. The syllables can be directly mapped by algorithm to sequences of two or three characters in the Hangul Jamo Unicode block: * one of U+1100–U+1112: the 19 modern Hangul leading consonant jamos; * one of U+1161–U+1175: the 21 modern Hangul vowel jamos; * none, or one of U+11A8–U+11C2: the 27 modern Hangul trailing consonant jamos. This block is encoded according to the canonically equivalent order of these (two or three) jamos (one in each subrange of jamos above) composing each syllable. Note that a full Hangul syllable may include one of these characters but may be preceded by one or more leading consonant jamos, and followed by one or more trailing jamos (possibly preceded by one or more vowel jamos if the encoded syllable is composed by two jamos does not include any trailing consonant jamos). As well some Hangul syllables may not include any one of these precomposed char ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Hangul (obsolete Unicode Block)
Hangul, Hangul Supplementary-A, and Hangul Supplementary-B were character blocks that existed in Unicode 1.0 and 1.1, and ISO/IEC 10646-1:1993. These blocks encoded precomposed modern Hangul syllables. These three Unicode 1.x blocks were deleted and superseded by the new Hangul Syllables block (U+AC00–U+D7AF) in Unicode 2.0 (July 1996) and ISO/IEC 10646-1:1993 Amd. 5 (1998), and are now occupied by CJK Unified Ideographs Extension A and Yijing Hexagram Symbols. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Hangul Syllables block introduced in Unicode 2.0 is immutable. Documentation The Unicode 1.0.0 code chart is still available online, including the Korean Hangul Syllables block, but not the supplements added in Unicode 1.1. Full code charts for Unicode 1.1 were "never created", since Unicode 1.1 was published only as a report amending Unicode 1.0 due to the urgency of releasing it; ho ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines as of the current version (15.0) 149,186 characters covering 161 modern and historic scripts, as well as symbols, emoji (including in colors), and non-visual control and formatting codes. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including modern operating systems, XML, and most modern programming languages. The Unicode character repertoire is synchronized with Universal Coded Character Set, ISO/IEC 10646, each being code-for-code identical with the other. ''The Unicode Standard'', however, includes more th ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
CJK Unified Ideographs (Unicode Block)
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters. When compared with other blocks containing CJK Unified Ideographs, it is also referred to as the ''Unified Repertoire and Ordering'' (URO). The block has hundreds of variation sequences defined for standardized variants. It also has tens of thousands of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs block: References {{CJK ideographs in Unicode Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Hanja
Hanja (Hangul: ; Hanja: , ), alternatively known as Hancha, are Chinese characters () used in the writing of Korean. Hanja was used as early as the Gojoseon period, the first ever Korean kingdom. (, ) refers to Sino-Korean vocabulary, which can be written with Hanja, and (, ) refers to Classical Chinese writing, although "Hanja" is also sometimes used to encompass both concepts. Because Hanja never underwent any major reforms, they are mostly resemble to '' kyūjitai'' and traditional Chinese characters, although the stroke orders for some characters are slightly different. For example, the characters and as well as and . Only a small number of Hanja characters were modified or are unique to Korean, with the rest being identical to the traditional Chinese characters. By contrast, many of the Chinese characters currently in use in mainland China, Malaysia and Singapore have been simplified, and contain fewer strokes than the corresponding Hanja characters. In J ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Hangul
The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The letters for the five basic consonants reflect the shape of the speech organs used to pronounce them, and they are systematically modified to indicate phonetic features; similarly, the vowel letters are systematically modified for related sounds, making Hangul a featural writing system. It has been described as a syllabic alphabet as it combines the features of alphabetic and syllabic writing systems, although it is not necessarily an abugida. Hangul was created in 1443 CE by King Sejong the Great in an attempt to increase literacy by serving as a complement (or alternative) to the logographic Sino-Korean '' Hanja'', which had been used by Koreans as its primary script to write the Korean language since as early as the Gojoseon p ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacritic'' is a noun, though it is sometimes used in an attributive sense, whereas ''diacritical'' is only an adjective. Some diacritics, such as the acute ( ◌́ ) and grave ( ◌̀ ), are often called ''accents''. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters. The main use of diacritics in Latin script is to change the sound-values of the letters to which they are added. Historically, English has used the diaeresis diacritic to indicate the correct pronunciation of ambiguous words, such as "coöperate", without which the letter sequence could be misinterpreted to be pronounced . Other examples are the acute and grave accents, which can ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Korean Language
Korean (South Korean: , ''hangugeo''; North Korean: , ''chosŏnmal'') is the native language for about 80 million people, mostly of Koreans, Korean descent. It is the official language, official and national language of both North Korea and South Korea (geographically Korea), but over the past years of political division, the North–South differences in the Korean language, two Koreas have developed some noticeable vocabulary differences. Beyond Korea, the language is recognised as a minority language in parts of China, namely Jilin, Jilin Province, and specifically Yanbian Korean Autonomous Prefecture, Yanbian Prefecture and Changbai Korean Autonomous County, Changbai County. It is also spoken by Sakhalin Koreans in parts of Sakhalin, the Russian island just north of Japan, and by the in parts of Central Asia. The language has a few Extinct language, extinct relatives which—along with the Jeju language (Jejuan) of Jeju Island and Korean itself—form the compact Koreanic l ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |