GB 12052
   HOME

TheInfoList



OR:

GB 12052-89, entitled ''Korean character coded character set for information interchange'' ( zh, s=信息交换用朝鲜文字编码字符集), is a
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical values that make up a c ...
standard established by
China China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...
for the Korean language in China. It consists of a total of 5,979 characters, and has no relationship nor compatibility with
South Korea South Korea, officially the Republic of Korea (ROK), is a country in East Asia. It constitutes the southern half of the Korea, Korean Peninsula and borders North Korea along the Korean Demilitarized Zone, with the Yellow Sea to the west and t ...
's
KS X 1001 KS X 1001, "''Code for Information Interchange (Hangul and Hanja)''", formerly called KS C 5601, is a South Korean coded character set standard to represent Hangul and Hanja characters on a computer. KS X 1001 is encoded by the most common leg ...
and
North Korea North Korea, officially the Democratic People's Republic of Korea (DPRK), is a country in East Asia. It constitutes the northern half of the Korea, Korean Peninsula and borders China and Russia to the north at the Yalu River, Yalu (Amnok) an ...
's
KPS 9566 KPS 9566 ("''DPRK Standard Korean Graphic Character Set for Information Interchange''") is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 spec ...
.


Characters

Characters in GB 12052 are arranged in a 94×94 grid (as in
ISO/IEC 2022 ISO/IEC 2022 ''Information technology—Character code structure and extension techniques'', is an ISO/ IEC standard in the field of character encoding. It is equivalent to the ECMA standard ECMA-35, the ANSI standard ANSI X3.41 and the Japane ...
), and the two-byte code point of each character is expressed in the ''qu''-''wei'' form, which specifies a row (''qu'' ) and the position of the character within the row (cell, ''wei'' ). The rows (numbered from 1 to 94) contain characters as follows: * 01–09: identical to
GB 2312 is a key official character set of the People's Republic of China, used for Simplified Chinese characters. GB2312 is the registered internet name for EUC-CN, which is its usual encoded form. ''GB'' refers to the Guobiao standards (国家标准), ...
, except 03-04 ( in GB 2312, in GB 12052) * 16–37: modern
Hangul The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...
syllables and ''jamo'', level 1 (2,017 syllables and 51 ''jamo'') * 38–52: modern Hangul syllables, level 2 (1,356 characters) * 53–72: archaic Hangul syllables and ''jamo'' (1,683 syllables and 96 ''jamo''), and 94
Chinese characters Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
The rows 10–15 and 73–94 are unassigned.


Errors

There are some errors in the standard: * 41-64: 믃 in the fold-out table, 믌 in the standard proper – should be 믃 * 46-65: 틘 in the fold-out table, 퇸 in the standard proper – should be 틘 * 49-37: 뗸 in the fold-out table, 뎬 in the standard proper – should be 뗸 * 51-82: 윹 in the fold-out table, 율 in the standard proper – should be 윹 * 53-67: ᄀᆈ in the fold-out table, missing in the standard proper – should be ᄀᆈ * 72-88: missing in the fold-out table, 夞 in the standard proper – should be 夞


Precomposed modern Hangul sets

Unlike KS X 1001 and KPS 9566, GB 12052 * does not encode ''jamo'' in a separate section (see row 4 of KS X 1001 and row 4 of KPS 9566). * has two levels of precomposed modern Hangul syllables. However, like KS X 1001, GB 12052 lacks the initial+vowel counterparts for some initial+vowel+final syllables: * 땩, 땬, 땽: missing 땨 * 뗙, 뗜, 뗨, 뗩, 뗭: missing 뗘 * 뗸, 똉: missing 뗴 * 뚁: missing 뚀 * 뚼: missing 뚸 * 뼹: missing 뼤 * 쪤, 쪵: missing 쪠


Level 1 (rows number 16 through 37)


Level 2 (rows number 38 through 52)


Statistics by jamo


Footnotes


References


External links

* * {{Hangul Jamo Encodings of Asian languages Korean-language computing 12052 Hangul