KOI8-U
KOI8-U (RFC 2319) is an 8-bit character encoding, designed to cover Ukrainian, which uses a Cyrillic alphabet. It is based on KOI8-R, which covers Russian and Bulgarian, but replaces eight box drawing characters with four Ukrainian letters Ґ, Є, І, and Ї in both upper case and lower case. KOI8-RU is closely related, but adds Ў for Belarusian. In both, the letter allocations match those in KOI8-E, except for Ґ which is added to KOI8-F. In Microsoft Windows, KOI8-U is assigned the code page number 21866. In IBM, KOI8-U is assigned code page/CCSID 1168. KOI8 remains much more commonly used than ISO 8859-5, which never really caught on. Another common Cyrillic character encoding is Windows-1251. In the future, both may eventually give way to Unicode. KOI8 stands for ''Kod Obmena Informatsiey, 8 bit'' (russian: Код Обмена Информацией, 8 бит) which means "Code for Information Exchange, 8 bit". The KOI8 character sets have the property that the Russi ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI8-F
KOI8-F or KOI8 Unified is an 8-bit character set. It was designed by Peter Cassetta of Fingertip Software (now defunct) as an attempt to support all the encoded letters from both KOI8-E (ISO-IR-111) and KOI8-RU (and hence also, KOI8-U and KOI8-R), along with some of the pseudographics from KOI8-R, with some additional punctuation in the remaining space, sourced partly from Windows-1251. This encoding was only used in the software of that company. Character set The following table shows the KOI8-F encoding. Each character is shown with its equivalent Unicode code point. Differences from ISO-IR-111 are boxed; other relevant encodings which are matched, if any, are noted in footnotes. See also *KOI character encodings KOI (''КОИ'') is a family of several code pages for the Cyrillic script. The name stands for ''Kod obmena informatsiey'' (russian: Код обмена информацией) which means "Code for Information Interchange". A particular feature ... References ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI Character Encodings
KOI (''КОИ'') is a family of several code pages for the Cyrillic script. The name stands for ''Kod obmena informatsiey'' (russian: Код обмена информацией) which means "Code for Information Interchange". A particular feature of the KOI code pages is that the text remains human-readable when the leftmost bit is stripped, should it inadvertently pass through equipment or software that can only deal with 7 bit wide characters. This is due to characters being placed in a special order (128 codepoints apart from the Latin letter they sound most similar to), which, however, does not correspond to the alphabetic order in any language that is written in Cyrillic and necessitates the use of lookup tables to perform sorting. These encodings are derived from ASCII on the base of some correspondence between Latin and Cyrillic (nearly phonetical), which was already used in Russian dialect of Morse code and in MTK-2 telegraph code. The first 26 characters from А (0xE1) i ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI8-E
ISO-IR-111 or KOI8-E is an 8-bit character set. It is a multinational extension of KOI-8 for Belarusian, Macedonian, Serbian, and Ukrainian (except Ґґ which is added to KOI8-F). The name "ISO-IR-111" refers to its registration number in the ISO-IR registry, and denotes it as a set usable with ISO/IEC 2022. It was defined by the first (1986) edition of ECMA-113, which is the Ecma International standard corresponding to , and as such also corresponds to a 1987 draft version of ISO-8859-5. The published editions of instead correspond to subsequent editions of ECMA-113, which defines a different encoding. Naming confusion ISO-IR-111, the 1985 edition of ECMA-113 (also called "ECMA-Cyrillic" or "KOI8-E"), was based on the 1974 edition of GOST 19768 (i.e. KOI-8). In 1987 ECMA-113 was redesigned. These newer editions of ECMA-113 are equivalent to ISO-8859-5, and do not follow the KOI layout. This confusion has led to a common misconception that ISO-8859-5 was defined in or based ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI8-R
KOI8-R (RFC 1489) is an 8-bit character encoding, derived from the KOI-8 encoding by the programmer Andrei Chernov in 1993 and designed to cover Russian, which uses a Cyrillic alphabet. KOI8-R was based on Russian Morse code, which was created from a phonetic version of Latin Morse code. As a result, Russian Cyrillic letters are in pseudo-Roman order rather than the normal Cyrillic alphabetical order. Although this may seem unnatural, if the 8th bit is stripped, the text is partially readable in ASCII and may convert to syntactically correct KOI-7. For example, "Русский Текст" in KOI8-R becomes ''rUSSKIJ tEKST'' ("Russian Text"). KOI8 stands for ''Kod Obmena Informatsiey, 8 bit'' (russian: Код Обмена Информацией, 8 бит) which means "Code for Information Exchange, 8 bit". In Microsoft Windows, KOI8-R is assigned the code page number 20866. In IBM, KOI8-R is assigned code page 878. KOI8-R also happens to cover Bulgarian, but has not be ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI8-B
KOI8-B is the informal name for an 8-bit Roman / Cyrillic character set constituting the common subset of the major KOI-8 variants (KOI8-R, KOI8-U, KOI8-RU, KOI8-E, KOI8-F). Accordingly, it is closely related to KOI8-R, but defines only the letter subset in the upper half. As such it was implemented by some font vendors for PC Unixes like Xenix in the late 1980s. Character set The following table shows the KOI8-B encoding. Each character is shown with its equivalent Unicode code point. See also * KOI character encodings KOI (''КОИ'') is a family of several code pages for the Cyrillic script. The name stands for ''Kod obmena informatsiey'' (russian: Код обмена информацией) which means "Code for Information Interchange". A particular feature ... References External links *http://czyborra.com/charsets/koi8-b.txt.gz *http://czyborra.com/charsets/koi8-b.bdf.gz {{Character encoding Character sets Computing in the Soviet Union ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
KOI8-RU
KOI8-RU is an 8-bit character encoding, designed to cover Russian, Ukrainian, and Belarusian which use a Cyrillic alphabet. It is closely related to KOI8-R, which covers Russian and Bulgarian, but replaces ten box drawing characters with five Ukrainian and Belarusian letters Ґ, Є, І, Ї, and Ў in both upper case and lower case. It is even more closely related to KOI8-U, which does not include Ў but otherwise makes the same replacements. The additional letter allocations are matched by KOI8-E, except for Ґ which is added to KOI8-F. In IBM, KOI8-RU is assigned code page/CCSID 1167. KOI8 remains much more commonly used than ISO 8859-5, which never really caught on. Another common Cyrillic character encoding is Windows-1251. In the future, both may eventually give way to Unicode. KOI8 stands for ''Kod obmena informatsiey, 8 bit'' (russian: Код обмена информацией, 8 бит) which means "Code for Information Exchange, 8 bit". The KOI8 character set ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Ukrainian Ye
Ukrainian Ye (Є є; italics: ) is a character of the Cyrillic script. It is a separate letter in the Ukrainian alphabet (8th position since 1992, 7th position before then), the Pannonian Rusyn alphabet, and both the Carpathian Rusyn alphabets; in all of these, it comes directly after Е. In modern Church Slavonic, it is considered a variant form of Ye (Е е) (there, the selection of Є and Е is driven by orthography rules). Until the mid-19th century, Є/є was also used in Romanian and Serbian (the letter was eliminated in Vuk Karadžić's alphabet and replaced by digraph је). Other modern Slavonic languages may use Є/є shapes instead of Е/е for decorative purposes. Then, the letter is usually referred to by the older name Yest (which also refers to the conventional Ye). If the two need to be distinguished, the descriptive name Broad E is sometimes used (in contrast with "Narrow E"). In Ukrainian, Є/є commonly represents the sound or like the pr ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Windows-1251
Windows-1251 is an 8-bit character encoding, designed to cover languages that use the Cyrillic script such as Russian, Ukrainian, Belarusian, Bulgarian, Serbian Cyrillic, Macedonian and other languages. On the web, it is the second most-used single-byte character encoding (or third most-used character encoding overall), and most used of the single-byte encodings supporting Cyrillic. , 0.4% of all websites use Windows-1251. It's by far mostly used for Russian, while a small minority of Russian websites use it, with 93.7% of Russian (.ru) websites using UTF-8, and the legacy 8-bit encoding is distant second. In Linux, the encoding is known as cp1251. IBM uses code page 1251 (CCSID 1251 and euro sign extended CCSID 5347) for Windows-1251. Windows-1251 and KOI8-R (or its Ukrainian variant KOI8-U) are much more commonly used than ISO 8859-5 (which is used by less than 0.0004% of websites). In contrast to Windows-1252 and ISO 8859-1, Windows-1251 is not closely related to ISO 885 ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
ISO 8859-5
ISO/IEC 8859-5:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin/Cyrillic. It was designed to cover languages using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian, Serbian and Macedonian but was never widely used. It would also have been usable for Ukrainian in the Soviet Union from 1933 to 1990, but it is missing the Ukrainian letter ''ge'', ґ, which is required in Ukrainian orthography before and since, and during that period outside Soviet Ukraine. As a result, IBM created Code page 1124. ISO-8859-5 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. The 8-bit encodings KOI8-R and KOI8-U, CP866, and also Windows-1251 are far more commonly used. In contrast to Windows-1252 ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Yi (Cyrillic)
Yi (Ї ї; italics: ) is a letter of the Cyrillic script. Yi is derived from the Greek letter iota with diaeresis. It was the initial variant of the Cyrillic letter Іі, which saw change from two dots to one in 18th century, possibly inspired by similar Latin letter i. Later two variants of the letter separated to become distinct letters in the Ukrainian alphabet. It is used in the Ukrainian alphabet, the Pannonian Rusyn alphabet, and the Prešov Rusyn alphabet of Slovakia, where it represents the iotated vowel sound , like the pronunciation of in "yeast". As the historical variant of the Cyrillic Іі it represented either /i/ (as i in ''pizza'') or /j/ (as y in ''yen''). In various romanization systems of Ukrainian, ''ї'' is represented by Latin letters ''i'' or ''yi'' (word-initially), ''yi'', ''ji'', or even '' ï''. It was formerly also used in the Serbian Cyrillic alphabet in the late 1700s and early 1800s, where it represented the sound ; in this capacity, ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Soft-dotted I (Cyrillic)
The dotted i (І і; italics: ''''), also called decimal i (и десятеричное, after its former numeric value), is a letter of the Cyrillic script. It commonly represents the close front unrounded vowel , like the pronunciation of ⟨i⟩ in English "machine". It is used in the orthographies of Belarusian, Kazakh, Khakas, Komi, Carpathian Rusyn and Ukrainian and quite often, but not always, is the equivalent of the Cyrillic letter i (И и) as used in Russian and other languages. The letter was also used in Russian before 1918. In Ukrainian, І is the twelfth letter of the alphabet and represents the sound Close_front_unrounded_vowel.html" ;"title="nowiki/> iin writing. Ukrainian uses и to represent the sound [Near-close near-front unrounded vowel">ɪ">Close front unrounded vowel">iin writing. Ukrainian uses и to represent the sound [Near-close near-front unrounded vowel">ɪ In Belarusian, I is the tenth letter of the alphabet. It represents Close f ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Ghe With Upturn
Ghe with upturn (Ґ ґ; italics: ) is a letter of the Cyrillic script. It is part of the Ukrainian alphabet, the Pannonian Rusyn alphabet and both the Carpathian Rusyn alphabets, and also some variants of the Urum and Belarusian (i.e. Belarusian Classical Orthography) alphabets. In these alphabets it is usually called "Ge", while the letter it follows — ⟨ Г г⟩ (which in its turn is also called "Ge" in Russian and many other languages) — is called "He". In Unicode, this letter is called "Ghe with upturn". The letterform of this letter is based on the letterform of the letter ⟨ Г г⟩, but its handwritten and italic lowercase forms do not follow the italic modification of ⟨г⟩ (i.e. ''г''). It represents the voiced velar plosive , like the pronunciation of ⟨g⟩ in "go". Ghe with upturn is romanized using the Latin letter G (but with an additional grave accent in ISO 9). History The common Slavic voiced velar plosive is represented in ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |