HOME





KOI8-RU
KOI8-RU is an 8-bit character encoding, designed to cover Russian, Ukrainian, and Belarusian which use a Cyrillic alphabet. It is closely related to KOI8-R, which covers Russian and Bulgarian, but replaces ten box drawing characters with five Ukrainian and Belarusian letters Ґ, Є, І, Ї, and Ў in both upper case and lower case. It is even more closely related to KOI8-U, which does not include Ў but otherwise makes the same letter replacements. The additional letter allocations are matched by KOI8-E, except for Ґ which is added to KOI8-F. In IBM, KOI8-RU is assigned code page/ CCSID 1167. KOI8 remains much more commonly used than ISO 8859-5, which never really caught on. Another common Cyrillic character encoding is Windows-1251. In the future, both may eventually give way to Unicode. KOI8 stands for ''Kod obmena informatsiey, 8 bit'' () which means "Code for Information Exchange, 8 bit". The KOI8 character sets have the property that the Russian Cyrillic lette ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


KOI Character Encodings
KOI (''КОИ'') is a family of several code pages for the Cyrillic script. The name stands for ''Kod obmena informatsiey'' () which means "Code for Information Interchange". A particular feature of the KOI code pages is that the text remains human-readable when the leftmost bit is stripped, should it inadvertently pass through equipment or software that can only deal with 7 bit wide characters. This is due to characters being placed in a special order (128 codepoints apart from the Latin letter they sound most similar to), which, however, does not correspond to the alphabetic order in any language that is written in Cyrillic and necessitates the use of lookup tables to perform Sorting algorithm, sorting. These encodings are derived from ASCII on the base of some correspondence between Latin and Cyrillic (nearly phonetical), which was already used in Russian Morse code, Russian dialect of Morse code and in MTK-2 telegraph code. The first 26 characters from А (0xE1) in KOI8-R are � ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


KOI8-F
KOI8-F or KOI8 Unified is an 8-bit character set. It was designed by Peter Cassetta of Fingertip Software (now defunct) as an attempt to support all the encoded letters from both KOI8-E (ISO-IR-111) and KOI8-RU (and hence also, KOI8-U and KOI8-R), along with some of the pseudographics from KOI8-R, with some additional punctuation in the remaining space, sourced partly from Windows-1251. This encoding was only used in the software of that company. FreeDOS calls it code page 60270. Character set The following table shows the KOI8-F encoding. Each character is shown with its equivalent Unicode code point. Differences from ISO-IR-111 are boxed; other relevant encodings which are matched, if any, are noted in footnotes. KOI8-C/KOI8-CA A variant is KOI8-C, also known as KOI8-CA, is an 8-bit character set. It is a modification of KOI8-F to support Caucasian languages while retaining support in the same languages as KOI8-F. FreeDOS calls it code page 61294. It has hardly ever been ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


KOI8-U
KOI8-U (RFC 2319) is an 8-bit character encoding, designed to cover Ukrainian, which uses a Cyrillic alphabet. It is based on KOI8-R, which covers Russian and Bulgarian, but replaces eight box drawing characters with four Ukrainian letters Ґ, Є, І, and Ї in both upper case and lower case. KOI8-RU is closely related, but adds Ў for Belarusian. In both, the letter allocations match those in KOI8-E, except for Ґ which is added to KOI8-F. In Microsoft Windows, KOI8-U is assigned the code page number 21866. In IBM, KOI8-U is assigned code page/ CCSID 1168. KOI8 remains much more commonly used than ISO 8859-5, which never really caught on. Another common Cyrillic character encoding is Windows-1251. In the future, both may eventually give way to Unicode. KOI8 stands for ''Kod Obmena Informatsiey, 8 bit'' () which means "Code for Information Exchange, 8 bit". The KOI8 character sets have the property that the Cyrillic letters are in pseudo-Latin alphabetic order rath ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




KOI8-B
KOI8-B is the informal name for an 8-bit Roman / Cyrillic character set constituting the common subset of the major KOI-8 variants (KOI8-R, KOI8-U, KOI8-RU, KOI8-E, KOI8-F). Accordingly, it is closely related to KOI8-R, but defines only the letter subset in the upper half. As such it was implemented by some font vendors for PC Unixes like Xenix in the late 1980s. Character set The following table shows the KOI8-B encoding. Each character is shown with its equivalent Unicode code point. See also * KOI character encodings KOI (''КОИ'') is a family of several code pages for the Cyrillic script. The name stands for ''Kod obmena informatsiey'' () which means "Code for Information Interchange". A particular feature of the KOI code pages is that the text remains huma ... References External links *http://czyborra.com/charsets/koi8-b.txt.gz *http://czyborra.com/charsets/koi8-b.bdf.gz {{Character encoding Character sets Computing in the Soviet Union ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


KOI8-R
KOI8-R (RFC 1489) is an 8-bit character encoding derived from the KOI-8 encoding by the programmer Andrei Chernov in 1993 and designed to cover Russian, which uses the Russian subset of a Cyrillic script. KOI-8, on its turn, is an 8-bit extension of the KOI-7 encoding, which inherited a phonetic correspondence of Russian and Latin letters from the MTK-2 teletype code. As a result, Russian Cyrillic letters in KOI8-R are in pseudo-Latin alphabetical order rather than the normal Cyrillic one like in ISO 8859-5. Although this may seem unnatural, this has the useful effect that if the 8th bit is stripped, the text remains partially readable in any ASCII-based encoding (including KOI8-R itself) as a case-reversed transliteration. For example, "Код для обмена и обработки информации" (the Russian meaning of the "KOI" acronym) becomes ''kOD DLQ OBMENA I OBRABOTKI INFORMACII''. KOI-8 stands for ''8-bitnyy kod dlya obmena i obrabotki informatsii'' ( ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Belarusian Language
Belarusian (, ) is an East Slavic languages, East Slavic language. It is one of the two Languages of Belarus, official languages in Belarus, the other being Russian language, Russian. It is also spoken in parts of Russia, Lithuania, Latvia, Poland, Ukraine, and the United States by the Belarusian diaspora. Before Belarus Dissolution of the Soviet Union, gained independence in 1991, the language was known in English language, English as ''Byelorussian'' or ''Belorussian'', or alternatively as ''White Russian''. Following independence, it became known as ''Belarusian'', or alternatively as ''Belarusan''. As one of the East Slavic languages, Belarusian shares many grammatical and lexical features with other members of the group. To some extent, Russian, Ukrainian language, Ukrainian, and Belarusian retain a degree of mutual intelligibility. Belarusian descends from a language generally referred to as Ruthenian language, Ruthenian (13th to 18th centuries), which had, in turn, descend ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Number Sign
The symbol is known as the number sign, hash, (or in North America) the pound sign. The symbol has historically been used for a wide range of purposes including the designation of an ordinal number and as a Typographic ligature, ligatured abbreviation for Pound (mass), pounds avoirdupois – having been derived from the now-rare . Since 2007, widespread usage of the symbol to introduce metadata tags on social media platforms has led to such tags being known as "hashtags", and from that, the symbol itself is sometimes called a hashtag. The symbol is distinguished from similar symbols by its combination of level horizontal strokes and right-tilting vertical strokes. History It is believed that the symbol traces its origins to the symbol , an abbreviation of the Roman term ''Roman pound, libra pondo'', which translates as "pound weight". The abbreviation "lb" was printed as a dedicated Ligature (writing), ligature including a horizontal line across (which indicated abbreviation ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Character (computing), characters and 168 script (Unicode), scripts used in various ordinary, literary, academic, and technical contexts. Unicode has largely supplanted the previous environment of a myriad of incompatible character sets used within different locales and on different computer architectures. The entire repertoire of these sets, plus many additional characters, were merged into the single Unicode set. Unicode is used to encode the vast majority of text on the Internet, including most web pages, and relevant Unicode support has become a common consideration in contemporary software development. Unicode is ultimately capable of encoding more than 1.1 million characters. The Unicode character repertoire is synchronized with Univers ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Space Character
A whitespace character is a character data element that represents white space when text is rendered for display by a computer. For example, a ''space'' character (, ASCII 32) represents blank space such as a word divider in a Western script. A printable character results in output when rendered, but a whitespace character does not. Instead, whitespace characters define the layout of text to a limited degree, interrupting the normal sequence of rendering characters next to each other. The output of subsequent characters is typically shifted to the right (or to the left for right-to-left script) or to the start of the next line. The effect of multiple sequential whitespace characters is cumulative such that the next printable character is rendered at a location based on the accumulated effect of preceding whitespace characters. The origin of the term ''whitespace'' is rooted in the common practice of rendering text on white paper. Normally, a whitespace character is ''n ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Exclamation Mark
The exclamation mark (also known as exclamation point in American English) is a punctuation mark usually used after an interjection or exclamation to indicate strong feelings or to show wikt:emphasis, emphasis. The exclamation mark often marks the end of a sentence, for example: "Watch out!". Similarly, a bare exclamation mark (with nothing before or after) is frequently used in warning signs. Additionally, the exclamation mark is commonly used in writing to make a character seem as though they are shouting, excited, or surprised. Other uses include: * In mathematics, it denotes the factorial operation. * Several computer languages use at the beginning of an expression (computer science), expression to denote logical negation. For example, means "the logical negation of A", also called "not A". This usage has spread to ordinary language (e.g., "!clue" means no-clue or clueless). * Some languages use ǃ, a symbol that looks like an exclamation mark, to denote a click consonant. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Quotation Mark
Quotation marks are punctuation marks used in pairs in various writing systems to identify direct speech, a quotation, or a phrase. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same glyph. Quotation marks have a variety of forms in different languages and in different media. History The single quotation mark is traced to Ancient Greek practice, adopted and adapted by monastic copyists. Isidore of Seville, in his seventh century encyclopedia, , described their use of the Greek ''diplé'' (a Angle bracket, chevron): The double quotation mark derives from a marginal notation used in fifteenth-century manuscript annotations to indicate a passage of particular importance (not necessarily a quotation); the notation was placed in the outside margin of the page and was repeated alongside each line of the passage. In his edition of the works of Aristotle, which appeared in 1483 or 1484, the Milanese Renaissance humanis ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Percent Sign
The percent sign (sometimes per cent sign in British English) is the symbol used to indicate a percentage, a number or ratio as a fraction (mathematics), fraction of 100. Related signs include the permille (per thousand) sign and the Basis point#Permyriad, permyriad (per ten thousand) sign (also known as a basis point), which indicate that a number is divided by one thousand or ten thousand, respectively. Higher proportions use parts-per notation. Correct style Form and spacing English style guides prescribe writing the percent sign following the number without any space between (e.g. 50%). However, the International System of Units and ISO 31-0 standard prescribe a space between the number and percent sign, in line with the general practice of using a non-breaking space between a numerical value and its corresponding unit of measurement. Other languages have other rules for spacing in front of the percent sign: * In Czech language, Czech and in Slovak language, S ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]