YUSCII
   HOME

TheInfoList



OR:

YUSCII is an informal name for several JUS standards for 7-
bit The bit is the most basic unit of information in computing and digital communication. The name is a portmanteau of binary digit. The bit represents a logical state with one of two possible values. These values are most commonly represented as ...
character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
. These include: * JUS I.B1.002 (ISO-IR-141, ISO 646-YU), which encodes
Gaj's Latin alphabet Gaj's Latin alphabet ( sh-Latn-Cyrl, Gajeva latinica, separator=" / ", Гајева латиница}, ), also known as ( sr-Cyrl, абецеда, ) or ( sr-Cyrl, гајица, link=no, ), is the form of the Latin script used for writing all ...
, used for
Serbo-Croatian Serbo-Croatian ( / ), also known as Bosnian-Croatian-Montenegrin-Serbian (BCMS), is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually i ...
and
Slovenian language Slovene ( or ) or Slovenian ( ; ) is a South Slavic language of the Balto-Slavic branch of the Indo-European language family. Most of its 2.5 million speakers are the inhabitants of Slovenia, the majority of them ethnic Slovenes. As Slo ...
* JUS I.B1.003 (ISO-IR-146), which encodes
Serbian Cyrillic alphabet The Serbian Cyrillic alphabet (, ), also known as the Serbian script, (, ), is a standardized variation of the Cyrillic script used to write the Serbian language. It originated in medieval Serbia and was significantly reformed in the 19th cen ...
, and * JUS I.B1.004 (ISO-IR-147), which encodes Macedonian Cyrillic alphabet. The encodings are based on
ISO 646 ISO/IEC 646 ''Information technology — ISO 7-bit coded character set for information interchange'', is an International Organization for Standardization, ISO/International Electrotechnical Commission, IEC standard in the ...
, 7-
bit The bit is the most basic unit of information in computing and digital communication. The name is a portmanteau of binary digit. The bit represents a logical state with one of two possible values. These values are most commonly represented as ...
Latinic
character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
standard, and were used in
Yugoslavia , common_name = Yugoslavia , life_span = 1918–19921941–1945: World War II in Yugoslavia#Axis invasion and dismemberment of Yugoslavia, Axis occupation , p1 = Kingdom of SerbiaSerbia , flag_p ...
before widespread use of later CP 852,
ISO-8859-2 ISO/IEC 8859-2:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. I ...
/ 8859-5,
Windows-1250 Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use the Latin script. It is primarily used by Czech. It is also used for Polish (as can Windows-1257), Slovak, H ...
/ 1251 and
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
standards. It was named after
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
, having the first word "American" replaced with "Yugoslav": "Yugoslav Standard Code for Information Interchange". Specific standards are also sometimes called by a local name: SLOSCII, CROSCII or SRPSCII for JUS I.B1.002, SRPSCII for JUS I.B1.003, MAKSCII for JUS I.B1.004. JUS I.B1.002 is a national ISO 646 variant, i.e. equal to basic
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
with less frequently used symbols replaced with specific letters of Gaj's alphabet. Cyrillic standards further replace Latin alphabet letters with corresponding Cyrillic letters. Љ (lj), Њ (nj), Џ (dž) and ѕ (dz) correspond to Latin digraphs, and are mapped over Latin letters which are not used in Serbian or Macedonian (q, w, x, y). YUSCII was originally developed for teleprinters but it also spread for
computer A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...
use. This was widely considered a bad idea among
software developer Software development is the process of designing and Implementation, implementing a software solution to Computer user satisfaction, satisfy a User (computing), user. The process is more encompassing than Computer programming, programming, wri ...
s who needed the original ASCII such as , ], ^, ~, , , \ in their
source code In computing, source code, or simply code or source, is a plain text computer program written in a programming language. A programmer writes the human readable source code to control the behavior of a computer. Since a computer, at base, only ...
(an issue partly addressed by trigraphs in C). On the other hand, an advantage of YUSCII is that it remains comparatively readable even when support for it is not available, similarly to the Russian
KOI-7 KOI-7 (КОИ-7) is a 7-bit character encoding, designed to cover Russian, which uses the Cyrillic alphabet. In Russian, KOI-7 stands for ''Kod Obmena Informatsiey, 7 bit'' (Код Обмена Информацией, 7 бит) which means "Co ...
. Numerous attempts to replace it with something better kept failing due to limited support. Eventually,
Microsoft Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's introduction of
code page In computing, a code page is a character encoding and as such it is a specific association of a set of printable character (computing), characters and control characters with unique numbers. Typically each number represents the binary value in a s ...
s, appearance of
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
and availability of fonts finally spelled sure (but nevertheless still slow) end of YUSCII.


Codepage layout

Code points remained largely the same as in
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
to maintain maximum compatibility. Following table shows allocation of character codes in YUSCII. Both
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
and
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
glyphs are shown:


World System Teletext

YUSCII should not be confused with the G0 Latin set for Serbian, Croatian and Slovene, or the G0 Cyrillic set for Serbian, defined by
World System Teletext World System Teletext (WST) is the name of a standard for encoding and displaying teletext information, which is used as the standard for teletext throughout Europe today. It was adopted into the international standard ITU-R, CCIR 653 (now ITU-R ...
. Like YUSCII, these are based on ASCII and are where possible homologous with each other for Serbian letters. However, they make different decisions and consequently are not compatible with YUSCII. Macedonian letters Ќ and Ѓ are also assigned unique positions rather than the same as their Serbian equivalents, whereas the lowercase form of Џ and the Macedonian letter Ѕ are not supported. The WST G0 sets are detailed below for reference.


See also

*
KOI-7 KOI-7 (КОИ-7) is a 7-bit character encoding, designed to cover Russian, which uses the Cyrillic alphabet. In Russian, KOI-7 stands for ''Kod Obmena Informatsiey, 7 bit'' (Код Обмена Информацией, 7 бит) which means "Co ...
, Russian equivalent. *
Cyrillic script The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, Mongolic, Uralic languages, Uralic, C ...
*
Scientific transliteration Scientific transliteration, variously called ''academic'', ''linguistic'', ''international'', or ''scholarly transliteration'', is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). Th ...
* Iskra Delta Partner, a computer with built-in YUSCII * , yet another scheme replacing ASCII characters with otherwise-missing Serbian letters


External links


Tabela standarda za zapisivanje (ex-)YU slova


Footnotes


References

{{DEFAULTSORT:Yuscii Character sets