As of Unicode version 15.0
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
is encoded across several
blocks:
*
CyrillicU+0400–U+04FF 256 characters
*
Cyrillic SupplementU+0500–U+052F 48 characters
*
Cyrillic Extended-A
Cyrillic Extended-A is a Unicode block containing combining Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 ...
U+2DE0–U+2DFF 32 characters
*
Cyrillic Extended-B
Cyrillic Extended-B is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation pu ...
U+A640–U+A69F 96 characters
*
Cyrillic Extended-C
Cyrillic Extended-C is a Unicode block containing Cyrillic characters for facsimile reprinting Old Believer
Old Believers or Old Ritualists, ''starovery'' or ''staroobryadtsy'' are Eastern Orthodox Christians who maintain the liturgical and r ...
U+1C80–U+1C8F 9 characters
*
Cyrillic Extended-DU+1E030–U+1E08F 63 characters
*
Phonetic Extensions
Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notation ...
U+1D2B, U+1D78 2 Cyrillic characters
*
Combining Half Marks
Combining Half Marks is a Unicode block containing diacritical combining characters for spanning multiple characters.
Block
History
The following Unicode-related documents record the purpose and process of defining specific characters in the C ...
U+FE2E–U+FE2F 2 Cyrillic characters
The characters in the range U+0400–U+045F are basically the characters from
ISO 8859-5
ISO/IEC 8859-5:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 198 ...
moved upward by 864 positions. The next characters in the Cyrillic block, range U+0460–U+0489, are historical letters, some being still used for
Church Slavonic. The characters in the range U+048A–U+04FF and the complete Cyrillic Supplement block (U+0500-U+052F) are additional letters for various languages that are written with
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
. Two characters in the block Phonetic Extensions block complete the
Uralic Phonetic Alphabet: and .
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
includes few
precomposed accented Cyrillic letters; the others can be
combined
Combined may refer to:
* Alpine combined (skiing), the combination of slalom and downhill skiing as a single event
** Super combined (skiing)
* Nordic combined (skiing), the combination of cross country skiing and ski jumping as a single event
* T ...
by adding U+0301 ("combining acute accent") after the accented vowel (e.g., е́ у́ э́) (see below).
The following two diacritical marks not specific to Cyrillic can be used with Cyrillic text:
* (= Cyrillic stress mark), in
Combining Diacritical Marks
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character " Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, act ...
bloc
U+0300–U+036F To input an accented letter (with acute accent): for the letter R (for example), digit R0301 (without space between letter and number), than select only and press + = Ŕ.
* (= Cyrillic ten thousands sign), in
Combining Diacritical Marks for Symbols bloc
U+20D0–U+20F0
In the table below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and
canonical decomposition
Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting st ...
s are included.
Table of characters
Blocks
The Cyrillic block (U+0400 – U+04FF) was added to the Unicode Standard in October, 1991 with the release of version 1.0:
The Cyrillic Supplement block (U+0500 – U+052F) was added to the Unicode Standard in March, 2002 with the release of version 3.2:
The Cyrillic Extended-A (U+2DE0 – U+2DFF) and Cyrillic Extended-B (U+A640 – U+A69F) blocks were added to the Unicode Standard in April, 2008 with the release of version 5.1:
The Cyrillic Extended-C block (U+1C80 – U+1C8F) was added to the Unicode Standard in June, 2016 with the release of version 9.0:
The Cyrillic Extended-D block (U+1E030 – U+1E08F) was added to the Unicode Standard in September, 2022 with the release of version 15.0:
See also
*
List of Cyrillic letters
*
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
*
Cyrillic alphabets
Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the Byzantine theologians Cyril and Methodius. It is the b ...
References
*
{{Slavic languages
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
*
Russian-language computing
Internet in Russian language