As of Unicode version 15.0
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
is encoded across several
blocks:
*
CyrillicU+0400–U+04FF 256 characters
*
Cyrillic SupplementU+0500–U+052F 48 characters
*
Cyrillic Extended-AU+2DE0–U+2DFF 32 characters
*
Cyrillic Extended-BU+A640–U+A69F 96 characters
*
Cyrillic Extended-CU+1C80–U+1C8F 9 characters
*
Cyrillic Extended-D
Cyrillic Extended-D is a Unicode block containing superscript and subscript Cyrillic characters used in Cyrillic-based phonetic transcription. The block contains the first Cyrillic characters defined outside of the Basic Multilingual Plane
In ...
U+1E030–U+1E08F 63 characters
*
Phonetic Extensions
Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notation ...
U+1D2B, U+1D78 2 Cyrillic characters
*
Combining Half MarksU+FE2E–U+FE2F 2 Cyrillic characters
The characters in the range U+0400–U+045F are basically the characters from
ISO 8859-5
ISO/IEC 8859-5:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 198 ...
moved upward by 864 positions. The next characters in the Cyrillic block, range U+0460–U+0489, are historical letters, some being still used for
Church Slavonic. The characters in the range U+048A–U+04FF and the complete Cyrillic Supplement block (U+0500-U+052F) are additional letters for various languages that are written with
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
. Two characters in the block Phonetic Extensions block complete the
Uralic Phonetic Alphabet
The Uralic Phonetic Alphabet (UPA) or Finno-Ugric transcription system is a phonetic transcription or notational system used predominantly for the transcription and reconstruction of Uralic languages. It was first published in 1901 by Eemil Ne ...
: and .
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
includes few
precomposed accented Cyrillic letters; the others can be
combined
Combined may refer to:
* Alpine combined (skiing), the combination of slalom and downhill skiing as a single event
** Super combined (skiing)
* Nordic combined (skiing), the combination of cross country skiing and ski jumping as a single event
* T ...
by adding U+0301 ("combining acute accent") after the accented vowel (e.g., е́ у́ э́) (see below).
The following two diacritical marks not specific to Cyrillic can be used with Cyrillic text:
* (= Cyrillic stress mark), in
Combining Diacritical Marks
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character " Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, act ...
bloc
U+0300–U+036F To input an accented letter (with acute accent): for the letter R (for example), digit R0301 (without space between letter and number), than select only and press + = Ŕ.
* (= Cyrillic ten thousands sign), in
Combining Diacritical Marks for Symbols bloc
U+20D0–U+20F0
In the table below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and
canonical decompositions are included.
Table of characters
Blocks
The Cyrillic block (U+0400 – U+04FF) was added to the Unicode Standard in October, 1991 with the release of version 1.0:
The Cyrillic Supplement block (U+0500 – U+052F) was added to the Unicode Standard in March, 2002 with the release of version 3.2:
The Cyrillic Extended-A (U+2DE0 – U+2DFF) and Cyrillic Extended-B (U+A640 – U+A69F) blocks were added to the Unicode Standard in April, 2008 with the release of version 5.1:
The Cyrillic Extended-C block (U+1C80 – U+1C8F) was added to the Unicode Standard in June, 2016 with the release of version 9.0:
The Cyrillic Extended-D block (U+1E030 – U+1E08F) was added to the Unicode Standard in September, 2022 with the release of version 15.0:
See also
*
List of Cyrillic letters
*
Cyrillic script
The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking cou ...
*
Cyrillic alphabets
Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the Byzantine theologians Cyril and Methodius. It is the b ...
References
*
{{Slavic languages
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
*
Russian-language computing
Internet in Russian language