Cuneiform (Unicode block)
   HOME

TheInfoList



OR:

In
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
, the Sumero-Akkadian
Cuneiform script Cuneiform is a logo- syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge-s ...
is covered in three blocks in the
Supplementary Multilingual Plane In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal ...
(SMP): * U+12000–U+123FF Cuneiform * U+12400–U+1247F Cuneiform Numbers and Punctuation * U+12480–U+1254F
Early Dynastic Cuneiform Early Dynastic Cuneiform is the name of a Unicode block of the Supplementary Multilingual Plane (SMP), at U+12480–U+1254F, introduced in version 8.0 (June 2015). It is a supplement to the earlier encoding of the cuneiform script in the t ...
The sample glyphs in the chart file published by the Unicode Consortium show the characters in their Classical Sumerian form ( Early Dynastic period, mid 3rd millennium BCE). The characters as written during the 2nd and 1st millennia BCE, the era during which the vast majority of cuneiform texts were written, are considered font variants of the same characters.


Character inventory and ordering

The final proposal for Unicode encoding of the script was submitted by two cuneiform scholars working with an experienced Unicode proposal writer in June 2004. The base character inventory is derived from the list of
Ur III The Third Dynasty of Ur, also called the Neo-Sumerian Empire, refers to a 22nd to 21st century BC ( middle chronology) Sumerian ruling dynasty based in the city of Ur and a short-lived territorial-political state which some historians consider t ...
signs compiled by the
Cuneiform Digital Library Initiative The Cuneiform Digital Library Initiative (CDLI) is an international digital library project aimed at putting text and images of an estimated 500,000 recovered cuneiform tablets created from between roughly 3350 BC and the end of the pre-Christian er ...
of
UCLA The University of California, Los Angeles (UCLA) is a public land-grant research university in Los Angeles, California. UCLA's academic roots were established in 1881 as a teachers college then known as the southern branch of the California ...
based on the inventories of Miguel Civil, Rykle Borger (2003), and Robert Englund. Rather than opting for a direct ordering by glyph shape and complexity, according to the numbering of an existing catalogue, the Unicode order of glyphs was based on the Latin alphabetic order of their 'main' Sumerian transliteration as a practical approximation. Of the 907 signs listed by Borger (2003), some 200 have no encoding at a single code point. Conversely, a number of combinations considered reducible by Borger were assigned unique code points. These differences are due to the difficulty of establishing what represents a single character in cuneiform, and indeed most of Borger's items not encoded have straightforward etymological decomposition. There are still quite a number of universally recognized signs missing, and criticism has been voiced to the effect that the encoding "disregards an important part of the accumulated knowledge of generations of assyriologists about what actually function as single signs in normal texts, and are reflected in the traditional sign lists, most recently and comprehensively Borger's ''Mesopotamische Zeichenliste''". For example, there are signs written as ligatures of varying constituent signs, such as KURUM7 (Borger 2003 no. 729) that was written IGI.NÍG in early times, but later IGI.ERIM. Since there is no code point for KURUM7, the sign must be expressed as either IGI.NÍG (U+12146 U+1243C, ) or IGI.ERIM (U+12146 U+1209F, ) depending on the shape of the glyph, in violation of the basic principle of Unicode to encode
characters Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
, not
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
s. While those signs can in principle still be added by a "Cuneiform Extended" range in the future, as has been done for a number of other scripts (" Latin Extended" etc.), their absence as of Unicode 7.0 means that the standard's usability for the encoding of actual texts is limited. Rather than opting for an ordering by glyph shape and complexity, the Unicode order of characters is the Latin alphabet order of their "main" Sumerian transliteration (placing signs on Š-, transliterated as SH-, between SAR and SI). In most (but not all) cases, the "etymological" decomposition of originally complex signs ("ligatures") has been chosen, even if the sign's most familiar value is another. For example, is better known as AMAŠ, is better known as ÁG, and is better known as ḪAR or ḪUR.


List of signs

The following table allows matching of Borger's 1981 and 2003 numbering with Unicode characters (after Anderson'
sign list
)
The "primary" transliteration column has the glyphs' Sumerian values as given by the official glyph name, slightly modified here for legibility by including traditional assyriological symbols such as "x" rather than "TIMES". The exact Unicode names can be unambiguously recovered by prefixing, "CUNEIFORM UMERICSIGN", replacing "TIMES" for "x", "PLUS" for "+" and "OVER" for "/", "ASTERISK" for "*", "H" for "Ḫ", "SH" for "Š", and switching to uppercase.


Sumero-Akkadian Cuneiform


Code chart

Sumero-Akkadian Cuneiform script was added to the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
Standard in July, 2006 with the release of version 5.0. The Unicode block for Sumero-Akkadian Cuneiform is U+12000–U+123FF:


History

The following Unicode-related documents record the purpose and process of defining specific characters in the Cuneiform block:


See also

* List of cuneiform signs


References


Citations


Bibliography

*Rykle Borger, ''Assyrisch-Babylonische Zeichenliste'', 2nd ed., Neukirchen-Vluyn (1981) *Rykle Borger
''Mesopotamisches Zeichenlexikon''
Münster (2003). *
Michael Everson Michael Everson (born January 9, 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over a hundred books since 2006. H ...
, Karljürgen Feuerherm, Steve Tinney
"Final proposal to encode the Cuneiform script in the SMP of the UCS"
ISO/IEC JTC1/SC2/WG2 N2786 (2004).


External links


cuneiformsigns.org
by Lloyd Anderson
Cuneiform Unicode.org chart (PDF)

Cuneiform Numbers and Punctuation Unicode.org chart (PDF)


Font packages


Akkadian
(reproduces the Sumerian (3rd millennium BC) glyphs given in the Unicode
reference chart
, b
George Douros


(branched off
FreeSerif GNU FreeFont (also known as Free UCS Outline Fonts) is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The p ...
), encodes some 390 Old Assyrian (2nd millennium BC) glyphs used in Hittite cuneiform.
Noto Sans Cuneiform
(Encodes all three Cuneiform blocks. Distributed under
SIL Open Font License The SIL Open Font License (or OFL in short) is one of the major open font licenses, which allows embedding, or "bundling", of the font in commercially sold products. OFL is a free and open source license. It was created by SIL Internationa ...
) *
Segoe UI Historic Segoe ( ) is a typeface, or family of fonts, that is best known for its use by Microsoft. The company uses Segoe in its online and printed marketing materials, including recent logos for a number of products. Additionally, the Segoe UI font su ...
(Comes pre-installed on Windows 10 and later) {{Unicode navigation Cuneiform
Cuneiform Cuneiform is a logo- syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge- ...