ISO 8859-14
   HOME

TheInfoList



OR:

ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 ( Celtic)'', is part of the
ISO/IEC 8859 ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC ...
series of ASCII-based standard
character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
s, first edition published in 1998. It is informally referred to as Latin-8 or ''Celtic''. It was designed to cover the
Celtic languages The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
, such as Irish, Manx,
Scottish Gaelic Scottish Gaelic (, ; Endonym and exonym, endonym: ), also known as Scots Gaelic or simply Gaelic, is a Celtic language native to the Gaels of Scotland. As a member of the Goidelic language, Goidelic branch of Celtic, Scottish Gaelic, alongs ...
, Welsh, Cornish, and Breton. ISO-8859-14 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. CeltScript made an extension for Windows called Extended Latin-8. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14. FreeDOS assigned code page 58163 to ISO-8859-14.


History

ISO-8859-14 was originally proposed for the
Sami languages Acronyms * SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft * Saudi Arabian Military Industries, a government-owned defence company * South African Malaria Initiative, a virtual expertise ...
. ISO 8859-12 was proposed for Celtic. Later, ISO 8859-12 was proposed for
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to
ISO 8859-15 ISO/IEC 8859-15:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 15: Latin alphabet No. 9'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1999. ...
, but it got rejected as an ISO/IEC 8859 part, although it was registered as ISO-IR-197. The original proposal used a different arrangement of points 0xA1–BF. At the committee draft stage of the specification, a dotless i was included at 0xAE, which was changed to a registered trademark sign (matching
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
) in the final publication. ISO-IR-182, an earlier (registered in 1994) modification of
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for
Welsh language Welsh ( or ) is a Celtic languages, Celtic language of the Brittonic languages, Brittonic subgroup that is native to the Welsh people. Welsh is spoken natively in Wales by about 18% of the population, by some in England, and in (the Welsh c ...
use. The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.


Codepage layout

Differences from
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
have the Unicode code point number below the character.


Draft layout

The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead. Differences from ISO-8859-14 have the Unicode code point below them.


References


External links


ISO/IEC 8859-14:1998ISO-IR 199
Celtic Supplementary Latin Set ''(May 1, 1998, submitted by Irish body NSAI/AGITS/WG6)'' {{DEFAULTSORT:ISO IEC 8859-14 ISO/IEC 8859 Computer-related introductions in 1998 Celtic languages