ISO-IR-197
   HOME

TheInfoList



OR:

ISO-IR-197 (known by the ISO-IR registration number of its GR set) is an
8-bit In computer architecture, 8-bit integers or other data units are those that are 8 bits wide (1 octet). Also, 8-bit central processing unit (CPU) and arithmetic logic unit (ALU) architectures are those that are based on registers or data bu ...
, single-byte
character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
which was designed for the
Sámi languages The Sámi languages ( ), also rendered in English language, English as Sami and Saami, are a group of Uralic languages spoken by the Indigenous Sámi peoples in Northern Europe (in parts of northern Finland, Norway, Sweden, and extreme northwest ...
. It is a modification of
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology— 8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 19 ...
, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies. FreeDOS calls it code page 59187. ISO-IR-197 was proposed for establishment as a part of
ISO/IEC 8859 ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC ...
in 1996 (as part 14 and, later,
part 15 Code of Federal Regulations, 'Title 47, Part 15(47 CFR 15) is an oft-quoted part of Federal Communications Commission (FCC) rules and regulations regarding unlicensed transmissions. It is a part of Title 47 of the Code of Federal Regulatio ...
), but was not accepted for this. However, ISO-IR-197 is referenced in an informative ISO/IEC 8859 annex, which lists it as an encoding which provides a more adequate coverage of the orthography of certain Sámi languages such as
Skolt Sámi Skolt Sámi (, , ; or , , ) is a Sámi languages, Sámi language that is spoken by the Skolts, with approximately 300 speakers in Finland, mainly in Sevettijärvi and approximately 20–30 speakers of the (Notozero) dialect in an area surround ...
than
ISO-8859-4 ISO/IEC 8859-4:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. ...
or ISO-8859-10, unless the latter is combined with ISO-IR-158.


Code page layout

Differences from
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology— 8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 19 ...
have their Unicode code point.


Windows extension

As documented by
Evertype Michael Everson (born January 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over one hundred books since 2006. His ...
, some Windows implementations use a variant which adds graphical characters to the C1 area ( 0x80-9F), including some of the other characters from the Mac OS Sámi repertoire. This was intended to be analogous to the Windows version of
Latin-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987 ...
(i.e.
Windows-1252 Windows-1252 or CP-1252 ( Windows code page 1252) is a legacy single-byte character encoding that is used by default (as the "ANSI code page") in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa. Initially ...
), and follows its layout where possible. Differences from Windows-1252 have their Unicode code point:


ISO-IR-209

ISO-IR-209 is an update that replaced the guillemets at 0xAB and 0xBB with the letter H with caron to add Finnish Romani support. FreeDOS calls it Code page 60211.


References

{{character encoding ISO/IEC 8859 Sámi languages