HOME

TheInfoList



OR:

ISO/IEC 8859-9:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 9: Latin alphabet No. 5'', is part of the
ISO/IEC 8859 ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC ...
series of ASCII-based standard
character encoding Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
s, first edition published in 1989. It is designated ECMA-128 by
Ecma International Ecma International () is a Nonprofit organization, nonprofit standards organization for information and communication systems. It acquired its current name in 1994, when the European Computer Manufacturers Association (ECMA) changed its name to ...
and TS 5881 as a Turkish standard. It is informally referred to as Latin-5 or ''Turkish''. It was designed to cover the
Turkish language Turkish ( , , also known as 'Turkish of Turkey') is the most widely spoken of the Turkic languages, a member of Oghuz languages, Oghuz branch with around 90 million speakers. It is the national language of Turkey and one of two official languag ...
(and the vast majority of users use it for that language, even though it can also be used for some other languages), designed as being of more use than the ISO/IEC 8859-3 encoding. It is identical to ISO/IEC 8859-1 except for the replacement of six Icelandic characters ( Ðð, Ýý, Þþ) with characters unique to the Turkish alphabet ( Ğğ, İ, ı, Şş). And the uppercase of i is İ; the lowercase of I is ı. ISO-8859-9 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. In modern applications Unicode and
UTF-8 UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8. UTF-8 supports all 1,112,0 ...
are preferred; authors of new web pages and the designers of new protocols are instructed to use
UTF-8 UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8. UTF-8 supports all 1,112,0 ...
instead. Since 2023, less than 0.05% of all web pages use ISO-8859-9, while 2.1% of web pages located in Turkey declare use of ISO-8859-9. However, the
WHATWG The Web Hypertext Application Technology Working Group (WHATWG) is a community of people interested in evolving HTML and related technologies. The WHATWG was founded by individuals from Apple Inc., the Mozilla Foundation and Opera Software, ...
Encoding Standard, which specifies the character encodings which are permitted in
HTML5 HTML5 (Hypertext Markup Language 5) is a markup language used for structuring and presenting hypertext documents on the World Wide Web. It was the fifth and final major HTML version that is now a retired World Wide Web Consortium (W3C) recommend ...
and which compliant browsers must support, requires that web pages marked as ISO-8859-9 be handled as Windows-1254, which differs from ISO-8859-9 by using the CR range which ISO-8859-9 reserves for C1 control codes for additional graphical characters instead (analogous to the relationship between
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
and Windows-1252). Microsoft has assigned code page 28599 a.k.a. Windows-28599 to ISO-8859-9 in Windows. IBM has assigned code page 920 ( CCSID 920) to ISO-8859-9. It is published by
Ecma International Ecma International () is a Nonprofit organization, nonprofit standards organization for information and communication systems. It acquired its current name in 1994, when the European Computer Manufacturers Association (ECMA) changed its name to ...
as ECMA-128.


Codepage layout

Differences from
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
have the Unicode code point number below the character.


See also

* Latin script in Unicode *
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
*
Universal Character Set The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/ IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), w ...
** European Unicode subset (DIN 91379) *
UTF-8 UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8. UTF-8 supports all 1,112,0 ...


References


External links


ISO/IEC 8859-9:1999

Standard ECMA-128
8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 ''2nd edition (December 1999)''
ISO-IR 148
Right-Hand Part of Latin Alphabet No.5 ''(October 1, 1988)'' {{DEFAULTSORT:ISO IEC 8859-9 ISO/IEC 8859 Computer-related introductions in 1989 Turkish language