Windows-1250 is a
code page
In computing, a code page is a character encoding and as such it is a specific association of a set of printable character (computing), characters and control characters with unique numbers. Typically each number represents the binary value in a s ...
used under
Microsoft Windows
Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
to represent texts in
Central Europe
Central Europe is a geographical region of Europe between Eastern Europe, Eastern, Southern Europe, Southern, Western Europe, Western and Northern Europe, Northern Europe. Central Europe is known for its cultural diversity; however, countries in ...
an and
Eastern Europe
Eastern Europe is a subregion of the Europe, European continent. As a largely ambiguous term, it has a wide range of geopolitical, geographical, ethnic, cultural and socio-economic connotations. Its eastern boundary is marked by the Ural Mountain ...
an languages that use the
Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
. It is primarily used by
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
. It is also used for
Polish (as can
Windows-1257
Windows-1257 (Windows Baltic) is an 8-bit, single-byte extended ASCII code page used to support the Estonian (which also used in Windows-1252), Latvian and Lithuanian languages under Microsoft Windows. In Lithuania, it is standardised as LST 15 ...
),
Slovak,
Hungarian,
Slovene (as can
Windows-1257
Windows-1257 (Windows Baltic) is an 8-bit, single-byte extended ASCII code page used to support the Estonian (which also used in Windows-1252), Latvian and Lithuanian languages under Microsoft Windows. In Lithuania, it is standardised as LST 15 ...
),
Serbo-Croatian
Serbo-Croatian ( / ), also known as Bosnian-Croatian-Montenegrin-Serbian (BCMS), is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually i ...
(Latin script),
Romanian (before a
1993 spelling reform) and
Albanian (as can
Windows-1252
Windows-1252 or CP-1252 ( Windows code page 1252) is a legacy single-byte character encoding that is used by default (as the "ANSI code page") in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.
Initially ...
). It may also be used with the
German language
German (, ) is a West Germanic language in the Indo-European language family, mainly spoken in Western Europe, Western and Central Europe. It is the majority and Official language, official (or co-official) language in Germany, Austria, Switze ...
, though it is missing uppercase
ẞ. German-language texts encoded with Windows-1250 and
Windows-1252
Windows-1252 or CP-1252 ( Windows code page 1252) is a legacy single-byte character encoding that is used by default (as the "ANSI code page") in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.
Initially ...
are identical.
This has been replaced by
UTF-8
UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8.
UTF-8 supports all 1,112,0 ...
far more than Windows-1252 has. , less than 0.05% of all web pages use Windows-1250.
Windows-1250 is similar to
ISO-8859-2
ISO/IEC 8859-2:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. I ...
and has all the printable characters it has and more. However, a few of them are rearranged (unlike
Windows-1252
Windows-1252 or CP-1252 ( Windows code page 1252) is a legacy single-byte character encoding that is used by default (as the "ANSI code page") in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.
Initially ...
, which keeps all printable characters from
ISO-8859-1
ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic character (computing), character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character enc ...
in the same place). Most of the rearrangements seem to have been done to keep characters shared with Windows-1252 in the same place but three of the characters moved (Ą, Ľ, ź) cannot be explained this way, since those do not occur in Windows-1252 and could have been put in the same positions as in ISO-8859-2 if ˇ had been put e.g. at 9F.
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
uses code page 1250 (
CCSID 1250 and
euro sign
The euro sign () is the currency sign used for the euro, the official currency of the eurozone. The design was presented to the public by the European Commission on 12 December 1996. It consists of a stylized letter E (or epsilon), crossed by ...
extended CCSID 5346) for Windows-1250.
Character set
The following table shows Windows-1250. Each character is shown with its
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
equivalent.
See also
*
Latin script in Unicode
*
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
*
Universal Character Set
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/ IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), w ...
**
European Unicode subset (DIN 91379)
*
UTF-8
UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8.
UTF-8 supports all 1,112,0 ...
Kodowanie polskich znaków
Notes
References
External links
Windows 1250 reference chartIANA Charset Name RegistrationUnicode mappings of windows 1250 with "best fit"
{{character encoding
Windows code pages