Windows-1250 is a
code page
In computing, a code page is a character encoding and as such it is a specific association of a set of printable characters and control characters with unique numbers. Typically each number represents the binary value in a single byte. (In some c ...
used under
Microsoft Windows to represent texts in
Central Europe
Central Europe is an area of Europe between Western Europe and Eastern Europe, based on a common historical, social and cultural identity. The Thirty Years' War (1618β1648) between Catholicism and Protestantism significantly shaped the ...
an and
Eastern Europe
Eastern Europe is a subregion of the European continent. As a largely ambiguous term, it has a wide range of geopolitical, geographical, ethnic, cultural, and socio-economic connotations. The vast majority of the region is covered by Russia, wh ...
an languages that use
Latin script
The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greece, Greek city of Cumae, in southe ...
, such as
Czech (which is its main user with half its use, though Czech has 96.6% use of
UTF-8
UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''.
UTF-8 is capable of ...
, and mostly abandoned (this) legacy encoding),
Polish
Polish may refer to:
* Anything from or related to Poland, a country in Europe
* Polish language
* Poles, people from Poland or of Polish descent
* Polish chicken
*Polish brothers (Mark Polish and Michael Polish, born 1970), American twin screenwr ...
,
Slovak,
Hungarian,
Slovene,
Serbo-Croatian
Serbo-Croatian () β also called Serbo-Croat (), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) β is a South Slavic language and the primary language of Serbia, Croatia, Bosnia an ...
(Latin script),
Romanian
Romanian may refer to:
*anything of, from, or related to the country and nation of Romania
**Romanians, an ethnic group
**Romanian language, a Romance language
*** Romanian dialects, variants of the Romanian language
**Romanian cuisine, traditiona ...
(before 1993
spelling reform
A spelling reform is a deliberate, often authoritatively sanctioned or mandated change to spelling rules. Proposals for such reform are fairly common, and over the years, many languages have undergone such reforms. Recent high-profile examples ar ...
),
Rotokas
Rotokas is a North Bougainville language spoken by about 4,320 people on the island of Bougainville, an island located to the east of New Guinea which is part of Papua New Guinea. According to Allen and Hurd (1963), there are three identif ...
and
Albanian. It may also be used with the
German language
German ( ) is a West Germanic language mainly spoken in Central Europe. It is the most widely spoken and official or co-official language in Germany, Austria, Switzerland, Liechtenstein, and the Italian province of South Tyrol. It is als ...
(though it's missing uppercase
αΊ); German-language texts encoded with Windows-1250 and
Windows-1252
Windows-1252 or CP-1252 ( code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German.
It ...
are identical.
This has been replaced by Unicode (such as
UTF-8
UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''.
UTF-8 is capable of ...
) far more than Windows-1252. As of October 2022, less than 0.04% of all web pages use Windows-1250.
Windows-1250 is similar to
ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike
Windows-1252
Windows-1252 or CP-1252 ( code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German.
It ...
, which keeps all printable characters from
ISO-8859-1
ISO/IEC 8859-1:1998, ''Information technology β 8-bit single-byte coded graphic character sets β Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
in the same place). Most of the rearrangements seem to have been done to keep characters shared with Windows-1252 in the same place but three of the characters moved (Δ, Δ½, ΕΊ) cannot be explained this way, since those do not occur in Windows-1252 and could have been put in the same positions as in ISO-8859-2 if Λ had been put e.g. at 9F.
IBM uses code page 1250 (
CCSID
A CCSID (coded character set identifier) is a 16-bit number that represents a particular character encoding, encoding of a specific code page. For example, Unicode is a code page that has several encoding (so called "transformation") forms, like UT ...
1250 and
euro sign
The euro sign () is the currency sign used for the euro, the official currency of the eurozone and unilaterally adopted by Kosovo and Montenegro. The design was presented to the public by the European Commission on 12 December 1996. It consi ...
extended CCSID 5346) for Windows-1250.
Character set
The following table shows Windows-1250. Each character is shown with its
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
equivalent.
See also
*
Latin script in Unicode
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with c ...
*
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
*
Universal Character Set
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/ IEC 10646, ''Information technology β Universal Coded Character Set (UCS)'' (plus amendments to that standard), ...
**
European Unicode subset (DIN 91379)
*
UTF-8
UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''.
UTF-8 is capable of ...
Notes
References
External links
Windows 1250 reference chartIANA Charset Name RegistrationUnicode mappings of windows 1250 with "best fit"
{{character encoding
Windows code pages