The Polish alphabet (
Polish: , ) is the
script of the
Polish language
Polish (, , or simply , ) is a West Slavic languages, West Slavic language of the Lechitic languages, Lechitic subgroup, within the Indo-European languages, Indo-European language family, and is written in the Latin script. It is primarily spo ...
, the basis for the
Polish system of orthography. It is based on the
Latin alphabet
The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
but includes certain letters (9) with
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s: the stroke (acute accent or
bar) – : ; the
overdot – : ; and the tail or ''
ogonek
The tail or ( ; Polish: , "little tail", diminutive of ) is a diacritic hook placed under the lower right corner of a vowel in the Latin alphabet used in several European languages, and directly under a vowel in several Native American langu ...
'' – . The letters , , and , which are used only in foreign words, are usually absent from the Polish alphabet. Additionally, before the standardization of Polish spelling, was sometimes used in place of , and in place of .
Modified variations of the Polish alphabet are used for writing
Silesian and
Kashubian, whereas the
Sorbian languages
The Sorbian languages (, ) are the Upper Sorbian language and Lower Sorbian language, two closely related and partially mutually intelligible languages spoken by the Sorbs, a West Slavs, West Slavic ethno-cultural minority in the Lusatia region ...
use a mixture of Polish and
Czech orthography.
Letters: aspect, name, value
There are 32 letters in the Polish alphabet: 9
vowel
A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s and 23
consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s.
, , and are not used in any native Polish words and are mostly found in foreign words (such as place names) and commercial names. In
loanword
A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing. Borrowing is a metaphorical term t ...
s they are usually replaced by , , and , respectively (as in 'niqab', 'quark', 'veranda', 'savanna', 'extra', 'oxymoron'), although some loanwords retain their original spelling (e.g., , ), and in a few cases both spellings are accepted (such as or , or ). In addition, they can occasionally be found in common abbreviations (e.g., 'priest' can be abbreviated as either or ). As a result, they are sometimes included in the Polish alphabet (bringing the total number of letters in the alphabet to 35); when included, they take their usual positions from the
Latin alphabet
The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
( after ; and either side of ).
The following table lists the letters of the alphabet, their Polish names (see also
Names of letters below), the Polish spelling alphabet name, the
Polish phonemes which they usually represent (and rough equivalents for them), other possible pronunciations, and letter frequencies. Diacritics are shown for the sake of clarity. For more information about the sounds, see
Polish phonology.
: For English speakers who end the word with a nasal vowel and not a consonant.
: Sequences may be pronounced as
geminates .
: is sometimes
transcribed phonetically as , though it is phonetically .
was historically used in native words prior to the 1891 spelling reform by the
Academy of Learning, e.g., , (now 'four', 'pope'). Now it is used in some loanwords, e.g., , , .
For
digraphs and other rules about spelling and the corresponding pronunciations, see
Polish orthography.
Names of letters
The spoken Polish names of the letters are given in the table under
Letters above.
The names of the letters are not normally written out in the way shown above, except as part of certain lexicalized abbreviations, such as
Pekao (or PeKaO), the name of a bank, which represents the spoken form of the abbreviation P.K.O. (for ).
Some letters may be referred to in alternative ways, often consisting of just the sound of the letter. For example, may be called as it is pronounced: rather than (from '
Greek i').
When giving the spelling of words, certain letters may be said in more emphatic ways to distinguish them from other identically pronounced characters. For example, may be referred to as ('h alone') to distinguish it from (). may be called or ('z with an overdot') to distinguish it from (). may be called ('open u', a reference to its graphical form) or ('normal u') to distinguish it from , which is sometimes called ('closed u') or , , ('dashed ó', 'dashed o', 'o with a dash').
Alphabetical order
Polish
alphabetical ordering uses the order of letters as in the table under
Letters above.
Note that (unlike in languages such as
French,
Spanish, and
German) Polish letters with
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s are treated as fully independent letters in alphabetical ordering. For example, comes after . The accented letters also have their own sections in dictionaries (words beginning with are not usually listed under ).
Digraphs are not given any special treatment in alphabetical ordering. For example, is treated simply as followed by and not as a single letter as in
Czech.
Computer encoding
There are several systems for
encoding
In communications and Data processing, information processing, code is a system of rules to convert information—such as a letter (alphabet), letter, word, sound, image, or gesture—into another form, sometimes data compression, shortened or ...
the Polish alphabet for computers. All letters of the Polish alphabet are included in
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
(blocks
Basic Latin,
Latin-1 Supplement and
Latin Extended-A
Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 (which is already encoded in the Latin-1 Supplement block) and also legacy characte ...
), and thus Unicode-based encodings such as
UTF-8
UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8.
UTF-8 supports all 1,112,0 ...
and
UTF-16 can be used. The Polish alphabet is completely included in the
Basic Multilingual Plane of Unicode. The standard 8-bit character encoding for the Polish alphabet is
ISO 8859-2
ISO/IEC 8859-2:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. I ...
(Latin-2), although both
ISO 8859-13 (Latin-7) and
ISO 8859-16 (Latin-10) encodings include glyphs of the Polish alphabet. Microsoft's format for encoding the Polish alphabet is
Windows-1250.
The Polish letters which are not present in the
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
have the following
HTML
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
codes and
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
codepoints:
For other encodings, see
Polish code pages, but also
Combining Diacritical Marks
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character " Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actua ...
Unicode block.
A common test sentence containing all the Polish diacritic letters is the nonsensical ('Yellow the ego with/of a
gusle').
See also
*
Polish orthography
*
Polish Braille
*
Cyrillization of Polish under the Russian Empire
*
Cyrillic transcriptions of Polish
*
Polish manual alphabet
Notes
References
Further reading
*
External links
Polish Alphabet & PronunciationOnline editor for typing Polish charactersA Foreigner's Guide to the Polish Alphabet, interactive listen-along guide from Culture.pl
{{Language orthographies
Latin alphabets
Alphabet
An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...