An alphabet is a standardized set of written
letters that represent particular spoken sounds in a language. Specifically, letters correspond to
phonemes, the categories of sounds that can distinguish one
word from another in a given language. Not all
writing systems represent language in this way: a
syllabary
In the linguistic study of written languages, a syllabary is a set of written symbols that represent the syllables or (more frequently) moras which make up words.
A symbol in a syllabary, called a syllabogram, typically represents an (optiona ...
assigns symbols to spoken
syllable
A syllable is a unit of organization for a sequence of speech sounds typically made up of a syllable nucleus (most often a vowel) with optional initial and final margins (typically, consonants). Syllables are often considered the phonological "bu ...
s, while
logographic systems assign symbols to spoken words,
morphemes, or other semantic units.
The first letters were invented in Ancient Egypt to aid writers already using
Egyptian hieroglyph
Egyptian hieroglyphs (, ) were the formal writing system used in Ancient Egypt, used for writing the Egyptian language. Hieroglyphs combined logographic, syllabic and alphabetic elements, with some 1,000 distinct characters.There were about 1,00 ...
s, now referred to by lexicographers as the
Egyptian uniliteral signs. This system was used until the 5th century AD,
and fundamentally differed by adding pronunciation hints to existing hieroglyphs that had previously carried no pronunciation information. Later on, these
phonemic symbols also became used to transcribe foreign words.
The first fully phonemic script was the
Proto-Sinaitic script, also descending from Egyptian hieroglyphics, which was later modified to create the
Phoenician alphabet. The Phoenician system is considered the first true alphabet and is the ultimate ancestor of many modern scripts, including
Arabic,
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
,
Greek,
Hebrew,
Latin, and possibly
Brahmic.
Peter T. Daniels distinguishes true alphabets, which have letters representing both consonants and vowels, from both ''abugidas'' and ''abjads'', which only have letters for consonants. Broadly, abjads lack vowel indicators altogether, while abugidas represent them with
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s added to letters. In this narrower sense, the
Greek alphabet was the first true alphabet,
while the Phoenician alphabet it derived from was an abjad.
Alphabets are usually associated with a standard ordering of letters. This makes them useful for purposes of
collation, which allows words to be sorted in a specific order, commonly known as the
alphabetical order. It also means that their letters can be used as an alternative method of "numbering" ordered items, in such contexts as
numbered list
Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
s and number placements. There are also names for letters in some languages. This is known as
acrophony
Acrophony (; Greek: ἄκρος ''akros'' uppermost + φωνή ''phone'' sound) is the naming of letters of an alphabetic writing system so that a letter's name begins with the letter itself. For example, Greek letter names are acrophonic: the name ...
; It is present in some modern scripts, such as
Greek, and many Semitic scripts, such as
Arabic,
Hebrew, and
Syriac. It was used in some ancient alphabets, such as in
Phoenician. However, this system is not present in all languages, such as the
Latin alphabet, which adds a vowel after a character for each letter. Some systems also used to have this system but later on abandoned it for a system similar to Latin, such as
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
.
Etymology
The English word ''alphabet'' came into
Middle English from the
Late Latin word , which in turn originated in the Greek, ἀλφάβητος (''alphábētos''); it was made from the first two letters of the Greek alphabet, ''
alpha
Alpha (uppercase , lowercase ; grc, ἄλφα, ''álpha'', or ell, άλφα, álfa) is the first letter of the Greek alphabet. In the system of Greek numerals, it has a value of one. Alpha is derived from the Phoenician letter aleph , whic ...
'' (α) and ''
beta
Beta (, ; uppercase , lowercase , or cursive ; grc, βῆτα, bē̂ta or ell, βήτα, víta) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Modern Greek, it represents the voiced labiod ...
'' (β). The names for the Greek letters, in turn, came from the first two letters of the Phoenician alphabet: ''
aleph
Aleph (or alef or alif, transliterated ʾ) is the first letter of the Semitic abjads, including Phoenician , Hebrew , Aramaic , Syriac , Arabic ʾ and North Arabian 𐪑. It also appears as South Arabian 𐩱 and Ge'ez .
These letter ...
'', the word for ''ox'', and ''
bet'', the word for ''house''.
History
Alphabets related to Phoenician
Ancient Near Eastern alphabets
The
Ancient Egyptian writing system had a set of some
24 hieroglyphs that are called uniliterals, which are glyphs that provide one sound. These glyphs were used as pronunciation guides for
logograms, to write grammatical inflections, and, later, to transcribe loan words and foreign names.
The script was used a fair amount in the 4th century CE. However, after pagan temples were closed down, it was forgotten in the 5th century until the discovery of the
Rosetta Stone.
There was also the
Cuneiform script. The script was used to write several ancient languages. However, it was primarily used to write
Sumerian
Sumerian or Sumerians may refer to:
*Sumer, an ancient civilization
**Sumerian language
**Sumerian art
**Sumerian architecture
**Sumerian literature
**Cuneiform script, used in Sumerian writing
*Sumerian Records, an American record label based in ...
. The last known use of the Cuneiform script was in 75 CE, after which the script fell out of use.
In the
Middle Bronze Age
The Bronze Age is a historic period, lasting approximately from 3300 BC to 1200 BC, characterized by the use of bronze, the presence of writing in some areas, and other early features of urban civilization. The Bronze Age is the second pri ...
, an apparently "alphabetic" system known as the
Proto-Sinaitic script appeared in Egyptian turquoise mines in the
Sinai peninsula dated 1840 BCE, apparently left by Canaanite workers.
Orly Goldwasser
Orly Goldwasser is an Israeli Egyptologist, professor of Egyptology at the Hebrew University of Jerusalem.
Orly Goldwasser received her B.A. at Tel Aviv University, continued studying at the Hebrew University in Jerusalem, where she was awarded he ...
has connected the illiterate turquoise miner graffiti theory to the origin of the alphabet.
In 1999,
John and
Deborah Darnell, American
Egyptologists, discovered an earlier version of this first alphabet at the
Wadi el-Hol valley in Egypt. The script dated to 1800 BCE and shows evidence of having been adapted from specific forms of Egyptian hieroglyphs that could be dated to 2000 BCE, strongly suggesting that the first alphabet had developed about that time. The script was based on letter appearances and names, believed to be based on Egyptian hieroglyphs.
This script had no characters representing vowels. Originally, it probably was a syllabary—a script where syllables are represented with characters—with symbols that were not needed being removed. The best-attested Bronze Age alphabet is
Ugaritic, invented in
Ugarit (
Syria
Syria ( ar, سُورِيَا or سُورِيَة, translit=Sūriyā), officially the Syrian Arab Republic ( ar, الجمهورية العربية السورية, al-Jumhūrīyah al-ʻArabīyah as-Sūrīyah), is a Western Asian country loc ...
) before the 15th century BCE. This was an alphabetic
cuneiform script with 30 signs, including three that indicate the following vowel. This script was not used after the destruction of Ugarit in 1178 BCE.

The Proto-Sinaitic script eventually developed into the Phoenician alphabet, conventionally called "
Proto-Canaanite" before 1050 BCE.
The oldest text in Phoenician script is an inscription on the sarcophagus of King
Ahiram 1000 BCE. This script is the parent script of all western alphabets. By the tenth century BCE, two other forms distinguish themselves,
Canaanite and
Aramaic. The Aramaic gave rise to the
Hebrew script.
The
South Arabian alphabet, a sister script to the Phoenician alphabet, is the script from which the
Ge'ez alphabet, an
abugida, a writing system where consonant-vowel sequences are written as units, which was used around the horn of Africa, descended. Vowel-less alphabets are called
abjads, currently exemplified in others such as Arabic, Hebrew, and
Syriac. The omission of vowels was not always a satisfactory solution due to the need of preserving sacred texts. "Weak"
consonants are used to indicate vowels. These letters have a dual function since they can also be used as pure consonants.
The Proto-Sinaitic script and the Ugaritic script were the first scripts with a limited number of signs instead of using many different signs for words, in contrast to the other widely used writing systems at the time,
Cuneiform,
Egyptian hieroglyphs, and
Linear B
Linear B was a syllabic script used for writing in Mycenaean Greek, the earliest attested form of Greek. The script predates the Greek alphabet by several centuries. The oldest Mycenaean writing dates to about 1400 BC. It is descended from ...
. The Phoenician script was probably the first phonemic script,
and it contained only about two dozen distinct letters, making it a script simple enough for traders to learn. Another advantage of the Phoenician alphabet was that it could write different languages since it recorded words phonemically.
The Phoenician script was spread across the Mediterranean by the Phoenicians.
The Greek Alphabet was the first alphabet in which vowels have independent letter forms separate from those of consonants. The Greeks chose letters representing sounds that did not exist in Phoenician to represent vowels. The
syllabical Linear B, a script that was used by the
Mycenaean Greeks from the 16th century BCE, had 87 symbols, including five vowels. In its early years, there were many variants of the Greek alphabet, causing many different alphabets to evolve from it.
European alphabets
The Greek alphabet, in
Euboean form, was carried over by Greek colonists to the Italian peninsula -600 BCE giving rise to many different alphabets used to write the
Italic languages
The Italic languages form a branch of the Indo-European language family, whose earliest known members were spoken on the Italian Peninsula in the first millennium BC. The most important of the ancient languages was Latin, the official languag ...
, like the
Etruscan alphabet. One of these became the Latin alphabet, which spread across Europe as the Romans expanded their republic. After the fall of the
Western Roman Empire, the alphabet survived in intellectual and religious works. It came to be used for the descendant languages of Latin (the
Romance languages) and most of the other languages of western and central Europe. Today, it is the most widely used script in the world.
The Etruscan alphabet remained nearly unchanged for several hundred years. Only evolving once the
Etruscan language changed itself. The letters used for non-existent phonemes were dropped. Afterwards, however, the alphabet went through many different changes. The final classical form of Etruscan contained 20 letters. Four of them are vowels (a, e, i, and u). Six fewer letters than the earlier forms. The script in its classical form was used until the 1st century CE. The Etruscan language itself was not used in
imperial Rome, but the script was used for religious texts.
Some adaptations of the Latin alphabet have
ligatures
Ligature may refer to:
* Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure
** Ligature (orthodontic), used in dentistry
* Ligature (music), an element of musical notation used especially in the me ...
, a combination of two letters make one, such as
æ in
Danish and
Icelandic and
Ȣ in
Algonquian; borrowings from other alphabets, such as the
thorn þ in
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
and
Icelandic, which came from the
Futhark runes; and modified existing letters, such as the
eth ð of Old English and Icelandic, which is a modified ''d''. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian and
Italian, which uses the letters ''j, k, x, y,'' and ''w'' only in foreign words.
Another notable script is
Elder Futhark
The Elder Futhark (or Fuþark), also known as the Older Futhark, Old Futhark, or Germanic Futhark, is the oldest form of the runic alphabets. It was a writing system used by Germanic peoples for Northwest Germanic dialects in the Migration Peri ...
, believed to have evolved out of one of the
Old Italic alphabet
The Old Italic scripts are a family of similar ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which ...
s. Elder Futhark gave rise to other alphabets known collectively as the
Runic alphabet
Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
s. The Runic alphabets were used for Germanic languages from 100 CE to the late Middle Ages, being engraved on stone and jewelry, although inscriptions found on bone and wood occasionally appear. These alphabets have since been replaced with the Latin alphabet. The exception was for decorative use, where the runes remained in use until the 20th century.
The
Old Hungarian script was the writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the 19th century, it once again became more and more popular.
The
Glagolitic alphabet was the initial script of the liturgical language
Old Church Slavonic
Old Church Slavonic or Old Slavonic () was the first Slavic languages, Slavic literary language.
Historians credit the 9th-century Byzantine Empire, Byzantine missionaries Saints Cyril and Methodius with Standard language, standardizing the lan ...
and became, together with the Greek uncial script, the basis of the
Cyrillic script. Cyrillic is one of the most widely used modern alphabetic scripts and is notable for its use in Slavic languages and also for other languages within the former
Soviet Union.
Cyrillic alphabets include
Serbian
Serbian may refer to:
* someone or something related to Serbia, a country in Southeastern Europe
* someone or something related to the Serbs, a South Slavic people
* Serbian language
* Serbian names
See also
*
*
* Old Serbian (disambiguat ...
,
Macedonian
Macedonian most often refers to someone or something from or related to Macedonia.
Macedonian(s) may specifically refer to:
People Modern
* Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North M ...
,
Bulgarian,
Russian,
Belarusian
Belarusian may refer to:
* Something of, or related to Belarus
* Belarusians, people from Belarus, or of Belarusian descent
* A citizen of Belarus, see Demographics of Belarus
* Belarusian language
* Belarusian culture
* Belarusian cuisine
* Byelor ...
, and
Ukrainian. The Glagolitic alphabet is believed to have been created by
Saints Cyril and Methodius, while the Cyrillic alphabet was created by
Clement of Ohrid, their disciple. They feature many letters that appear to have been borrowed from or influenced by Greek and Hebrew.
Asian alphabets
Many phonetic scripts exist in Asia. The
Arabic alphabet,
Hebrew alphabet,
Syriac alphabet
The Syriac alphabet ( ) is a writing system primarily used to write the Syriac language since the 1st century AD. It is one of the Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares similarities with ...
, and other
abjads of the Middle East are developments of the
Aramaic alphabet
The ancient Aramaic alphabet was adapted by Arameans from the Phoenician alphabet and became a distinct script by the 8th century BC. It was used to write the Aramaic languages spoken by ancient Aramean pre-Christian tribes throughout the Fertil ...
.
Most alphabetic scripts of India and Eastern Asia descend from the
Brahmi script, believed to be a descendant of Aramaic.
European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad, as with
Urdu and
Persian, and sometimes as a complete alphabet, as with
Kurdish and
Uyghur.
Other alphabets
Hangul
In
Korea,
Sejong the Great
Sejong of Joseon (15 May 1397 – 8 April 1450), personal name Yi Do (Korean: 이도; Hanja: 李祹), widely known as Sejong the Great (Korean: 세종대왕; Hanja: 世宗大王), was the fourth ruler of the Joseon dynasty of Korea. Initial ...
created the
Hangul alphabet in 1443 CE. Hangul is a unique alphabet: it is a
featural alphabet, where the design of many of the letters comes from a sound's place of articulation, like P looking like the widened mouth and L looking like the tongue pulled in. The creation of Hangul was planned by the government of the day, and it places individual letters in syllable clusters with equal dimensions, in the same way as
Chinese characters. This change allows for mixed-script writing, where one syllable always takes up one type space no matter how many letters get stacked into building that one sound-block.
Zhuyin
Zhuyin, sometimes referred to as ''Bopomofo,'' is a
semi-syllabary
A semi-syllabary is a writing system that behaves partly as an alphabet and partly as a syllabary. The main group of semi-syllabic writing are the Paleohispanic scripts of ancient Spain, a group of semi-syllabaries that transform redundant plosive ...
. It transcribes
Mandarin
Mandarin or The Mandarin may refer to:
Language
* Mandarin Chinese, branch of Chinese originally spoken in northern parts of the country
** Standard Chinese or Modern Standard Mandarin, the official language of China
** Taiwanese Mandarin, Stand ...
phonetically in the
Republic of China
Taiwan, officially the Republic of China (ROC), is a country in East Asia, at the junction of the East and South China Seas in the northwestern Pacific Ocean, with the People's Republic of China (PRC) to the northwest, Japan to the northeast ...
. After the later establishment of the
People's Republic of China and its adoption of
Hanyu Pinyin
Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese for ...
, the use of Zhuyin today is limited. However, it is still widely used in Taiwan. Zhuyin developed from a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet, the phonemes of
syllable initials are represented by individual symbols, but like a syllabary, the phonemes of the
syllable finals are not; each possible final (excluding the
medial glide) has its own character, an example being ''luan'' written as ㄌㄨㄢ (''l-u-an''). The last symbol ㄢ takes place as the entire final ''-an''. While Zhuyin is not a mainstream writing system, it is still often used in ways similar to a
romanization system, for aiding pronunciation and as an input method for Chinese characters on computers and cellphones.
Types

The term "alphabet" is used by
linguists
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguis ...
and
paleographers in both a wide and a narrow sense. In a broader sense, an alphabet is a ''segmental'' script at the
phoneme level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script,
abjads, and
abugidas. These three differ in how they treat vowels. Abjads have letters for consonants and leave most vowels unexpressed. Abugidas are also consonant-based but indicate vowels with
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s, a systematic graphic modification of the consonants. The earliest known alphabet using this sense is the
Wadi el-Hol script, believed to be an abjad. Its successor,
Phoenician, is the ancestor of modern alphabets, including
Arabic,
Greek,
Latin (via the
Old Italic alphabet
The Old Italic scripts are a family of similar ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which ...
),
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
(via the Greek alphabet), and
Hebrew (via
Aramaic).

Examples of present-day abjads are the
Arabic and
Hebrew scripts; true alphabets include
Latin, Cyrillic, and Korean
Hangul; and abugidas, used to write
Tigrinya,
Amharic
Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
,
Hindi, and
Thai. The
Canadian Aboriginal syllabics
Canadian syllabic writing, or simply syllabics, is a family of writing systems used in a number of Indigenous Canadian languages of the Algonquian, Inuit, and (formerly) Athabaskan language families. These languages had no formal writing s ...
are also an abugida, rather than a syllabary, as their name would imply, because each glyph stands for a consonant and is modified by rotation to represent the following vowel. In a true syllabary, each consonant-vowel combination gets represented by a separate glyph.
All three types may be augmented with syllabic glyphs.
Ugaritic, for example, is essentially an abjad but has syllabic letters for These are the only times that vowels are indicated.
Coptic has a letter for .
Devanagari is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a
zero consonant as the graphic base for such vowels.
The boundaries between the three types of segmental scripts are not always clear-cut. For example,
Sorani Kurdish is written in the
Arabic script
The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used writing system in the world by number of countries using it or a script directly derived from it, and the ...
, which, when used for other languages, is an abjad. In
Kurdish, writing the vowels is mandatory, and whole letters are used, so the script is a true alphabet. Other languages may use a Semitic abjad with forced vowel diacritics, effectively making them abugidas. On the other hand, the
Phagspa script of the
Mongol Empire
The Mongol Empire of the 13th and 14th centuries was the largest contiguous land empire in history. Originating in present-day Mongolia in East Asia, the Mongol Empire at its height stretched from the Sea of Japan to parts of Eastern Europe, ...
was based closely on the
Tibetan abugida, but vowel marks are written after the preceding consonant rather than as diacritic marks. Although short ''a'' is not written, as in the Indic abugidas, The source of the term "abugida", namely the
Ge'ez abugida now used for
Amharic
Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
and
Tigrinya, has assimilated into their consonant modifications. It is no longer systematic and must be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became
logographic.

Thus the primary
categorisation of alphabets reflects how they treat vowels. For
tonal languages, further classification can be based on their treatment of tone. Though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load, as in
Somali and many other languages of Africa and the Americas. Most commonly, tones are indicated by diacritics, which is how vowels are treated in abugidas, which is the case for
Vietnamese (a true alphabet) and
Thai (an abugida). In Thai, the tone is determined primarily by a consonant, with diacritics for disambiguation. In the
Pollard script, an abugida, vowels are indicated by diacritics. The placing of the diacritic relative to the consonant is modified to indicate the tone.
More rarely, a script may have separate letters for tones, as is the case for
Hmong and
Zhuang. For many, regardless of whether letters or diacritics get used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas. In
Zhuyin, not only is one of the tones unmarked; but there is a diacritic to indicate a lack of tone, like the
virama of Indic.
Alphabetical order
Alphabets often come to be associated with a standard ordering of their letters; this is for
collation—namely, for listing words and other items in ''
alphabetical order''.
Latin alphabets
The basic ordering of the
Latin alphabet (
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z), which derives from the Northwest Semitic "Abgad" order, is already well established. Although, languages using this alphabet have different conventions for their treatment of modified letters (such as the
French
French (french: français(e), link=no) may refer to:
* Something of, from, or related to France
** French language, which originated in France, and its various dialects and accents
** French people, a nation and ethnic group identified with Franc ...
''é'', ''à'', and ''ô'') and certain combinations of letters (
multigraphs). In French, these are not considered to be additional letters for collation. However, in
Icelandic, the accented letters such as ''á'', ''í'', and ''ö'' are considered distinct letters representing different vowel sounds from sounds represented by their unaccented counterparts. In Spanish, ''ñ'' is considered a separate letter, but accented vowels such as ''á'' and ''é'' are not. The ''ll'' and ''ch'' were also formerly considered single letters and sorted separately after ''l'' and ''c'', but in 1994, the tenth congress of the
Association of Spanish Language Academies changed the collating order so that ''ll'' came to be sorted between ''lk'' and ''lm'' in the dictionary and ''ch'' came to be sorted between ''cg'' and ''ci''; those digraphs were still formally designated as letters, but in 2010 the changed it, so they are no longer considered letters at all.
In German, words starting with ''sch-'' (which spells the German phoneme ) are inserted between words with initial ''sca-'' and ''sci-'' (all incidentally loanwords) instead of appearing after the initial ''sz'', as though it were a single letter, which contrasts several languages such as
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
, in which ''dh-'', ''ë-'', ''gj-'', ''ll-'', ''rr-'', ''th-'', ''xh-,'' and ''zh-,'' which all represent phonemes and considered separate single letters, would follow the letters ''d'', ''e'', ''g'', ''l'', ''n'', ''r'', ''t'', ''x,'' and ''z,'' respectively, as well as Hungarian and Welsh. Further, German words with an
umlaut get collated ignoring the umlaut as—contrary to
Turkish
Turkish may refer to:
*a Turkic language spoken by the Turks
* of or about Turkey
** Turkish language
*** Turkish alphabet
** Turkish people, a Turkic ethnic group and nation
*** Turkish citizen, a citizen of Turkey
*** Turkish communities and mi ...
, which adopted the
graphemes ö and ü, and where a word like ''tüfek'' would come after ''tuz'', in the dictionary. An exception is the German telephone directory, where umlauts are sorted like ''ä''=''ae'' since names such as ''Jäger'' also appear with the spelling ''Jaeger'' and are not distinguished in the spoken language.
The
Danish and
Norwegian alphabets end with ''æ''—''ø''—''å'', whereas the Swedish conventionally put ''å''—''ä''—''ö'' at the end. However, æ phonetically corresponds with ä, as does ø and ''ö.''
Early alphabets
It is unknown whether the earliest alphabets had a defined sequence. Some alphabets today, such as the
Hanuno'o script
Hanunoo (), also rendered Hanunó'o, is one of the scripts indigenous to the Philippines and is used by the Mangyan peoples of southern Mindoro to write the Hanunó'o language.
It is an abugida descended from the Brahmic scripts, closely r ...
, are learned one letter at a time, in no particular order, and are not used for
collation where a definite order is required. However, a dozen
Ugaritic tablets from the fourteenth century BCE preserve the alphabet in two sequences. One, the ''ABCDE'' order later used in Phoenician, has continued with minor changes in
Hebrew,
Greek,
Armenian,
Gothic
Gothic or Gothics may refer to:
People and languages
*Goths or Gothic people, the ethnonym of a group of East Germanic tribes
**Gothic language, an extinct East Germanic language spoken by the Goths
**Crimean Gothic, the Gothic language spoken b ...
,
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
, and
Latin; the other, ''HMĦLQ,'' was used in southern Arabia and is preserved today in
Ethiopic. Both orders have therefore been stable for at least 3000 years.
Runic used an unrelated
Futhark sequence, which got
simplified later on.
Arabic uses usually uses its sequence, although Arabic retains the traditional
abjadi order, which is used for numbers.
The
Brahmic family of alphabets used in India uses a unique order based on
phonology: The letters are arranged according to how and where the sounds get produced in the mouth. This organization is present in Southeast Asia, Tibet, Korean
hangul, and even Japanese
kana, which is not an alphabet.
Acrophony
In Phoenician, each letter got associated with a word that begins with that sound. This is called
acrophony
Acrophony (; Greek: ἄκρος ''akros'' uppermost + φωνή ''phone'' sound) is the naming of letters of an alphabetic writing system so that a letter's name begins with the letter itself. For example, Greek letter names are acrophonic: the name ...
and is continuously used to varying degrees in
Samaritan,
Aramaic,
Syriac,
Hebrew,
Greek, and
Arabic.
Acrophony got abandoned in
Latin. It referred to the letters by adding a vowel (usually "e", sometimes "a", or "u") before or after the consonant. Two exceptions were
Y and
Z, which were borrowed from the Greek alphabet rather than Etruscan. They were known as ''Y Graeca'' "Greek Y" and ''zeta'' (from Greek)—this discrepancy was inherited by many European languages, as in the term ''zed'' for Z in all forms of English, other than American English. Over time names sometimes shifted or were added, as in ''double U'' for
W, or "double V" in French, the English name for Y, and the American ''zee'' for Z. Comparing them in English and French gives a clear reflection of the
Great Vowel Shift: A, B, C, and D are pronounced in today's English, but in contemporary French they are . The French names (from which the English names got derived) preserve the qualities of the English vowels before the Great Vowel Shift. By contrast, the names of F, L, M, N, and S () remain the same in both languages because "short" vowels were largely unaffected by the Shift.
In Cyrillic, originally, acrophony was present using Slavic words. The first three words going, azŭ, buky, vědě, with the Cyrillic collation order being, А, Б, В. However, this was later abandoned in favor of a system similar to Latin.
Orthography and pronunciation
When an alphabet is adopted or developed to represent a given language, an
orthography generally comes into being, providing rules for
spelling words, following the principle on which alphabets get based. These rules will map letters of the alphabet to the
phonemes of the spoken language. In a perfectly
phonemic orthography, there would be a consistent one-to-one correspondence between the letters and the phonemes so that a writer could predict the spelling of a word given its pronunciation, and a speaker would always know the pronunciation of a word given its spelling, and vice versa. However, this ideal is usually never achieved in practice. Languages can come close to it, such as Spanish and
Finnish. others, such as English, deviate from it to a much larger degree.
The pronunciation of a language often evolves independently of its writing system. Writing systems have been borrowed for languages the orthography was not initially made to use. The degree to which letters of an alphabet correspond to phonemes of a language varies.
Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways:
* A language may represent a given phoneme by combinations of letters rather than just a single letter. Two-letter combinations are called
digraphs, and three-letter groups are called
trigraphs.
German uses the
tetragraphs (four letters) "tsch" for the phoneme and (in a few borrowed words) "dsch" for .
Kabardian also uses a tetragraph for one of its phonemes, namely "кхъу." Two letters representing one sound occur in several instances in Hungarian as well (where, for instance, ''cs'' stands for
ʃ ''sz'' for
''zs'' for
� ''dzs'' for
ʒ.
* A language may represent the same phoneme with two or more different letters or combinations of letters. An example is
modern Greek which may write the phoneme in six different ways: , , , , , and .
* A language may spell some words with unpronounced letters that exist for historical or other reasons. For example, the spelling of the Thai word for "beer"
��บียร์retains a letter for the final consonant "r" present in the English word it borrows, but silences it.
* Pronunciation of individual words may change according to the presence of surrounding words in a sentence, for example, in
Sandhi.
* Different dialects of a language may use different phonemes for the same word.
* A language may use different sets of symbols or rules for distinct vocabulary items, typically for foreign words, such as in the Japanese
katakana syllabary is used for foreign words, and there are rules in English for using loanwords from other languages.
National languages sometimes elect to address the problem of dialects by associating the alphabet with the national standard. Some national languages like
Finnish,
Armenian,
Turkish
Turkish may refer to:
*a Turkic language spoken by the Turks
* of or about Turkey
** Turkish language
*** Turkish alphabet
** Turkish people, a Turkic ethnic group and nation
*** Turkish citizen, a citizen of Turkey
*** Turkish communities and mi ...
,
Russian,
Serbo-Croatian (
Serbian
Serbian may refer to:
* someone or something related to Serbia, a country in Southeastern Europe
* someone or something related to the Serbs, a South Slavic people
* Serbian language
* Serbian names
See also
*
*
* Old Serbian (disambiguat ...
,
Croatian
Croatian may refer to:
* Croatia
*Croatian language
*Croatian people
*Croatians (demonym)
See also
*
*
* Croatan (disambiguation)
* Croatia (disambiguation)
* Croatoan (disambiguation)
* Hrvatski (disambiguation)
* Hrvatsko (disambiguation)
* S ...
, and
Bosnian), and
Bulgarian have a very regular spelling system with nearly one-to-one correspondence between letters and phonemes. Similarly, the
Italian verb corresponding to 'spell (out),' ''compitare'', is unknown to many Italians because spelling is usually trivial, as Italian spelling is highly phonemic. In standard Spanish, one can tell the pronunciation of a word from its spelling, but not vice versa, as phonemes sometimes can be represented in more than one way, but a given letter is consistently pronounced.
French
French (french: français(e), link=no) may refer to:
* Something of, from, or related to France
** French language, which originated in France, and its various dialects and accents
** French people, a nation and ethnic group identified with Franc ...
using
silent letters,
nasal vowel
A nasal vowel is a vowel that is produced with a lowering of the soft palate (or velum) so that the air flow escapes through the nose and the mouth simultaneously, as in the French vowel or Amoy []. By contrast, oral vowels are produced wit ...
s, and elision, may seem to lack much correspondence between the spelling and pronunciation. However, its rules on pronunciation, though complex, are consistent and predictable with a fair degree of accuracy.
At the other extreme are languages such as English, where pronunciations mostly have to be memorized as they do not correspond to the spelling consistently. For English, this is because the
Great Vowel Shift occurred after the orthography got established and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels. However, even English has general, albeit complex, rules that predict pronunciation from spelling. Rules like this are usually successful. However, rules to predict spelling from pronunciation have a higher failure rate.
Sometimes, countries have the written language undergo a
spelling reform to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system. For example, Turkey switched from the Arabic alphabet to a Latin-based
Turkish alphabet, and when
Kazakh
Kazakh, Qazaq or Kazakhstani may refer to:
* Someone or something related to Kazakhstan
*Kazakhs, an ethnic group
*Kazakh language
*The Kazakh Khanate
* Kazakh cuisine
* Qazakh Rayon, Azerbaijan
*Qazax, Azerbaijan
*Kazakh Uyezd, administrative dis ...
changed from an Arabic script to a Cyrillic script due to the Soviet Union's influence, and in 2021, it made a transition to the Latin alphabet, similar to Turkish. The Cyrillic script used to be official in Uzbekistan and Turkmenistan before they all switched to the Latin alphabet, including Uzbekistan that is having a reform of the alphabet to use diacritics on the letters that are marked by apostrophes and the letters that are digraphs.
The standard system of symbols used by
linguists to represent sounds in any language, independently of orthography, is called the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic transcription, phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standa ...
.
See also
*
Ugaritic alphabet
*
Abecedarium
An abecedarium (also known as an abecedary or ABCs or simply an ABC) is an inscription consisting of the letters of an alphabet, almost always listed in order. Typically, abecedaria (or abecedaries) are practice exercises.
Non-Latin alphabe ...
*
Acrophony
Acrophony (; Greek: ἄκρος ''akros'' uppermost + φωνή ''phone'' sound) is the naming of letters of an alphabetic writing system so that a letter's name begins with the letter itself. For example, Greek letter names are acrophonic: the name ...
*
Akshara
*
Alphabet book
*
Alphabet effect
*
Alphabet song
*
Alphabetical order
*
Butterfly Alphabet
*
Character encoding
*
Constructed script
*
Fingerspelling
*
NATO phonetic alphabet
*
Lipogram
*
List of writing systems
*
Pangram
*
Thoth
*
Transliteration
*
Unicode
References
Bibliography
*
* Overview of modern and some ancient writing systems.
*
*
* Chapter 3 traces and summarizes the invention of alphabetic writing.
*
*
*
*
*
*
*
* Chapter 4 traces the invention of writing
Further reading
*
Josephine Quinn, "Alphabet Politics" (review of
Silvia Ferrara, ''The Greatest Invention: A History of the World in Nine Mysterious Scripts'', translated from the Italian by
Todd Portnowitz, Farrar, Straus and Giroux, 2022, 289 pp.; and
Johanna Drucker, ''Inventing the Alphabet: The Origins of Letters from Antiquity to the Present'', University of Chicago Press, 2022, 380 pp.), ''
The New York Review of Books'', vol. LXX, no. 1 (19 January 2023), pp. 6, 8, 10.
External links
The Origins of abc"Language, Writing and Alphabet: An Interview with Christophe Rico" ''Damqātum 3'' (2007)
*
Michael Everson'
Alphabets of Europe animation by Prof. Robert Fradkin at the
University of Maryland
How the Alphabet Was Born from Hieroglyphs��Biblical Archaeology Review
An Early Hellenic AlphabetThe Alphabet BBC Radio 4 discussion with Eleanor Robson, Alan Millard and Rosalind Thomas (''In Our Time'', 18 December 2003)
{{Authority control
Orthography