
The Tajik alphabet (, , ) has been written in three
alphabet
An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
s over the course of its history, with those being the
Perso-Arabic
The Persian alphabet (), also known as the Perso-Arabic script, is the right-to-left script, right-to-left alphabet used for the Persian language. It is a variation of the Arabic script with four additional letters: (the sounds 'g', 'zh', ' ...
,
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
and nowadays
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
script.
The use of a specific alphabet generally corresponds with
stages in history, with Arabic being used first for most of the time, followed by Latin, as a result of the Soviet takeover, for a short period and then Cyrillic, which remains the most widely used alphabet in
Tajikistan
Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
. The
Bukhori dialect spoken by
Bukharan Jews traditionally used the
Hebrew alphabet
The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
, but today is written using the Cyrillic variant.
Political context
As with many
post-Soviet states
The post-Soviet states, also referred to as the former Soviet Union or the former Soviet republics, are the independent sovereign states that emerged/re-emerged from the dissolution of the Soviet Union in 1991. Prior to their independence, they ...
, the change in
writing system
A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
and the debates surrounding it is closely intertwined with political themes. Although not having been used since the adoption of Cyrillic, the Latin script is supported by those who wish to bring the country closer to
Uzbekistan
, image_flag = Flag of Uzbekistan.svg
, image_coat = Emblem of Uzbekistan.svg
, symbol_type = Emblem of Uzbekistan, Emblem
, national_anthem = "State Anthem of Uzbekistan, State Anthem of the Republ ...
, which has adopted the Latin-based
Uzbek alphabet.
The Persian alphabet is supported by the devoutly religious,
Islamists, and by those who wish to bring the country closer to
Iran
Iran, officially the Islamic Republic of Iran (IRI) and also known as Persia, is a country in West Asia. It borders Iraq to the west, Turkey, Azerbaijan, and Armenia to the northwest, the Caspian Sea to the north, Turkmenistan to the nort ...
,
Afghanistan
Afghanistan, officially the Islamic Emirate of Afghanistan, is a landlocked country located at the crossroads of Central Asia and South Asia. It is bordered by Pakistan to the Durand Line, east and south, Iran to the Afghanistan–Iran borde ...
, and their
Persian heritage. As the ''de facto'' standard, the Cyrillic alphabet is generally supported by those who wish to maintain the ''status quo'', and not distance the country from
Russia
Russia, or the Russian Federation, is a country spanning Eastern Europe and North Asia. It is the list of countries and dependencies by area, largest country in the world, and extends across Time in Russia, eleven time zones, sharing Borders ...
.
History
As a result of the influence of
Islam
Islam is an Abrahamic religions, Abrahamic monotheistic religion based on the Quran, and the teachings of Muhammad. Adherents of Islam are called Muslims, who are estimated to number Islam by country, 2 billion worldwide and are the world ...
in the region, Tajik was written in the
Persian alphabet up to the 1920s. Until this time, the language was not thought of as separate and simply considered a dialect of the
Persian language
Persian ( ), also known by its endonym and exonym, endonym Farsi (, Fārsī ), is a Western Iranian languages, Western Iranian language belonging to the Iranian languages, Iranian branch of the Indo-Iranian languages, Indo-Iranian subdivision ...
. The
Soviets began by simplifying the Persian alphabet in 1923, before moving to a Latin-based system in 1927.
[ The Latin script was introduced by the ]Soviet Union
The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
as part of an effort to increase literacy and distance the, at that time, largely illiterate population, from the Islamic Central Asia
Central Asia is a region of Asia consisting of Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan, and Uzbekistan. The countries as a group are also colloquially referred to as the "-stans" as all have names ending with the Persian language, Pers ...
. There were also practical considerations. The regular Persian alphabet, being an abjad
An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
, does not provide sufficient letters for representing the vowel system of Tajik. In addition, the abjad is more difficult to learn, each letter having different forms depending on the position in the word.[
The ''Decree on Romanisation'' made this law in April 1928.][ The Latin variant for Tajik was based on the work by Turcophone scholars who aimed to produce a unified Turkic alphabet,][ despite Tajik not being a Turkic language. The literacy campaign was successful, with near-universal ]literacy
Literacy is the ability to read and write, while illiteracy refers to an inability to read and write. Some researchers suggest that the study of "literacy" as a concept can be divided into two periods: the period before 1950, when literacy was ...
being achieved by the 1950s.
As part of the " russification" of Central Asia
Central Asia is a region of Asia consisting of Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan, and Uzbekistan. The countries as a group are also colloquially referred to as the "-stans" as all have names ending with the Persian language, Pers ...
, the Cyrillic script was introduced in the late 1930s. The alphabet remained Cyrillic until the end of the 1980s with the disintegration of the Soviet Union
The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the state language. In addition, the law officially equated Tajik with Persian, placing the word ''Farsi'' (the endonym for the Persian language) after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet.
The Persian alphabet was introduced into education
Education is the transmission of knowledge and skills and the development of character traits. Formal education occurs within a structured institutional framework, such as public schools, following a curriculum. Non-formal education als ...
and public life, although the banning of the Islamic Renaissance Party in 1993 slowed down the adoption. In 1999, the word ''Farsi'' was removed from the state-language law.[ the ''de facto'' standard in use was the Cyrillic alphabet][ and , only a very small part of the population could read the Persian alphabet.][
]
Variants
The letters of the major versions of the Tajik alphabet are presented below, along with their phonetic values. There is also a comparative table below.
Persian alphabet
A variant of the Persian alphabet (technically an abjad
An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
) is used to write Tajik. In the Tajik version, as with all other versions of the Arabic script, with the exception of (''alef''), vowels are not given unique letters, but rather optionally indicated with diacritic marks.
Latin
The Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
was introduced after the Russian Revolution
The Russian Revolution was a period of Political revolution (Trotskyism), political and social revolution, social change in Russian Empire, Russia, starting in 1917. This period saw Russia Dissolution of the Russian Empire, abolish its mona ...
of 1917 in order to facilitate an increase in literacy and distance the language from Islamic influence. Only lowercase
Letter case is the distinction between the letters that are in larger uppercase or capitals (more formally ''majuscule'') and smaller lowercase (more formally '' minuscule'') in the written representation of certain languages. The writing system ...
letters were found in the first versions of the Latin variant, between 1926 and 1929. A slightly different version used by Jews
Jews (, , ), or the Jewish people, are an ethnoreligious group and nation, originating from the Israelites of History of ancient Israel and Judah, ancient Israel and Judah. They also traditionally adhere to Judaism. Jewish ethnicity, rel ...
speaking the Bukhori dialect included three extra characters for phonemes not found in the other dialects: , , and .[ in particular represented the ]voiceless pharyngeal fricative
The voiceless pharyngeal fricative is a type of consonantal sound, used in some Speech communication, spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is an h with stroke, h-bar, , and the equivalent ...
/ħ/, a feature of the Bukhori dialect.
The unusual character is called '' Gha'' and represents the phoneme . The character is found in Yañalif
The New Turkic Alphabet or Yañalif ( Tatar: jaꞑa əlifba/yaña älifba → jaꞑalif/yañalif, , Cyrillic: Яңалиф, "new alphabet"), is the first Latin alphabet used during the latinisation in the Soviet Union in the 1930s for the Turkic ...
in which most non-Slavic languages of the Soviet Union
The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
were written until the late 1930s. The Latin alphabet is not widely used today, although its adoption is advocated by certain groups.[
]
Cyrillic
The Cyrillic script
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, Mongolic, Uralic languages, Uralic, C ...
was introduced in Tajik Soviet Socialist Republic
The Tajik Soviet Socialist Republic, also commonly known as Soviet Tajikistan, the Tajik SSR, TaSSR, or simply Tajikistan, was one of the constituent republics of the Soviet Union which existed from 1929 to 1991 in Central Asia.
The Tajik Rep ...
in the late 1930s, replacing the Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
that had been used since the October Revolution
The October Revolution, also known as the Great October Socialist Revolution (in Historiography in the Soviet Union, Soviet historiography), October coup, Bolshevik coup, or Bolshevik revolution, was the second of Russian Revolution, two r ...
. After 1939, materials published in Persian in the Persian alphabet were banned from the country.[ The alphabet below was supplemented by the letters Щ and Ы in 1952.
]
Before 1998, the Tajik Cyrillic alphabet contained 39 letters in the following order: (the 33 letters of the Russian alphabet and 6 additional letters as distinct letters at the end). The letters , and were used only in loanword
A loanword (also a loan word, loan-word) is a word at least partly assimilated from one language (the donor language) into another language (the recipient or target language), through the process of borrowing. Borrowing is a metaphorical term t ...
s; the letter was used in the combinations , , , (for after consonants) and in loanwords. The letters , , , and were officially dropped from the alphabet in the 1998 reform. Loanwords are now respelled using native Tajik letters: after vowels, otherwise for ; for ; for ; is replaced by in (also in loanwords), dropped otherwise (including , , ). Along with the deprecation of these letters, the 1998 reform also changed the order of the alphabet, which now has the characters with diacritics following their unaltered partners, e.g. , and , , etc.[ leading to the present order (35 letters): . In 2010, it was suggested that the letters might be dropped as well.][ The letters and represent the same sound, except that э is used at the beginning of a word (ex. , "]Iran
Iran, officially the Islamic Republic of Iran (IRI) and also known as Persia, is a country in West Asia. It borders Iraq to the west, Turkey, Azerbaijan, and Armenia to the northwest, the Caspian Sea to the north, Turkmenistan to the nort ...
"). The sound combination is represented by at the beginning of words, otherwise by .
The alphabet includes a number of letters not found in the Russian alphabet
The Russian alphabet (, or , more traditionally) is the script used to write the Russian language.
The modern Russian alphabet consists of 33 letters: twenty consonants (, , , , , , , , , , , , , , , , , , , ), ten vowels (, , , , , , , , , ) ...
:
:
During the period when the Cyrillicization took place, Ӷ ӷ also appeared a few times in the table of the Tajik Cyrillic alphabet.[
]
Transliteration standards
The transliteration standards for the Tajik alphabet in Cyrillic into the Latin alphabet are as follows:
Notes to the table above:
# ISO 9 — The International Organization for Standardization
The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries.
M ...
ISO 9 specification.
# KNAB — From the placenames database of the Institute of the Estonian Language.
# WWS — From ''World's Writing Systems'', Bernard Comrie (ed.)
# ALA-LC — The standard of the Library of Congress
The Library of Congress (LOC) is a research library in Washington, D.C., serving as the library and research service for the United States Congress and the ''de facto'' national library of the United States. It also administers Copyright law o ...
and the American Library Association
The American Library Association (ALA) is a nonprofit organization based in the United States that promotes libraries and library education internationally. It is the oldest and largest library association in the world.
History 19th century ...
.
# Edward Allworth, ed. Nationalities of the Soviet East. Publications and Writing Systems (NY: Columbia University Press, 1971)
# BGN/PCGN — The standard of the United States Board on Geographic Names
The United States Board on Geographic Names (BGN) is a Federal government of the United States, federal body operating under the United States Secretary of the Interior. The purpose of the board is to establish and maintain uniform usage of geogr ...
and the Permanent Committee on Geographical Names for British Official Use.
# KSNG – The standard of the Commission on Standardization of Geographical Names Outside the Republic of Poland (Komisja Standaryzacji Nazw Geograficznych poza Granicami Rzeczypospolitej Polskiej)
Hebrew
The Hebrew alphabet
The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
(an abjad
An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
like the Persian alphabet) is used for the Jewish Bukhori dialect primarily in Samarkand
Samarkand ( ; Uzbek language, Uzbek and Tajik language, Tajik: Самарқанд / Samarqand, ) is a city in southeastern Uzbekistan and among the List of oldest continuously inhabited cities, oldest continuously inhabited cities in Central As ...
and Bukhara
Bukhara ( ) is the List of cities in Uzbekistan, seventh-largest city in Uzbekistan by population, with 280,187 residents . It is the capital of Bukhara Region.
People have inhabited the region around Bukhara for at least five millennia, and t ...
. Additionally, since 1940, when Jewish schools were closed in Central Asia, the use of the Hebrew Alphabet outside Hebrew liturgy fell into disuse and Bukharian Jewish publications such as books and newspapers began to appear using the Tajik Cyrillic Alphabet. Today, many older Bukharian Jews who speak Bukharian and went to Tajik or Russian schools in Central Asia only know the Tajik Cyrillic Alphabet when reading and writing Bukharian and Tajik.
Samples
Tajik Cyrillic, Tajik Latin and Persian alphabet
For reference, the Persian script variant transliterated letter-for-letter into the Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
appears as follows:
And the BGN/PCGN transliteration of the Cyrillic text:
Tajik Cyrillic and Persian alphabet
Vowel-pointed Persian includes the vowels that are not usually written.
Comparative table
A table comparing the different writing system
A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
s used for the Tajik alphabet. The Latin here is based on the 1929 standard, the Cyrillic on the revised 1998 standard, and Persian letters are given in their stand-alone forms.
See also
*
*
*
Notes
References
*
*
*
*
*
*
*
*
* Goodman, E. R. (1956) "The Soviet Design for a World Language." in '' Russian Review'' 15 (2): 85–99.
*
*
*
*
*
*
*
*
*
*
External links
*
Omniglot – Tajik (Тоҷики / Toçikī / تاجیكی)
View Cyrillic-script Tajik websites transliterated into the 1920s Latin orthography
{{DEFAULTSORT:Tajik Alphabet
Arabic alphabets
Cyrillic alphabets
Latin alphabets
Persian orthography
Tajik language
Persian scripts