HOME

TheInfoList



OR:

The Persian alphabet (), also known as the Perso-Arabic script, is the
right-to-left A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
alphabet An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
used for the
Persian language Persian ( ), also known by its endonym and exonym, endonym Farsi (, Fārsī ), is a Western Iranian languages, Western Iranian language belonging to the Iranian languages, Iranian branch of the Indo-Iranian languages, Indo-Iranian subdivision ...
. It is a variation of the
Arabic script The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
with four additional letters: (the sounds 'g', 'zh', 'ch', and 'p', respectively), in addition to the obsolete that was used for the sound . This letter is no longer used in Persian, as the -sound changed to , e.g. archaic > 'language'. It was the basis of many Arabic-based scripts used in Central and South Asia. It is used for both
Iranian Iranian () may refer to: * Something of, from, or related to Iran ** Iranian diaspora, Iranians living outside Iran ** Iranian architecture, architecture of Iran and parts of the rest of West Asia ** Iranian cuisine, cooking traditions and practic ...
and
Dari Dari (; endonym: ), Dari Persian (, , or , ), or Eastern Persian is the variety of the Persian language spoken in Afghanistan. Dari is the Afghan government's official term for the Persian language;Lazard, G.Darī – The New Persian ...
: standard varieties of Persian; and is one of two
official An official is someone who holds an office (function or Mandate (politics), mandate, regardless of whether it carries an actual Office, working space with it) in an organization or government and participates in the exercise of authority (eithe ...
writing system A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
s for the Persian language, alongside the
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
-based
Tajik alphabet The Tajik alphabet (, , ) has been written in three alphabets over the course of its history, with those being the Perso-Arabic, Latin and nowadays Cyrillic script. The use of a specific alphabet generally corresponds with stages in history, ...
. The script is mostly but not exclusively
right-to-left A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
; mathematical expressions, numeric dates and numbers bearing units are embedded from left to right. The script is
cursive Cursive (also known as joined-up writing) is any style of penmanship in which characters are written joined in a flowing manner, generally for the purpose of making writing faster, in contrast to block letters. It varies in functionality and m ...
, meaning most letters in a word connect to each other; when they are typed, contemporary
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Early word processors were stand-alone devices dedicated to the function, but current word ...
s automatically join adjacent letter forms. Persian is unusual among Arabic scripts because a zero-width non-joiner is sometimes entered in a word, causing a letter to become disconnected from others in the same word.


History

The Persian alphabet is directly derived and developed from the
Arabic alphabet The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is a unicase, unicameral script written from right-to-left in a cursive style, and includes 28 letters, of which most ...
. The Arabic alphabet was introduced to the Persian-speaking world after the
Muslim conquest of Persia As part of the early Muslim conquests, which were initiated by Muhammad in 622, the Rashidun Caliphate conquered the Sasanian Empire between 632 and 654. This event led to the decline of Zoroastrianism, which had been the official religion of ...
and the fall of the
Sasanian Empire The Sasanian Empire (), officially Eranshahr ( , "Empire of the Iranian peoples, Iranians"), was an List of monarchs of Iran, Iranian empire that was founded and ruled by the House of Sasan from 224 to 651. Enduring for over four centuries, th ...
in the 7th century. Following this, the
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
language became the principal language of government and religious institutions in
Persia Iran, officially the Islamic Republic of Iran (IRI) and also known as Persia, is a country in West Asia. It borders Iraq to the west, Turkey, Azerbaijan, and Armenia to the northwest, the Caspian Sea to the north, Turkmenistan to the nort ...
, which led to the widespread usage of the Arabic script. Classical
Persian literature Persian literature comprises oral compositions and written texts in the Persian language and is one of the world's oldest literatures. It spans over two-and-a-half millennia. Its sources have been within Greater Iran including present-day ...
and poetry were affected by this simultaneous usage of
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
and Persian. A new influx of
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
vocabulary soon entered the Persian language. In the 8th century, the Tahirid dynasty and Samanid dynasty officially adopted the Arabic script for writing Persian, followed by the Saffarid dynasty in the 9th century, gradually displacing the various
Pahlavi scripts Pahlavi may refer to: Iranian royalty * Seven Parthian clans, ruling Parthian families during the Sasanian Empire * Pahlavi dynasty, the ruling house of Imperial State of Persia/Iran from 1925 until 1979 ** Reza Shah Pahlavi (1878–1944), Shah ...
used for the Persian language earlier. By the 9th-century, the Perso-Arabic alphabet became the dominant form of writing in
Greater Khorasan KhorasanDabeersiaghi, Commentary on Safarnâma-e Nâsir Khusraw, 6th Ed. Tehran, Zavvâr: 1375 (Solar Hijri Calendar) 235–236 (; , ) is a historical eastern region in the Iranian Plateau in West Asia, West and Central Asia that encompasses wes ...
. Under the influence of various Persian Empires, many languages in Central and South Asia that adopted the Arabic script use the Persian Alphabet as the basis of their writing systems. Today, extended versions of the Persian alphabet are used to write a wide variety of
Indo-Iranian languages The Indo-Iranian languages (also known as Indo-Iranic languages or collectively the Aryan languages) constitute the largest branch of the Indo-European language family. They include over 300 languages, spoken by around 1.7 billion speakers ...
, including Kurdish, Balochi,
Pashto Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
,
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
(from Classical Hindustani), Saraiki, Panjabi, Sindhi and Kashmiri. In the past the use of the Persian alphabet was common amongst
Turkic languages The Turkic languages are a language family of more than 35 documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia (Siberia), and West Asia. The Turkic langua ...
, but today is relegated to those spoken within Iran, such as Azerbaijani, Turkmen, Qashqai, Chaharmahali and Khalaj. The
Uyghur language Uyghur or Uighur (; , , or , , ), formerly known as Turki or Eastern Turki, is a Turkic languages, Turkic language with 8 to 13 million speakers (), spoken primarily by the Uyghur people in the Xinjiang Uyghur Autonomous Region of Western ...
in western China is the most notable exception to this. During the
Soviet The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
period many languages in Central Asia, including Persian, were reformed by the government. This ultimately resulted in the Cyrillic-based alphabet used in Tajikistan today. See: .


Letters

Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides) and final (joined on the right) of a word. These include 28 letters of the
Arabic alphabet The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is a unicase, unicameral script written from right-to-left in a cursive style, and includes 28 letters, of which most ...
, in addition to 4 other letters. The names of the letters are mostly the ones used in Arabic except for the Persian pronunciation. The only ambiguous name is , which is used for both and . For clarification, they are often called (literally "-like " after , the name for the letter that uses the same base form) and (literally "two-eyed ", after the contextual middle letterform ), respectively. There are eight Persian letters that are mainly used in Arabic or foreign loanwords and not in native words: , , , , , , , and . These eight letters are also common used in proper names only. Unlike Arabic, the Persian language absolutely does not have
pharyngealization Pharyngealization is a secondary articulation of consonants or vowels by which the pharynx or epiglottis is constricted during the articulation of the sound. IPA symbols In the International Phonetic Alphabet, pharyngealization can be indicate ...
at all. Although the letter is mainly used in Arabic loanwords, there are some native Persian words with this letter: , , etc. The pronunciation of these letters in Persian can differ from their pronunciation in Arabic. For example, the letter ث is pronounced as /s/ in Persian, while it is pronounced as /θ/ in Arabic. *


Overview table

Historically, in
Early New Persian Persian ( ), also known by its endonym Farsi (, Fārsī ), is a Western Iranian language belonging to the Iranian branch of the Indo-Iranian subdivision of the Indo-European languages. Persian is a pluricentric language predominantly spoke ...
, there was a special letter for the sound . This letter is no longer used, as the -sound changed to , e.g. archaic /zaβān/ > 'language'. Another obsolete variant of the twenty-sixth letter is which used to appear in old manuscripts. Another obsolete variant of the twenty-fifth letter is which used to appear in old manuscripts. The archaic letter was also used as a substitute for the twenty-sixth letter of the Persian alphabet, , which was used to appear in the older manuscripts of Persian in the late 18th century to the early 19th century.


Variants


Letter construction

The i'jam diacritic characters are illustrative only; in most typesetting the combined characters in the middle of the table are used. Persian ''yē'' has 2 dots below in the initial and middle positions only. The
standard Arabic Modern Standard Arabic (MSA) or Modern Written Arabic (MWA) is the variety of standardized, literary Arabic that developed in the Arab world in the late 19th and early 20th centuries, and in some usages also the variety of spoken Arabic that ...
version always has 2 dots below.


Letters that do not link to a following letter

Seven letters (, , , , , , ) do not connect to the following letter, unlike the rest of the letters of the alphabet. The seven letters have the same form in isolated and initial position and a second form in medial and final position. For example, when the letter is at the beginning of a word such as ("here"), the same form is used as in an isolated . In the case of ("today"), the letter takes the final form and the letter takes the isolated form, but they are in the middle of the word, and also has its isolated form, but it occurs at the end of the word.


Diacritics

Persian script has adopted a subset of
Arabic diacritics The Arabic script has numerous diacritics, which include consonant pointing known as (, ), and supplementary diacritics known as (, ). The latter include the vowel marks termed (, ; , ', ). The Arabic script is a modified abjad, where all ...
: ( in Arabic), ( in Arabic), and or ( in Arabic, pronounced in
Western Persian Iranian Persian (), Western Persian or Western Farsi, natively simply known as Persian (), refers to the varieties of the Persian language spoken in Iran and by others in neighboring countries, as well as by Iranian communities throughout the ...
), and (
gemination In phonetics and phonology, gemination (; from Latin 'doubling', itself from '' gemini'' 'twins'), or consonant lengthening, is an articulation of a consonant for a longer period of time than that of a singleton consonant. It is distinct from ...
). Other Arabic diacritics may be seen in Arabic loanwords in Persian.


Short vowels

Of the four Arabic diacritics, the Persian language has adopted the following three for short vowels. The last one,
sukūn The Arabic script has numerous diacritics, which include consonant pointing known as (, ), and supplementary diacritics known as (, ). The latter include the vowel marks termed (, ; , ', ). The Arabic script is a modified abjad, where all ...
, which indicates the lack of a vowel, has not been adopted. There is no standard transliteration for Persian. The letters 'i' and 'u' are only ever used as short vowels when transliterating Dari or Tajik Persian. See Persian Phonology Diacritics differ by dialect, due to Dari having 8 distinct vowels compared to the 6 vowels of Farsi. See Persian Phonology In Farsi, none of these short vowels may be the initial or final grapheme in an isolated word, although they may appear in the final position as an
inflection In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
, when the word is part of a noun group. In a word that starts with a vowel, the first grapheme is a silent which carries the short vowel, e.g. (, meaning "hope"). In a word that ends with a vowel, letters , and respectively become the proxy letters for , and , e.g. (, meaning "new") or (, meaning "package").


Tanvin (nunation)

Nunation (, ) is the addition of one of three vowel diacritics to a noun or adjective to indicate that the word ends in an alveolar nasal sound without the addition of the letter nun.


Tašdid


Other characters

The following are not actual letters but different orthographical shapes for letters, a ligature in the case of the . As to (''
hamza The hamza ( ') () is an Arabic script character that, in the Arabic alphabet, denotes a glottal stop and, in non-Arabic languages, indicates a diphthong, vowel, or other features, depending on the language. Derived from the letter '' ʿayn'' ( ...
''), it has only one graphical form since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a , or , and in that case, the seat behaves like an ordinary , or respectively. Technically, ''hamza'' is not a letter but a diacritic. Although at first glance, they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently. Unicode has accepted in the
Miscellaneous Symbols Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories: astrological, astronomical, chess, dice, musical notation, political symbols, recycling, religious symbols, trig ...
range. In Unicode 1.0 this symbol was known as . It is a stylization of () used as the
emblem of Iran The national emblem of the Islamic Republic of Iran features four curves and a sword, surmounted by a shadda. The emblem was designed by Hamid Nadimi, and was officially approved by Ayatollah Ruhollah Khomeini, the first Supreme Leader of Iran, on ...
. It is also a part of the
flag of Iran The national flag of the Iran, Islamic Republic of Iran, also known as the ''Tricolour Flag of Iran'' (), is a tricolour (flag), tricolour featuring the Pan-Iranian colors comprising equal horizontal bands of green, white and red with the embl ...
. The Unicode Standard has a compatibility character defined that can represent , the Persian name of the currency of Iran.


Novel letters

The Persian alphabet has four extra letters that are not in the Arabic alphabet: , (''ch'' in ''chair''), (''s'' in ''measure''), . An additional fifth letter was used for (''v in'' Spanish ) but it is no longer used.


Deviations from the Arabic script

Persian uses the
Eastern Arabic numerals The Eastern Arabic numerals, also called Indo-Arabic numerals or Arabic-Indic numerals as known by Unicode, are the symbols used to represent numerical digits in conjunction with the Arabic alphabet in the countries of the Mashriq (the east o ...
, but the shapes of the digits 'four' (), 'five' (), and 'six' () are different from the shapes used in Arabic. All the digits also have different codepoints in
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
:


Comparison of different numerals


Word boundaries

Typically, words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ'), however, are written without a space. On a computer, they are separated from the word using the
zero-width non-joiner The zero-width non-joiner (ZWNJ, ; rendered: ; HTML entity: or ) is a non-printing character used in the computerization of writing systems that make use of Typographic ligature, ligatures. For example, in writing systems that feature initial, ...
.


Cyrillic Persian alphabet in Tajikistan

As part of the
russification Russification (), Russianisation or Russianization, is a form of cultural assimilation in which non-Russians adopt Russian culture and Russian language either voluntarily or as a result of a deliberate state policy. Russification was at times ...
of
Central Asia Central Asia is a region of Asia consisting of Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan, and Uzbekistan. The countries as a group are also colloquially referred to as the "-stans" as all have names ending with the Persian language, Pers ...
, the Cyrillic script was introduced in the late 1930s. The alphabet has remained Cyrillic since then. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the
state language An official language is defined by the Cambridge English Dictionary as, "the language or one of the languages that is accepted by a country's government, is taught in schools, used in the courts of law, etc." Depending on the decree, establishmen ...
. In addition, the law officially equated Tajik with Persian, placing the word ''Farsi'' (the endonym for the Persian language) after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet. The Persian alphabet was introduced into
education Education is the transmission of knowledge and skills and the development of character traits. Formal education occurs within a structured institutional framework, such as public schools, following a curriculum. Non-formal education als ...
and public life, although the banning of the Islamic Renaissance Party in 1993 slowed adoption. In 1999, the word ''Farsi'' was removed from the state-language law, reverting the name to simply ''Tajik''. the ''de facto'' standard in use is the Tajik Cyrillic alphabet, and only a very small part of the population can read the Persian alphabet.


See also

* Scripts used for Persian *
Romanization of Persian Romanization or Latinization of Persian (, ) is the representation of the Persian language (Iranian Persian, Dari language, Dari and Tajik language, Tajik) with the Latin script. Several different romanization schemes exist, each with its own set ...
* Persian braille * Persian phonology *
Abjad numerals The Abjad numerals, also called Hisab al-Jummal (, ), are a decimal alphabetic numeral system/alphanumeric code, in which the 28 letters of the Arabic alphabet are assigned numerical values. They have been used in the Arab world, Arabic-speaking ...
*
Nastaʿlīq ''Nastaliq'' (; ; ), also romanized as ''Nastaʿlīq'' or ''Nastaleeq'' (), is one of the main calligraphic hands used to write Arabic script and is used for some Indo-Iranian languages, predominantly Classical Persian, Kashmiri, Punjabi a ...
, the calligraphy used to write Persian before the 20th century


References


External links


Dastoore khat
– The Official document in Persian by
Academy of Persian Language and Literature The Academy of Persian Language and Literature (APLL) (, ''Farhangestân-e Zabân-o Adab-e Fârsi'') is the regulatory body for the Persian language, headquartered in Tehran, Iran. Formerly known as the Academy of Iran (, ''Farhangestân-e Iran'' ...
{{DEFAULTSORT:Perso-Arabic Script Arabic alphabets Persian orthography Alphabets Persian scripts