Transliteration is a type of conversion of a text from one
script to another that involves swapping
letters (thus ''
trans-'' + ''
liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or Latin → .
For instance, for the
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
term , which is usually
translated as '
Hellenic Republic
Greece, officially the Hellenic Republic, is a country in Southeast Europe. Located on the southern tip of the Balkan peninsula, it shares land borders with Albania to the northwest, North Macedonia and Bulgaria to the north, and Turkey to th ...
', the usual
transliteration
Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or L ...
into the
Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
(
romanization
In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
) is ; and the
Russian term , which is usually translated as '
Russian Republic
The Russian Republic,. referred to as the Russian Democratic Federative Republic in the 1918 Constitution, was a short-lived state which controlled, ''de jure'', the territory of the former Russian Empire after its proclamation by the Rus ...
', can be
transliterated
Transliteration is a type of conversion of a text from one writing system, script to another that involves swapping Letter (alphabet), letters (thus ''wikt:trans-#Prefix, trans-'' + ''wikt:littera#Latin, liter-'') in predictable ways, such as ...
either as or
alternatively as .
Transliteration is the process of representing or intending to represent a word, phrase, or text in a different script or writing system. Transliterations are designed to convey the pronunciation of the original word in a different script, allowing readers or speakers of that script to approximate the sounds and pronunciation of the original word. Transliterations do not change the pronunciation of the word. Thus, in the Greek above example, is transliterated though it is pronounced exactly the same way as , or the Greek letters, . is transliterated though pronounced as , and is transliterated , though it is pronounced (exactly like ) and is not
long
Long may refer to:
Measurement
* Long, characteristic of something of great duration
* Long, characteristic of something of great length
* Longitude (abbreviation: long.), a geographic coordinate
* Longa (music), note value in early music mens ...
.
Transcription, conversely, seeks to capture sound, but phonetically approximate it into the new script; corresponds to in the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
. While differentiation is lost in the case of , note the allophonic realization of as a palatalized when preceding front vowels and .
Angle brackets may be used to set off transliteration, as opposed to slashes for phonemic transcription and square brackets for phonetic transcription. Angle brackets may also be used to set off characters in the original script. Conventions and author preferences vary.
Definitions
Systematic transliteration is a
mapping from one system of writing into another, typically
grapheme
In linguistics, a grapheme is the smallest functional unit of a writing system.
The word ''grapheme'' is derived from Ancient Greek ('write'), and the suffix ''-eme'' by analogy with ''phoneme'' and other emic units. The study of graphemes ...
to grapheme. Most transliteration systems are
one-to-one, so a reader who knows the system can reconstruct the original spelling.
Transliteration, which adapts written form altering the pronunciation when spoken out, is opposed to letter
transcription, which is a ''letter by letter conversion'' of one language into ''another writing system''. Still, most systems of transliteration map the letters of the source script to letters pronounced similarly in the target script, for some specific pair of source and target language. Transliteration may be very close to letter-by-letter transcription if the relations between letters and sounds are similar in both languages.
For many script pairs, there are one or more standard transliteration systems. However, unsystematic transliteration is common, as for
Burmese, for instance.
Difference from transcription
In
Modern Greek
Modern Greek (, or , ), generally referred to by speakers simply as Greek (, ), refers collectively to the dialects of the Greek language spoken in the modern era, including the official standardized form of the language sometimes referred to ...
, the letters ⟨η, ι, υ⟩ and the letter combinations ⟨ει, oι, υι⟩ are pronounced (except when pronounced as
semivowel
In phonetics and phonology, a semivowel, glide or semiconsonant is a sound that is phonetically similar to a vowel sound but functions as the syllable boundary, rather than as the nucleus of a syllable. Examples of semivowels in English are ''y ...
s), and a modern transcription renders them as ⟨i⟩. However, a transliteration distinguishes them; for example, by transliterating them as ⟨ē, i, y⟩ and ⟨ei, oi, yi⟩. (As the
ancient pronunciation of ⟨η⟩ was , it is often transliterated as ⟨ē⟩.) On the other hand, ⟨αυ, ευ, ηυ⟩ are pronounced , and are voiced to when followed by a voiced consonant – a shift from Ancient Greek . A transliteration would render them all as ⟨au, eu, iu⟩ no matter the environment these sounds are in, reflecting the traditional orthography of Ancient Greek, yet a transcription would distinguish them, based on their phonemic allophonic pronunciations in Modern Greek. Furthermore, the initial letter ⟨h⟩ reflecting the historical
rough breathing
In the polytonic orthography of Ancient Greek, the rough breathing ( or ; ) character is a diacritical mark used to indicate the presence of an sound before a vowel, diphthong, or after rho. It remained in the polytonic orthography even af ...
⟨ ̔⟩ in words such as ⟨Hellēnikḗ⟩ would intuitively be omitted in transcription for Modern Greek, as Modern Greek no longer has the sound.
Challenges
A simple example of difficulties in transliteration is the
Arabic
Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
letter
qāf
Qoph is the nineteenth Letter (alphabet), letter of the Semitic abjads, including Phoenician alphabet, Phoenician ''qōp'' 𐤒, Hebrew alphabet, Hebrew ''qūp̄'' , Aramaic alphabet, Aramaic ''qop'' 𐡒, Syriac alphabet, Syriac ''qōp̄'' ܩ, ...
. It is pronounced, in literary Arabic, approximately like English
except that the tongue makes contact not on the
soft palate
The soft palate (also known as the velum, palatal velum, or muscular palate) is, in mammals, the soft biological tissue, tissue constituting the back of the roof of the mouth. The soft palate is part of the palate of the mouth; the other part is ...
but on the
uvula
The uvula (: uvulas or uvulae), also known as the palatine uvula or staphyle, is a conic projection from the back edge of the middle of the soft palate, composed of connective tissue containing a number of racemose glands, and some muscular fi ...
, but the pronunciation varies between different
dialects of Arabic. The letter is sometimes transliterated into "g", sometimes into "q" or (for in Egypt it is silent) and rarely even into "k" in English.
Another example is the Russian letter
"Х" (kha). It is pronounced as the
voiceless velar fricative
The voiceless velar fricative is a type of consonantal sound used in some spoken languages. It was part of the consonant inventory of Old English and can still be found in some dialects of English, most notably in Scottish English, e.g. in ''lo ...
, like the Scottish pronunciation of in "loch". This sound is not present in most forms of English and is often transliterated as "kh" as in
Nikita Khrushchev
Nikita Sergeyevich Khrushchev (– 11 September 1971) was the General Secretary of the Communist Party of the Soviet Union, First Secretary of the Communist Party of the Soviet Union from 1953 to 1964 and the Premier of the Soviet Union, Chai ...
. Many languages have phonemic sounds, such as
click consonant
Click consonants, or clicks, are speech sounds that occur as consonants in many languages of Southern Africa and in three languages of East Africa. Examples familiar to English-speakers are the '' tut-tut'' (British spelling) or '' tsk! tsk!' ...
s, which are quite unlike any phoneme in the language into which they are being transliterated.
Some languages and
scripts present particular difficulties to transcribers. These are discussed on separate pages. Examples of languages and writing systems and methods of transliterating include:
*
Ancient Near East
The ancient Near East was home to many cradles of civilization, spanning Mesopotamia, Egypt, Iran (or Persia), Anatolia and the Armenian highlands, the Levant, and the Arabian Peninsula. As such, the fields of ancient Near East studies and Nea ...
**
Transliterating cuneiform languages
**
Transliteration of Ancient Egyptian
As used for Egyptology, transliteration of Ancient Egyptian is the process of converting (or mapping) texts written as Egyptian language symbols to alphabetic symbols representing uniliteral Egyptian hieroglyphs, hieroglyphs or their hieratic and D ...
(''see also''
Egyptian hieroglyphs
Ancient Egyptian hieroglyphs ( ) were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined Ideogram, ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct char ...
)
** Hieroglyphic
Luwian
Luwian (), sometimes known as Luvian or Luish, is an ancient language, or group of languages, within the Anatolian branch of the Indo-European language family. The ethnonym Luwian comes from ''Luwiya'' (also spelled ''Luwia'' or ''Luvia'') – ...
*
Armenian language
Armenian (endonym: , , ) is an Indo-European languages, Indo-European language and the sole member of the independent branch of the Armenian language family. It is the native language of the Armenians, Armenian people and the official language of ...
**
Armenian alphabet
The Armenian alphabet (, or , ) or, more broadly, the Armenian script, is an alphabetic writing system developed for Armenian and occasionally used to write other languages. It is one of the three historical alphabets of the South Caucasu ...
***
Romanization of Armenian
There are various systems of romanization of the Armenian alphabet.
Transliteration systems
Hübschmann-Meillet (1913)
In linguistic literature on Classical Armenian, the commonly used transliteration is that of Hübschmann-Meillet (1913). ...
*
Avestan
Avestan ( ) is the liturgical language of Zoroastrianism. It belongs to the Iranian languages, Iranian branch of the Indo-European languages, Indo-European language family and was First language, originally spoken during the Avestan period, Old ...
*
Brahmic family
The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
**
Bengali–Assamese script
The Bengali–Assamese script, sometimes also known as Eastern Nagri, is an eastern Brahmic script, primarily used today for the Bengali and Assamese language spoken in eastern South Asia. It evolved from Gaudi script, also the commo ...
***
Romanisation of Assamese
***
Romanisation of Bengali
**
Devanagari
Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
***
Devanagari transliteration
Devanagari is an Indic script used for many Indo-Aryan languages of North India and Nepal, including Hindi, Marathi and Nepali, which was the script used to write Classical Sanskrit. There are several somewhat similar methods of translite ...
**
Kannada script
The Kannada script ( IAST: ''Kannaḍa lipi''; obsolete: Kanarese or Canarese script in English) is an abugida of the Brahmic family, used to write Kannada, one of the Dravidian languages of South India especially in the state of Karnataka. I ...
**
Malayalam script
Malayalam script (; / ) is a Brahmic scripts, Brahmic script used to write Malayalam, the principal language of Kerala, India, spoken by 45 million people. It is a Dravidian language spoken in the Indian state of Kerala and the union ter ...
***
Romanization of Malayalam
**
Meitei script
The Meitei script (), also known as the Kanglei script () or the Kok Sam Lai script (), after its first three letters is an abugida in the Brahmic scripts family used to write the Meitei language, the official language of Manipur, Assam an ...
**
Mon–Burmese script
The Mon–Burmese script (, ; , , also called the Mon script and Burmese script) is an abugida that derives from the Pallava Grantha script of southern India and later of Southeast Asia. It is the primary writing system for Burmese, Mon, Sh ...
***
Romanization of Burmese
Romanization of the Burmese alphabet is representation of the Burmese language or Burmese names in the Latin alphabet.
Official transcription systems
The MLC romanization system (1980) is promoted inside Myanmar. Inside and outside Myanmar several ...
**
Pali
Pāli (, IAST: pāl̤i) is a Classical languages of India, classical Middle Indo-Aryan languages, Middle Indo-Aryan language of the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pali Canon, Pāli Can ...
**
Tamil script
The Tamil script ( ) is an abugida script that is used by Tamils and Tamil language, Tamil speakers in India, Sri Lanka, Malaysia, Singapore and elsewhere to write the Tamil language. It is one of the official scripts of the Indian Republic. ...
**
Tibetan script
The Tibetan script is a segmental writing system, or '' abugida'', forming a part of the Brahmic scripts, and used to write certain Tibetic languages, including Tibetan, Dzongkha, Sikkimese, Ladakhi, Jirel and Balti. Its exact origins ...
***
Wylie transliteration
Wylie transliteration is a method for Transliteration, transliterating Tibetan script using only the letters available on a typical English-language typewriter. The system is named for the American scholar Turrell V. Wylie, who created the system ...
***
Tibetan pinyin
***
Romanization of Dzongkha
**
Tocharian
*
Celtic languages
The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
*
Chinese language
Chinese ( or ) is a group of languages spoken natively by the ethnic Han Chinese majority and List of ethnic groups in China, many minority ethnic groups in China, as well as by various communities of the Chinese diaspora. Approximately 1.39& ...
**
Bopomofo
Bopomofo, also called Zhuyin Fuhao ( ; ), or simply Zhuyin, is a Chinese transliteration, transliteration system for Standard Chinese and other Sinitic languages. It is the principal method of teaching Chinese Mandarin pronunciation in Taiwa ...
**
Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
***
Transcription into Chinese characters
Transcription into Chinese characters is the use of traditional or simplified Chinese characters to '' phonetically'' transcribe the sound of terms and names of foreign words to the Chinese language. Transcription is distinct from translatio ...
***
Romanization of Chinese
Romanization of Chinese is the use of the Latin alphabet to transliterate Varieties of Chinese, Chinese. Chinese uses a logographic script and its Chinese characters, characters do not represent phonemes directly. There have been many systems us ...
****
Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
(official)
***
Cyrillization of Chinese
*
Click languages
Click consonants, or clicks, are speech sounds that occur as consonants in many languages of Southern Africa and in three languages of East Africa. Examples familiar to English-speakers are the '' tut-tut'' (British spelling) or '' tsk! tsk!' ...
of Africa
**
Khoisan languages
The Khoisan languages ( ; also Khoesan or Khoesaan) are a number of Languages of Africa, African languages once classified together, originally by Joseph Greenberg. Khoisan is defined as those languages that have click languages, click consonant ...
**
Bantu languages
The Bantu languages (English: , Proto-Bantu language, Proto-Bantu: *bantʊ̀), or Ntu languages are a language family of about 600 languages of Central Africa, Central, Southern Africa, Southern, East Africa, Eastern and Southeast Africa, South ...
*
English language
English is a West Germanic language that developed in early medieval England and has since become a English as a lingua franca, global lingua franca. The namesake of the language is the Angles (tribe), Angles, one of the Germanic peoples th ...
**
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
***
Hebraization of English
*
French language
French ( or ) is a Romance languages, Romance language of the Indo-European languages, Indo-European family. Like all other Romance languages, it descended from the Vulgar Latin of the Roman Empire. French evolved from Northern Old Gallo-R ...
**
French alphabet
French orthography encompasses the spelling and punctuation of the French language. It is based on a combination of phoneme, phonemic and historical principles. The spelling of words is largely based on the pronunciation of Old French –1200 AD, ...
***
Cyrillization of French
*
Georgian language
Georgian (, ) is the most widely spoken Kartvelian language, Kartvelian language family. It is the official language of Georgia (country), Georgia and the native or primary language of 88% of its population. It also serves as the literary langu ...
**
Georgian scripts
The Georgian scripts are the three writing systems used to write the Georgian language: #Asomtavruli, Asomtavruli, #Nuskhuri, Nuskhuri and #Mkhedruli, Mkhedruli. Although the systems differ in appearance, their Letter (alphabet), letters share ...
***
Romanization of Georgian
Romanization of Georgian is the process of transliterating the Georgian language from the Georgian script into the Latin script.
Georgian national system of romanization
This system, adopted in February 2002 by the State Department of Geodesy a ...
*
Greek language
Greek (, ; , ) is an Indo-European languages, Indo-European language, constituting an independent Hellenic languages, Hellenic branch within the Indo-European language family. It is native to Greece, Cyprus, Italy (in Calabria and Salento), south ...
**
Linear B
Linear B is a syllabary, syllabic script that was used for writing in Mycenaean Greek, the earliest Attested language, attested form of the Greek language. The script predates the Greek alphabet by several centuries, the earliest known examp ...
**
Greek alphabet
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
***
Romanization of Greek
Romanization of Greek is the transliteration ( letter-mapping) or transcription (sound-mapping) of text from the Greek alphabet into the Latin alphabet.
History
The conventions for writing and romanizing Ancient Greek and Modern Greek diffe ...
***
Greeklish
*
Hmong language
Hmong or Mong ( ; Romanized Popular Alphabet, RPA: , Chữ Hmông Việt, CHV: ''Hmôngz'', Nyiakeng Puachue Hmong, Nyiakeng Puachue: , Pahawh: , ) is a dialect continuum of the West Hmongic branch of the Hmongic languages spoken by the Hmong p ...
**
Pahawh Hmong
**
Nyiakeng Puachue Hmong
*
Japanese language
is the principal language of the Japonic languages, Japonic language family spoken by the Japanese people. It has around 123 million speakers, primarily in Japan, the only country where it is the national language, and within the Japanese dia ...
**
Japanese writing system
The modern Japanese writing system uses a combination of Logogram, logographic kanji, which are adopted Chinese characters, and Syllabary, syllabic kana. Kana itself consists of a pair of syllabary, syllabaries: hiragana, used primarily for n ...
***
Romanization of Japanese
****
Hepburn romanization
is the main system of Romanization of Japanese, romanization for the Japanese language. The system was originally published in 1867 by American Christian missionary and physician James Curtis Hepburn as the standard in the first edition of h ...
***
Cyrillization of Japanese
*
Khmer language
Khmer ( ; , Romanization of Khmer#UNGEGN, UNGEGN: ) is an Austroasiatic language spoken natively by the Khmer people. This language is an official language and national language of Cambodia. The language is also widely spoken by Khmer people i ...
**
Khmer script
Khmer script (, )Huffman, Franklin. 1970. ''Cambodian System of Writing and Beginning Reader''. Yale University Press. . is an abugida (alphasyllabary) script used to write the Khmer language, the official language of Cambodia. It is also use ...
***
Romanization of Khmer
The romanization of Khmer is a representation of the Khmer (Cambodian) language using letters of the Latin alphabet. This is most commonly done with Khmer proper nouns, such as names of people and geographical names, as in a gazetteer.
Romanizat ...
*
Korean language
Korean is the first language, native language for about 81 million people, mostly of Koreans, Korean descent. It is the national language of both South Korea and North Korea. In the south, the language is known as () and in the north, it is kn ...
**
Hangul
The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...
/
Chosŏn'gŭl
The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...
***
Romanization of Korean
The romanization of Korean is the use of the Latin script to transcribe the Korean language.
There are multiple romanization systems in common use. The two most prominent systems are McCune–Reischauer (MR) and Revised Romanization (RR). MR ...
***
Cyrillization of Korean
*
Mongolian language
Mongolian is the Prestige (sociolinguistics), principal language of the Mongolic languages, Mongolic language family that originated in the Mongolian Plateau. It is spoken by ethnic Mongols and other closely related Mongolic peoples who are nati ...
**
Mongolian Cyrillic alphabet
The Mongolian Cyrillic alphabet ( Mongolian: , or , ) is the writing system used for the standard dialect of the Mongolian language in the modern state of Mongolia. It has a largely phonemic orthography, meaning that there is a fair degree of ...
**
Mongolian script
The traditional Mongolian script, also known as the Hudum Mongol bichig, was the first Mongolian alphabet, writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic script, Cy ...
***
SASM/GNC romanization
The former State Administration of Surveying and Mapping, Geographical Names Committee and former Script Reform Committee of the People's Republic of China have adopted several romanizations for Chinese, Mongolian, Tibetan and Uyghur, offici ...
*
Northwest Caucasian languages
The Northwest Caucasian languages, also called West Caucasian, Abkhazo-Adyghean, Abkhazo-Circassian, Circassic, or sometimes Pontic languages (from Ancient Greek, ''pontos'', referring to the Black Sea, in contrast to the Northeast Caucasian ...
**
Abkhaz language
Abkhaz, also known as Abkhazian, is a Northwest Caucasian languages, Northwest Caucasian language most closely related to Abaza language, Abaza. It is spoken mostly by the Abkhazians, Abkhaz people. It is one of the official languages of Abkhazi ...
**
Circassian languages
Circassian (; ), also known as Cherkess ( ), is a subdivision of the Northwest Caucasian language family, spoken by the Circassian people. There are two main variants of the Circassian language, defined by their literary standards, Adyghe (; a ...
***
Adyghe language
Adyghe ( or ; also known as West Circassian) is a Northwest Caucasian language spoken by the western subgroups of Circassians. It is spoken mainly in Russia, as well as in Turkey, Jordan, Syria, Iraq and Israel, where Circassians settled after ...
***
Kabardian language
Kabardian (), also known as , is a Northwest Caucasian languages, Northwest Caucasian language, that is widely considered to be the eastern dialect of Adyghe language, Adyghe. While some Soviet linguists have treated the two as distinct language ...
*
Pashto
Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
**
Pashto alphabet
The Pashto alphabet () is the right-to-left script, right-to-left abjad-based alphabet developed from the Persian alphabet, Perso-Arabic script, used for the Pashto, Pashto language in Pakistan and Afghanistan. It originated in the 16th century ...
*
Persian language
Persian ( ), also known by its endonym and exonym, endonym Farsi (, Fārsī ), is a Western Iranian languages, Western Iranian language belonging to the Iranian languages, Iranian branch of the Indo-Iranian languages, Indo-Iranian subdivision ...
**
Persian alphabet
The Persian alphabet (), also known as the Perso-Arabic script, is the right-to-left alphabet used for the Persian language. It is a variation of the Arabic script with four additional letters: (the sounds 'g', 'zh', 'ch', and 'p', respecti ...
***
Romanization of Persian
Romanization or Latinization of Persian (, ) is the representation of the Persian language (Iranian Persian, Dari language, Dari and Tajik language, Tajik) with the Latin script. Several different romanization schemes exist, each with its own set ...
***
Cyrillization of Persian
***
Persian chat alphabet
*
Semitic languages
The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic,
Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
**
Amharic
Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
***
Geʽez script
Geʽez ( ; , ) is a script used as an abugida (alphasyllabary) for several Afroasiatic languages, Afro-Asiatic and Nilo-Saharan languages, Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad (consonantal alphabet) and was ...
**
Arabic
Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
***
Arabic alphabet
The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is a unicase, unicameral script written from right-to-left in a cursive style, and includes 28 letters, of which most ...
****
Romanization of Arabic
The romanization of Arabic is the systematic rendering of Modern Standard Arabic, written and varieties of Arabic, spoken Arabic language, Arabic in the Latin script. Romanized Arabic is used for various purposes, among them transcription of na ...
****
Arabic chat alphabet
The Arabic chat alphabet, also known as ''Arabizi'', ''Arabeezi'', ''Arabish'', Franco-Arabic or simply Franco (from ) refer to the romanized alphabets for informal Arabic dialects in which Arabic script is transcribed or encoded into a combinati ...
**
Hebrew
Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
***
Hebrew alphabet
The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
****
Romanization of Hebrew
**
Ugaritic
Ugaritic () is an extinct Northwest Semitic languages, Northwest Semitic language known through the Ugaritic texts discovered by French archaeology, archaeologists in 1928 at Ugarit, including several major literary texts, notably the Baal cycl ...
***
Ugaritic alphabet
The Ugaritic alphabet is an abjad (consonantal alphabet) with syllabic elements written using the same tools as cuneiform (i.e. pressing a wedge-shaped stylus into a clay tablet), which emerged or 1300 BCE to write Ugaritic, an extinct Nor ...
*
Slavic languages
The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavs, Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic language, Proto- ...
written in the
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
or
Glagolitic alphabet
The Glagolitic script ( , , ''glagolitsa'') is the oldest known Slavic alphabet. It is generally agreed that it was created in the 9th century for the purpose of translating liturgical texts into Old Church Slavonic by Saints Cyril and Methodi ...
s
**
Romanization of Belarusian
Romanization or Latinization of Belarusian is any system for transliterating written Belarusian from Cyrillic to the Latin alphabet.
Standard systems for romanizing Belarusian
Standard systems for romanizing Belarusian include:
*BGN/PCGN rom ...
**
Romanization of Bulgarian
Romanization of Bulgarian is the practice of transliteration of text in Bulgarian from its conventional Cyrillic orthography into the Latin alphabet. Romanization can be used for various purposes, such as rendering of proper names and place nam ...
**
Romanization of Russian
The romanization of the Russian language (the transliteration of Russian text from the Cyrillic script into the Latin script), aside from its primary use for including Russian names and words in text written in a Latin alphabet, is also essentia ...
**
Romanization of Macedonian
The romanization of Macedonian is the transliteration of text in Macedonian from the Macedonian Cyrillic alphabet into the Latin alphabet. Romanization can be used for various purposes, such as rendering of proper names in foreign contexts, or fo ...
**
Romanization of Serbian
The romanization or Latinization of Serbian is the representation of the Serbian language using Latin letters. Serbian is written in two alphabets, Serbian Cyrillic, a variation of the Cyrillic alphabet, and Gaj's Latin, or ''latinica'', a variat ...
**
Romanization of Ukrainian
The romanization of Ukrainian, or Latinization of Ukrainian, is the representation of the Ukrainian language in Latin letters. Ukrainian is written in its own Ukrainian alphabet, which is based on the Cyrillic script. Romanization may be employ ...
*
Tai languages
The Tai, Zhuang–Tai, or Daic languages (Ahom language, Ahom: 𑜁𑜪𑜨 𑜄𑜩 or 𑜁𑜨𑜉𑜫 𑜄𑜩 ; ; or , ; , ) are a branch of the Kra–Dai languages, Kra–Dai language family. The Tai languages include the most widely spo ...
**
Lao language
Lao (Lao: , ), sometimes referred to as Laotian, is the official language of Laos and a significant language in the Isan region of northeastern Thailand, where it is usually referred to as the Isan language. Spoken by over 3 million people in ...
***
Lao script
Lao script or Akson Lao ( ) is the primary script used to write the Lao language and other languages in Laos. Its earlier form, the Tai Noi script, was also used to write the Isan language, but was replaced by the Thai script. It has 27 co ...
****
Romanization of Lao
**
Thai language
Thai,In or Central Thai (historically Siamese;Although "Thai" and "Central Thai" have become more common, the older term, "Siamese", is still used by linguists, especially when it is being distinguished from other Tai languages (Diller 2008:6 ...
***
Thai script
The Thai script (, , ) is the abugida used to write Thai language, Thai, Southern Thai language, Southern Thai and many other languages spoken in Thailand. The Thai script itself (as used to write Thai) has 44 consonant symbols (, ), 16 vowel s ...
****
Romanization of Thai
*
Turkic language
The Turkic languages are a language family of more than 35 documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia (Siberia), and West Asia. The Turkic langua ...
**
Old Turkic
Old Siberian Turkic, generally known as East Old Turkic and often shortened to Old Turkic, was a Siberian Turkic language spoken around East Turkistan and Mongolia. It was first discovered in inscriptions originating from the Second Turkic Kh ...
***
Old Turkic script
The Old Turkic script (also known variously as Göktürk script, Orkhon script, Orkhon-Yenisey script, Turkic runes) was the alphabet used by the Göktürks and other early Turkic peoples, Turkic khanates from the 8th to 10th centuries to recor ...
**
Azerbaijani language
Azerbaijani ( ; , , ) or Azeri ( ), also referred to as Azerbaijani Turkic or Azerbaijani Turkish (, , ), is a Turkic languages, Turkic language from the Oghuz languages, Oghuz sub-branch. It is spoken primarily by the Azerbaijanis, Azerbaij ...
***
Azerbaijani alphabets
**
Kazakh language
Kazakh is a Turkic language of the Kipchak branch spoken in Central Asia by Kazakhs. It is closely related to Nogai, Kyrgyz and Karakalpak. It is the official language of Kazakhstan, and has official status in the Altai Republic of Russia ...
***
Kazakh alphabets
The Kazakh language was written mainly in four scripts at various points of time – Old Turkic script, Old Turkic, Cyrillic script, Cyrillic, Latin script, Latin, and Arabic script, Arabic – each having a distinct alphabet. The Arabic script i ...
**
Kyrgyz language
Kyrgyz is a Turkic language of the Kipchak branch spoken in Central Asia. Kyrgyz is the official language of Kyrgyzstan and a significant minority language in the Kizilsu Kyrgyz Autonomous Prefecture in Xinjiang, China and in the Gorno-Badak ...
***
Kyrgyz alphabets
The Kyrgyz alphabets are the alphabets used to write the Kyrgyz language. Kyrgyz uses the following alphabets:
*The Cyrillic script is officially used in the Kyrgyz Republic (Kyrgyzstan)
*The Perso-Arabic script is officially used in Afghanistan ...
**
Turkmen language
Turkmen (, , , or , , , ) is a Turkic language of the Oghuz branch spoken by the Turkmens of Central Asia. It has an estimated 4.7 million native speakers in Turkmenistan (where it is the official language), and a further 359,000 speakers i ...
***
Turkmen alphabet
The Turkmen alphabet refers to variants of the Latin script, Latin alphabet, Cyrillic script, Cyrillic alphabet, or Arabic script, Arabic alphabet used for writing of the Turkmen language.
The modified variant of the Latin script, Latin alphabe ...
**
Uyghur language
Uyghur or Uighur (; , , or , , ), formerly known as Turki or Eastern Turki, is a Turkic languages, Turkic language with 8 to 13 million speakers (), spoken primarily by the Uyghur people in the Xinjiang Uyghur Autonomous Region of Western ...
***
Uyghur alphabets
Uyghur is a Turkic language with a long literary tradition spoken in Xinjiang, China by the Uyghurs. Today, the Uyghur Arabic alphabet is the official writing system used for Uyghur in Xinjiang, whereas other alphabets like the Uyghur Cyrilli ...
**
Uzbek language
Uzbek is a Karluk Turkic language spoken by Uzbeks. It is the official and national language of Uzbekistan and formally succeeded Chagatai, an earlier Karluk language endonymically called or , as the literary language of Uzbekistan in the 19 ...
***
Uzbek alphabet
*
Urdu language
Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
**
Urdu alphabet
The Urdu alphabet () is the right-to-left alphabet used for writing Urdu. It is a modification of the Persian alphabet, which itself is derived from the Arabic script. It has co-official status in the republics of Pakistan, India and South Afri ...
(
Nastaliq
''Nastaliq'' (; ; ), also Romanization of Persian, romanized as ''Nastaʿlīq'' or ''Nastaleeq'' (), is one of the main book hand, calligraphic hands used to write Arabic script and is used for some Indo-Iranian languages, predominantly Persi ...
)
***
Romanization of Urdu
Roman Urdu is the name used for the Urdu language written with the Latin script, also known as Roman script.
According to the Urdu scholar Habib R. Sulemani: "Roman Urdu is strongly opposed by the traditional Arabic script lovers. Despite thi ...
Adopted
*
Buckwalter transliteration
*
Devanagari transliteration
Devanagari is an Indic script used for many Indo-Aryan languages of North India and Nepal, including Hindi, Marathi and Nepali, which was the script used to write Classical Sanskrit. There are several somewhat similar methods of translite ...
*
Hans Wehr transliteration
*
International Alphabet of Sanskrit Transliteration
The International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme that allows the lossless romanisation of Indic scripts as employed by Sanskrit and related Indic languages. It is based on a scheme that emerged during ...
*
Scientific transliteration of Cyrillic
Scientific transliteration, variously called ''academic'', ''linguistic'', ''international'', or ''scholarly transliteration'', is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). Th ...
*
Transliteration of Ancient Egyptian
As used for Egyptology, transliteration of Ancient Egyptian is the process of converting (or mapping) texts written as Egyptian language symbols to alphabetic symbols representing uniliteral Egyptian hieroglyphs, hieroglyphs or their hieratic and D ...
*
Transliterations of Manchu
*
Wylie transliteration
Wylie transliteration is a method for Transliteration, transliterating Tibetan script using only the letters available on a typical English-language typewriter. The system is named for the American scholar Turrell V. Wylie, who created the system ...
See also
*
Cyrillization
*
International Components for Unicode
International Components for Unicode (ICU) is an open-source project of mature C/ C++ and Java libraries for Unicode support, software internationalization, and software globalization. ICU is widely portable to many operating systems and envir ...
*
ISO 15924
*
Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
*
List of ISO transliterations
A list is a set of discrete items of information collected and set forth in some format for utility, entertainment, or other purposes. A list may be memorialized in any number of ways, including existing only in the mind of the list-maker, but ...
*
Orthographic transcription Orthographic transcription is a transcription method that employs the standard spelling system of each target language.Hayes, Bruce (2011)Introductory Phonology John Wiley & Sons; , 9781444360134. "The term orthographic transcription simply means ...
*
Phonemic orthography
A phonemic orthography is an orthography (system for writing a language) in which the graphemes (written symbols) correspond consistently to the language's phonemes (the smallest units of speech that can differentiate words), or more generally ...
*
Phonetic transcription
Phonetic transcription (also known as Phonetic script or Phonetic notation) is the visual representation of speech sounds (or ''phonetics'') by means of symbols. The most common type of phonetic transcription uses a phonetic alphabet, such as the ...
*
Romanization
In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
*
Spread of the Latin script
*
Substitution cipher
In cryptography, a substitution cipher is a method of encrypting in which units of plaintext are replaced with the ciphertext, in a defined manner, with the help of a key; the "units" may be single letters (the most common), pairs of letters, t ...
*
Transcription (linguistics)
In linguistics, transcription is the systematic representation of spoken language in written form. The source can either be utterances (''speech'' or ''sign language'') or preexisting text in another writing system
A writing system compris ...
References
External links
International Components for Unicode transliteration services – history of the transliteration of Slavic languages into Latin alphabets.
Transliteration of Non-Latin scripts– Collection of transliteration tables for many non-Latin scripts maintained by Thomas T. Pedersen.
Unicode Transliteration GuidelinesUnited Nations Group of Experts on Geographical Names (UNGEGN)–
working group
A working group is a group of experts working together to achieve specified goals. Such groups are domain-specific and focus on discussion or activity around a specific subject area. The term can sometimes refer to an interdisciplinary collab ...
on Romanization Systems.
Library of Congress: Romanization TablesLocaltyping.comimplements google transliteration library and also allows to create To-Do Lists in English and Transliterated Languages.
24x7offshoring.comTransliterationenglish.
Usage of Transliterations– condensed description of the definition of transliteration and its usage.
* G. Gerych
Transliteration of Cyrillic Alphabets.Ottawa University, April 1965. 126 pp. – historical overview of the concept of transliteration and its evolution and application
{{Authority control