Romani language
   HOME

TheInfoList



OR:

Romani (; also Romany, Romanes , Roma; rom, rromani ćhib, links=no) is an Indo-Aryan macrolanguage of the Romani communities. According to '' Ethnologue'', seven varieties of Romani are divergent enough to be considered languages of their own. The largest of these are Vlax Romani (about 500,000 speakers), Balkan Romani (600,000), and Sinte Romani (300,000). Some Romani communities speak
mixed language A mixed language is a language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. It differs from a creole or pidgin language in that, whereas creoles/pidgin ...
s based on the surrounding language with retained Romani-derived vocabulary – these are known by linguists as
Para-Romani Para-Romani are various mixed languages of non- Indo-Aryan linguistic classification containing considerable admixture from the Romani language. They are spoken as the traditional vernacular of Romani communities, Matras, Y. ''Romani: A Lingu ...
varieties, rather than dialects of the Romani language itself. The differences between the various varieties can be as large as, for example, the differences between the
Slavic languages The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic, spoken during the ...
.


Name

Speakers of the Romani language usually refer to the language as ' "the Romani language" or '' (adverb)'' "in a Rom way". This derives from the Romani word ', meaning either "a member of the (Romani) group" or "husband". This is also the origin of the term "Roma" in English, although some Roma groups refer to themselves using other demonyms (e.g. 'Kaale', 'Sinti').


Classification

In the 18th century, it was shown by comparative studies that Romani belongs to the Indo-European language family.Šebková, Hana; Žlnayová, Edita (1998)
''Nástin mluvnice slovenské romštiny (pro pedagogické účely)''
. Ústí nad Labem: Pedagogická fakulta Univerzity J. E. Purkyně v Ústí nad Labem: p. 4. . "V 18. století bylo na základě komparatistických výzkumů jednoznačně prokázáno, že romština patří do indoevropské jazykové rodiny a že je jazykem novoindickým" In the 18th century, it was conclusively proved on the basis of comparative studie that Romani belongs to the Indo-European language family and is a New-Indian language"/ref> In 1763 Vályi István, a Calvinist pastor from Satu Mare in
Transylvania Transylvania ( ro, Ardeal or ; hu, Erdély; german: Siebenbürgen) is a historical and cultural region in Central Europe, encompassing central Romania. To the east and south its natural border is the Carpathian Mountains, and to the west the Ap ...
, was the first to notice the similarity between Romani and Indo-Aryan by comparing the Romani dialect of
Győr Győr ( , ; german: Raab, links=no; names in other languages) is the main city of northwest Hungary, the capital of Győr-Moson-Sopron County and Western Transdanubia region, and – halfway between Budapest and Vienna – situated on one of ...
with the language (perhaps Sinhala) spoken by three Sri Lankan students he met in the Netherlands. This was followed by the linguist Johann Christian Christoph Rüdiger (1751–1822) whose book ' (1782) posited Romani was descended from
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
. This prompted the philosopher
Christian Jakob Kraus Christian Jakob Kraus (; 27 July 1753 – 25 August 1807) was a German comparative and historical linguist. Biography A native of Osterode (East Prussia), Kraus studied at the universities of Königsberg and Göttingen. In 1782 he became a pr ...
to collect linguistic evidence by systematically interviewing the Roma in Königsberg prison. Kraus's findings were never published, but they may have influenced or laid the groundwork for later linguists, especially August Pott and his pioneering ' (1844–45). Research into the way the Romani dialects branched out was started in 1872 by the Slavicist
Franz Miklosich Franz Miklosich (german: Franz Ritter von Miklosich, also known in Slovene as ; 20 November 1813 – 7 March 1891) was a Slovene philologist. Early life Miklosich was born in the small village of Radomerščak near the Lower Styrian town of Lju ...
in a series of essays. However, it was the philologist Ralph Turner's 1927 article “The Position of Romani in Indo-Aryan” that served as the basis for the integrating of Romani into the history of Indian languages. Romani is an
Indo-Aryan language The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in India, Pa ...
that is part of the Balkan sprachbund. It is the only New Indo-Aryan spoken exclusively outside the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
.Schrammel, Barbara; Halwachs, Dieter W. (2005). "Introduction". ''General and Applied Romani Linguistics - Proceeding from the 6th International Conference on Romani Linguistics'' (München: LINCOM): p. 1. . Romani is sometimes classified in the Central Zone or Northwestern Zone Indo-Aryan languages, and sometimes treated as a group of its own. Romani shares a number of features with the Central Zone languages. The most significant isoglosses are the shift of Old Indo-Aryan ''r̥'' to ''u'' or ''i'' (
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
', Romani ' 'to hear') and ''kṣ-'' to ''kh'' (Sanskrit ', Romani ' 'eye'). However, unlike other Central Zone languages, Romani preserves many dental clusters (Romani ' 'three', ' 'brother', compare
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
', '). This implies that Romani split from the Central Zone languages before the Middle Indo-Aryan period. However, Romani shows some features of New Indo-Aryan, such as erosion of the original nominal case system towards a nominative/oblique dichotomy, with new grammaticalized case suffixes added on. This means that the Romani exodus from India could not have happened until late in the first millennium. Many words are similar to the Marwari and
Lambadi Lambadi, Gor Boli, Banjara, Labanki or Banjari is a language spoken by the once nomadic Banjara people across India,Ancient Pastoral Nomadic Community of India Ancient Warrior Community/Raajputs Medieval Traders/Grain Carriers Modern Grain Tra ...
languages spoken in large parts of India. Romani also shows some similarity to the Northwestern Zone languages. In particular, the grammaticalization of enclitic pronouns as person markers on verbs (' 'done' + ' 'me' → ' 'I did') is also found in languages such as Kashmiri and Shina. This evidences a northwest migration during the split from the Central Zone languages consistent with a later migration to Europe. Based on these data, Yaron Matras views Romani as "kind of Indian hybrid: a central Indic dialect that had undergone partial convergence with northern Indic languages." In terms of its grammatical structures, Romani is conservative in maintaining almost intact the Middle Indo-Aryan present-tense person concord markers, and in maintaining consonantal endings for nominal case – both features that have been eroded in most other modern Indo-Aryan languages. Romani shows a number of phonetic changes that distinguish it from other Indo-Aryan languages – in particular, the devoicing of voiced aspirates (''bh dh gh'' > ''ph th kh''), shift of medial ''t d'' to ''l'', of short ''a'' to ''e'', initial ''kh'' to ''x'', rhoticization of retroflex ''ḍ, ṭ, ḍḍ, ṭṭ, ḍh'' etc. to ''r'' and ''ř'', and shift of inflectional ''-a'' to ''-o''. After leaving the Indian subcontinent, Romani was heavily affected by contact with European languages. The most significant of these was Medieval Greek, which contributed lexically, phonemically, and grammatically to Early Romani (10th–13th centuries). This includes inflectional affixes for nouns, and verbs that are still productive with borrowed vocabulary, the shift to VO word order, and the adoption of a preposed definite article. Early Romani also borrowed from
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian Diaspora, Armenian communities across the ...
and
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
. Romani and Domari share some similarities: agglutination of postpositions of the second layer (or case marking clitics) to the nominal stem, concord markers for the past tense, the neutralisation of gender marking in the plural, and the use of the oblique case as an accusative. This has prompted much discussion about the relationships between these two languages. Domari was once thought to be the "sister language" of Romani, the two languages having split after the departure from the Indian subcontinent, but more recent research suggests that the differences between them are significant enough to treat them as two separate languages within the Central Zone ( Hindustani) group of languages. The Dom and the Rom therefore likely descend from two different migration waves out of India, separated by several centuries.


History

The first attestation of Romani is from 1542 AD in western Europe. The earlier history of the Romani language is completely undocumented, and is understood primarily through comparative linguistic evidence. Linguistic evaluation carried out in the nineteenth century by Pott (1845) and Miklosich (1882–1888) showed the Romani language to be a New Indo-Aryan language (NIA), not a Middle Indo-Aryan (MIA), establishing that the ancestors of the Romani could not have left India significantly earlier than AD 1000. The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, and its reduction to just a two-way case system, nominative vs. oblique. A secondary argument concerns the system of gender differentiation. Romani has only two genders (masculine and feminine). Middle Indo-Aryan languages (named MIA) generally had three genders (masculine, feminine and neuter), and some modern Indo-Aryan languages retain this old system even today. It is argued that loss of the neuter gender did not occur until the transition to NIA. Most of the neuter nouns became masculine while a few feminine, like the neuter (') in the
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
became the feminine (') in
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
and ' in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have been cited as evidence that the forerunner of Romani remained on the Indian subcontinent until a later period, perhaps even as late as the tenth century. There is no historical proof to clarify who the ancestors of the Romani were or what motivated them to emigrate from the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, but there are various theories. The influence of
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
, and to a lesser extent of
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian Diaspora, Armenian communities across the ...
and the
Iranian languages The Iranian languages or Iranic languages are a branch of the Indo-Iranian languages in the Indo-European language family that are spoken natively by the Iranian peoples, predominantly in the Iranian Plateau. The Iranian languages are groupe ...
(like
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
and
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish languages *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern Kurdistan **Eastern Kurdistan **Northern Kurdistan **Western Kurdistan See also * Kurd (dis ...
) points to a prolonged stay in
Anatolia Anatolia, tr, Anadolu Yarımadası), and the Anatolian plateau, also known as Asia Minor, is a large peninsula in Western Asia and the westernmost protrusion of the Asian continent. It constitutes the major part of modern-day Turkey. The ...
, Armenian highlands/Caucasus after the departure from South Asia. The latest territory where Romani is thought to have been spoken as a mostly unitary linguistic variety is the
Byzantine Empire The Byzantine Empire, also referred to as the Eastern Roman Empire or Byzantium, was the continuation of the Roman Empire primarily in its eastern provinces during Late Antiquity and the Middle Ages, when its capital city was Constantinopl ...
, between the 10th and the 13th centuries. The language of this period, which can be reconstructed on the basis of modern-day dialects, is referred to as ''
Early Romani Early Romani (sometimes referred to as Late Proto-Romani) is the latest common predecessor of all forms of the Romani language. It was spoken before the Roma people dispersed throughout Europe. It is not directly attested, but rather linguistic rec ...
'' or ''Late Proto-Romani''. The
Mongol invasion of Europe From the 1220s into the 1240s, the Mongols conquered the Turkic states of Volga Bulgaria, Cumania, Alania, and the Kievan Rus' federation. Following this, they began their invasion into heartland Europe by launching a two-pronged invasion of ...
beginning in the first half of the thirteenth century triggered another westward migration. The Romani arrived in
Europe Europe is a large peninsula conventionally considered a continent in its own right because of its great physical size and the weight of its history and traditions. Europe is also considered a subcontinent of Eurasia and it is located entirel ...
and afterwards spread to the other continents. The great distances between the scattered Romani groups led to the development of local community distinctions. The differing local influences have greatly affected the modern language, splitting it into a number of different (originally exclusively regional) dialects. Today, Romani is spoken by small groups in 42 European countries. A project at
Manchester University , mottoeng = Knowledge, Wisdom, Humanity , established = 2004 – University of Manchester Predecessor institutions: 1956 – UMIST (as university college; university 1994) 1904 – Victoria University of Manchester 1880 – Victoria Univer ...
in England is transcribing Romani dialects, many of which are on the brink of extinction, for the first time.


Dialects

Today's dialects of Romani are differentiated by the vocabulary accumulated since their departure from
Anatolia Anatolia, tr, Anadolu Yarımadası), and the Anatolian plateau, also known as Asia Minor, is a large peninsula in Western Asia and the westernmost protrusion of the Asian continent. It constitutes the major part of modern-day Turkey. The ...
, as well as through divergent phonemic evolution and grammatical features. Many Roma no longer speak the language or speak various new
contact language Language contact occurs when speakers of two or more languages or varieties interact and influence each other. The study of language contact is called contact linguistics. When speakers of different languages interact closely, it is typical for the ...
s from the local language with the addition of Romani vocabulary. Dialect differentiation began with the dispersal of the Romani from the Balkans around the 14th century and on, and with their settlement in areas across Europe in the 16th and 17th centuries. The two most significant areas of divergence are the southeast (with epicenter of the northern Balkans) and west-central Europe (with epicenter Germany). The central dialects replace ' in grammatical paradigms with '. The northwestern dialects append ', simplify ' to ', retain ' in the nominalizer ' / ', and lose adjectival past-tense in intransitives (', ' → ' 'he/she went'). Other isoglosses (esp. demonstratives, 2/3pl perfective concord markers, loan verb markers) motivate the division into Balkan, Vlax, Central, Northeast, and Northwest dialects. Matras (2002, 2005) has argued for a theory of geographical classification of Romani dialects, which is based on the diffusion in space of innovations. According to this theory,
Early Romani Early Romani (sometimes referred to as Late Proto-Romani) is the latest common predecessor of all forms of the Romani language. It was spoken before the Roma people dispersed throughout Europe. It is not directly attested, but rather linguistic rec ...
(as spoken in the Byzantine Empire) was brought to western and other parts of Europe through population migrations of Rom in the 14th–15th centuries. These groups settled in the various European regions during the 16th and 17th centuries, acquiring fluency in a variety of contact languages. Changes emerged then, which spread in wave-like patterns, creating the dialect differences attested today. According to Matras, there were two major centres of innovations: some changes emerged in western Europe (Germany and vicinity), spreading eastwards; other emerged in the Wallachian area, spreading to the west and south. In addition, many regional and local isoglosses formed, creating a complex wave of language boundaries. Matras points to the prothesis of ' in ' > ' 'egg' and ' > ' 'he' as typical examples of west-to-east diffusion, and of addition of prothetic ' in ' > ' as a typical east-to-west spread. His conclusion is that dialect differences formed in situ, and not as a result of different waves of migration. According to this classification, the dialects are split as follows: * Northern Romani dialects in western and northern Europe, southern Italy and the Iberian peninsula * Central Romani dialects from southern
Poland Poland, officially the Republic of Poland, is a country in Central Europe. It is divided into 16 administrative provinces called voivodeships, covering an area of . Poland has a population of over 38 million and is the fifth-most populou ...
,
Slovakia Slovakia (; sk, Slovensko ), officially the Slovak Republic ( sk, Slovenská republika, links=no ), is a landlocked country in Central Europe. It is bordered by Poland to the north, Ukraine to the east, Hungary to the south, Austria to the s ...
,
Hungary Hungary ( hu, Magyarország ) is a landlocked country in Central Europe. Spanning of the Pannonian Basin, Carpathian Basin, it is bordered by Slovakia to the north, Ukraine to the northeast, Romania to the east and southeast, Serbia to the ...
,
Carpathian Ruthenia Carpathian Ruthenia ( rue, Карпатьска Русь, Karpat'ska Rus'; uk, Закарпаття, Zakarpattia; sk, Podkarpatská Rus; hu, Kárpátalja; ro, Transcarpatia; pl, Zakarpacie); cz, Podkarpatská Rus; german: Karpatenukrai ...
and southeastern
Austria Austria, , bar, Östareich officially the Republic of Austria, is a country in the southern part of Central Europe, lying in the Eastern Alps. It is a federation of nine states, one of which is the capital, Vienna, the most populous ...
* Balkan Romani dialects, including the Black Sea coast dialects * Vlax Romani dialects, chiefly associated with the historical Wallachian and Transylvanian regions, with outmigrants in various regions throughout Europe and beyond SIL Ethnologue has the following classification: * Balkan Romani **Arlija ** Dzambazi ** Tinners Romani * Northern Romani ** Baltic Romani ***Estonian Romani ***Latvian Romani (Lettish Romani) *** North Russian Romani *** Polish Romani ***White Russian Romani **
Carpathian Romani Carpathian Romani, also known as Central Romani or Romungro Romani, is a group of dialects of the Romani language spoken from southern Poland to Hungary, and from eastern Austria to Ukraine. North Central Romani is one of a dozen major dialect g ...
(Central Romani) ***East Slovak Romani ***Moravian Romani ***West Slovak Romani ** Finnish Kalo Romani ** Sinte Romani ***Abbruzzesi ***Serbian Romani ***Slovenian-Croatian Romani ** Welsh Romani * Vlax Romani **Churari (Churarícko, Sievemakers) **Eastern Vlax Romani (Bisa) **Ghagar **Grekurja (Greco) ** Kalderash (Coppersmith, Kelderashícko) **Lovari (Lovarícko) **Machvano (Machvanmcko) **North Albanian Romani **Sedentary Bulgaria Romani **Sedentary Romania Romani **Serbo-Bosnian Romani **South Albanian Romani **Ukraine-Moldavia Romani **Zagundzi In a series of articles (beginning in 1982) Marcel Courthiade proposed a different kind of classification. He concentrates on the dialectal diversity of Romani in three successive strata of expansion, using the criteria of phonological and grammatical changes. Finding the common linguistic features of the dialects, he presents the historical evolution from the first stratum (the dialects closest to the Anatolian Romani of the 13th century) to the second and third strata. He also names as "pogadialects" (after the ' dialect of
Great Britain Great Britain is an island in the North Atlantic Ocean off the northwest coast of continental Europe. With an area of , it is the largest of the British Isles, the largest European island and the ninth-largest island in the world. It i ...
) those with only a Romani vocabulary grafted into a non-Romani language (normally referred to as
Para-Romani Para-Romani are various mixed languages of non- Indo-Aryan linguistic classification containing considerable admixture from the Romani language. They are spoken as the traditional vernacular of Romani communities, Matras, Y. ''Romani: A Lingu ...
). A table of some dialectal differences: The first stratum includes the oldest dialects: ' (of
Tirana Tirana ( , ; aln, Tirona) is the capital and largest city of Albania. It is located in the centre of the country, enclosed by mountains and hills with Dajti rising to the east and a slight valley to the northwest overlooking the Adriatic Sea ...
), ' (of Korça), ', ', ', ', ', ' (of Pristina), ' ('), ' ('), ', ', ' (from
Finland Finland ( fi, Suomi ; sv, Finland ), officially the Republic of Finland (; ), is a Nordic country in Northern Europe. It shares land borders with Sweden to the northwest, Norway to the north, and Russia to the east, with the Gulf of B ...
), ', and the so-called '' Baltic dialects''. In the second there are ' (of
Podgorica Podgorica (Cyrillic: Подгорица, ; lit. 'under the hill') is the capital and largest city of Montenegro. The city was formerly known as Titograd (Cyrillic: Титоград, ) between 1946 and 1992—in the period that Montenegro form ...
), ', ', ', ' (of Agia Varvara) The third comprises the rest of the Romani dialects, including ', ', '.


Mixed languages

Some Romanies have developed
mixed language A mixed language is a language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. It differs from a creole or pidgin language in that, whereas creoles/pidgin ...
s (chiefly by retaining Romani lexical items and adopting second language grammatical structures), including: * in Northern Europe **
Angloromani Angloromani or Anglo-Romani (literally "English Romani"; also known as Angloromany, Rummaness, or Pogadi Chib) is a mixed language of Indo European origin involving the presence of Romani vocabulary and syntax in the English used by descendants of ...
(in England) ** Scottish Cant (in Lowland Scotland) ** Scandoromani (in Norway & Sweden) * on the Latin Countries and
France France (), officially the French Republic ( ), is a country primarily located in Western Europe. It also comprises of overseas regions and territories in the Americas and the Atlantic, Pacific and Indian Oceans. Its metropolitan area ...
: ** Erromintxela (in the Basque Country) ** Caló (in
Portugal Portugal, officially the Portuguese Republic ( pt, República Portuguesa, links=yes ), is a country whose mainland is located on the Iberian Peninsula of Southwestern Europe, and whose territory also includes the Atlantic archipelagos of ...
,
Brazil Brazil ( pt, Brasil; ), officially the Federative Republic of Brazil (Portuguese: ), is the largest country in both South America and Latin America. At and with over 217 million people, Brazil is the world's fifth-largest country by area ...
and
Spain , image_flag = Bandera de España.svg , image_coat = Escudo de España (mazonado).svg , national_motto = ''Plus ultra'' (Latin)(English: "Further Beyond") , national_anthem = (English: "Royal March") , i ...
). ** Manouche (a variant of Sinte Romani in France and its Mediterranean borders from Spain to Italy) * in
Southeast Europe Southeast Europe or Southeastern Europe (SEE) is a geographical subregion of Europe, consisting primarily of the Balkans. Sovereign states and territories that are included in the region are Albania, Bosnia and Herzegovina, Bulgaria, Croatia (a ...
** Romano-Greek ** Romano-Serbian * in the Caucasus (
Armenia Armenia (), , group=pron officially the Republic of Armenia,, is a landlocked country in the Armenian Highlands of Western Asia.The UNbr>classification of world regions places Armenia in Western Asia; the CIA World Factbook , , and ' ...
) ** Lomavren


Geographic distribution

Romani is the only Indo-Aryan language spoken almost exclusively in Europe. The most concentrated areas of Romani speakers are found in the
Balkans The Balkans ( ), also known as the Balkan Peninsula, is a geographical area in southeastern Europe with various geographical and historical definitions. The region takes its name from the Balkan Mountains that stretch throughout the who ...
and central Europe, particularly in Romania, Bulgaria, North Macedonia and Slovakia. Although there are no reliable figures for the exact number of Romani speakers, the estimated amount of Romani speakers in the European Union is around 3.5 million, this makes it the largest spoken minority language in the European Union.


Status

The language is recognized as a minority language in many countries. At present the only places in the world where Romani is employed as an official language are the Republic of Kosovo (only regionally, not nationally) and the
Šuto Orizari Municipality Šuto Orizari ( mk, ; Balkan Romani: ''Shuto Orizari''; sq, Shutkë), often shortened as ''Šutka'' (Шутка), is one of the ten municipalities that make up the City of Skopje, the capital of the Republic of North Macedonia. ''Šuto Orizari' ...
within the administrative borders of Skopje,
North Macedonia North Macedonia, ; sq, Maqedonia e Veriut, (Macedonia before February 2019), officially the Republic of North Macedonia,, is a country in Southeast Europe. It gained independence in 1991 as one of the successor states of Yugoslavia. It ...
's capital. The first efforts to publish in Romani were undertaken in the interwar
Soviet Union The Soviet Union,. officially the Union of Soviet Socialist Republics. (USSR),. was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 to 1991. A flagship communist state, ...
(using the
Cyrillic script The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
) and in socialist
Yugoslavia Yugoslavia (; sh-Latn-Cyrl, separator=" / ", Jugoslavija, Југославија ; sl, Jugoslavija ; mk, Југославија ;; rup, Iugoslavia; hu, Jugoszlávia; rue, label=Pannonian Rusyn, Югославия, translit=Juhoslavija ...
. Portions and selections of the Bible have been translated to many different forms of the Romani language. The entire Bible has been translated to Kalderash Romani. Some traditional communities have expressed opposition to codifying Romani or having it used in public functions. However, the mainstream trend has been towards standardization. Different variants of the language are now in the process of being codified in those countries with high Romani populations (for example,
Slovakia Slovakia (; sk, Slovensko ), officially the Slovak Republic ( sk, Slovenská republika, links=no ), is a landlocked country in Central Europe. It is bordered by Poland to the north, Ukraine to the east, Hungary to the south, Austria to the s ...
). There are also some attempts currently aimed at the creation of a unified standard language. A standardized form of Romani is used in Serbia, and in Serbia's autonomous province of Vojvodina, Romani is one of the officially recognized languages of minorities having its own radio stations and news broadcasts. In Romania, a country with a sizable Romani minority (3.3% of the total population), there is a unified teaching system of the Romani language for all dialects spoken in the country. This is primarily a result of the work of Gheorghe Sarău, who made Romani textbooks for teaching Romani children in the Romani language. He teaches a purified, mildly
prescriptive Linguistic prescription, or prescriptive grammar, is the establishment of rules defining preferred usage of language. These rules may address such linguistic aspects as spelling, pronunciation, vocabulary, syntax, and semantics. Sometimes infor ...
language, choosing the original Indo-Aryan words and grammatical elements from various dialects. The pronunciation is mostly like that of the dialects from the first stratum. When there are more variants in the dialects, the variant that most closely resembles the oldest forms is chosen, like ', instead of ', ', ' instead of ', ' instead of ' or ', etc. An effort is also made to derive new words from the vocabulary already in use, ''i.e.'', ' (airplane), ' (slide rule), ' (retrospectively), ' (adjective). There is an ever-changing set of borrowings from Romanian as well, including such terms as ' (weather, time), ' (town hall), ' (cream), ' (saint, holy).
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
-based
neologism A neologism Greek νέο- ''néo''(="new") and λόγος /''lógos'' meaning "speech, utterance"] is a relatively recent or isolated term, word, or phrase that may be in the process of entering common use, but that has not been fully accepted int ...
s include ' (bulb, electricity), ' (example), ' (drawing, design), ' (writing), while there are also English language, English-based neologisms, like ' < "to print". Romani is now used on the internet, in some local media, and in some countries as a medium of instruction.


Orthography

Historically, Romani was an exclusively unwritten language; for example, Slovak Romani's orthography was codified only in 1971. The overwhelming majority of academic and non-academic literature produced currently in Romani is written using a Latin-based orthography. The proposals to form a unified Romani alphabet and one standard Romani language by either choosing one dialect as a standard, or by merging more dialects together, have not been successful - instead, the trend is towards a model where each dialect has its own writing system. Among native speakers, the most common pattern for individual authors to use an orthography based on the writing system of the dominant contact language: thus Romanian in
Romania Romania ( ; ro, România ) is a country located at the crossroads of Central, Eastern, and Southeastern Europe. It borders Bulgaria to the south, Ukraine to the north, Hungary to the west, Serbia to the southwest, Moldova to the east, and ...
, Hungarian in
Hungary Hungary ( hu, Magyarország ) is a landlocked country in Central Europe. Spanning of the Pannonian Basin, Carpathian Basin, it is bordered by Slovakia to the north, Ukraine to the northeast, Romania to the east and southeast, Serbia to the ...
and so on. To demonstrate the differences, the phrase /romani tʃʰib/, which means "Romani language" in all the dialects, can be written as ', ', ', ', ', ', ', ', ', ', ' and so on. A currently observable trend, however, appears to be the adoption of a loosely English and Czech-oriented orthography, developed spontaneously by native speakers for use online and through email.


Phonology

The Romani sound system is not highly unusual among European languages. Its most marked features are a three-way contrast between unvoiced, voiced, and aspirated stops: /p, t, k, t͡ʃ/, /b, d, ɡ, d͡ʒ/, and /pʰ, tʰ, kʰ, t͡ʃʰ/ and the presence in some dialects of a second rhotic ⟨ř⟩, realized as retroflex or a long trill ː or uvular The following is the core sound inventory of Romani. Phonemes in parentheses are only found in some dialects: Eastern and Southeastern European Romani dialects commonly have palatalized consonants, either distinctive or allophonic. Some dialects add the central vowel . Vowel length is often distinctive in Western European Romani dialects. Loans from contact languages often allow other non-native phonemes. Conservative dialects of Romani have final stress, with the exception of some unstressed affixes (e.g. the vocative ending, the case endings added on to the accusative noun, and the remoteness tense marker). Central and Western European dialects often have shifted stress earlier in the word. In some varieties such as Slovak Romani, at the end of a word, voiced consonants become voiceless and aspirated ones lose aspiration. Some examples:


Lexicon


Morphology


Nominals

Nominals in Romani are nouns, adjectives, pronouns and numerals. Some sources describe articles as nominals. The indefinite article is often borrowed from the local contact language.


Types

General Romani is an unusual language, in having two classes of nominals, based on the historic origin of the word, that have a completely different morphology. The two classes can be called ''inherited'' and ''borrowed'', but this article uses names from Matras (2006), ''ikeoclitic'' and ''xenoclitic''. The class to which a word belongs is obvious from its ending.


= Ikeoclitic

= The first class is the old, Indian vocabulary (and to some extent
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
, Armenian and Greek loanwords). The ikeoclitic class can also be divided into two sub-classes, based on the ending.


Nominals ending in o/i

The ending of words in this sub-class is -o with masculines, -i with feminines, with the latter ending triggering palatalisation of preceding ''d, t, n, l'' to ''ď, ť, ň, ľ''. Examples: *masculine ** - the son ** - the little ** - our (m.) *feminine ** - non-romani girl ** - small (note the change n > ň) ** - ours (f.)


Nominals without ending

All words in this sub-class have no endings, regardless of gender. Examples: *masculine ** - the brother ** - the nice (m.) ** - the father *feminine ** - the sister ** - the nice (f.) - same as m. ** - the mother


= Xenoclitic

= The second class is loanwords from
European languages Most languages of Europe belong to the Indo-European language family. Out of a total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European language. Within Indo-European, the three largest phyla are Ro ...
. (Matras adds that the morphology of the new loanwords might be borrowed from Greek.) The ending of borrowed masculine is -os, -is, -as, -us, and the borrowed feminine ends in -a. Examples from Slovak Romani: *masculine ** - shoemaker ** - bus ** - teacher (m.) *feminine ** - shirt ** - window ** - teacher (f.) (from ' in Slovak)


Basics of morphology

Romani has two
grammatical gender In linguistics, grammatical gender system is a specific form of noun class system, where nouns are assigned with gender categories that are often not related to their real-world qualities. In languages with grammatical gender, most or all noun ...
s (masculine / feminine) and two numbers (singular / plural). All nominals can be singular or plural.


Cases

Nouns are marked for any of eight cases; nominative,
vocative In grammar, the vocative case (abbreviated ) is a grammatical case which is used for a noun that identifies a person (animal, object, etc.) being addressed, or occasionally for the noun modifiers (determiners, adjectives, participles, and numer ...
, accusative, genitive, dative, locative,
ablative In grammar, the ablative case (pronounced ; sometimes abbreviated ) is a grammatical case for nouns, pronouns, and adjectives in the grammars of various languages; it is sometimes used to express motion away from something, among other uses. ...
, and
instrumental An instrumental is a recording normally without any vocals, although it might include some inarticulate vocals, such as shouted backup vocals in a big band setting. Through semantic widening, a broader sense of the word song may refer to inst ...
. The former three are formed by inflections on the noun itself, but the latter five are marked by adding postpositions to the accusative, used as an "indirect root." The vocative and nominative are a bit "outside" of the case systemŠebková, Žlnayová 1998, p. 52–54 as they are produced only by adding a suffix to the root. Example: the suffix for singular masculine vocative of ikeoclitic types is . * - you, boy (or son)! * - you, little one! * - brother! The oblique cases disregard gender or type: ' / ' (locative), ' / ' (dative), ' (ablative), ' (instrumental and
comitative In grammar, the comitative case (; abbreviated ) is a grammatical case that denotes accompaniment. In English, the preposition "with", in the sense of "in company with" or "together with", plays a substantially similar role (other uses of "with", l ...
), and ' / ' (genitive). Example: The endings for o/i ending nominals are as follows: Example: the suffix for indirect root for masculine plural for all inherited words is ', the dative suffix is '. * - mushroom * - the indirect root (also used as accusative) * – In the summer we go on mushrooms (meaning picking mushrooms) There are many
declension class In linguistics, declension (verb: ''to decline'') is the changing of the form of a word, generally to express its syntactic function in the sentence, by way of some inflection. Declensions may apply to nouns, pronouns, adjectives, adverbs, and ar ...
es of nouns that decline differently, and show dialectal variation. Parts of speech such as adjectives and the article, when they function as attributes before a word, distinguish only between a nominative and an indirect/oblique case form. In the Early Romani system that most varieties preserve, declinable adjectives had nominative endings similar to the nouns ending in ''-o'' (masculine ''-o'', feminine ''-i'') but the oblique endings ''-e'' in the masculine, ''-a'' in the feminine. The ending ''-e'' was the same regardless of gender. So-called athematic adjectives had the nominative forms ''-o'' in the masculine ''and'' the feminine and ''-a'' in the plural; the oblique has the same endings as the previous group, but the preceding stem changes by adding the element ''-on-''.


Agreement

Romani shows the typically Indo-Aryan pattern of the genitive agreeing with its head noun. Example: * ' - 'the boy's brother' * ' - 'the boy's sister'. Adjectives and the definite article show agreement with the noun they modify. Example: * ' - 'my father' * ' - 'my mother'.


Verbs

Romani derivations are highly synthetic and partly agglutinative. However, they are also sensitive to recent development - for example, in general, Romani in Slavic countries show an adoption of productive aktionsart morphology. The core of the verb is the lexical root, verb morphology is suffixed. The verb stem (including derivation markers) by itself has non-perfective aspect and is present or subjunctive.


Types

Similarly to nominals, verbs in Romani belong to several classes, but unlike nominals, these are not based on historical origin. However, the loaned verbs can be recognized, again, by specific endings, which are Greek in origin.


= Irregular verbs

= Some words are irregular, like ' - to be.


= Class I

= The next three classes are recognizable by suffix in 3rd person singular. The first class, called I., has a suffix ' in 3rd person singular. Examples, in 3 ps. sg: *' - to do *' - to hear *' - to see


= Class II

= Words in the second category, called II., have a suffix ' in 3rd person singular. Examples, in 3 ps. sg: *' - to go *' - to be ashamed, shy away. *' - to laugh *' - to believe *' - to eat


= Class III

= All the words in the third class are semantically causative passive. Examples: *' - to learn *' - to burn *' - to be beaten *' - to lie


= Borrowed verbs

= Borrowed verbs from other languages are marked with affixes taken from Greek tense/aspect suffixes, including ', ', and '.


Morphology

The Romani verb has three persons and two numbers, singular and plural. There is no verbal distinction between masculine and feminine. Romani tenses are, not exclusively, present tense, future tense, two past tenses (perfect and imperfect), present or past conditional and present imperative. Depending on the dialect, the suffix ' marks the present, future, or conditional. There are many perfective suffixes, which are determined by root phonology, valency, and semantics: e.g. ' 'did'. There are two sets of personal conjugation suffixes, one for non-perfective verbs, and another for perfective verbs. The non-perfective personal suffixes, continued from Middle Indo-Aryan, are as follows: These are slightly different for consonant- and vowel-final roots (e.g. ' 'you eat', ' 'you want'). The perfective suffixes, deriving from late Middle Indo-Aryan enclitic pronouns, are as follows: Verbs may also take a further remoteness suffix whose original form must have been ' and which is preserved in different varieties as ', ', ' or '. With non-perfective verbs this marks the imperfect, habitual, or conditional. With the perfective, this marks the
pluperfect The pluperfect (shortening of plusquamperfect), usually called past perfect in English, is a type of verb form, generally treated as a grammatical tense in certain languages, relating to an action that occurred prior to an aforementioned time i ...
or counterfactual.


= Class I

= All the persons and numbers of present tense of the word ' in East Slovak Romani.Šebková, Žlnayová 1998, p. 38 Various tenses of the same word, all in 2nd person singular. *present - ' *future - ' (many other dialects use a future particle such as ''ka'' preceding the imperfective form : '') *past imperfect = present conditional - ' *past perfect - ' (' + ' + ') *past conditional - ' (' + ' + ' + ') *present imperative - '


= Class II

= All the persons and numbers of present tense of the word ' in East Slovak Romani. Various tenses of the word ', all in 2nd person singular. *present - ' *future - ' *past imperfect = present conditional - ' *past perfect - ' (irregular - regular form of ' is ') *past conditional - ' *present imperative - '


= Class III

= All the persons and numbers of present tense of the word ' in East Slovak Romani. Note the added ', which is typical for this group. Various tenses of the same word, all in 2nd person singular again. *present - ' *future - ' *past imperfect = present conditional - ' *past perfect - ' (' + ' + ') *past conditional - ' (' + ' + ' + ') *present imperative - '


Valency

Valency markers are affixed to the verb root either to increase or decrease valency. There is dialectal variation as to which markers are most used; common valency-increasing markers are ', ', and ', and common valency-decreasing markers are ' and '. These may also be used to derive verbs from nouns and adjectives.


Syntax

Romani syntax is quite different from most Indo-Aryan languages, and shows more similarity to the
Balkan languages This is a list of languages spoken in regions ruled by Balkan countries. With the exception of several Turkic languages, all of them belong to the Indo-European family. A subset of these languages is notable for forming a well-studied '' sprachb ...
. Šebková and Žlnayová, while describing Slovak Romani, argues that Romani is a free word order language and that it allows for theme-rheme structure, similarly to Czech, and that in some Romani dialects in East Slovakia, there is a tendency to put a verb at the end of a sentence. However, Matras describes it further. According to Matras, in most dialects of Romani, Romani is a
VO language Vo or VO may refer to: Businesses and brands * Austrian Arrows (2003-2015, IATA airline code VO) * VLM Airlines Slovenia (2016-2018, IATA airline code VO) * Seagram's VO Whiskey Language * Volapük language (ISO 639-1 code vo) * VO language, a ...
, with SVO order in contrastive sentences and VSO order in thetic sentences. The tendency of some dialects to put the verb in final position may be due to Slavic influence. Examples, from Slovak Romani: * ' - This cup is cold. * ' - This is a cold cup. Clauses are usually
finite Finite is the opposite of infinite. It may refer to: * Finite number (disambiguation) * Finite set, a set whose cardinality (number of elements) is some natural number * Finite verb, a verb form that has a subject, usually being inflected or marke ...
. Relative clauses, introduced by the
relativizer In linguistics, a relativizer (abbreviated ) is a type of conjunction that introduces a relative clause. For example, in English, the conjunction ''that'' may be considered a relativizer in a sentence such as "I have one that you can use."Fox, Bar ...
''kaj'', are postponed. Factual and non-factual complex clauses are distinguished.


Romani in modern times

Romani has lent several words to English such as ''pal'' (ultimately from Sanskrit ' "brother"Hoad, TF (ed.) ''Oxford Concise Dictionary of Etymology'' (1996) Oxford University Press ) and ''nark'' "informant" (from Romani ' "nose"). Other Romani words in general British slang are ''gadgie'' (man), ''shiv'' or ''chiv'' (knife). Urban British slang shows an increasing level of Romani influence, with some words becoming accepted into the lexicon of standard English (for example, '' chav'' from an assumed Anglo-Romani word, meaning "small boy", in the majority of dialects).the Romany origin of the British ‘chav’
/ref> There are efforts to teach and familiarise Vlax-Romani to a new generation of Romani so that Romani spoken in different parts of the world are connected through a single dialect of Romani. The Indian Institute of Romani Studies,
Chandigarh Chandigarh () is a planned city in India. Chandigarh is bordered by the state of Punjab to the west and the south, and by the state of Haryana to the east. It constitutes the bulk of the Chandigarh Capital Region or Greater Chandigarh, which a ...
published several Romani language lessons through its journal ''Roma'' during the 1970s. Occasionally loanwords from other Indo-Iranian languages, such as
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
, are mistakenly labelled as Romani due to surface similarities (due to a shared root), such as ''cushy'', which is from Urdu (itself a loan from
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
') meaning "excellent, healthy, happy".


See also

* Balkan Romani * Bohemian Romani *
Carpathian Romani Carpathian Romani, also known as Central Romani or Romungro Romani, is a group of dialects of the Romani language spoken from southern Poland to Hungary, and from eastern Austria to Ukraine. North Central Romani is one of a dozen major dialect g ...
* Finnish Kalo language * Laiuse Romani language * Lotegorisch *
Romani alphabets The Romani language has for most of its history been an entirely oral language, with no written form in common use. Although the first example of written Romani dates from 1542, it is not until the twentieth century that vernacular writing by native ...
* Romani language standardization * Zargari Romani


Explanatory notes


References


Citations


General and cited sources

* * * * *


Further reading


Iliev, Iv. I. Armak. The System of the Personal Pronouns in the Romani Dialect in and around Kardzhali, Bulgaria (In print)
* * * Walter Simson. ''A History of the Gipsies: with specimens of the Gipsy language. Edited, with preface, introduction, and notes, and a disquisition on the past, present and future of Gipsydom'', by James Simson. London: Sampson Low & Marston, 1865
A History of the Gipsies with Specimens of the Gipsy Language by Walter Simson
* Peter Bakker, Milena Hübschmannová. ''What Is the Romani Language?''. Hatfield: University Of Hertfordshire Press, 2000.
The Zincali : or, An account of the Gypsies of Spain; with an original collection of their songs and poetry
by
George Borrow George Henry Borrow (5 July 1803 – 26 July 1881) was an English writer of novels and of travel based on personal experiences in Europe. His travels gave him a close affinity with the Romani people of Europe, who figure strongly in his work. Hi ...
(1842)
The Zincali, an account of the Gypsies of Spain (1907)

El gitanismo : historia, costumbres, y dialecto de los gitanos

Embéo e Majaró Lucas
* John Sampson. ''The dialect of the gypsies of Wales : being the older form of British Romani preserved in the speech of the clan of Abram Wood.'' Oxford : Clarendon Press, 1926. xxiii, 230 p
The Dialect of the Gypsies of Wales: Being the Older Form of British Romani Preserved in the Speech of the Clan of Abram Wood


External links


Romani project at Manchester University
with a collection of downloadable papers about the Romani language and a collection of links to Romani media
Outline of Romani Grammar
Victor A. Friedman
Partial Romani/English Dictionary
Compiled by Angela Ba'Tal Libal and Will Strain
ROMLEX Lexical Database
of different dialects of Romani
"Romani language in Macedonia in the Third Millennium: Progress and Problems"
, Victor Friedman.
"The Romani Language in the Republic of Macedonia: Status, Usage and Sociolinguistic Perspectives
Victor Friedman. * Romani Wikipedia (head page) {{DEFAULTSORT:Romani Language Languages of Albania Languages of Argentina Languages of Australia Languages of Bosnia and Herzegovina Languages of Brazil Languages of Bulgaria Languages of Canada Languages of Colombia Languages of Finland Languages of France Languages of Hungary Languages of India Languages of Kosovo Languages of Mexico Languages of Moldova Languages of Montenegro Languages of North Macedonia Languages of Norway Languages of Poland Languages of Portugal Languages of Romania Languages of Serbia Languages of Slovakia Languages of South Africa Languages of Spain Languages of Sweden Languages of the Netherlands Languages of the United Kingdom Languages of the United States Languages of Turkey Languages of Ukraine Languages of Vojvodina Vulnerable languages