Sora (Sora pronunciations or ) is a south
Munda language of the
Austroasiatic language of the
Sora people, an ethnic group of eastern India, mainly in the states of
Odisha
Odisha (), formerly Orissa (List of renamed places in India, the official name until 2011), is a States and union territories of India, state located in East India, Eastern India. It is the List of states and union territories of India by ar ...
and
Andhra Pradesh
Andhra Pradesh (ISO 15919, ISO: , , AP) is a States and union territories of India, state on the East Coast of India, east coast of southern India. It is the List of states and union territories of India by area, seventh-largest state and th ...
. Sora contains very little formal literature but has an abundance of folk tales and traditions. Most of the knowledge passed down from generation to generation is transmitted orally. Like many languages in eastern India, Sora is listed as 'vulnerable to extinction' by
UNESCO
The United Nations Educational, Scientific and Cultural Organization (UNESCO ) is a List of specialized agencies of the United Nations, specialized agency of the United Nations (UN) with the aim of promoting world peace and International secur ...
.
Sora speakers are concentrated in Odisha and Andhra Pradesh. The language is endangered according to the
International Mother Language Institute (IMLI).
Distribution
Speakers are concentrated mainly in
Ganjam District
Ganjam is a district in the Indian state of Odisha. Ganjam's total area is 8,206 km² (3,168 mi²).
The district headquarters is Chhatrapur. Ganjam is divided into three sub-divisions: Chhatrapur, Berhampur, and Bhanjanagar.
As of ...
,
Gajapati District (including the central Gumma Hills region (Gumma Block),
[Anderson, Gregory D.S (ed). 2008. ''The Munda languages''. Routledge Language Family Series 3.New York: Routledge. .] and
Rayagada District
Rayagada district is a district in the southern Odisha States and union territories of India, state in India. Rayagada became a separate district in October 1992. Its population consists mainly of Adivasi, tribes, primarily the Khonds and t ...
, and are also found in adjacent areas such as
Koraput and
Phulbani districts; other communities exist in northern
Andhra Pradesh
Andhra Pradesh (ISO 15919, ISO: , , AP) is a States and union territories of India, state on the East Coast of India, east coast of southern India. It is the List of states and union territories of India by area, seventh-largest state and th ...
(
Vizianagaram District
Vizianagaram district is one of the six districts in the Uttarandhra region of the States and union territories of India, Indian state of Andhra Pradesh with its headquarters located at Vizianagaram. The district was once the part of ancient Ka ...
, Parvatipuram Manyam District and
Srikakulam District
Srikakulam district is a district in the Indian state of Andhra Pradesh, located in the Uttarandhra region of the state, with its headquarters located at Srikakulam. It is one of the six districts, located in the extreme northeastern direction ...
).
History
The Sora language has faced a wavelike pattern of usage—that is, the number of people who speak Sora climbed steadily for decades before crashing down. In fact, the number of people who spoke Sora went from 157 thousand in 1901 to 166 thousand in 1911.
In 1921, this number marginally rose to 168 thousand and kept climbing.
[ In 1931, speaker numbers jumped to 194 thousand but in 1951, a period of exponential growth occurred, with speaker numbers jumping to 256 thousand.][ in 1961, numbers topped at 265 thousand speakers before crashing down in 1971 when speaker numbers dropped back down to 221 thousand.][
]
Culture
Sora is spoken by the Sora people, who are a part of the Adivasi
The Adivasi (also transliterated as Adibasi) are heterogeneous tribal groups across the Indian subcontinent. The term is a recent invention from the 20th century and is now widely used as a self-designation by groups classified as Scheduled Tr ...
, or tribal people, in India, making Sora an Adivasi language. Sora is found in close proximity to Odia and Telugu speaking peoples so that many Sora people are bilingual.[ Sora had little literature except for a few songs and folk tales which are usually transmitted orally.][
]
Phonology
On a similar note, our understanding of Sora phonology is limited at best but there are some generalizations that can be made. Most syllables are of the Consonant, Vowel, Consonant form and morphemes usually contain one to three syllables. There are 18 identifiable consonants and they fall into most of the established origins of sound. Five consonants originate from the palate while only one consonant originates from the glottis. An interesting facet of Sora consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s is that they contain an inherent ɘ vowel.[Sora Sompeng. (n.d.). Retrieved April 15, 2017, from http://scriptsource.org/cms/scripts/page.php?item_id=script_detail&key=Sora] Although vowels may be pronounced differently, there exist only six vowels in Sora. There are no diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s and aspiration varies depending on the speaker. It is likely that the influence of English, Odia, and Telugu has also affected vowel pronunciation over the course of Sora's use. Pronunciations also change in prevocalic (occurring before a vowel) and non prevocalic environments.
Consonants
Vowels
Except schwa, all Sora vowels have long counterparts. They may be stressed or unstressed. According to Ramamurti (1986), vowels can be short, half-long, or long. Vowel length may denote expressive formations for certain stems, eg. ''sura'' ('big') and ''suːra'' ('really big'), but these require further studies.
Horo & Sarmah (2015) reported for the Sora Assam
Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
dialect's vowel inventory.
Morphophonology
Sora consonants and vowels can undergo a process of sound alternation at prosodic level based on stress-shifts and morphosyntactic conditioned during which the consonants and vowels assimilate to match with the sound of preceding or following stem, or the final nasal with the initial obstruent of the following word. By doing this, some suffixes will merge with its verb phonotactically and a word can have several allomorphs depending on morphological properties of morpheme initials and codas produced during a casual or rapid speech.
Grammar
Overview
Sora is polysynthetic
In linguistic typology, polysynthetic languages, formerly holophrastic languages, are highly synthetic languages, i.e., languages in which words are composed of many morphemes (word parts that have independent meaning but may or may not be able t ...
and noun-incorporating. A single Sora word can convey the meaning of a whole sentence. However, while researchers consider Sora sentence-words to be single individual words, native Sora speakers perceive them as phrases and break them into sequences of iambic words with a rising contour.
For example:
The grammatically correct form in Sora however requires a subject:
A full sentence in Sora:
Sora uses grammatical devices, including subject and object agreement, word order, and noun compounding to show case. It is seen as a predominantly nominative-accusative language and once again differs from most other languages with its lack of a passive structure. However, just because Sora lacks a passive case does not mean other established forms of grammatical case
Case or CASE may refer to:
Instances
* Instantiation (disambiguation), a realization of a concept, theme, or design
* Special case, an instance that differs in a certain way from others of the type
Containers
* Case (goods), a package of relate ...
are also missing. Rather, Sora has some complex grammatical cases (In Nominal morphology).[
In addition, Sora, like many other Munda languages, uses relator nouns to link nouns with the other parts of the sentence in order to provide a more specific meaning, called compounding.] These monosyllabic nouns that enhance meaning are called Semantic relator nouns and are used widely in Sora.[ Sora also has a combining form for every noun in addition to the full form of the noun.] The combining form allows the noun to be attached to a verb root to create a more semantically complex word, similar to compounding in other languages.[ Sora contains prefixes, infixes, and suffixes to form its affixation but only uses its suffixes to change the possession of nouns.] The combining form is the form seen when the noun is being used with a verb or another full formed noun.[ The full form is the form seen when the noun is standing alone or functioning not in tandem with other parts of speech.][ Some templates of Sora combinations between nouns and verbs are as follows:][
Verb + Combined Form
Verb + Combined Form + Combined Form
Full Form + Combined Form
Full Form + Combined Form + Combined Form
An example of a Full Form noun shortened into the Combined Form is as follows: ''mənra,'' the Full Form of man, transform into the combined form word --''mər'' . The two—indicate that a Noun (Full or Combined) or Verb has to precede the Combined Form noun; that is the Combined Form Noun can not stand on its own.][ Although by no means conclusive, a few general guidelines about the Combined Form is that it depends on where the combination with the verb or other noun is to take place.][ If the combined form is to an infix, then its resulting form will be different from if it were to be combined as a prefix. Some examples of Full Form Nouns and their Combined Forms are as follows:][
Full Form Combined Form English Translation
ədɘ'ŋ --dɘ'ŋ honeycomb
ərɘ'ŋ --rɘ'ŋ sour
bɘ'nra'j --bɘn flour
ba'ra' --bal gun barrel
kəṛíŋ—diŋ drum
]
Nominal morphology
Number
Plural in nouns and verbs is marked by enclitic/suffix ''-ji'' positions just right behind noun-suffix ''-ɘn.'' Animate nouns generally can attach (not obligatorily) the plural suffix, but inanimate nouns may often not.
The plural suffix is not attached after countable numerals and finite numbers but it may trigger plural-verb agreement. Numerals can form compounds with nouns; in those cases, they are nominalized and become plural marking by themselves.
Pronouns
Third person pronouns can be used as definite markers in noun phrases. A reflexive pronoun can be formed with the reflexive enclitic ''=dəm.'' For example, anɪnji + =dəm → ''anɪnji=dəm'' 'themselves'.
Demonstratives
Demonstratives in Sora listed by Starosta (1967).
Cases
Case marking in Sora is generally murky. There is no role marker to show syntactic alignments between subjects and objects, i.e. nominative-accusative nor ergative-absolutive. A number of grammatical constructions that may or may not be expressed morphologically into an animate primary object argument of the verb, eg. the oblique-dative marker -''dɔ �ɔ�''- can manifest in standalone morpheme as ''adɔŋ''.
* Absolutive
In grammar, the absolutive case (abbreviated ) is the case of nouns in ergative–absolutive languages that would generally be the subjects of intransitive verbs or the objects of transitive verbs in the translational equivalents of nominative� ...
?–Nominal ''-n''
* Dative
In grammar, the dative case (abbreviated , or sometimes when it is a core argument) is a grammatical case used in some languages to indicate the recipient or beneficiary of an action, as in "", Latin for "Maria gave Jacob a drink". In this exampl ...
– Oblique ''-dɔ �ɔ�-''
* Adessive ''maŋ-''
* Possessive
A possessive or ktetic form (Glossing abbreviation, abbreviated or ; from ; ) is a word or grammatical construction indicating a relationship of possession (linguistics), possession in a broad sense. This can include strict ownership, or a numbe ...
''a-'' and ''-a''
* Allative ''-ban''
* Locative
In grammar, the locative case ( ; abbreviated ) is a grammatical case which indicates a location. In languages using it, the locative case may perform a function which in English would be expressed with such prepositions as "in", "on", "at", and " ...
– Inessive– Illative ''-leŋ-ən'' and ''-leŋ''
Gender & Class
Grammatical gender or class is not deeply encoded in Sora morphosyntax. To signal something masculine/feminine, Sora speakers utilize indigenous compound endings =mar (male, person) and =boj (female) while also use, albeit rarely, gender suffixes borrowed from Indo-Aryan like ''-a'' and ''-i''.
Adjectives
Many Sora adjectives are nominal compounds, i.e. phrase-words, and pre-nominal with modificational function.
Adpositions
Adverbs
Adverbs are uninflected. Some Sora adverbs are: ''tiki'' 'after', ''tikki'' 'afterwards', ''mailen'' 'together', ''dɔ'' 'so', ''əntɚpsɛlɛ'' 'therefore', ''biɲdɔ'' 'but', ''bɔibɔi'' 'very', ''annɚŋ'' 'during', ''nam'' 'now', ''aŋaːnʼaŋaːn'' 'sometimes', ''moyed'' 'previous', ''moyedʼmoyed'' 'recently', etc.
Verbal morphology
Sora verbal morphology makes use of prefixes, infixes, and suffixes per grammatical categories. In typical Munda synthetic structure, the verb phrase in Sora is head-final subject-object-verb SOV. However, Sora has developed an elaborate and productive noun incorporation system which appears to have originated from an earlier offshoot of proto-Munda. Its noun incorporation clearly distinguishes free form and incorporated forms of lexical nouns. In polysynthetic morphosyntax, Sora verb phrases display a strict head-first SVO order like those typically seen in non-Munda Austroasiatic languages
The Austroasiatic languages ( ) are a large language family spoken throughout Mainland Southeast Asia, South Asia and East Asia. These languages are natively spoken by the majority of the population in Vietnam and Cambodia, and by minority popu ...
. The most intriguing aspect of Sora syntactic noun incorporation is ''transitive subject incorporation'', describes that the language allows transitive verb
A transitive verb is a verb that entails one or more transitive objects, for example, 'enjoys' in ''Amadeus enjoys music''. This contrasts with intransitive verbs, which do not entail transitive objects, for example, 'arose' in ''Beatrice arose ...
to incorporate (absorb) its transitive subject/agent with the verb stems remaining transitive and object indexing stays active even after being incorporated.
Verb serialization and clause-chaining can be realized by forming compound of verb1-verb2-nonpast. It also works with pairs of incorporated nouns.
Agreement markers
Object indexing is not obligatory in Sora. In some cases, the object may be omitted or suppressed.
Possessor of an incorporated direct object is marked by pronominal object markers, therefore Sora incorporation is not entirely a valency-reducing process like in many languages.
Syntax
In Sora, the basic clausal constituent order is SOV.
Vocabulary
Compared to other Munda languages such as Kharia whose vocabulary is reported as having 40 percent of words borrowed from Indo-Aryan, Sora has very few, if not negligible, number of foreign loan words. Sora also has zero foreign phonemes. (Donegan & Stampe, 2004) Sora borrows words from surrounding languages like Telugu and Oriya.[ An example of a word borrowed from Oriya is ''kɘ'ra'ñja''' which is a tree name.][ From Telugu ''mu'nu, which means black gram, is borrowed.][ Ramamurti (1931) identified three Sora words that apparently were borrowed from ]Prakrit
Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
during ancient time: ''siŋger'' (green ginger, cf. Pali
Pāli (, IAST: pāl̤i) is a Classical languages of India, classical Middle Indo-Aryan languages, Middle Indo-Aryan language of the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pali Canon, Pāli Can ...
singi vera and Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
sr͎inga veram), ''kaːrella'' (bitter melon, cf. Sanskrit kāravella), and ''keda'' (fragrant screwpine, cf. Sanskrit keta-ki). Moreover, within the Munda family itself most words appear to be mutually intelligible owing to minor differences in pronunciations and phonology. Kharia and Korku, two other Munda languages, share mutually intelligible words with Sora.[ For example, the number 11 in Kharia is ''ghol moŋ,'' in Korku it is ''gel ḑo miya,'' and in Sora it is ''gelmuy.''][ Each 11 in each language looks and sounds remarkably similar to the other 11's. This phenomenon is not just contained in numbers but rather a great deal of vocabulary is mutually intelligible among the Munda languages. Within the Austroasiatic language family more knowledge about Sora vocabulary can be found. The ]Mon-Khmer
The Austroasiatic languages ( ) are a large language family spoken throughout Mainland Southeast Asia, South Asia and East Asia. These languages are natively spoken by the majority of the population in Vietnam and Cambodia, and by minority popu ...
language family which encompasses the languages primarily spoken in Southeast Asia has lexical cognate
In historical linguistics, cognates or lexical cognates are sets of words that have been inherited in direct descent from an etymological ancestor in a common parent language.
Because language change can have radical effects on both the s ...
s with the Munda family. That means that some words found in Sora are of direct proto-Austroasiatic origin and share similarities with other derived Austroasiatic language families. Words that relate to the body, family, home, field, as well as pronouns, demonstratives, and numerals are the ones with the most cognates.
Numerals
The Sora numeral system
A numeral system is a writing system for expressing numbers; that is, a mathematical notation for representing numbers of a given set, using digits or other symbols in a consistent manner.
The same sequence of symbols may represent differe ...
uses a base 12, which only a few other languages in the world do. Ekari, for example, uses a base 60 system. For example, 39 in Sora arithmetic
Arithmetic is an elementary branch of mathematics that deals with numerical operations like addition, subtraction, multiplication, and division. In a wider sense, it also includes exponentiation, extraction of roots, and taking logarithms.
...
would be thought of as (1 * 20) + 12 + 7. Here are the first 12 numerals in the Sora language :[
English: one two three four five six seven eight nine ten eleven twelve
''Sora: aboy bago yagi unji monloy tudru gulji thamji tinji gelji gelmuy migel''
Similar to how English uses the suffix from the numeral ''ten'' after ''twelve'' (such as ''thirteen, fourteen,'' etc.), Sora also uses a suffix assignment to numerals after 12 and before 20. Thirteen in Sora is expressed as ''migelboy'' (12+1), fourteen as ''migelbagu'' (12+2), etc.][ Between numerals 20 and 99, Sora adds the suffix ''kuri'' to the first constituent of the numeral. For example, 31 is expressed as ''bokuri gelmuy'' and 90 as ''unjikuri gelji''.][
The Sora number system was featured in a puzzle by Lera Boroditsky, found in the More Resources section associated with he]
"TED talk"
Writing systems
The Sora language is written using multiple systems. The Sora Sompeng script was developed in 1936 by Mangei Gomango as a native writing system created for the Sora language.
Sora is also written in the Latin script, in the Odia alphabet
The Odia script (, also ) is a Brahmic script used to write the Odia language. To a lesser extent, it is also used to write Sanskrit and other regional languages. It is one of the official scripts of the Indian Republic. The script has dev ...
in Odisha, and in the Telugu script
Telugu script (), an abugida from the Brahmic family of scripts, is used to write the Telugu language, a Dravidian language spoken in the Indian states of Andhra Pradesh and Telangana as well as several other neighbouring states. It is one ...
in Andhra Pradesh.
Sample text
The following text is from Psalm 23 of the KJV
The King James Version (KJV), also the King James Bible (KJB) and the Authorized Version (AV), is an Early Modern English translation of the Christian Bible for the Church of England, which was commissioned in 1604 and published in 1611, by ...
-derived Sora Bible.
Media coverage
Sora was one of the subjects of Ironbound Films' 2008 American documentary film '' The Linguists'', in which two linguists attempted to document
A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as fictional, content. The word originates from the Latin ', which denotes ...
several moribund languages.
Further reading
*Hammarström, Harald; Forkel, Robert; Haspelmath, Martin; Bank, Sebastian, eds. (2016)
"Sora"
'' Glottolog 2.7''. Jena: Max Planck Institute for the Science of Human History.
*Ramamurti, R. S. (1931). ''A Manual of the Sora (Savara) Language. Delhi: Mittal Publication.''
*Veṅkaṭarāmamūrti, G. (1986). ''Sora–English dictionary''. Delhi: Mittal Publication.
*Anderson, Gregory D.S (ed). 2008. ''The Munda languages''. Routledge Language Family Series 3.New York: Routledge. .
Notes
References
External links
Austroasiatic Languages: Munda and Mon–Khmer
{{Austro-Asiatic languages
Munda languages
Endangered languages of India