An unclassified language is a language whose
genetic affiliation to other languages has not been established. Languages can be unclassified for a variety of reasons, mostly due to a lack of reliable data but sometimes due to the confounding influence of
language contact
Language contact occurs when speakers of two or more languages or varieties interact with and influence each other. The study of language contact is called contact linguistics. Language contact can occur at language borders, between adstratum ...
, if different layers of its vocabulary or morphology point in different directions and it is not clear which represents the ancestral form of the language. Some poorly known
extinct languages, such as
Gutian and
Cacán, are simply unclassifiable, and it is unlikely the situation will ever change.
A supposedly unclassified language may turn out not to be a language at all, or even a distinct dialect, but merely a family, tribal or village name, or an alternative name for a people or language that is classified.
If a language's genetic relationship has not been established after significant documentation of the language and comparison with other languages and families, as in the case of
Basque
Basque may refer to:
* Basques, an ethnic group of Spain and France
* Basque language, their language
Places
* Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France
* Basque Country (autonomous co ...
in Europe, it is considered a
language isolate – that is, it is classified as a language family of its own. An 'unclassified' language therefore is one which may still turn out to belong to an established family once better data is available or more thorough comparative research is done. Extinct unclassified languages for which little evidence has been preserved are likely to remain in limbo indefinitely, unless lost documents or a surviving speaking population are discovered.
Classification challenges
An example of a language that has caused multiple problems for classification is
Mimi of Decorse in
Chad
Chad, officially the Republic of Chad, is a landlocked country at the crossroads of North Africa, North and Central Africa. It is bordered by Libya to Chad–Libya border, the north, Sudan to Chad–Sudan border, the east, the Central Afric ...
. This language is only attested in a single word list collected ca. 1900. At first it was thought to be a
Maban language, because of similarities to
Maba, the first Maban language to be described. However, as other languages of the Maban family were described, it became clear that the similarities were solely with Maba itself, and the relationship was too distant for Mimi to be related specifically to Maba and not equally to the other Maban languages. The obvious similarities are therefore now thought to be due to borrowings from Maba, which is the socially dominant language in the area. When such loans are discounted, there is much less data to classify Mimi with, and what does remain is not particularly similar to any other language or language family. Mimi might therefore be a language isolate, or perhaps a member of some other family related to Maban in the proposed but as yet undemonstrated
Nilo-Saharan
The Nilo-Saharan languages are a proposed family of around 210 African languages spoken by somewhere around 70 million speakers, mainly in the upper parts of the Chari and Nile rivers, including historic Nubia, north of where the two tributari ...
phylum. It would be easier to address the problem with better data, but no-one has been able to find speakers of the language again.
It also happens that a language may be unclassified within an established family. That is, it may be obvious that it is, say, a
Malayo-Polynesian language, but not clear in which branch of Malayo-Polynesian it belongs. When a family consists of many similar languages with great degree of confusing contact, a large number of languages may be effectively unclassified in this manner. Families where this is a substantial problem include Malayo-Polynesian,
Bantu,
Pama–Nyungan, and
Arawakan.
Examples by reason
There are hundreds of unclassified languages, most of them extinct, although there are some, albeit relatively few, that are still spoken; in the following list, the extinct languages are labeled with a dagger ().
Absence of data
Some languages are unclassifiable, not just unclassified, because while there may be record of a language existing there may not be enough materials in it to analyze and classify, especially with now-extinct languages. (See, for example,
list of unclassified languages of South America.) Unclassifiable languages with an absence of data include:
*
Sentinelese
The Sentinelese, also known as the Sentineli and the North Sentinel Islanders, are Indigenous people who inhabit North Sentinel Island in the Bay of Bengal in the northeastern Indian Ocean. Designated a particularly vulnerable tribal group a ...
(Andaman Islands, specifically
North Sentinel Island
North Sentinel Island is one of the Andaman Islands, an Indian archipelago in the Bay of Bengal which also includes South Sentinel Island. The island is a protected area of India. It is home to the Sentinelese, an indigenous tribe in volunta ...
) – a living presumed language of an uncontacted people; assumed to be
Ongan
*
Weyto (Ethiopia) – speculated to have been
Agaw
*
Nam (Chinese–Tibetan border) – data remains undeciphered; probably
Sino-Tibetan
*
Harappan † (
Indus Valley civilisation
The Indus Valley Civilisation (IVC), also known as the Indus Civilisation, was a Bronze Age civilisation in the Northwestern South Asia, northwestern regions of South Asia, lasting from 3300 Common Era, BCE to 1300 BCE, and in i ...
33rd–13th centuries BC) – data remains undeciphered
*
Cypro-Minoan † (Cyprus 15th–10th centuries BC) – data remains undeciphered
*
Lullubi
Lullubi, Lulubi (: ''Lu-lu-bi'', : ''Lu-lu-biki'' "Country of the Lullubi"), more commonly known as Lullu, were a group of Bronze Age tribes of Hurrian and Semitic languages, Semitic origin who existed and disappeared during the 3rd millennium BC ...
(Iran)
* (Tanzania)
*
Guale –
Yamasee (US)
*
Himarimã (Brazil) – a living presumed language of an uncontacted people; assumed to be
Arawan
*
Nagarchal (India) – assumed to have been
Dravidian
*
Kwisi (Angola)
*
Ancient Cappadocian (Asia Minor) – possibly
Anatolian
*
Lycaonian (Asia Minor) – possibly
Anatolian
*
Zapotec (Jalisco) (Mexico)
*
Otomi (Jalisco) (Mexico)
*
Moksela (Indonesia) – possibly has been one of the
Central Maluku languages
Scarcity of data
Many unclassified languages are also considered unclassifiable due to the presence of some, but not enough, data to reveal close language relatives. For others there may be enough data to show the language belongs to a particular family, but not where within it, or to show the language has no close relatives, but not enough to conclude that it is a language isolate.
*
Solano (Mexico) – possibly a
language isolate
*
Cacán (Argentina)
*
Kujargé (Chad) – possibly
Afroasiatic
*
Bung (Cameroon) – most likely
Niger–Congo
*
Luo (Cameroon)
*
Komta (Nigeria)
*
Wawu (Ghana or possibly the Ivory Coast)
*
Kambojan (South Asia and Central Asia)
* (West Africa)
*
Dima-Bottego (Ethiopia)
*
Philistine
Philistines (; Septuagint, LXX: ; ) were ancient people who lived on the south coast of Canaan during the Iron Age in a confederation of city-states generally referred to as Philistia.
There is compelling evidence to suggest that the Philist ...
(Israel) – perhaps either
Afroasiatic or
Indo-European
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
*
Iberian (Spain and southern France)
*
Minoan (ancient Crete)
*
Eteocretan (ancient Crete)
*
Hattic (Anatolia) – probably a
language isolate
*
Kaskian (Anatolia) – possibly related to Hattic
*
Kassite (Iraq) – possibly
Hurro-Urartian
*
Gutian (Zagros borderlands)
*
Hunnic (Eastern Europe and Central Asia)
*
Xiongnu
The Xiongnu (, ) were a tribal confederation of Nomad, nomadic peoples who, according to ancient Chinese historiography, Chinese sources, inhabited the eastern Eurasian Steppe from the 3rd century BC to the late 1st century AD. Modu Chanyu, t ...
(Mongolia) – possibly
Para-Yeniseian or an isolate
*
Tuoba
The Tuoba (Chinese language, Chinese) or Tabgatch (, ''Tabγač''), also known by #Names, other names, was an influential Xianbei clan in early imperial China. During the Sixteen Kingdoms after the fall of Han and the Three Kingdoms, the Tuoba e ...
(China) – possibly
Para-Mongolic
Para-Mongolic is a proposed group of languages that is considered to be an extinct sister branch of the Mongolic languages. Para-Mongolic contains certain historically attested extinct languages, among them Khitan language, Khitan and Tuyuhun lang ...
or an isolate
*
Rouran
The Rouran Khaganate ( Chinese: zh, c=, p=Róurán, label=no), also known as Ruanruan or Juan-juan ( zh, c=, p=Ruǎnruǎn, label=no) (or variously ''Jou-jan'', ''Ruruan'', ''Ju-juan'', ''Ruru'', ''Ruirui'', ''Rouru'', ''Rouruan'' or ''Tantan'') ...
(Mongolia) – possibly
Para-Mongolic
Para-Mongolic is a proposed group of languages that is considered to be an extinct sister branch of the Mongolic languages. Para-Mongolic contains certain historically attested extinct languages, among them Khitan language, Khitan and Tuyuhun lang ...
or an isolate
*
Beothuk (Newfoundland) – assumed to have been related to
Algonquian languages
The Algonquian languages ( ; also Algonkian) are a family of Indigenous languages of the Americas and most of the languages in the Algic language family are included in the group. The name of the Algonquian language family is distinguished from ...
*
Meroitic (Sudan) – possibly
Nilo-Saharan
The Nilo-Saharan languages are a proposed family of around 210 African languages spoken by somewhere around 70 million speakers, mainly in the upper parts of the Chari and Nile rivers, including historic Nubia, north of where the two tributari ...
or
Afroasiatic
*
Guanahatabey (Cuba) – presumed to have been related to
Warao (
Waroid)
*
Macorix (Dominican Republic and possibly Haiti) – presumed to have been related to
Warao (
Waroid)
*
Pankararú (Brazil) – likely a
language isolate
*
Ramanos (Bolivia)
*
Tartessian (southwest Iberian Peninsula)
*
Ligurian (ancient) (Liguria) – probably
Indo-European
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
*
Rutulian (central Italy)
*
Elymian (western Sicily) – likely
Indo-European
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia (e. ...
*
Sicanian (central Sicily)
*
Eteocypriot (Cyprus)
*
Tambora (Indonesia) – possibly a
language isolate
*
Karami (Papua New Guinea)
*
Makolkol (New Britain)
Unrelated to nearby languages and not commonly examined
*
Bangime (Mali)
*
Jalaa (Nigeria)
*
Kwaza (Brazil)
*
Xocó (Brazil) – not clear if it was a single language
*
Mpre
The Multistate Professional Responsibility Examination (MPRE) is a 120-minute, 60-question, multiple-choice examination designed to measure the knowledge and understanding of established standards related to a lawyer's professional conduct. It w ...
(Ghana)
Basic vocabulary unrelated to other languages
*
Bayot (Senegal)
*
Laal (Chad)
Not closely related to other languages and no academic consensus
*
Ongota (Ethiopia)
*
Shabo (Ethiopia)
*
Omaio (Tanzania)
*
Kenaboi (Malaysia)
Languages of dubious existence
*
Oropom (Uganda) (extinct, if it existed)
*
Imeraguen (Mauritania) (
Hassaniya Arabic variety with
Berber
Berber or Berbers may refer to:
Ethnic group
* Berbers, an ethnic group native to Northern Africa
* Berber languages, a family of Afro-Asiatic languages
Places
* Berber, Sudan, a town on the Nile
People with the surname
* Ady Berber (1913–196 ...
words for fishing)
*
Nemadi (Mauritania)
*
Rer Bare (Ethiopia) (extinct, if it existed)
*
Wutana (Nigeria) (extinct, if it existed)
*
Trojan (Anatolia) (extinct and as yet unattested, if it existed; possibly a
Luwian dialect or related language)
*
North Picene (Italy) (extinct, if it existed; attested in inscriptions that have been accused of being fabricated)
*
Quimbaya (Colombia) (extinct, if it existed; only one known word)
Some 'languages' turn out to be fabricated, such as the
Kukurá language of Brazil.
See also
*
:Unclassified languages
*
List of unclassified languages according to the Ethnologue
*
List of unclassified languages of North America
*
List of unclassified languages of South America
*
Language isolate
*
List of language families (including isolates and unclassified languages)
Notes
References
External links
*
Ethnologue: Unclassified languages
{{language families