HOME

TheInfoList



OR:

''Ethnologue: Languages of the World'' is an annual reference publication in print and online that provides statistics and other information on the
living language A modern language is any human language that is currently in use as a native language. The term is used in language education to distinguish between languages which are used for day-to-day communication (such as French and German) and dead cla ...
s of the world. It is the world's most comprehensive catalogue of languages. It was first issued in 1951 and is now published by
SIL International SIL Global (formerly known as the Summer Institute of Linguistics International) is an evangelical Christian nonprofit organization whose main purpose is to study, develop and document languages, especially those that are lesser-known, to expan ...
, an American
evangelical Evangelicalism (), also called evangelical Christianity or evangelical Protestantism, is a worldwide, interdenominational movement within Protestantism, Protestant Christianity that emphasizes evangelism, or the preaching and spreading of th ...
Christian non-profit organization.


Overview and content

''Ethnologue'' has been published by SIL Global (formerly known as the Summer Institute of Linguistics), a Christian
linguistic Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
service organization with an international office in
Dallas Dallas () is a city in the U.S. state of Texas and the most populous city in the Dallas–Fort Worth metroplex, the List of Texas metropolitan areas, most populous metropolitan area in Texas and the Metropolitan statistical area, fourth-most ...
, Texas. The organization studies numerous minority languages to facilitate language development, and to work with speakers of such language communities in translating portions of the Bible into their languages. Despite the Christian orientation of its publisher, ''Ethnologue'' is not ideologically or theologically biased. ''Ethnologue'' includes alternative names and autonyms, the number of L1 and L2 speakers, language prestige, domains of use, literacy rates, locations, dialects, language classification, linguistic affiliations, typology, language maps, country maps, publication and use in media, availability of the
Bible The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writt ...
in each language and dialect described,
religious affiliations Religious identity is a specific type of identity formation. Particularly, it is the sense of group membership to a religion and the importance of this group membership as it pertains to one's self-concept. Religious identity is not necessarily the ...
of speakers, a cursory description of revitalization efforts where reported, intelligibility and
lexical similarity In linguistics, lexical similarity is a measure of the degree to which the word sets of two given languages are similar. A lexical similarity of 1 (or 100%) would mean a total overlap between vocabularies, whereas 0 means there are no common words. ...
with other dialects and languages, writing scripts, an estimate of language viability using the
Expanded Graded Intergenerational Disruption Scale The Expanded Graded Intergenerational Disruption Scale (EGIDS), developed by Lewis and Simons (2010) as an expansion of Joshua Fishman's GIDS, measures a language's status in terms of Language endangerment, endangerment or development. The table ...
(EGIDS), and bibliographic resources. Coverage varies depending on language. For instance, as of 2008, information on
word order In linguistics, word order (also known as linear order) is the order of the syntactic constituents of a language. Word order typology studies it from a cross-linguistic perspective, and examines how languages employ different orders. Correlatio ...
was present for 15% of entries while religious affiliations were mentioned for 38% of languages. According to
Lyle Campbell Lyle Richard Campbell (born October 22, 1942) is an American scholar and linguist known for his studies of indigenous American languages, especially those of Central America, and on historical linguistics in general. Campbell is professor emeri ...
"language maps are highly valuable" and most country maps are of high quality and user-friendly. ''Ethnologue'' gathers information from SIL's thousands of field linguists, surveys done by linguists and literacy specialists, observations of Bible translators, and
crowdsourced Crowdsourcing involves a large group of dispersed participants contributing or producing goods or services—including ideas, votes, micro-tasks, and finances—for payment or as volunteers. Contemporary crowdsourcing often involves digit ...
contributions. SIL's field linguists use an online collaborative research system to review current data, update it, or request its removal. SIL has a team of editors by geographical area who prepare reports to Ethnologue's general editor. These reports combine opinions from SIL area experts and feedback solicited from non-SIL linguists. Editors have to find compromises when opinions differ. Most of SIL's linguists have taken three to four semesters of graduate linguistics courses, and half of them have a master's degree. They're trained by 300 PhD linguists in SIL. The determination of what characteristics define a single language depends upon
sociolinguistic Sociolinguistics is the descriptive, scientific study of how language is shaped by, and used differently within, any given society. The field largely looks at how a language changes between distinct social groups, as well as how it varies unde ...
evaluation by various scholars; as the preface to ''Ethnologue'' states, "Not all scholars share the same set of criteria for what constitutes a 'language' and what features define a '
dialect A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
'." The criteria used by ''Ethnologue'' are
mutual intelligibility In linguistics, mutual intelligibility is a relationship between different but related language varieties in which speakers of the different varieties can readily understand each other without prior familiarity or special effort. Mutual intelli ...
and the existence or absence of a common literature or ethnolinguistic identity. The number of languages identified has been steadily increasing, from 5,445 in the 10th edition (in 1984) to 6,909 in the 16th (in 2009), partly due to governments recognizing mutually intelligible varieties as separate languages, and partly due to SIL establishing new Bible translation needs. ''Ethnologue'' codes were used as the base to create the new
ISO 639-3 ISO 639-3:2007, ''Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages'', is an international standard for language codes in the ISO 639 series. It defines three-letter codes for ...
international standard. Since 2007, ''Ethnologue'' relies only on this standard, administered by SIL International, to determine what is listed as a language. In addition to choosing a primary name for a language, ''Ethnologue'' provides listings of other name(s) for the language and any dialects that are used by its speakers, government, foreigners and neighbors. Also included are any names that have been commonly referenced historically, regardless of whether a name is considered official, politically correct or offensive; this allows more complete historical research to be done. These lists of names are not necessarily complete.


History

''Ethnologue'' was founded in 1951 by Richard S. Pittman and was initially focused on minority languages, to share information on Bible translation needs. The first edition included information on 46 languages. Hand-drawn maps were introduced in the fourth edition (1953). The seventh edition (1969) listed 4,493 languages. In 1971, ''Ethnologue'' expanded its coverage to all known languages of the world. The ''Ethnologue'' database was created in 1971 at the
University of Oklahoma The University of Oklahoma (OU) is a Public university, public research university in Norman, Oklahoma, United States. Founded in 1890, it had existed in Oklahoma Territory near Indian Territory for 17 years before the two territories became the ...
under a grant from the
National Science Foundation The U.S. National Science Foundation (NSF) is an Independent agencies of the United States government#Examples of independent agencies, independent agency of the Federal government of the United States, United States federal government that su ...
. In 1974 the database was moved to
Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
. Since 2000, the database has been maintained by SIL International in their Dallas headquarters. In 1997 (13th edition), the website became the primary means of access. In 1984, ''Ethnologue'' released a three-letter coding system, called 'SIL code', to identify each language that it described. This set of codes significantly exceeded the scope of other existing standards, e.g.
ISO 639-1 ISO 639-1:2002, ''Codes for the representation of names of languages—Part 1: Alpha-2 code'', is the first part of the ISO 639 series of international standards for language codes. Part 1 covers the registration of "set 1" two-letter codes. The ...
and
ISO 639-2 ISO 639-2:1998, ''Codes for the representation of names of languages — Part 2: Alpha-3 code'', is the second part of the ISO 639 International standard, standard, which lists Language code, codes for the representation of the names of languages ...
. The 14th edition, published in 2000, included 7,148 language codes. In 2002, ''Ethnologue'' was asked to work with the
International Organization for Standardization The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries. M ...
(ISO) to integrate its codes into a draft international standard. ''Ethnologue'' codes have then been adopted by ISO as the international standard,
ISO 639-3 ISO 639-3:2007, ''Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages'', is an international standard for language codes in the ISO 639 series. It defines three-letter codes for ...
. The 15th edition of ''Ethnologue'' was the first edition to use this standard. This standard is now administered separately from Ethnologue. SIL International is the
registration authority Registration authorities (RAs) exist for many standards organizations, such as ISO, the Object Management Group, W3C, and others. In general, registration authorities all perform a similar function, in promoting the use of a particular standard ...
for language names and codes, according to rules established by ISO. Since then ''Ethnologue'' relies on the standard to determine what is listed as a language. In only one case, ''Ethnologue'' and the ISO standards treat languages slightly differently. ISO 639-3 considers Akan to be a macrolanguage consisting of two distinct languages, Twi and Fante, whereas ''Ethnologue'' considers Twi and Fante to be dialects of a single language (Akan), since they are mutually intelligible. This anomaly resulted because the ISO 639-2 standard has separate codes for Twi and Fante, which have separate literary traditions, and all 639-2 codes for individual languages are automatically part of 639-3, even though 639-3 would not normally assign them separate codes. In 2014, with the 17th edition, ''Ethnologue'' introduced a numerical code for language status using a framework called EGIDS (Expanded Graded Intergenerational Disruption Scale), an elaboration of Fishman's GIDS ( Graded Intergenerational Disruption Scale). It ranks a language from 0 for an international language to 10 for an
extinct language An extinct language or dead language is a language with no living native speakers. A dormant language is a dead language that still serves as a symbol of ethnic identity to an ethnic group; these languages are often undergoing a process of r ...
, i.e. a language with which nobody retains a sense of ethnic identity. In 2015, SIL's funds decreased and in December 2015, ''Ethnologue'' launched a metered
paywall A paywall is a method of restricting access to content (media), content, with a purchase or a subscription business model, paid subscription, especially news. Beginning in the mid-2010s, newspapers started implementing paywalls on their website ...
to cover its cost, as it is financially self-sustaining. Users in high-income countries who wanted to refer to more than seven pages of data per month had to buy a paid subscription. The 18th edition released that year included a new section on
language policy Language policy is both an interdisciplinary academic field and implementation of ideas about language use. Some scholars such as Joshua Fishman and Ofelia García consider it as part of sociolinguistics. On the other hand, other scholars such as ...
country by country. In 2016, ''Ethnologue'' added data about
language planning In sociolinguistics, language planning (also known as language engineering) is a deliberate effort to influence the function, structure or acquisition of languages or language varieties within a speech community.Kaplan B., Robert, and Rich ...
agencies to the 19th edition. As of 2017, ''Ethnologue'''s 20th edition described 237
language families A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ana ...
including 86 language isolates and six typological categories, namely
sign language Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with #Non-manual elements, no ...
s, creoles,
pidgin A pidgin , or pidgin language, is a grammatically simplified form of contact language that develops between two or more groups of people that do not have a language in common: typically, its vocabulary and grammar are limited and often drawn f ...
s,
mixed language A mixed language, also referred to as a hybrid language or fusion language, is a type of contact language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. ...
s,
constructed language A constructed language (shortened to conlang) is a language whose phonology, grammar, orthography, and vocabulary, instead of having developed natural language, naturally, are consciously devised for some purpose, which may include being devise ...
s, and as yet
unclassified languages An unclassified language is a language whose genetic affiliation to other languages has not been established. Languages can be unclassified for a variety of reasons, mostly due to a lack of reliable data but sometimes due to the confounding inf ...
. The early focus of the Ethnologue was on native use (L1) but was gradually expanded to cover L2 use as well. In 2019, ''Ethnologue'' disabled trial views and introduced a hard paywall to cover its nearly $1 million in annual operating costs (website maintenance, security, researchers, and SIL's 5,000 field linguists). Subscriptions start at $480 per person per year, while full access costs $2,400 per person per year. Users in low and middle-income countries as defined by the
World Bank The World Bank is an international financial institution that provides loans and Grant (money), grants to the governments of Least developed countries, low- and Developing country, middle-income countries for the purposes of economic development ...
are eligible for free access and there are discounts for libraries and independent researchers. Subscribers are mostly institutions: 40% of the world's top 50 universities subscribe to ''Ethnologue'', and it is also sold to business intelligence firms and Fortune 500 companies. The introduction of the paywall was harshly criticized by the community of linguists who rely on ''Ethnologue'' to do their work and cannot afford the subscription The same year, ''Ethnologue'' launched its contributor program to fill gaps and improve accuracy, allowing contributors to submit corrections and additions and to get a complimentary access to the website. ''Ethnologue'''s editors gradually review crowdsourced contributions before publication. As 2019 was the International Year of Indigenous Languages, this edition focused on
language loss Language attrition is the process of decreasing proficiency in or losing a language. For first or native language attrition, this process is generally caused by both isolation from speakers of the first language ("L1") and the acquisition and use o ...
: it added the date when the last fluent speaker of the language died, standardized the age range of language users, and improved the EGIDS estimates. In 2020, the 23rd edition listed 7,117 living languages, an increase of 6 living languages from the 22nd edition. In this edition, ''Ethnologue'' expanded its coverage of immigrant languages: previous editions only had full entries for languages considered to be "established" within a country. From this edition, ''Ethnologue'' includes data about the first and second languages of
refugee A refugee, according to the United Nations High Commissioner for Refugees (UNHCR), is a person "forced to flee their own country and seek safety in another country. They are unable to return to their own country because of feared persecution as ...
s, temporary foreign workers and immigrants. In 2021, the 24th edition had 7,139 modern languages, an increase of 22 living languages from the 23rd edition. Editors especially improved data about
language shift Language shift, also known as language transfer, language replacement or language assimilation, is the process whereby a speech community shifts to a different language, usually over an extended period of time. Often, languages that are perceived ...
in this edition. In 2022, the 25th edition listed a total of 7,151 living languages, an increase of 12 living languages from the 24th edition. This edition specifically improved the use of languages in education. In 2023, the 26th edition listed a total of 7,168 living languages, an increase of 17 living languages from the 25th edition. In 2024, the 27th edition listed a total of 7,164 living languages, a decrease of 4 living languages from the 26th edition.


Reception, reliability, and use

In 1986, William Bright, then editor of the journal ''
Language Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed language, signed forms, and may also be conveyed through writing syste ...
'', wrote of ''Ethnologue'' that it "is indispensable for any reference shelf on the languages of the world". The 2003 '' International Encyclopedia of Linguistics'' described ''Ethnologue'' as "a comprehensive listing of the world's languages, with genetic classification", and follows Ethnologue's classification. In 2005, linguists Lindsay J. Whaley and Lenore Grenoble considered that ''Ethnologue'' "continues to provide the most comprehensive and reliable count of numbers of speakers of the world's languages", still they recognize that "individual language surveys may have far more accurate counts for a specific language, but ''The Ethnologue'' is unique in bringing together speaker statistics on a global scale". In 2006, computational linguists John C. Paolillo and Anupam Das conducted a systematic evaluation of available information on language populations for the
UNESCO Institute for Statistics The UNESCO Institute for Statistics (UIS) is the statistical office of UNESCO and is the UN depository for cross-nationally comparable statistics on education, science and technology, culture, and communication. The UIS was established in 1999. ...
. They reported that ''Ethnologue'' and Linguasphere were the only comprehensive sources of information about language populations and that ''Ethnologue'' had more specific information. They concluded that: "the language statistics available today in the form of the ''Ethnologue'' population counts are already good enough to be useful" According to linguist William Poser, ''Ethnologue'' was, as of 2006, the "best single source of information" on language classification. In 2008 linguists
Lyle Campbell Lyle Richard Campbell (born October 22, 1942) is an American scholar and linguist known for his studies of indigenous American languages, especially those of Central America, and on historical linguistics in general. Campbell is professor emeri ...
and Verónica Grondona highly commended ''Ethnologue'' in ''Language''. They described it as a highly valuable catalogue of the world's languages that "has become the standard reference" and whose "usefulness is hard to overestimate". They concluded that ''Ethnologue'' was "truly excellent, highly valuable, and the very best book of its sort available." In a review of ''Ethnologue'''s 2009 edition in '' Ethnopolitics'', Richard O. Collin, professor of politics, noted that "''Ethnologue'' has become a standard resource for scholars in the other social sciences: anthropologists, economists, sociologists and, obviously, sociolinguists". According to Collin, ''Ethnologue'' is "stronger in languages spoken by indigenous peoples in economically less-developed portions of the world" and "when recent in-depth country-studies have been conducted, information can be very good; unfortunately ..data are sometimes old". In 2012, linguist Asya Pereltsvaig described ''Ethnologue'' as "a reasonably good source of thorough and reliable geographical and demographic information about the world's languages". She added in 2021 that its maps "are generally fairly accurate although they often depict the linguistic situation as it once was or as someone might imagine it to be but not as it actually is". Linguist George Tucker Childs wrote in 2012 that: "''Ethnologue'' is the most widely referenced source for information on languages of the world", but he added that regarding African languages, "when evaluated against recent field experience thnologueseems at least out of date". In 2014, ''Ethnologue'' admitted that some of its data was out-of-date and switched from a four-year publication cycle (in print and online) to yearly online updates. In 2017, Robert Phillipson and Tove Skutnabb-Kangas described ''Ethnologue'' as "the most comprehensive global source list for (mostly oral) languages". According to the 2018 ''Oxford Research Encyclopedia of Linguistics'', ''Ethnologue'' is a "comprehensive, frequently updated atabaseon languages and language families'. According to quantitative linguists Simon Greenhill, ''Ethnologue'' offers, as of 2018, "sufficiently accurate reflections of speaker population size". Linguists Lyle Campbell and Kenneth Lee Rehg wrote in 2018 that ''Ethnologue'' was "the best source that lists the non-endangered languages of the world". Lyle Campbell and Russell Barlow also noted that the 2017 edition of ''Ethnologue'' "improved tsclassification markedly". They note that ''Ethnologue'''s genealogy is similar to that of the
World Atlas of Language Structures The World Atlas of Language Structures (WALS) is a database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials. It was first published by Oxford University Press as a book with CD-RO ...
(WALS) but different from that of the Catalogue of Endangered Languages (ELCat) and Glottolog. Linguist Lisa Matthewson commented in 2020 that ''Ethnologue'' offers "accurate information about speaker numbers". In a 2021 review of ''Ethnologue'' and Glottolog, linguist Shobhana Chelliah noted that "For better or worse, the impact of the site is indeed considerable. ..Clearly, the site has influence on the field of linguistics and beyond." She added that she, among other linguists, integrated ''Ethnologue'' in her linguistics classes." The ''
Encyclopedia of Language and Linguistics The ''Encyclopedia of Language and Linguistics'', first published in 1994 (edited by Ronald E. Asher), with a 2nd edition in 2006 (edited by Keith Brown), is an encyclopedia of all matters related to language and linguistics. Reception The ''Jo ...
'' uses ''Ethnologue'' as its primary source for the list of languages and language maps. According to linguist Suzanne Romaine, ''Ethnologue'' is also the leading source for research on
language diversity Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed forms, and may also be conveyed through writing. Human language is ch ...
. According to ''The Oxford Handbook of Language and Society'', ''Ethnologue'' is "the standard reference source for the listing and enumeration of Endangered Languages, and for all known and "living" languages of the world"." Similarly, linguist David Bradley describes ''Ethnologue'' as "the most comprehensive effort to document the level of endangerment in languages around the world." The US
National Science Foundation The U.S. National Science Foundation (NSF) is an Independent agencies of the United States government#Examples of independent agencies, independent agency of the Federal government of the United States, United States federal government that su ...
uses ''Ethnologue'' to determine which languages are endangered. According to Hammarström et al., ''Ethnologue'' is, as of 2022, one of the three global databases documenting language endangerment with the ''
Atlas of the World's Languages in Danger The UNESCO ''Atlas of the World's Languages in Danger'' was an online publication containing a comprehensive list of the world's endangered languages. It originally replaced the ''Red Book of Endangered Languages'' as a title in print after ...
'' and the Catalogue of Endangered Languages (ELCat). The University of Hawaii Kaipuleohone language archive uses ''Ethnologue'''s metadata as well. The ''
World Atlas of Language Structures The World Atlas of Language Structures (WALS) is a database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials. It was first published by Oxford University Press as a book with CD-RO ...
'' uses ''Ethnologue'''s genealogical classification. The Rosetta Project uses ''Ethnologue'''s language metadata. In 2005, linguist Harald Hammarström wrote that ''Ethnologue'' was consistent with specialist views most of the time and was a catalog "of very high absolute value and by far the best of its kind". In 2011, Hammarström created
Glottolog ''Glottolog'' is an open-access online bibliographic database of the world's languages. In addition to listing linguistic materials ( grammars, articles, dictionaries) describing individual languages, the database also contains the most up-to-d ...
in response to the lack of a comprehensive language bibliography, especially in ''Ethnologue''. In 2015, Hammarström reviewed the 16th, 17th, and 18th editions of ''Ethnologue'' and described the frequent lack of citations as its only "serious fault" from a scientific perspective. He concluded: "''Ethnologue'' is at present still better than any other nonderivative work of the same scope. tis an impressively comprehensive catalogue of world languages, and it is far superior to anything else produced prior to 2009. In particular, it is superior by virtue of being explicit." According to Hammarström, as of 2016, ''Ethnologue'' and Glottolog are the only global-scale continually maintained inventories of the world's languages. The main difference is that ''Ethnologue'' includes additional information (such as speaker numbers or vitality) but lacks systematic sources for the information given. In contrast, Glottolog provides no language context information but points to primary sources for further data. Contrary to ''Ethnologue'', Glottolog does not run its own surveys, but it uses ''Ethnologue'' as one of its primary sources. As of 2019, Hammarström uses ''Ethnologue'' in his articles, noting that it "has (unsourced, but) detailed information associated with each speech variety, such as speaker numbers and map location". In 2013, responding to feedback about the lack of references, ''Ethnologue'' added a link on each language to language resources from the Open Language Archives Community (OLAC) ''Ethnologue'' acknowledges that it rarely quotes any source verbatim but cites sources wherever specific statements are directly attributed to them, and corrects missing attributions upon notification. The website provides a list of all of the references cited. In her 2021 review, Shobhana Chelliah noted that Glottolog aims to be better than ''Ethnologue'' in language classification and genetic and areal relationships by using linguists' original sources.


Editions

Starting with the 17th edition, ''Ethnologue'' has been published every year, on
February 21 Events Pre-1600 * 452 or 453 – Severianus, Bishop of Scythopolis, is martyred in Palestine. * 1245 – Thomas, the first known Bishop of Finland, is granted resignation after confessing to torture and forgery. * 1440 – The ...
, which is International Mother Language Day.


References


Bibliography

*


External links

* {{Authority control 1951 non-fiction books 1952 non-fiction books 1953 non-fiction books 1958 non-fiction books 1965 non-fiction books 1969 non-fiction books 1974 non-fiction books 1978 non-fiction books 1984 non-fiction books 1988 non-fiction books 1992 non-fiction books 1996 non-fiction books 2000 non-fiction books 2005 non-fiction books 2009 non-fiction books 2013 non-fiction books Linguistics websites Academic works about linguistics * Linguistics databases SIL International books