HOME

TheInfoList



OR:

In
linguistics Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
, lexical similarity is a measure of the degree to which the word sets of two given
language Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed language, signed forms, and may also be conveyed through writing syste ...
s are similar. A lexical similarity of 1 (or 100%) would mean a total overlap between vocabularies, whereas 0 means there are no common words. There are different ways to define the lexical similarity and the results vary accordingly. For example, ''
Ethnologue ''Ethnologue: Languages of the World'' is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensive catalogue of languages. It w ...
''s method of calculation consists of comparing a regionally standardized wordlist (comparable to the Swadesh list) and counting those forms that show similarity in both form and meaning. Using such a method, English was evaluated to have a lexical similarity of 60% with German and 27% with French. Lexical similarity can be used to evaluate the degree of genetic relationship between two languages. Percentages higher than 85% usually indicate that the two languages being compared are likely to be related
dialect A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
s. The lexical similarity is only one indication of the
mutual intelligibility In linguistics, mutual intelligibility is a relationship between different but related language varieties in which speakers of the different varieties can readily understand each other without prior familiarity or special effort. Mutual intelli ...
of the two languages, since the latter also depends on the degree of phonetical, morphological, and syntactical similarity. The variations due to differing wordlists weigh on this. For example, the lexical similarity between French and English is considerable in lexical fields relating to culture, whereas their similarity is smaller as far as basic (function) words are concerned. Unlike mutual intelligibility, lexical similarity can only be symmetrical.


East Asian languages

There are words borrowed from Chinese which are called ''Sino-Korean'' vocabulary, and there are new Korean words created from
Chinese characters Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
, and there are also words borrowed from
Sino-Japanese vocabulary Sino-Japanese vocabulary, also known as , is a subset of Japanese vocabulary that originated in Chinese language, Chinese or was created from elements borrowed from Chinese. Most Sino-Japanese words were borrowed in the 5th–9th centuries AD, from ...
. According to the '' Standard Korean Language Dictionary'' published by the National Institute of Korean Language (NIKL), Sino-Korean represents approximately 57% of the Korean vocabulary. As for Japanese, it has been estimated that about 60% of the words contained in modern Japanese dictionaries are ''Sino-Japanese'', and that about 18–20% of words used in common speech are Sino-Japanese, as measured by the National Institute for Japanese Language in its study of language use in
NHK , also known by its Romanization of Japanese, romanized initialism NHK, is a Japanese public broadcasting, public broadcaster. It is a statutory corporation funded by viewers' payments of a television licence, television license fee. NHK ope ...
broadcasts from April to June 1989.Kokuritsu Kokugo Kenkyuujo, "Terebi Hoosoo no Goi Choosa 1" (1995, Shuuei Publishing) The usage of such Sino-Japanese words also increase in formal or literary contexts, and in expressions of abstract or complex ideas. Despite the borrowing of many Chinese words into the Japanese and Korean languages, speakers of the three languages do not have enough mutual intelligibility to be able to communicate with each other. Japanese and Korean aren't tonal languages, but Chinese languages are tonal, which means that the proper pronunciation of a syllable for a word is important for communication, as well as the proper tone when pronouncing a word. When Chinese symbols (Hanzi) are used for writing in Korean (which are called "Hanja") and in Japanese (which are called "Kanji"), sometimes a few words can be understood in a sentence, but an entire sentence is highly unlikely to be understood even in writing. Japanese and Korean have their writing systems which are different from Hanzi, so entirely sentences aren't likely to be fully written in borrowed Chinese symbols.


Indo-European languages

The table below shows some lexical similarity values for pairs of selected Romance, Germanic, and Slavic languages, as collected and published by ''
Ethnologue ''Ethnologue: Languages of the World'' is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensive catalogue of languages. It w ...
''. ''Notes:'' *Language codes are from standard ISO 639-3. *Roberto Bolognesi and Wilbert Heeringa found the average divergence between Sardinian and Italian to be around 48.7%, ranging from a minimum dialectal degree of divergence being 46.6% to the highest one of 51.1%. That would make the various dialects of Sardinian slightly more divergent from Italian than Spanish (with an average degree of divergence from Italian being around 46.0%) is. *"-" denotes that comparison data are not available. *In the case of English-French lexical similarity, at least two other studies estimate the number of English words directly inherited from French at 28.3% and 41% respectively, with respectively 28.24% and 15% of other English words derived from Latin, putting English-French lexical similarity at around 0.56, with reciprocally lower English-German lexical similarities. Another study estimates the number of English words with an Italic origin at 51%, consistent with the two previous analyses.{{cite book , last=Nation , first=I.S.P. , authorlink1= Paul Nation , date=2001, title=Learning Vocabulary in Another Language, publisher=Cambridge University Press, url=https://books.google.com/books?id=sKqx8k8gYTkC, page=477, isbn=0-521-80498-1


See also

*
Lexis (linguistics) In linguistics, the term lexis (from 'word') designates the complete set of all possible words in a language, or a particular subset of words that are grouped by some specific linguistic criteria. For example, the general term ''English lexi ...
*
Language family A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
*
Dialect A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
* Linguistic distance


References


''Ethnologue.com''
(lexical similarity values available at some of the individual language entries)
Definition of lexical similarity at ''Ethnologue.com''
*Rensch, Calvin R. 1992. "Calculating lexical similarity." In Eugene H. Casad (ed.), ''Windows on bilingualism '', 13-15. (Summer Institute of Linguistics and the University of Texas at Arlington Publications in Linguistics, 110). Dallas: Summer Institute of Linguistics and the University of Texas at Arlington.


Notes


External links


Most similar languages

A Similarity Database of Modern Lexicons
Lexical similarity of 331 languages Language comparison