Collocation
In corpus linguistics, a collocation is a series of words or terms that co-occur more often than would be expected by chance. In phraseology, a collocation is a type of compositional phraseme, meaning that it can be understood from the words that make it up. This contrasts with an idiom, where the meaning of the whole cannot be inferred from its parts, and may be completely unrelated. There are about seven main types of collocations: adjective + noun, noun + noun (such as collective nouns), noun + verb, verb + noun, adverb + adjective, verbs + prepositional phrase ( phrasal verbs), and verb + adverb. Collocation extraction is a computational technique that finds collocations in a document or corpus, using various computational linguistics elements resembling data mining. Expanded definition Collocations are partly or fully fixed expressions that become established through repeated context-dependent use. ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Phraseme
A phraseme, also called a set phrase, fixed expression, multiword expression (in computational linguistics), or idiom, is a multi-word or multi-morphemic utterance whose components include at least one that is selectionally constrained or restricted by linguistic convention such that it is not freely chosen. In the most extreme cases, there are expressions such as ''X kicks the bucket'' ≈ ‘person X dies of natural causes, the speaker being flippant about X’s demise’ where the unit is selected as a whole to express a meaning that bears little or no relation to the meanings of its parts. All of the words in this expression are chosen restrictedly, as part of a chunk. At the other extreme, there are collocations such as ''stark naked'', ''hearty laugh'', or ''infinite patience'' where one of the words is chosen freely (''naked'', ''laugh'', and ''patience'', respectively) based on the meaning the speaker wishes to express while the choice of the other (intensifying) word ('' ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Longman Dictionary Of Contemporary English
The ''Longman Dictionary of Contemporary English'' (''LDOCE''), first published by Longman in 1978, is an advanced learner's dictionary, providing definitions using a restricted vocabulary, helping non-native English speakers understand meanings easily. It is available in four configurations: * Printed book * Premium online access * Printed book plus premium online access * Reduced online version with no access charge (called "free" but technically "gratis": the license is still proprietary) The dictionary is currently in its sixth edition. The premium website was revised in 2014 and 2015. It now offers over a million corpus examples (exceeding the paper version's), and includes sound files for every word, 88,000 example sentences, and various tools for study, teaching, examinations and grammar. The 9000 Most Important English Words to Learn have been highlighted via the Longman Communication 9000. The free online version was updated in 2008 and offers search (with spelling ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Collocation Extraction
Collocation extraction is the task of using a computer to extract collocations automatically from a corpus. The traditional method of performing collocation extraction is to find a formula based on the statistical quantities of those words to calculate a score associated to every word pairs. Proposed formulas are mutual information, t-test, z test, chi-squared test and likelihood ratio. Within the area of corpus linguistics, collocation is defined as a sequence of words or terms which co-occur more often than would be expected by chance. 'Crystal clear', 'middle management', 'nuclear family', and 'cosmetic surgery' are examples of collocated pairs of words. Some words are often found together because they make up a compound noun, for example 'riding boots' or 'motor cyclist' or ‘collocation extraction’ its very self. See also * Collocational restriction * Collostructional analysis * Compound noun, adjective and verb *Phrasal verb * Siamese twins (English language) *Termi ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
![]() |
Phrasal Verb
In the traditional grammar of Modern English, a phrasal verb typically constitutes a single semantic unit consisting of a verb followed by a particle (e.g., ''turn down'', ''run into,'' or ''sit up''), sometimes collocated with a preposition (e.g., ''get together with'', ''run out of,'' or ''feed off of''). Phrasal verbs ordinarily cannot be understood based upon the meanings of the individual parts alone but must be considered as a whole: the meaning is non- compositional and thus unpredictable. Phrasal verbs are differentiated from other classifications of multi-word verbs and free combinations by the criteria of idiomaticity, replacement by a single verb, ''wh''-question formation and particle movement. Terminology In 1900, Frederick Schmidt referred to particle verbs in the Middle English writings of Reginald Pecock as "phrasal verbs", though apparently without intending it as a technical term. The term was popularized by Logan Pearsall Smith in ''Words and Idioms'' (1925 ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
Keyword (linguistics)
In corpus linguistics a key word is a word which occurs in a text more often than we would expect to occur by chance alone. Key words are calculated by carrying out a statistical test (e.g., loglinear or chi-squared) which compares the word frequencies in a text against their expected frequencies derived in a much larger corpus, which acts as a reference for general language use. Keyness is then the quality a word or phrase has of being "key" in its context. Combinations of nouns with parts of speech that human readers would not likely notice, such as prepositions, time adverbs, and pronouns can be a relevant part of keyness. Even separate pronouns can constitute keywords. Compare this with collocation, the quality linking two words or phrases usually assumed to be within a given span of each other. Keyness is a ''textual'' feature, not a language feature (so a word has keyness in a certain textual context but may well not have keyness in other contexts, whereas a node and colloca ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Monolingual Learner's Dictionary
A monolingual learner's dictionary (MLD) is designed to meet the reference needs of people learning a foreign language. MLDs are based on the premise that language-learners should progress from a bilingual dictionary to a monolingual one as they become more proficient in their target language, but that general-purpose dictionaries (aimed at native speakers) are inappropriate for their needs. Dictionaries for learners include information on grammar, usage, common errors, collocation, and pragmatics, which is largely missing from standard dictionaries, because native speakers tend to know these aspects of language intuitively. And while the definitions in standard dictionaries are often written in difficult language, those in an MLD use a simple and accessible defining vocabulary. History of English language MLDs The first English MLD, published in 1935, was the ''New Method English Dictionary'' by Michael West and James Endicott, a small dictionary using a restricted defining v ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
![]() |
Specialized Dictionary
A specialized dictionary is a dictionary that covers a relatively restricted set of phenomena. The definitive book on the subject (Cowie 2009) includes chapters on some of the dictionaries included below: *synonyms *pronunciations *names (place names and personal names) *phrases and idioms *dialect terms *slang *quotations *etymologies * rhymes *lyrics Dictionaries of idioms and slang are common in most cultures. Examples include (of French) the ''Dictionnaire des expressions et locutions'', edited by Alain Rey (Paris: Le Robert 2006), and (of English) Eric Partridge's ''Dictionary of Slang and Unconventional English'' (8th edition, London: Routledge 2002). In the area of language learning, there are specialized dictionaries for aspects of language which tend to be ordinary for mother-tongue speakers but may cause difficulty for learners. These include dictionaries of phrasal verbs, such as the ''Oxford Phrasal Verbs Dictionary'' (2nd edition, Oxford University Press: 2006), an ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
![]() |
Idiom
An idiom is a phrase or expression that largely or exclusively carries a Literal and figurative language, figurative or non-literal meaning (linguistic), meaning, rather than making any literal sense. Categorized as formulaic speech, formulaic language, an idiomatic expression's meaning is different from the Literal and figurative language, literal meanings of each word inside it. Idioms occur frequently in all languages. In English language, English alone there are an estimated twenty-five thousand idiomatic expressions. Some well known idioms in English are "spill the beans" (meaning "reveal secret information"), "it's raining cats and dogs" (meaning "it's raining intensely"), and "break a leg" (meaning "good luck"). Derivations Many idiomatic expressions were meant literally in their original use, but occasionally the attribution of the literal meaning changed and the phrase itself grew away from its original roots—typically leading to a folk etymology. For instance, the ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
Macmillan English Dictionary For Advanced Learners
''Macmillan English Dictionary for Advanced Learners'', also known as ''MEDAL'', is an advanced learner's dictionary published from 2002 until 2023 by Macmillan Education. It shares most of the features of this type of dictionary: it provides definitions in simple language, using a controlled defining vocabulary; most words have example sentences to illustrate how they are typically used; and information is given about how words combine grammatically or in collocations. ''MEDAL'' also introduced a number of innovations. These include: * "collocation boxes" giving lists of high-frequency collocates, identified using Sketch Engine software * word frequency information, with the most frequent 7500 English words shown in red and categorised in three frequency bands, based on the idea, derived from Zipf's law, that a relatively small number of high-frequency words account for a high percentage of most texts * "metaphor boxes", showing how the vocabulary used for expressing common c ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Mutual Information
In probability theory and information theory, the mutual information (MI) of two random variables is a measure of the mutual Statistical dependence, dependence between the two variables. More specifically, it quantifies the "Information content, amount of information" (in Units of information, units such as shannon (unit), shannons (bits), Nat (unit), nats or Hartley (unit), hartleys) obtained about one random variable by observing the other random variable. The concept of mutual information is intimately linked to that of Entropy (information theory), entropy of a random variable, a fundamental notion in information theory that quantifies the expected "amount of information" held in a random variable. Not limited to real-valued random variables and linear dependence like the Pearson correlation coefficient, correlation coefficient, MI is more general and determines how different the joint distribution of the pair (X,Y) is from the product of the marginal distributions of X and ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Lexicon
A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word (), neuter of () meaning 'of or for words'. Linguistic theories generally regard human languages as consisting of two parts: a lexicon, essentially a catalogue of a language's words (its wordstock); and a grammar, a system of rules which allow for the combination of those words into meaningful sentences. The lexicon is also thought to include bound morphemes, which cannot stand alone as words (such as most affixes). In some analyses, compound words and certain classes of idiomatic expressions, collocations and other phrasemes are also considered to be part of the lexicon. Dictionaries are lists of the lexicon, in alphabetical order, of a given language; usually, however, bound morphemes are not included. Size and organization Items ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |
|
Co-occurrence
In linguistics, co-occurrence or cooccurrence is an above-chance frequency of ordered occurrence of two adjacent terms in a text corpus. Co-occurrence in this linguistic sense can be interpreted as an indicator of semantic proximity or an idiomatic expression. Corpus linguistics and its statistic analyses reveal patterns of co-occurrences within a language and enable to work out typical collocations for its lexical items. A ''co-occurrence restriction'' is identified when linguistic elements never occur together. Analysis of these restrictions can lead to discoveries about the structure and development of a language. Co-occurrence can be seen an extension of word counting in higher dimensions. Co-occurrence can be quantitatively described using measures like a massive correlation or mutual information. See also * Distributional hypothesis * Statistical semantics * Idiom (language structure) * Co-occurrence matrix * Co-occurrence networks * Similarity measure * Dice ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu] |