Collocation

	Collocation In corpus linguistics, a collocation is a series of words or terms that co-occur more often than would be expected by chance. In phraseology, a collocation is a type of compositional phraseme, meaning that it can be understood from the words that make it up. This contrasts with an idiom, where the meaning of the whole cannot be inferred from its parts, and may be completely unrelated. An example of a phraseological collocation is the expression ''strong tea''. While the same meaning could be conveyed by the roughly equivalent ''powerful tea'', this adjective does not modify ''tea'' frequently enough for English speakers to become accustomed to its co-occurrence and regard it as idiomatic or unmarked. (By way of counterexample, ''powerful'' is idiomatically preferred to ''strong'' when modifying a ''computer'' or a ''car''.) There are about six main types of collocations: adjective + noun, noun + noun (such as collective nouns), verb + noun, ad ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Phraseme A phraseme, also called a set phrase, idiomatic phrase, multi-word expression (in computational linguistics), or idiom, is a multi-word or multi-morphemic utterance whose components include at least one that is selectionally constrained or restricted by linguistic convention such that it is not freely chosen. In the most extreme cases, there are expressions such as ''X kicks the bucket'' ≈ ‘person X dies of natural causes, the speaker being flippant about X’s demise’ where the unit is selected as a whole to express a meaning that bears little or no relation to the meanings of its parts. All of the words in this expression are chosen restrictedly, as part of a chunk. At the other extreme, there are collocations such as ''stark naked'', ''hearty laugh'', or ''infinite patience'' where one of the words is chosen freely (''naked'', ''laugh'', and ''patience'', respectively) based on the meaning the speaker wishes to express while the choice of the other (intensifying) word ('' ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Collocation Extraction Collocation extraction is the task of using a computer to extract collocations automatically from a corpus. The traditional method of performing collocation extraction is to find a formula based on the statistical quantities of those words to calculate a score associated to every word pairs. Proposed formulas are mutual information, t-test, z test, chi-squared test and likelihood ratio. Within the area of corpus linguistics, collocation is defined as a sequence of words or terms which co-occur more often than would be expected by chance. 'Crystal clear', 'middle management', 'nuclear family', and 'cosmetic surgery' are examples of collocated pairs of words. Some words are often found together because they make up a compound noun, for example 'riding boots' or 'motor cyclist'. See also * Collocational restriction * Collostructional analysis * Compound noun, adjective and verb Phrasal verb Siamese twins (English language) Terminology extraction Terminology extraction ( ... [...More Info...] [...Related Items...] OR:* [Wikipedia] [Google] [Baidu]
picture info	Phrasal Verb In the traditional grammar of Modern English, a phrasal verb typically constitutes a single semantic unit composed of a verb followed by a particle (examples: ''turn down'', ''run into'' or ''sit up''), sometimes combined with a preposition (examples: ''get together with'', ''run out of'' or ''feed off of''). Alternative terms include verb-adverb combination, verb-particle construction, two-part word/verb or three-part word/verb (depending on the number of particles) and multi-word verb. Phrasal verbs ordinarily cannot be understood based upon the meanings of the individual parts alone but must be considered as a whole: the meaning is non-compositional and thus unpredictable. Phrasal verbs are differentiated from other classifications of multi-word verbs and free combinations by criteria based on idiomaticity, replacement by a single-word verb, wh-question formation and particle movement. Types The category "phrasal verb" is mainly used in English as a second language te ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Idiom An idiom is a phrase or expression that typically presents a figurative, non-literal meaning attached to the phrase; but some phrases become figurative idioms while retaining the literal meaning of the phrase. Categorized as formulaic language, an idiom's figurative meaning is different from the literal meaning. Idioms occur frequently in all languages; in English alone there are an estimated twenty-five million idiomatic expressions. Derivations Many idiomatic expressions were meant literally in their original use, but sometimes the attribution of the literal meaning changed and the phrase itself grew away from its original roots—typically leading to a folk etymology. For instance, the phrase "spill the beans" (meaning to reveal a secret) is first attested in 1919, but has been said to originate from an ancient method of voting by depositing beans in jars, which could be spilled, prematurely revealing the results. Other idioms are deliberately figurative. For example, " br ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Macmillan English Dictionary For Advanced Learners ''Macmillan English Dictionary for Advanced Learners'', also known as ''MEDAL'', was first published in 2002 by Macmillan Education. ''MEDAL'' is an advanced learner’s dictionary and shares most of the features of this type of dictionary: it provides definitions in simple language, using a controlled defining vocabulary; most words have example sentences to illustrate how they are typically used; and information is given about how words combine grammatically or in collocations. ''MEDAL'' also introduced a number of innovations. These include: * ‘collocation boxes’ giving lists of high-frequency collocates, identified using Sketch Engine software * word frequency information, with the most frequent 7500 English words shown in red and categorised in three frequency bands, based on the idea, derived from Zipf's law, that a relatively small number of high-frequency words account for a high percentage of most texts * ‘metaphor boxes’, showing how the vocabulary used for expr ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Monolingual Learner's Dictionary A monolingual learner's dictionary (MLD) is designed to meet the reference needs of people learning a foreign language. MLDs are based on the premise that language-learners should progress from a bilingual dictionary to a monolingual one as they become more proficient in their target language, but that general-purpose dictionaries (aimed at native speakers) are inappropriate for their needs. Dictionaries for learners include information on grammar, usage, common errors, collocation, and pragmatics, which is largely missing from standard dictionaries, because native speakers tend to know these aspects of language intuitively. And while the definitions in standard dictionaries are often written in difficult language, those in an MLD use a simple and accessible defining vocabulary. History of English language MLDs The first English MLD, published in 1935, was the ''New Method English Dictionary'' by Michael West and James Endicott, a small dictionary using a restricted defining v ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Keyword (linguistics) In corpus linguistics a key word is a word which occurs in a text more often than we would expect to occur by chance alone.Scott, M. & Tribble, C., 2006, ''Textual Patterns: keyword and corpus analysis in language education'', Amsterdam: Benjamins, 55. Key words are calculated by carrying out a statistical test A statistical hypothesis test is a method of statistical inference used to decide whether the data at hand sufficiently support a particular hypothesis. Hypothesis testing allows us to make probabilistic statements about population parameters. ... (e.g., loglinear or chi-squared) which compares the word frequencies in a text against their expected frequencies derived in a much larger corpus, which acts as a reference for general language use. Keyness is then the quality a word or phrase has of being "key" in its context. Compare this with collocation, the quality linking two words or phrases usually assumed to be within a given span of each other. Keyness is a ''tex ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Mutual Information In probability theory and information theory, the mutual information (MI) of two random variables is a measure of the mutual dependence between the two variables. More specifically, it quantifies the " amount of information" (in units such as shannons (bits), nats or hartleys) obtained about one random variable by observing the other random variable. The concept of mutual information is intimately linked to that of entropy of a random variable, a fundamental notion in information theory that quantifies the expected "amount of information" held in a random variable. Not limited to real-valued random variables and linear dependence like the correlation coefficient, MI is more general and determines how different the joint distribution of the pair (X,Y) is from the product of the marginal distributions of X and Y. MI is the expected value of the pointwise mutual information (PMI). The quantity was defined and analyzed by Claude Shannon in his landmark paper "A Mathemati ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Lexicon A lexicon is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word (), neuter of () meaning 'of or for words'. Linguistic theories generally regard human languages as consisting of two parts: a lexicon, essentially a catalogue of a language's words (its wordstock); and a grammar, a system of rules which allow for the combination of those words into meaningful sentences. The lexicon is also thought to include bound morphemes, which cannot stand alone as words (such as most affixes). In some analyses, compound words and certain classes of idiomatic expressions, collocations and other phrases are also considered to be part of the lexicon. Dictionaries are lists of the lexicon, in alphabetical order, of a given language; usually, however, bound morphemes are not included. Size and organization Items in the lexicon are called lexemes, l ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Co-occurrence In linguistics, co-occurrence or cooccurrence is an above-chance frequency of occurrence of two terms (also known as coincidence or concurrence) from a text corpus alongside each other in a certain order. Co-occurrence in this linguistic sense can be interpreted as an indicator of semantic proximity or an idiomatic expression. Corpus linguistics and its statistic analyses reveal patterns of co-occurrences within a language and enable to work out typical collocations for its lexical items. A ''co-occurrence restriction'' is identified when linguistic elements never occur together. Analysis of these restrictions can lead to discoveries about the structure and development of a language. Co-occurrence can be seen an extension of word counting in higher dimensions. Co-occurrence can be quantitatively described using measures like correlation or mutual information. See also * Distributional hypothesis * Statistical semantics * Co-occurrence matrix * Co-occurrence networks Co-oc ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Collective Nouns In linguistics, a collective noun is a word referring to a collection of things taken as a whole. Most collective nouns in everyday speech are not specific to one kind of thing. For example, the collective noun "group" can be applied to people ("a group of people"), or dogs ("a group of dogs"), or objects ("a group of stones"). Some collective nouns are specific to one kind of thing, especially terms of venery, which identify groups of specific animals. For example, "pride" as a term of venery always refers to lions, never to dogs or cows. Other examples come from popular culture such as a group of owls, which is called a "parliament". Different forms of English handle verb agreement with collective count nouns differently. For example, users of British English generally accept that collective nouns take either singular or plural verb forms depending on context and the metonymic shift that it implies. Derivation Morphological derivation accounts for many collective words and v ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Statistical Significance In statistical hypothesis testing, a result has statistical significance when it is very unlikely to have occurred given the null hypothesis (simply by chance alone). More precisely, a study's defined significance level, denoted by \alpha, is the probability of the study rejecting the null hypothesis, given that the null hypothesis is true; and the ''p''-value of a result, ''p'', is the probability of obtaining a result at least as extreme, given that the null hypothesis is true. The result is statistically significant, by the standards of the study, when p \le \alpha. The significance level for a study is chosen before data collection, and is typically set to 5% or much lower—depending on the field of study. In any experiment or observation that involves drawing a sample from a population, there is always the possibility that an observed effect would have occurred due to sampling error alone. But if the ''p''-value of an observed effect is less than (or equal to) the significa ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]