HOME





Corpus Linguistics
Corpus linguistics is an empirical method for the study of language by way of a text corpus (plural ''corpora''). Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing that aim to represent a given linguistic variety. Today, corpora are generally machine-readable data collections. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. Large collections of text, though corpora may also be small in terms of running words, allow linguists to run quantitative analyses on linguistic concepts that may be difficult to test in a qualitative manner. The text-corpus method uses the body of texts in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other languages ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Study Of Language
Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds and equivalent gestures in sign languages), phonology (the abstract sound system of a particular language, and analogous systems of sign languages), and pragmatics (how the context of use contributes to meaning). Subdisciplines such as biolinguistics (the study of the biological variables and evolution of language) and psycholinguistics (the study of psychological factors in human language) bridge many of these divisions. Linguistics encompasses Outline of linguistics, many branches and subfields that span both theoretical and practical applications. Theoretical linguistics is concerned with understanding the universal grammar, universal and Philosophy of language#Nature of language, fundamental nature of language and developing a general ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Henry Kučera
Henry Kučera (15 February 1925 – 20 February 2010), born Jindřich Kučera (), was a Czech-American linguist who pioneered corpus linguistics, linguistic software, a major contributor to the ''American Heritage Dictionary'', and a pioneer in the development of spell checking computer software. He is remembered in particular as one of the initiators of the Brown Corpus. Early life and education Kučera was born in Třebařov (between Pardubice and Olomouc) in Czechoslovakia and later moved with his family to Hodonín, where he studied. When the Communists came to power in February 1948, his studies in philosophy and linguistics at Charles University in the Czech capital of Prague were interrupted. He was forced to leave Czechoslovakia in April 1948 when it became clear that his political writings had placed him at risk of detention by the Communist authorities. Kučera then moved to Allied-occupied Germany where he worked under the supervision of the U.S. CIC (Counterin ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

New Zealand English
New Zealand English (NZE) is the variant of the English language spoken and written by most English-speaking New Zealanders. Its language code in ISO and Internet standards is en-NZ. It is the first language of the majority of the population. The English language was established in New Zealand by colonists during the 19th century. It is one of "the newest native-speaker variet esof the English language in existence, a variety which has developed and become distinctive only in the last 150 years". The variety of English that had the biggest influence on the development of New Zealand English was Australian English, itself derived from Southeastern England English, with considerable influence from Scottish and Hiberno-English, and with lesser influences the British prestige accent Received Pronunciation (RP) and American English. An important source of vocabulary is the Māori language of the indigenous people of New Zealand, whose contribution distinguishes New Zealand Eng ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Indian English
Indian English (IndE, IE) or English (India) is a group of English dialects spoken in the Republic of India and among the Indian diaspora and native to India. English is used by the Government of India for communication, and is enshrined in the Constitution of India. English is also an official language in seven states and seven union territories of India, and the additional official language in seven other states and one union territory. Furthermore, English is the sole official language of the Judiciary of India, unless the state governor or legislature mandates the use of a regional language, or if the President of India has given approval for the use of regional languages in courts. Before the dissolution of the British Empire on the Indian subcontinent, the term ''Indian English'' broadly referred to '' South Asian English'', also known as '' British Indian English''. Status After gaining independence from the British Raj in 1947, English remained an official lang ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


British English
British English is the set of Variety (linguistics), varieties of the English language native to the United Kingdom, especially Great Britain. More narrowly, it can refer specifically to the English language in England, or, more broadly, to the collective dialects of English throughout the United Kingdom taken as a single umbrella variety, for instance additionally incorporating Scottish English, Welsh English, and Northern Irish English. Tom McArthur (linguist), Tom McArthur in the Oxford English Dictionary, Oxford Guide to World English acknowledges that British English shares "all the ambiguities and tensions [with] the word 'British' and as a result can be used and interpreted in two ways, more broadly or more narrowly, within a range of blurring and ambiguity". Variations exist in formal (both written and spoken) English in the United Kingdom. For example, the adjective ''wee'' is almost exclusively used in parts of Scotland, north-east England, Northern Ireland, Ireland ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




LOB Corpus
The Lancaster-Oslo/Bergen (LOB) Corpus is a one-million-word collection of British English texts which was compiled in the 1970s in collaboration between the University of Lancaster, the University of Oslo, and the Norwegian Computing Centre for the Humanities, Bergen, to provide a British counterpart to the Brown Corpus compiled by Henry Kučera and W. Nelson Francis for American English in the 1960s. Its composition was designed to match the original Brown corpus in terms of its size and genres as closely as possible using documents published in the UK in 1961 by British authors. Both corpora consist of 500 samples each comprising about 2000 words in the following genres: The chief compilers of the LOB corpous were Geoffrey Leech (Lancaster University) and Stig Johansson (University of Oslo); see Leech & Johansson (2009). The corpus has been also tagged, i.e. part-of-speech In grammar, a part of speech or part-of-speech ( abbreviated as POS or PoS, also known as word class ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Bank Of English
The Bank of English (BoE) is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts. These are mainly British in origin, but content from North America, Australia, New Zealand, South Africa and other Commonwealth countries is also being included. The majority of the texts are from written English, collected from websites, newspapers, magazines and books. There is also a large component of spoken data using material from radio, TV and informal conversations. The Bank of English totals 650 million running words. Copies of the corpus are held both at HarperCollins Publishers and the University of Birmingham. The version at Birmingham can be accessed for academic research. The Bank of English forms part of the ''Collins Word Web'' together with the French, German and Spanish corpora. See also * Corpus of Contemporary American English (COCA) * British National Corpus The British National Corpus (BNC) is a 100-million-word text corpus of sample ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

English Language Learning And Teaching
English-language learner (often abbreviated as ELL) is a term used in some English-speaking countries such as the United States and Canada to describe a person who is learning the English language and has a native language that is not English. Some educational advocates, especially in the United States, classify these students as non-native English speakers or emergent bilinguals. Various other terms are also used to refer to students who are not proficient in English, such as English as a second language (ESL), English as an additional language (EAL), limited English proficient (LEP), culturally and linguistically diverse (CLD), non-native English speaker, bilingual students, heritage language, emergent bilingual, and language-minority students. The legal term that is used in federal legislation is 'limited English proficient'. The models of instruction and assessment of students, their cultural background, and the attitudes of classroom teachers towards ELLs have all been foun ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Monolingual Learner's Dictionary
A monolingual learner's dictionary (MLD) is designed to meet the reference needs of people learning a foreign language. MLDs are based on the premise that language-learners should progress from a bilingual dictionary to a monolingual one as they become more proficient in their target language, but that general-purpose dictionaries (aimed at native speakers) are inappropriate for their needs. Dictionaries for learners include information on grammar, usage, common errors, collocation, and pragmatics, which is largely missing from standard dictionaries, because native speakers tend to know these aspects of language intuitively. And while the definitions in standard dictionaries are often written in difficult language, those in an MLD use a simple and accessible defining vocabulary. History of English language MLDs The first English MLD, published in 1935, was the ''New Method English Dictionary'' by Michael West and James Endicott, a small dictionary using a restricted defining v ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


COBUILD
COBUILD, an acronym for Collins Birmingham University International Language Database, is a British research facility set up at the University of Birmingham in 1980 and funded by Collins publishers. The facility was initially led by professor John Sinclair. The most important achievements of the COBUILD project have been the creation and analysis of an electronic corpus Corpus (plural ''corpora'') is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of ... of contemporary text, the ''Collins Corpus'', later leading to the development of the Bank of English, and the production of the monolingual learner's dictionary ''Collins COBUILD English Language Dictionary'', based on the study of the COBUILD corpus and first published in 1987. A collection of other dictionaries and grammars have also been published, all based ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Dictionary
A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged Alphabetical order, alphabetically (or by Semitic root, consonantal root for Semitic languages or radical-and-stroke sorting, radical and stroke for Logogram, logographic languages), which may include information on definitions, usage, etymologies, pronunciations, Bilingual dictionary, translation, etc.Webster's New World College Dictionary, Fourth Edition, 2002 It is a Lexicography, lexicographical reference that shows inter-relationships among the data. A broad distinction is made between general and specialized dictionaries. Specialized dictionaries include words in specialist fields, rather than a comprehensive range of words in the language. Lexical items that describe concepts in specific fields are usually called terms instead of words, although there is no consensus whether lexicology and terminology are two different fields of study. In theory, general dictionarie ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Houghton-Mifflin
Houghton Mifflin Harcourt Company ( ; HMH) is an American publisher of textbooks, instructional technology materials, assessments, and reference works. The company is based in the Boston Financial District. It was formerly known as the Houghton Mifflin Company, but it changed its name following the 2007 acquisition of Harcourt Publishing. Prior to March 2010, it was a subsidiary of Education Media and Publishing Group Limited, an Irish-owned holding company registered in the Cayman Islands and formerly known as Riverdeep. In 2022, it was acquired by Veritas Capital, a New York-based private-equity firm. Company history In 1832, William Ticknor and John Allen purchased a bookselling business in Boston and began to involve themselves in publishing; James T. Fields joined as a partner in 1843. Fields and Ticknor gradually gathered an impressive list of writers, including Ralph Waldo Emerson, Nathaniel Hawthorne, and Henry David Thoreau. The duo formed a close relationship ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]