HOME





ICAME
The International Computer Archive of Modern and Medieval English (ICAME) is an international group of linguists and data scientists working in corpus linguistics to digitise English texts. The organisation was founded in Oslo, Norway in 1977 as the International Computer Archive of Modern English, before being renamed to its current title. The portal to their materials is hosted at the University of Bergen, where they have set out the aim of the organization to "collect and distribute information on English language material available for computer processing and on linguistic research to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions." Creating computer corpora, i.e. collections of texts in machine-readable form, is the most accessible way to study both transcribed spoken language and various genres of written texts for modern scholars, including both "descriptive and more theoretically-minded linguists". ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Geoffrey Leech
Geoffrey Neil Leech FBA (16 January 1936 – 19 August 2014) was a specialist in English language and linguistics. He was the author, co-author, or editor of over 30 books and over 120 published papers. His main academic interests were English grammar, corpus linguistics, stylistics, pragmatics, and semantics. Life and career Leech was born in Gloucester, England on 16 January 1936. He was educated at Tewkesbury Grammar School, Gloucestershire, and at University College London (UCL), where he was awarded a BA (1959) and PhD (1968). He began his teaching career at UCL, where he was influenced by Randolph Quirk and Michael Halliday as senior colleagues. He spent 1964-5 as a Harkness Fellow at the Massachusetts Institute of Technology, Cambridge MA. In 1969 Leech moved to Lancaster University, UK, where he was Professor of English Linguistics from 1974 to 2001. In 2002 he became Emeritus Professor in the Department of Linguistics and English Language, Lancaster University. He ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Corpus Linguistics
Corpus linguistics is the study of a language as that language is expressed in its text corpus (plural ''corpora''), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other languages which have undergone a similar analysis. The first such corpora were manually derived from source texts, but now that work is automated. Corpora have not only been used for linguistics research, they have also been used to compile dictionaries (starting with ''The American Heritage Dictionary of the English Language'' in 1969) and grammar guides, such as '' A Comprehensive Grammar ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Corpus Of English
The International Corpus of English (ICE) is a set of corpora representing varieties of English from around the world. Over twenty countries or groups of countries where English is the first language or an official second language are included. History Sidney Greenbaum's goal to compile corpora that would compare the syntax of world English became the ICE project that was achieved by Professor Charles F. Meyer. Sidney Greenbaum anticipated for international teams of researchers to collect comparable national variations of English both written and spoken. Comparable variations would be British English, American English, and Indian English, that would be represented through a computer corpora. The corpora are used by researchers to compare the syntax of the varieties of English. ICE corpora completion would have comprehensive linguistic analysis of varieties of English that have emerged. Ongoing research for ICE is implemented by international teams in diversified regions. The project ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Corpus Linguistics
Corpus linguistics is the study of a language as that language is expressed in its text corpus (plural ''corpora''), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other languages which have undergone a similar analysis. The first such corpora were manually derived from source texts, but now that work is automated. Corpora have not only been used for linguistics research, they have also been used to compile dictionaries (starting with ''The American Heritage Dictionary of the English Language'' in 1969) and grammar guides, such as '' A Comprehensive Grammar ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Brown Corpus
The Brown University Standard Corpus of Present-Day American English (or just Brown Corpus) is an electronic collection of text samples of American English, the first major structured corpus of varied genres. This corpus first set the bar for the scientific study of the frequency and distribution of word categories in everyday language use. Compiled by Henry Kučera and W. Nelson Francis at Brown University, in Rhode Island, it is a general language corpus containing 500 samples of English, totaling roughly one million words, compiled from works published in the United States in 1961. History In 1967, Kučera and Francis published their classic work ''Computational Analysis of Present-Day American English'', which provided basic statistics on what is known today simply as the ''Brown Corpus''. The Brown Corpus was a carefully compiled selection of current American English, totalling about a million words drawn from a wide variety of sources. Kučera and Francis subjected it ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Wellington Corpus Of Spoken New Zealand English
The Wellington Corpus of Spoken New Zealand English is a one-million-word corpus of transcribed English compiled from materials collected between 1988 and 1994, which is made up of excerpts from a range of speakers who have lived in New Zealand since before the age of 10. The corpus was collected under the direction of linguist Janet Holmes and includes broadcast transcripts as well as informal conversations, telephone conversations, lectures, and oral history interviews. The corpus, which was distributed as part of the 1999 ICAME CD-ROM, has been used for a number of academic studies including those looking at morphology, pronoun use and language contact studies, as of the influence of Māori Māori or Maori can refer to: Relating to the Māori people * Māori people of New Zealand, or members of that group * Māori language, the language of the Māori people of New Zealand * Māori culture * Cook Islanders, the Māori people of the C ... on NZ English. References External ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Linguistic Research Institutes
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguistics is concerned with both the cognitive and social aspects of language. It is considered a scientific field as well as an academic discipline; it has been classified as a social science, natural science, cognitive science,Thagard, PaulCognitive Science, The Stanford Encyclopedia of Philosophy (Fall 2008 Edition), Edward N. Zalta (ed.). or part of the humanities. Traditional areas of linguistic analysis correspond to phenomena found in human linguistic systems, such as syntax (rules governing the structure of sentences); semantics (meaning); morphology (structure of words); phonetics (speech sounds and equivalent gestures in sign languages); phonology (the abstract sound system of a particular language); and pragmatics (how social con ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


English Corpora
English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national identity, an identity and common culture ** English language in England, a variant of the English language spoken in England * English languages (other) * English studies, the study of English language and literature * ''English'', an Amish term for non-Amish, regardless of ethnicity Individuals * English (surname), a list of notable people with the surname ''English'' * People with the given name ** English McConnell (1882–1928), Irish footballer ** English Fisher (1928–2011), American boxing coach ** English Gardner (b. 1992), American track and field sprinter Places United States * English, Indiana, a town * English, Kentucky, an unincorporated community * English, Brazoria County, Texas, an unincorporated community * ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Applied Linguistics
Applied linguistics is an interdisciplinary field which identifies, investigates, and offers solutions to language-related real-life problems. Some of the academic fields related to applied linguistics are education, psychology, communication research, information science, natural language processing, anthropology, and sociology. Domain Applied linguistics is an interdisciplinary field. Major branches of applied linguistics include bilingualism and multilingualism, conversation analysis, contrastive linguistics, language assessment, literacies, discourse analysis, language pedagogy, second language acquisition, language planning and policy, interlinguistics, stylistics, language teacher education, forensic linguistics, and translation. Journals Major journals of the field include ''Research Methods in Applied Linguistics'', ''Annual Review of Applied Linguistics'', '' Applied Linguistics'', Studies in Second Language Acquisition, ''Applied Psycholinguistics'', ' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Anna-Brita Stenström
Anna-Brita Stenström (born 1932) is a linguist whose areas of research include corpus linguistics, sociolinguistics, pragmatics, and discourse analysis. She has initiated and co-directed three online corpora of adolescent language: The Bergen Corpus of London Teenage Language (COLT), Ungdomsspråk och språkkontakt i Norden (UNO), and Corpus Oral de Lenguaje Adolescente (COLA). She is Professor Emerita of English Linguistics at the University of Bergen, Norway. Stenström is a foreign member of the Norwegian Academy of Science and Letters The Norwegian Academy of Science and Letters ( no, Det Norske Videnskaps-Akademi, DNVA) is a learned society based in Oslo, Norway. Its purpose is to support the advancement of science and scholarship in Norway. History The Royal Frederick Unive .... Publications *Anna-Brita Stenström and Annette Myre Jørgensen, eds. 2009. ''Youngspeak in a Multilingual Perspective'' (Pragmatics & Beyond New Series) John Benjamins. *Stenström, Anna-Brita ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Part Of Speech
In grammar, a part of speech or part-of-speech ( abbreviated as POS or PoS, also known as word class or grammatical category) is a category of words (or, more generally, of lexical items) that have similar grammatical properties. Words that are assigned to the same part of speech generally display similar syntactic behavior (they play similar roles within the grammatical structure of sentences), sometimes similar morphological behavior in that they undergo inflection for similar properties and even similar semantic behavior. Commonly listed English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, interjection, numeral, article, and determiner. Other terms than ''part of speech''—particularly in modern linguistic classifications, which often make more precise distinctions than the traditional scheme does—include word class, lexical class, and lexical category. Some authors restrict the term ''lexical category'' to refer only to a par ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Lancaster-Oslo-Bergen Corpus
The Lancaster-Oslo/Bergen (LOB) Corpus is a million-word collection of British English texts which was compiled in the 1970s in collaboration between the University of Lancaster, the University of Oslo, and the Norwegian Computing Centre for the Humanities, Bergen, to provide a British counterpart to the Brown Corpus compiled by Henry Kučera and W. Nelson Francis W. Nelson Francis (October 23, 1910 – June 14, 2002) was an American author, linguist, and university professor. He served as a member of the faculties of Franklin & Marshall College and Brown University, where he specialized in Engl ... for American English in the 1960s. Its composition was designed to match the original Brown corpus in terms of its size and genres as closely as possible using documents published in the UK by British authors. Both corpora consist of 500 samples each comprising about 2000 words in the following genres: The corpus has been also tagged, i.e. part-of-speech categori ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]