Cultural Observatory
Culturomics is a form of computational lexicology that studies human behavior and cultural trends through the quantitative analysis of digitized texts. Researchers data mine large digital archives to investigate cultural phenomena reflected in language and word usage. The term is an American neologism first described in a 2010 '' Science (journal), Science'' article called ''Quantitative Analysis of Culture Using Millions of Digitized Books'', co-authored by Harvard researchers Jean-Baptiste Michel and Erez Lieberman Aiden. Michel and Aiden helped create the Google Labs project Google Ngram Viewer which uses n-grams to analyze the Google Books digital library for cultural patterns in language use over time. Because the Google Ngram data set is not an unbiased sample, and does not include metadata, there are several pitfalls when using it to study language or the popularity of terms. Medical literature accounts for a large, but shifting, share of the corpus, which does not take ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Computational Lexicology
Computational lexicology is a branch of computational linguistics, which is concerned with the use of computers in the study of lexicon. It has been more narrowly described by some scholars (Amsler, 1980) as the use of computers in the study of '' machine-readable dictionaries''. It is distinguished from ''computational lexicography'', which more properly would be the use of computers in the construction of dictionaries, though some researchers have used computational lexicography as synonymous. History Computational lexicology emerged as a separate discipline within computational linguistics with the appearance of machine-readable dictionaries, starting with the creation of the machine-readable tapes of the ''Merriam-Webster Seventh Collegiate Dictionary'' and the ''Merriam-Webster New Pocket Dictionary'' in the 1960s by John Olney et al. at System Development Corporation. Today, computational lexicology is best known through the creation and applications of WordNet. As the comp ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Prediction
A prediction (Latin ''præ-'', "before," and ''dicere'', "to say"), or forecast, is a statement about a future event or data. They are often, but not always, based upon experience or knowledge. There is no universal agreement about the exact difference from "estimation"; different authors and disciplines ascribe different connotations. Future events are necessarily uncertain, so guaranteed accurate information about the future is impossible. Prediction can be useful to assist in making plans about possible developments. Opinion In a non-statistical sense, the term "prediction" is often used to refer to an informed guess or opinion. A prediction of this kind might be informed by a predicting person's abductive reasoning, inductive reasoning, deductive reasoning, and experience; and may be useful—if the predicting person is a knowledgeable person in the field. The Delphi method is a technique for eliciting such expert-judgement-based predictions in a controlled way. ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Fukushima Daiichi Nuclear Disaster
The was a nuclear accident in 2011 at the Fukushima Daiichi Nuclear Power Plant in Ōkuma, Fukushima, Japan. The Proximate and ultimate causation, proximate cause of the disaster was the 2011 Tōhoku earthquake and tsunami, which occurred on the afternoon of 11 March 2011 and remains the most powerful earthquake ever recorded in Japan. The earthquake triggered a powerful tsunami, with 13–14-meter-high waves damaging the nuclear power plant's emergency diesel generators, leading to a loss of electric power. The result was the most severe nuclear accident since the Chernobyl disaster in 1986, classified as level seven on the International Nuclear Event Scale (INES) after initially being classified as level five, and thus joining Chernobyl as the only other accident to receive such classification. While the 1957 Kyshtym disaster, explosion at the Mayak facility was the second worst by radioactivity released, the INES ranks incidents by impact on population, so Chernobyl (335,000 ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Twitter
Twitter is an online social media and social networking service owned and operated by American company Twitter, Inc., on which users post and interact with 280-character-long messages known as "tweets". Registered users can post, like, and 'Reblogging, retweet' tweets, while unregistered users only have the ability to read public tweets. Users interact with Twitter through browser or mobile Frontend and backend, frontend software, or programmatically via its APIs. Twitter was created by Jack Dorsey, Noah Glass, Biz Stone, and Evan Williams (Internet entrepreneur), Evan Williams in March 2006 and launched in July of that year. Twitter, Inc. is based in San Francisco, California and has more than 25 offices around the world. , more than 100 million users posted 340 million tweets a day, and the service handled an average of 1.6 billion Web search query, search queries per day. In 2013, it was one of the ten List of most popular websites, most-visited websites and has been de ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Public Opinion
Public opinion is the collective opinion on a specific topic or voting intention relevant to a society. It is the people's views on matters affecting them. Etymology The term "public opinion" was derived from the French ', which was first used in 1588 by Michel de Montaigne in the second edition of his '' Essays'' (ch. XXII). The French term also appears in the 1761 work '' Julie, or the New Heloise'' by Jean-Jacques Rousseau. Precursors of the phrase in English include William Temple's "general opinion" (appearing in his 1672 work ''On the Original and Nature of Government'') and John Locke's "law of opinion" (appearing in his 1689 work '' An Essay Concerning Human Understanding''). History The emergence of public opinion as a significant force in the political realm dates to the late 17th century, but opinion had been regarded as having singular importance much earlier. Medieval ''fama publica'' or ''vox et fama communis'' had great legal and social importance from the ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Information Extraction
Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents and other electronically represented sources. In most of the cases this activity concerns processing human language texts by means of natural language processing (NLP). Recent activities in multimedia document processing like automatic annotation and content extraction out of images/audio/video/documents could be seen as information extraction Due to the difficulty of the problem, current approaches to IE (as of 2010) focus on narrowly restricted domains. An example is the extraction from newswire reports of corporate mergers, such as denoted by the formal relation: :\mathrm(company_1, company_2, date), from an online news sentence such as: :''"Yesterday, New York based Foo Inc. announced their acquisition of Bar Corp."'' A broad goal of IE is to allow computation to be done on the previously unstructured data. A more sp ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Text Categorisation
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a document to one or more classes or categories. This may be done "manually" (or "intellectually") or algorithmically In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ca .... The intellectual classification of documents has mostly been the province of library science, while the algorithmic classification of documents is mainly in information science and computer science. The problems are overlapping, however, and there is therefore interdisciplinary research on document classification. The documents to be classified may be texts, images, music, etc. Each kind of document possesses its special classification problems. When not otherwise speci ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Statistical Machine Translation
Statistical machine translation (SMT) is a machine translation paradigm where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contrasts with the rule-based approaches to machine translation as well as with example-based machine translation, and has more recently been superseded by neural machine translation in many applications (see this article's final section). The first ideas of statistical machine translation were introduced by Warren Weaver in 1949, including the ideas of applying Claude Shannon's information theory. Statistical machine translation was re-introduced in the late 1980s and early 1990s by researchers at IBM's Thomas J. Watson Research Center and has contributed to the significant resurgence in interest in machine translation in recent years. Before the introduction of neural machine translation, it was by far the most widely studied machine translati ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Eurovision Song Contest
The Eurovision Song Contest (), sometimes abbreviated to ESC and often known simply as Eurovision, is an international songwriting competition organised annually by the European Broadcasting Union (EBU), featuring participants representing primarily European countries. Each participating country submits an original song to be performed on live television and radio, transmitted to national broadcasters via the EBU's Eurovision and Euroradio networks, with competing countries then casting votes for the other countries' songs to determine a winner. Based on the Sanremo Music Festival held in Italy since 1951, Eurovision has been held annually since 1956 (apart from ), making it the longest-running annual international televised music competition and one of the world's longest-running television programmes. Active members of the EBU, as well as invited associate members, are eligible to compete, and 52 countries have participated at least once. Each participating broadcaster ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
News Coverage
News is information about current events. This may be provided through many different media: word of mouth, printing, postal systems, broadcasting, electronic communication, or through the testimony of observers and witnesses to events. News is sometimes called "hard news" to differentiate it from soft media. Common topics for news reports include war, government, politics, education, health, the environment, economy, business, fashion, entertainment, and sport, as well as quirky or unusual events. Government proclamations, concerning royal ceremonies, laws, taxes, public health, and criminals, have been dubbed news since ancient times. Technological and social developments, often driven by government communication and espionage networks, have increased the speed with which news can spread, as well as influenced its content. Throughout history, people have transported new information through oral means. Having developed in China over centuries, newspapers became est ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Gender Bias
Sexism is prejudice or discrimination based on one's sex or gender. Sexism can affect anyone, but it primarily affects women and girls.There is a clear and broad consensus among academic scholars in multiple fields that sexism refers primarily to discrimination against women, and primarily affects women. See, for example: * Defines sexism as "prejudice, stereotyping, or discrimination, typically against women, on the basis of sex". * Defines sexism as "prejudice or discrimination based on sex or gender, especially against women and girls". Notes that "sexism in a society is most commonly applied against women and girls. It functions to maintain patriarchy, or male domination, through ideological and material practices of individuals, collectives, and institutions that oppress women and girls on the basis of sex or gender." * Notes that Sexism' refers to a historically and globally pervasive form of oppression against women." * Notes that "sexism usually refers to prejudice ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Lexicon
A lexicon is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word (), neuter of () meaning 'of or for words'. Linguistic theories generally regard human languages as consisting of two parts: a lexicon, essentially a catalogue of a language's words (its wordstock); and a grammar, a system of rules which allow for the combination of those words into meaningful sentences. The lexicon is also thought to include bound morphemes, which cannot stand alone as words (such as most affixes). In some analyses, compound words and certain classes of idiomatic expressions, collocations and other phrases are also considered to be part of the lexicon. Dictionaries are lists of the lexicon, in alphabetical order, of a given language; usually, however, bound morphemes are not included. Size and organization Items in the lexicon are called lexemes, ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |