Culturomics
Culturomics is a form of computational lexicology that studies human behavior and cultural trends through the quantitative analysis of digitized texts. Researchers data mine large digital archives to investigate cultural phenomena reflected in language and word usage. The term is an American neologism first described in a 2010 ''Science'' article called ''Quantitative Analysis of Culture Using Millions of Digitized Books'', co-authored by Harvard researchers Jean-Baptiste Michel and Erez Lieberman Aiden. Michel and Aiden helped create the Google Labs project Google Ngram Viewer which uses n-grams to analyze the Google Books digital library for cultural patterns in language use over time. Because the Google Ngram data set is not an unbiased sample, and does not include metadata, there are several pitfalls when using it to study language or the popularity of terms. Medical literature accounts for a large, but shifting, share of the corpus, which does not take into account how o ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Google Ngram Viewer
The Google Books Ngram Viewer is an online search engine that charts the frequencies of any set of search strings using a yearly count of ''n''-grams found in printed sources published between 1500 and 2022 in Google's text corpora in English, Chinese (simplified), French, German, Hebrew, Italian, Russian, or Spanish. There are also some specialized English corpora, such as American English, British English, and English Fiction. The program can search for a word or a phrase, including misspellings or gibberish. The ''n''-grams are matched with the text within the selected corpus, and if found in 40 or more books, are then displayed as graph The Google Books Ngram Viewer supports searches for parts of speech and wildcards. It is routinely used in research. History In the development processes, Google teamed up with two Harvard researchers, Jean-Baptiste Michel and Erez Lieberman Aiden, and quietly released the program on December 16, 2010. Before the release, it was difficult ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Erez Lieberman Aiden
Erez Lieberman Aiden (born 1980, né Erez Lieberman) is an American research scientist active in multiple fields related to applied mathematics. He is a professor of molecular and human genetics and Emeritus McNair Scholar at the Baylor College of Medicine, and formerly a fellow at the Harvard Society of Fellows and visiting faculty member at Google. He is an adjunct professor of computer science at Rice University. Using mathematical and computational approaches, he has studied evolution in a range of contexts, including that of networks through evolutionary graph theory and languages in the field of culturomics. He has published scientific articles in a variety of disciplines. Lieberman Aiden has won awards including the Lemelson–MIT Student Prize and the American Physical Society's Award for Outstanding Doctoral Thesis Research in Biological Physics. In 2009, Lieberman Aiden was named as one of 35 top innovators under 35 by '' Technology Review'' and in 2011 he was o ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Computational Lexicology
Computational lexicology is a branch of computational linguistics, which is concerned with the use of computers in the study of lexicon. It has been more narrowly described by some scholars (Amsler, 1980) as the use of computers in the study of ''machine-readable dictionary, machine-readable dictionaries''. It is distinguished from ''computational lexicography'', which more properly would be the use of computers in the construction of dictionaries, though some researchers have used computational lexicography as synonymous. History Computational lexicology emerged as a separate discipline within computational linguistics with the appearance of machine-readable dictionaries, starting with the creation of the machine-readable tapes of the ''Merriam-Webster Seventh Collegiate Dictionary'' and the ''Merriam-Webster New Pocket Dictionary'' in the 1960s by John Olney et al. at System Development Corporation. Today, computational lexicology is best known through the creation and applicatio ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Arab Spring
The Arab Spring () was a series of Nonviolent resistance, anti-government protests, Rebellion, uprisings, and Insurgency, armed rebellions that spread across much of the Arab world in the early 2010s. It began Tunisian revolution, in Tunisia in response to corruption and economic stagnation. From Tunisia, the protests initially spread to five other countries: Libya, Egypt, Yemen, Syria and Bahrain. Rulers were deposed (Zine El Abidine Ben Ali of Tunisia, Muammar Gaddafi of Libya, and Hosni Mubarak of Egypt all in 2011, and Ali Abdullah Saleh of Yemen in 2012) and major uprisings and social violence occurred, including riots, civil wars, or insurgencies. Sustained street demonstrations took place in Morocco, Iraq, Algeria, Lebanon, Jordan, Kuwait, Oman and Sudan. Minor protests took place in Djibouti, Mauritania, Palestine, Saudi Arabia and the Western Sahara. A major slogan of the demonstrators in the Arab world is ''Ash-shab yurid isqat an-nizam, ash-shaʻb yurīd isqāṭ an- ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Fukushima Daiichi Nuclear Disaster
The Fukushima nuclear accident was a major nuclear accident at the Fukushima Daiichi Nuclear Power Plant in Ōkuma, Fukushima, Japan, which began on 11 March 2011. The cause of the accident was the 2011 Tōhoku earthquake and tsunami, which resulted in electrical grid failure and damaged nearly all of the power plant's Emergency power system, backup energy sources. The subsequent inability to sufficiently cool reactors after shutdown compromised Primary containment, containment and resulted in the release of radioactive contamination, radioactive contaminants into the surrounding environment. The accident was rated seven (the maximum severity) on the International Nuclear Event Scale by Nuclear and Industrial Safety Agency, following a report by the JNES (Japan Nuclear Energy Safety Organization). It is regarded as the worst nuclear incident since the Chernobyl disaster in 1986, which was also rated a seven on the International Nuclear Event Scale. According to the United Nati ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Twitter
Twitter, officially known as X since 2023, is an American microblogging and social networking service. It is one of the world's largest social media platforms and one of the most-visited websites. Users can share short text messages, images, and videos in Microblogging, short posts commonly known as "Tweet (social media), tweets" (officially "posts") and Like button, like other users' content. The platform also includes direct message, direct messaging, video and audio calling, bookmarks, lists, communities, a chatbot (Grok (chatbot), Grok), job search, and Spaces, a social audio feature. Users can vote on context added by approved users using the Community Notes feature. Twitter was created in March 2006 by Jack Dorsey, Noah Glass, Biz Stone, and Evan Williams (Internet entrepreneur), Evan Williams, and was launched in July of that year. Twitter grew quickly; by 2012 more than 100 million users produced 340 million daily tweets. Twitter, Inc., was based in San Francisco, C ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Public Opinion
Public opinion, or popular opinion, is the collective opinion on a specific topic or voting intention relevant to society. It is the people's views on matters affecting them. In the 21st century, public opinion is widely thought to be heavily influenced by the media; many studies have been undertaken which look at the different factors which influence public opinion. Politicians and other people concerned with public opinion often attempt to influence it using advertising or rhetoric. Opinion plays a vital role in uncovering some critical decisions. Sentiment analysis or opinion mining is a method used to mine the thoughts or feelings of the general population. One of the struggles of public opinion is how it can be influenced by misinformation. Etymology The term "public opinion" was derived from the French ', which was first used in 1588 by Michel de Montaigne, one of the most significant philosophers of the French Renaissance, in the second edition of his famous '' Essays ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Text Categorisation
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a document to one or more classes or categories. This may be done "manually" (or "intellectually") or algorithmically. The intellectual classification of documents has mostly been the province of library science, while the algorithmic classification of documents is mainly in information science and computer science. The problems are overlapping, however, and there is therefore interdisciplinary research on document classification. The documents to be classified may be texts, images, music, etc. Each kind of document possesses its special classification problems. When not otherwise specified, text classification is implied. Documents may be classified according to their subjects or according to other attributes (such as document type, author, printing year etc.). In the rest of this article only subject classification is considered ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Statistical Machine Translation
Statistical machine translation (SMT) is a machine translation approach where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contrasts with the rule-based approaches to machine translation as well as with example-based machine translation, that superseded the previous rule-based approach that required explicit description of each and every linguistic rule, which was costly, and which often did not generalize to other languages. The first ideas of statistical machine translation were introduced by Warren Weaver in 1949, including the ideas of applying Claude Shannon's information theory. Statistical machine translation was re-introduced in the late 1980s and early 1990s by researchers at IBM's Thomas J. Watson Research Center. Before the introduction of neural machine translation, it was by far the most widely studied machine translation method. Basis The idea b ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Eurovision Song Contest
The Eurovision Song Contest (), often known simply as Eurovision, is an international Music competition, song competition organised annually by the European Broadcasting Union (EBU) among its members since 1956. Each participating broadcaster submits an original song representing its country to be performed and broadcast live to all of them via the Eurovision (network), Eurovision and Euroradio networks, and then casts votes for the other countries' songs to determine a winner. The contest was inspired by and based on the Italian Sanremo Music Festival, held in the Italian Riviera since 1951. Eurovision has been held annually since 1956 (except for due to the COVID-19 pandemic), making it the longest-running international music competition on television and one of the world's longest-running television programmes. Active members of the EBU and invited associate members are eligible to compete; broadcasters from List of countries in the Eurovision Song Contest, 52 countries hav ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
News Coverage
News is information about current events. This may be provided through many different media: word of mouth, printing, postal systems, broadcasting, electronic communication, or through the testimony of observers and witnesses to events. News is sometimes called "hard news" to differentiate it from soft media. Subject matters for news reports include war, government, politics, education, health, economy, business, fashion, sport, entertainment, and the environment, as well as quirky or unusual events. Government proclamations, concerning royal ceremonies, laws, taxes, public health, and criminals, have been dubbed news since ancient times. Technological and social developments, often driven by government communication and espionage networks, have increased the speed with which news can spread, as well as influenced its content. Throughout history, people have transported new information through oral means. Having developed in China over centuries, newspapers became estab ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |