
The Google Knowledge Graph is a
knowledge base
In computer science, a knowledge base (KB) is a set of sentences, each sentence given in a knowledge representation language, with interfaces to tell new sentences and to ask questions about what is known, where either of these interfaces migh ...
from which
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
serves relevant information in an infobox beside its
search results
A search engine results page (SERP) is a webpage that is displayed by a search engine in response to a query by a user. The main component of a SERP is the listing of results that are returned by the search engine in response to a keyword quer ...
. This allows the user to see the answer in a glance, as an
instant answer. The data is generated automatically from a variety of sources, covering places, people, businesses, and more.
The information covered by Google's Knowledge Graph grew quickly after launch, tripling its data size within seven months (covering 570 million entities and 18 billion facts). By mid-2016, Google reported that it held 70 billion facts and answered "roughly one-third" of the 100 billion monthly searches they handled. By May 2020, this had grown to 500 billion facts on 5 billion entities.
There is no official documentation of how the Google Knowledge Graph is implemented.
According to Google, its information is retrieved from many sources, including the ''
CIA World Factbook
''The World Factbook'', also known as the ''CIA World Factbook'', is a reference resource produced by the United States' Central Intelligence Agency (CIA) with almanac-style information about the countries of the world. The official print ve ...
'' and
Wikipedia
Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...
.
It is used to answer direct spoken questions in
Google Assistant
Google Assistant is a virtual assistant software application developed by Google that is primarily available on home automation and mobile devices. Based on artificial intelligence, Google Assistant can engage in two-way conversations, unlike ...
and
Google Home
Google Nest, previously named Google Home, is a line of smart speakers developed by Google under the Google Nest brand. The devices enable users to speak voice commands to interact with services through Google Assistant, the company's virtual ...
voice queries.
It has been criticized for providing answers with neither source attribution nor
citation
A citation is a reference to a source. More precisely, a citation is an abbreviated alphanumeric expression embedded in the body of an intellectual work that denotes an entry in the bibliographic references section of the work for the purpose o ...
s.
History
Google announced its Knowledge Graph on May 16, 2012, as a way to significantly enhance the value of information returned by Google searches.
Initially available only in English, it was expanded in December 2012 to
Spanish,
French,
German,
Portuguese,
Japanese,
Russian and
Italian.
Bengali support was added in March 2017.
The Knowledge Graph was powered in part by
Freebase.
In August 2014, ''
New Scientist
''New Scientist'' is a popular science magazine covering all aspects of science and technology. Based in London, it publishes weekly English-language editions in the United Kingdom, the United States and Australia. An editorially separate organ ...
'' reported that Google had launched a Knowledge Vault project. After publication, Google reached out to ''
Search Engine Land'' to explain that Knowledge Vault was a research report, not an active Google service. ''Search Engine Land'' expressed indications that Google was experimenting with "numerous models" for gathering meaning from text.
Google's Knowledge Vault was meant to deal with facts, automatically gathering and merging information from across the Internet into a knowledge base capable of answering direct questions, such as "Where was
Madonna
Madonna Louise Ciccone ( ; born August 16, 1958) is an American singer, songwriter, record producer, and actress. Referred to as the "Queen of Pop", she has been recognized for her continual reinvention and versatility in music production, ...
born?" In a 2014 report, the Vault was reported to have collected over 1.6 billion facts, 271 million of which were considered "confident facts" deemed to be more than 90% true. It was reported to be different from the Knowledge Graph in that it gathered information automatically instead of relying on crowd-sourced facts compiled by humans.
Features
Google Knowledge Panel
A Google Knowledge Panel which is part of Google search engine result pages, presents an overview of entities such as individuals, organizations, locations, or objects directly within the search interface. This feature uses data from Google Knowledge Graph, an extensive database that organizes and interconnects information about entities, enhancing the retrieval and presentation of relevant content to users.
Criticism
Lack of source attribution
By May 2016, knowledge boxes were appearing for "roughly one-third" of the 100 billion monthly searches the company processed.
Dario Taraborelli, head of research at the
Wikimedia Foundation
The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as foundation (United States law), a charitable foundation. It is the host of Wikipedia, th ...
, told ''
The Washington Post
''The Washington Post'', locally known as ''The'' ''Post'' and, informally, ''WaPo'' or ''WP'', is an American daily newspaper published in Washington, D.C., the national capital. It is the most widely circulated newspaper in the Washington m ...
'' that Google's omission of sources in its knowledge boxes "undermines people’s ability to verify information and, ultimately, to develop well-informed opinions". The publication also reported that the boxes are "frequently unattributed", such as a knowledge box on the age of actress
Betty White, which is "as unsourced and absolute as if handed down by God".
Declining Wikipedia article readership
According to ''
The Register
''The Register'' (often also called El Reg) is a British Technology journalism, technology news website co-founded in 1994 by Mike Magee (journalist), Mike Magee and John Lettice. The online newspaper's Nameplate_(publishing), masthead Logo, s ...
'' in 2014 the display of direct answers in knowledge panels alongside Google search results caused significant readership declines for
Wikipedia
Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...
, from which the panels obtained some of their information. Also in 2014, ''
The Daily Dot
''The Daily Dot'' is a digital media company covering the culture of the Internet and the World Wide Web. It was founded by Nicholas White in 2011, and is headquartered in Austin, Texas.
The site, conceived as the Internet's "hometown newsp ...
'' noted that "Wikipedia still has no real competitor as far as actual content is concerned. All that's up for grabs are traffic stats. And as a nonprofit, traffic numbers don't equate into revenue in the same way they do for a commercial media site". After the article's publication, a spokesperson for the
Wikimedia Foundation
The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as foundation (United States law), a charitable foundation. It is the host of Wikipedia, th ...
, which operates Wikipedia, stated that it "welcomes" the knowledge panel functionality, that it was "looking into" the traffic drops, and that "We've also not noticed a significant drop in search engine referrals. We also have a continuing dialog with staff from Google working on the Knowledge Panel".
In his 2020 book,
Dariusz Jemielniak noted that as most Google users do not realize that many answers to their questions that appear in the Knowledge Graph come from Wikipedia, this reduces Wikipedia's popularity, and in turn limited the site's ability to raise new funds and attract new volunteers.
Bias
The algorithm has been criticized for presenting biased or inaccurate information, usually because of sourcing information from websites with high
search engine optimization
Search engine optimization (SEO) is the process of improving the quality and quantity of Web traffic, website traffic to a website or a web page from web search engine, search engines. SEO targets unpaid search traffic (usually referred to as ...
. It had been noted in 2014 that while there was a Knowledge Graph for most major historical or pseudo-historical
religious
Religion is a range of social- cultural systems, including designated behaviors and practices, morals, beliefs, worldviews, texts, sanctified places, prophecies, ethics, or organizations, that generally relate humanity to supernatural ...
figures such as
Moses
In Abrahamic religions, Moses was the Hebrews, Hebrew prophet who led the Israelites out of slavery in the The Exodus, Exodus from ancient Egypt, Egypt. He is considered the most important Prophets in Judaism, prophet in Judaism and Samaritani ...
,
Muhammad
Muhammad (8 June 632 CE) was an Arab religious and political leader and the founder of Islam. Muhammad in Islam, According to Islam, he was a prophet who was divinely inspired to preach and confirm the tawhid, monotheistic teachings of A ...
and
Gautama Buddha
Siddhartha Gautama, most commonly referred to as the Buddha (),*
*
*
was a śramaṇa, wandering ascetic and religious teacher who lived in South Asia during the 6th or 5th century BCE and founded Buddhism. According to Buddhist lege ...
, there was none for
Jesus
Jesus (AD 30 or 33), also referred to as Jesus Christ, Jesus of Nazareth, and many Names and titles of Jesus in the New Testament, other names and titles, was a 1st-century Jewish preacher and religious leader. He is the Jesus in Chris ...
, the central figure of
Christianity
Christianity is an Abrahamic monotheistic religion, which states that Jesus in Christianity, Jesus is the Son of God (Christianity), Son of God and Resurrection of Jesus, rose from the dead after his Crucifixion of Jesus, crucifixion, whose ...
. On June 3, 2021, a knowledge box identified
Kannada
Kannada () is a Dravidian language spoken predominantly in the state of Karnataka in southwestern India, and spoken by a minority of the population in all neighbouring states. It has 44 million native speakers, and is additionally a ...
as the ugliest language in India, prompting outrage from the Kannada-language community; the state of
Karnataka
Karnataka ( ) is a States and union territories of India, state in the southwestern region of India. It was Unification of Karnataka, formed as Mysore State on 1 November 1956, with the passage of the States Reorganisation Act, 1956, States Re ...
, where most Kannada speakers live, also threatened to sue Google for damaging the public image of the language. Google promptly changed the featured snippet for the search query and issued a formal apology.
See also
*
DBpedia
DBpedia (from "DB" for "database") is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web using OpenLink Virtuoso. DBpedia a ...
*
Google Assistant
Google Assistant is a virtual assistant software application developed by Google that is primarily available on home automation and mobile devices. Based on artificial intelligence, Google Assistant can engage in two-way conversations, unlike ...
*
Linked data
In computing, linked data is structured data which is interlinked with other data so it becomes more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web ...
*
Knowledge graph
In knowledge representation and reasoning, a knowledge graph is a knowledge base that uses a Graph (discrete mathematics), graph-structured data model or topology to represent and operate on data. Knowledge graphs are often used to store interl ...
*
Semantic integration
Semantic integration is the process of interrelating information from diverse sources, for example calendars and to do lists, email archives, presence information (physical, psychological, and social), documents of all sorts, contacts (including ...
*
Semantic network
A semantic network, or frame network is a knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, ...
*
Wikidata
Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, are able to use under the CC0 public domain ...
*
AI Overviews
References
{{Google LLC
Google Search
Internet search engines
Knowledge bases
Internet properties established in 2012
Knowledge graphs
Information