Abstract Wikipedia
   HOME

TheInfoList



OR:

Abstract Wikipedia is an in-development project of the
Wikimedia Foundation The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as foundation (United States law), a charitable foundation. It is the host of Wikipedia, th ...
. It aims to use
Wikifunctions Wikifunctions is a collaboratively edited catalog of computer functions to enable the creation, modification, and reuse of source code In computing, source code, or simply code or source, is a plain text computer program written in a pr ...
to create a language-independent version of
Wikipedia Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...
using its
structured data A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
. First conceived in 2020 (with a precursor proposal in 2013), Abstract Wikipedia has been under active development ever since, with the related project of Wikifunctions launched in 2023. Nevertheless, the project has proved controversial. As envisioned, Abstract Wikipedia would consist of "Constructors" (templates for abstract statements), "Content" (the abstract statements themselves), and "Renderers" (which would automatically translate abstract statements into
natural language A natural language or ordinary language is a language that occurs naturally in a human community by a process of use, repetition, and change. It can take different forms, typically either a spoken language or a sign language. Natural languages ...
).


History


Conception (2013–2020)

On 7 August 2013,
Denny Vrandečić Zdenko "Denny" Vrandečić (born 27 February 1978 in Stuttgart, Germany) is a Croatian computer scientist. He was a co-developer of Semantic MediaWiki and Wikidata, the lead developer of the Wikifunctions project, and an employee of the Wikime ...
, the co-founder of
Wikidata Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, are able to use under the CC0 public domain ...
, suggested "an extension of the template system" where template calls would expand into content based on the language of the user. For example, a template call such as could be variously expanded by ''Template:F12/en'' into "''Berlin is the capital of Germany''.", and by ''Template:F12/de'' into "" This has been viewed as a predecessor of Abstract Wikipedia proper. Vrandečić proposed it again in a
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
working paper in April 2020, formally proposed in May 2020 (as ''Wikilambda''). It was approved by the Wikimedia Foundation Board of Trustees in July 2020 as ''Abstract Wikipedia''.


Development (2020–present)

In April 2021, Vrandečić published an overview of the system in the computer science journal ''
Communications of the ACM ''Communications of the ACM'' (''CACM'') is the monthly journal of the Association for Computing Machinery (ACM). History It was established in 1958, with Saul Rosen as its first managing editor. It is sent to all ACM members. Articles are i ...
''. In January 2023, ''The Signpost'' reported on the slow progress of the Abstract Wikipedia project. According to an evaluation by four
Google Fellows Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
working on the project, it was at a "substantial risk of failure" due to its poor technical plan. The Google Fellows recommended that Abstract Wikipedia be decoupled from Wikifunctions, that Wikifunctions refine MediaWiki's support for programming in Lua rather than having a completely new language, and that Abstract Wikipedia converge on a unified approach to
natural language generation Natural language generation (NLG) is a software process that produces natural language output. A widely cited survey of NLG methods describes NLG as "the subfield of artificial intelligence and computational linguistics that is concerned with the ...
(NLG) that builds on open source software if possible. The Wikimedia Foundation staff responded to this report by completely rejecting the idea that Abstract Wikipedia and Wikifunctions could be separated, and accusing the Google Fellows of making "fallacies and false comparisons". The Wikimedia Foundation also stated that using existing NLG pipelines like
Grammatical Framework Grammatical Framework (GF) is a programming language for writing grammars of natural languages. GF is capable of parsing and generating texts in several languages simultaneously while working from a language-independent representation of meaning. ...
could not support certain languages such as the Niger–Congo B languages, and would also "replicate the trends of an imperialist English-focused Western-thinking industry." On 26 July 2023,
Wikifunctions Wikifunctions is a collaboratively edited catalog of computer functions to enable the creation, modification, and reuse of source code In computing, source code, or simply code or source, is a plain text computer program written in a pr ...
officially launched to the general public.


Design


Technical components

The Abstract Wikipedia project would consist of three main components: # Constructors, which enable abstract statements. The Abstract Wikipedia team prefers that these are hosted in Wikifunctions. # Content, which consists of abstract calls to Constructors, with values for each slot. These are preferably hosted in
Wikidata Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, are able to use under the CC0 public domain ...
. # Renderers (one per language), which convert the abstract Content into text in that particular language. These are, like Constructors, also preferably hosted in Wikifunctions. Each version of Wikipedia, once Abstract Wikipedia is deployed, could choose between three options: # Implicit integration with Abstract Wikipedia. There would be a special page called ''Special:Abstract'' that would display content automatically translated from Abstract Wikipedia into the local language. This content would be linkable and searchable. Furthermore, a new magic word LINK_TO_Q would be added in order to enable linking to Abstract Wikipedia content. # Explicit integration with Abstract Wikipedia. In this scenario, to create a new article, the editor would add a sitelink on Wikidata to a not-yet-existing page. This would create a "virtual article" in mainspace that would be pre-populated with content from Abstract Wikipedia automatically translated into the local language. This "virtual article" would have a URL similar to that of a real article, and would also be linkable and searchable just like a real article. # No integration with Abstract Wikipedia.


Example

As a preliminary example, content from Abstract Wikipedia could look like: Article( content: Instantiation( instance: (), class: Object_with_modifier_and_of( object: center, modifier: And_modifier( conjuncts: [cultural, commercial, financial ), of: () ) ), Ranking( subject: (), rank: 4, object: (), by: (), local_constraint: (), after: ) ] ) This would translate into English as "San Francisco is the cultural, commercial, and financial center of Northern California. It is the fourth-most populous city in California, after Los Angeles, San Diego and San Jose."


See also

*
Semantic Web The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable. To enable the encoding o ...
* IEML


References


External links

* * Project updates on Meta-Wiki {{Wikimedia Foundation Wikipedia Wikimedia projects