An infobox is a digital or physical

table Table may refer to: * Table (database), how the table data arrangement is used within the databases * Table (furniture), a piece of furniture with a flat surface and one or more legs * Table (information), a data arrangement with rows and column ...

used to collect and present a subset of information about its subject, such as a

document A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as fictional, content. The word originates from the Latin ', which denotes ...

. It is a

structured document A structured document is an electronic document where some method of markup language, markup is used to identify the whole and parts of the document as having various meanings beyond their formatting. For example, a structured document might identi ...

containing a set of attribute–value pairs, and in

Wikipedia Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...

represents a summary of information about the subject of an article. In this way, they are comparable to data tables in some aspects. When presented within the larger document it summarizes, an infobox is often presented in a sidebar format. An infobox may be implemented in another document by transcluding it into that document and specifying some or all of the attribute–value pairs associated with that infobox, known as

parameterization In mathematics, and more specifically in geometry, parametrization (or parameterization; also parameterisation, parametrisation) is the process of finding parametric equations of a curve, a surface (mathematics), surface, or, more generally, a ma ...

Wikipedia

An infobox may be used to summarize the information of an article on

. They are used on similar articles to ensure consistency of presentation by using a common format. Originally, infoboxes (and templates in general) were used for

page layout In graphic design, page layout is the arrangement of visual elements on a page. It generally involves organizational principles of composition to achieve specific communication objectives. The high-level page layout involves deciding on the ...

purposes. An infobox may be transcluded into an article by specifying the value for some or all of its

parameters A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...

. The parameter name used must be the same as that specified in the infobox template, but any value may be associated to it. The name is delimited from the value by an

equals sign The equals sign (British English) or equal sign (American English), also known as the equality sign, is the mathematical symbol , which is used to indicate equality. In an equation it is placed between two expressions that have the same valu ...

. The parameter name may be regarded as an attribute of the article's subject. On Wikipedia, an infobox is transcluded into an article by enclosing its name and attribute–value pairs within a double set of braces. The

MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...

software on which Wikipedia operates then parses the document, for which the infobox and other templates are processed by a

template processor A template processor (also known as a template engine or template parser) is software designed to combine ''template''s with data (defined by a data model) to produce resulting documents or programs. The language that the templates are writte ...

. This is a template engine which produces a

web document A web page (or webpage) is a Web document that is accessed in a web browser. A website typically consists of many web pages linked together under a common domain name. The term "web page" is therefore a metaphor of paper pages bound together in ...

and a style sheet used for presentation of the document. This enables the design of the infobox to be separated from the content it manipulates; that is, the design of the template may be updated without affecting the information within it, and the new design will automatically propagate to all articles that transclude the infobox. Usually, infoboxes are formatted to appear in the top-right corner of a Wikipedia article in the desktop view, or at the top in the mobile view. Placement of an infobox within the

wikitext A wiki ( ) is a form of hypertext publication on the internet which is collaboratively edited and managed by its audience directly through a web browser. A typical wiki contains multiple pages that can either be edited by the public or li ...

of an article is important for accessibility. A

best practice A best practice is a method or technique that has been generally accepted as superior to alternatives because it tends to produce superior results. Best practices are used to achieve quality as an alternative to mandatory standards. Best practice ...

is to place them following ''disambiguation'' templates (those that direct readers to articles about topics with similar names) and maintenance templates (such as that marking an article as unreferenced), but before all other content. Baeza-Yates and King say that some editors find templates such as infoboxes complicated, as the template may hide text about a property or resource that the editor wishes to change; this is exacerbated by chained templates, that is templates transcluded within other templates. As of August 2009, English Wikipedia used about infobox templates that collectively used more than attributes. Since then, many have been merged, to reduce redundancy. As of June 2013, there were at least transclusions of the parent Infobox template, used by some, but not all, infoboxes, on articles. The name of an Infobox is typically "Infobox enre; however, widely used infoboxes may be assigned shorter names, such as "taxobox" for taxonomy.

Machine learning

About 44.2% of Wikipedia articles contained an infobox in 2008, and about 33% in 2010. Automated semantic knowledge extraction using

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

algorithms is used to "extract machine-processable information at a relatively low complexity cost". However, the low coverage makes it more difficult, though this can be partially overcome by complementing article data with that in

categories Category, plural categories, may refer to: General uses *Classification, the general act of allocating things to classes/categories Philosophy *Category of being * ''Categories'' (Aristotle) *Category (Kant) *Categories (Peirce) *Category (Vais ...

in which the article is included. The

French Wikipedia The French Wikipedia () is the French-language edition of Wikipedia, the free online encyclopedia. This edition was started on 23 March 2001, two months after the official creation of Wikipedia. It has :fr:Special:Statistics, encyclopedia artic ...

initiated the project ''Infobox Version 2'' in May 2011.The project is hosted on the

page Infobox/V2. Knowledge obtained by machine learning can be used to improve an article, such as by using automated software suggestions to editors for adding infobox data. The iPopulator project created a system to add a value to an article's infobox parameter via an automated parsing of the text of that article.

DBpedia DBpedia (from "DB" for "database") is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web using OpenLink Virtuoso. DBpedia a ...

uses

structured content Structured content is information or content that is organized in a predictable way and is usually classified with metadata. XML is a common storage format, but structured content can also be stored in other standard or proprietary formats. Whe ...

extracted from infoboxes by machine learning algorithms to create a resource of

linked data In computing, linked data is structured data which is interlinked with other data so it becomes more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web ...

in the

Semantic Web The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable. To enable the encoding o ...

; it has been described by

Tim Berners-Lee Sir Timothy John Berners-Lee (born 8 June 1955), also known as TimBL, is an English computer scientist best known as the inventor of the World Wide Web, the HTML markup language, the URL system, and HTTP. He is a professorial research fellow a ...

as "one of the more famous" components of the linked data project. Machine extraction creates a triple consisting of a subject, predicate or relation, and object. Each attribute-value pair of the infobox is used to create an RDF statement using an

ontology Ontology is the philosophical study of existence, being. It is traditionally understood as the subdiscipline of metaphysics focused on the most general features of reality. As one of the most fundamental concepts, being encompasses all of realit ...

. This is facilated by the narrower gap between Wikipedia and an ontology than exists between unstructured or free text and an ontology. The semantic relationship between the subject and object is established by the predicate. In the example infobox, the triple ("crostata", type, "tart") indicates that a

crostata ''Crostata'' () is an Italian baked tart or pie. The earliest known use of ''crostata'' in its modern sense can be traced to the cookbooks ''Libro de Arte Coquinaria'' (''Book of the Art of Cooking'') by Martino da Como, published , and ''C ...

is a type of

tart A tart is a baked dish consisting of a filling over a pastry base with an open top not covered with pastry. The pastry is usually shortcrust pastry; the filling may be sweet or savoury, though modern tarts are usually fruit-based, sometimes with ...

. The article's topic is used as the subject, the parameter name is used as the predicate, and the parameter's value as the object. Each type of infobox is mapped to an ontology class, and each property (parameter) within an infobox is mapped to an ontology property. These mappings are used when parsing a Wikipedia article to extract data.

Metadata

Link to infobox article posted into teams

Presenting the basic facts of an article within an infobox allows the facts to be presented in a machine-friendly way, allowing extra functionality such as when a link to a Wikipedia article is pasted into a compatible program; instead of just the link itself being posted, other information such as the article's lead image is also posted.

Citations

Works cited

* * * * * * *

Wikipedia

Machine learning

Metadata

Citations

Works cited

Further reading