Various
binary formats have been proposed as compact representations for
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. ...
(''Extensible Markup Language''). Using a binary XML format generally reduces the verbosity of XML documents thereby also reducing the cost of parsing, but hinders the use of ordinary text editors and third-party tools to view and edit the document. There are several competing formats, but none has yet emerged as a ''
de facto standard
A ''de facto'' standard is a custom or convention that has achieved a dominant position by public acceptance or market forces (for example, by early entrance to the market). is a Latin phrase (literally "in fact"), here meaning "in practice but ...
'', although the
World Wide Web Consortium
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 and led by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working t ...
adopted
EXI as a Recommendation on 10 March 2011.
Binary XML is typically used in applications where the performance of standard XML is insufficient, but the ability to convert the document to and from a form (XML) which ''is'' easily viewed and edited is valued. Other advantages may include enabling
random access and
index
Index (or its plural form indices) may refer to:
Arts, entertainment, and media Fictional entities
* Index (''A Certain Magical Index''), a character in the light novel series ''A Certain Magical Index''
* The Index, an item on a Halo megastru ...
ing of XML documents.
The major challenge for binary XML is to create a single, widely adopted standard. The
International Organization for Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Ar ...
(ISO) and the
International Telecommunication Union
The International Telecommunication Union is a specialized agency of the United Nations responsible for many matters related to information and communication technologies. It was established on 17 May 1865 as the International Telegraph Unio ...
(ITU) published the
Fast Infoset standard in 2007 and 2005, respectively. Another standard (ISO/IEC 23001-1), known as Binary MPEG format for XML (
BiM), has been standardized by the
ISO in 2001.
BiM is used by many
ETSI
The European Telecommunications Standards Institute (ETSI) is an independent, not-for-profit, standardization organization in the field of information and communications. ETSI supports the development and testing of global technical standard ...
standards for digital TV and mobile TV. The
Open Geospatial Consortium provides a Binary XML Encoding Specification (currently a Best Practice Paper) optimized for geo-related data (GML) and also a benchmark to compare performance of Fast InfoSet, EXI, BXML and
deflate to encode/decode
AIXM.
Alternatives to binary XML include using traditional file compression methods on XML documents (for example
gzip
gzip is a file format and a software application used for file compression and decompression. The program was created by Jean-loup Gailly and Mark Adler as a free software replacement for the compress program used in early Unix systems, and ...
); or using an existing standard such as
ASN.1. Traditional compression methods, however, offer only the advantage of reduced file size, without the advantage of decreased parsing time or random access.
ASN.1/PER forms the basis of
Fast Infoset, which is one binary XML standard. There are also hybrid approaches (e.g.,
VTD-XML
Virtual Token Descriptor for eXtensible Markup Language (VTD-XML) refers to a collection of cross-platform XML processing technologies centered on a non-extractive XML, "document-centric" parsing technique called Virtual Token Descriptor (VTD). ...
) that attach a small index file to an XML document to eliminate the overhead of parsing.
Binary XML Efforts
Projects and file formats related to the notion of binary XML include:
*
BiM Standard, from the
ISO, developed by the
MPEG
The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and fi ...
working group
*
Fast Infoset, a standard published by ISO/IEC and ITU-T
Efficient XML Interchange (EXI)W3C Recommendation, based on work by
Efficient XML from AgileDelta, Inc.
*
Extensible Binary Meta Language (EBML) from
Matroska
Matroska is a project to create a container format that can hold an unlimited number of video, audio, picture, or subtitle tracks in one file. The Matroska Multimedia Container is similar in concept to other containers like AVI, MP4, or Advan ...
*
WAP Binary XML (WBXML)
.NET Binary Format: XML Data Structurefrom
Microsoft
Microsoft Corporation is an American multinational corporation, multinational technology company, technology corporation producing Software, computer software, consumer electronics, personal computers, and related services headquartered at th ...
; the implementation included in
.NET Framework
The .NET Framework (pronounced as "''dot net"'') is a proprietary software framework developed by Microsoft that runs primarily on Microsoft Windows. It was the predominant implementation of the Common Language Infrastructure (CLI) until bein ...
3.0 and later.
Binary Markup Language(BML)
Other projects that have functionality related to (or competing with) binary representations include:
*
VTD-XML
Virtual Token Descriptor for eXtensible Markup Language (VTD-XML) refers to a collection of cross-platform XML processing technologies centered on a non-extractive XML, "document-centric" parsing technique called Virtual Token Descriptor (VTD). ...
from XimpleWare and VTD-XML project
*
Protocol Buffers from Google
*
Apache Thrift
*
Data Distribution Service from
OMG
OMG may refer to:
* Oh my God (sometimes also Oh my Goodness or Oh my Gosh), a common abbreviation, often used in SMS messages and Internet communication
Acronyms
* OMG is the IATA code for Omega Airport, Omega, Namibia
* Operational manoeuvre ...
*
Apache Avro
Avro is a row-oriented remote procedure call and data serialization framework developed within Apache's Hadoop project. It uses JSON for defining data types and protocols, and serializes data in a compact binary format. Its primary use is in ...
for Big Data
*
Android application package uses an undocumented binary XML format.
See also
*
BSON
*
MessagePack
*
CBOR
*
Canonical S-expressions
References
{{DEFAULTSORT:Binary Xml
XML
Data serialization formats