XDXF (XML Dictionary eXchange Format) is a project to unite all existing open
dictionaries
A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged Alphabetical order, alphabetically (or by Semitic root, consonantal root for Semitic languages or radical-and-stroke sorting, radical an ...
and provide both users and developers with a universal
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
-based
format, convertible from and to other popular formats like Mova,
PtkDic, and
StarDict
StarDict, developed by Hu Zheng (胡正), is a free GUI released under the GPL-3.0-or-later license for accessing StarDict dictionary files (a ''dictionary shell''). It is the successor of StarDic, developed by Ma Su'an (馬蘇安), continuin ...
.
Available dictionaries
As of December 15, 2006 the XDXF project repository contains 615 dictionaries, which are collectively 936,189,613 bytes in size (compressed) and contain 24,804,355 articles.
Software
GUIs
The XDXF file format is used b
AlpusSimpleDictand GoldenDict. Also
StarDict
StarDict, developed by Hu Zheng (胡正), is a free GUI released under the GPL-3.0-or-later license for accessing StarDict dictionary files (a ''dictionary shell''). It is the successor of StarDic, developed by Ma Su'an (馬蘇安), continuin ...
starting with version 2.4.6 has basic support for XDXF.
Converters
There are numerous converters
pyglossaryxdxf2sloband others. Initially, the project had its own converter, but it was deprecated.
Alternatives
Many languages serve a similar purpose, e.g., the
Lexical Markup Framework
Language resource management – Lexical markup framework (LMF; ISO 24613), produced by ISO/TC 37, is the ISO standard for natural language processing (NLP) and machine-readable dictionary (MRD) lexicons. The scope is standardization of principles ...
(XML and other serializations),
OntoLex
OntoLex is the short name of a vocabulary for lexical resources in the web of data (OntoLex-Lemon) and the short name of the W3C community group that created it (W3C Ontology-Lexica Community Group).
OntoLex-Lemon vocabulary
The OntoLex-Lemon ...
(RDF),
DICT
DICT is a dictionary network protocol created by the DICT Development Group in 1997, described by RFC 2229. Its goal is to surpass the Webster protocol to allow clients to access a variety of dictionaries via a uniform interface.
In section ...
(text format), or the
dicML markup languages. As for
dicML and XDXF, neither concept is specified completely. For example, XDXF lacks elements to annotate possible hyphenations, while the recent working draft of dicML does not include elements to describe the etymology of words.
References
External links
Project siteXDXF dictionaries repositoryXDXF StandardEasy XDXF Dictionary – Free dictionary for Iphone
Computer file formats
Dictionary formats
Markup languages
Open formats
XML-based standards
{{free-software-stub