The Common Locale Data Repository (CLDR) is a project of the
Unicode Consortium to provide
locale data in
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
format for use in computer applications. CLDR contains locale-specific information that an
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
will typically provide to applications.
CLDR is written in the Locale Data Markup Language (LDML).
CLDR is maintained by a technical committee which includes employees from IBM, Apple, Google, Microsoft, and some government-based organizations. The committee is chaired by John Emmons, of IBM;
Mark Davis, of Google, is vice-chair.
Details
Among the types of data that CLDR includes are the following:
* Translations for language names
* Translations for territory and country names
* Translations for currency names, including singular/plural modifications
* Translations for
weekday,
month
A month is a unit of time, used with calendars, that is approximately as long as a natural phase cycle of the Moon; the words ''month'' and ''Moon'' are cognates. The traditional concept of months arose with the cycle of Moon phases; such lunar mo ...
,
era, period of day, in full and abbreviated forms
* Translations for time zones and example cities (or similar) for time zones
* Translations for calendar fields
* Patterns for formatting/parsing dates or times of day
* Exemplar sets of characters used for writing the language
* Patterns for formatting/parsing numbers
* Rules for language-adapted
collation
* Rules for spelling out
numbers as words
* Rules for formatting numbers in traditional numeral systems (such as
Roman and
Armenian numerals
Armenian numerals form a historic numeral system created using the majuscules (uppercase letters) of the Armenian alphabet.
There was no notation for zero in the old system, and the numeric values for individual letters were added together ...
)
* Rules for
transliteration between scripts, much of it based on
BGN/PCGN romanization
The information is currently used in
International Components for Unicode
International Components for Unicode (ICU) is an open-source project of mature C/ C++ and Java libraries for Unicode support, software internationalization, and software globalization. ICU is widely portable to many operating systems and envir ...
,
Apple
An apple is a round, edible fruit produced by an apple tree (''Malus'' spp.). Fruit trees of the orchard or domestic apple (''Malus domestica''), the most widely grown in the genus, are agriculture, cultivated worldwide. The tree originated ...
's
macOS
macOS, previously OS X and originally Mac OS X, is a Unix, Unix-based operating system developed and marketed by Apple Inc., Apple since 2001. It is the current operating system for Apple's Mac (computer), Mac computers. With ...
,
LibreOffice,
MediaWiki, and
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
's
AIX, among other applications and operating systems.
CLDR overlaps somewhat with
ISO/IEC 15897 (
POSIX
The Portable Operating System Interface (POSIX; ) is a family of standards specified by the IEEE Computer Society for maintaining compatibility between operating systems. POSIX defines application programming interfaces (APIs), along with comm ...
locales). POSIX locale information can be derived from CLDR by using some of CLDR's conversion tools.
The CLDR covers 400+ languages.
References
External links
Common Locale Data Repository the informational webpage of the CLDR project
Locale Data Markup Language
{{Unicode navigation
Unicode
Date and time representation
Internationalization and localization