CLDR Algorithm
   HOME

TheInfoList



OR:

The Common Locale Data Repository (CLDR) is a project of the
Unicode Consortium The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the in ...
to provide locale data in
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
format for use in computer applications. CLDR contains locale-specific information that an
operating system An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs. Time-sharing operating systems scheduler (computing), schedule tasks for ...
will typically provide to applications. CLDR is written in the Locale Data Markup Language (LDML). CLDR is maintained by a technical committee which includes employees from IBM, Apple, Google, Microsoft, and some government-based organizations. The committee is chaired by John Emmons, of IBM; Mark Davis, of Google, is vice-chair.


Details

Among the types of data that CLDR includes are the following: * Translations for language names * Translations for territory and country names * Translations for currency names, including singular/plural modifications * Translations for
weekday The weekdays and weekend are the complementary parts of the week, devoted to labour and rest, respectively. The legal weekdays (British English), or workweek (American English), is the part of the seven-day week devoted to working. In most o ...
,
month A month is a unit of time, used with calendars, that is approximately as long as a natural phase cycle of the Moon; the words ''month'' and ''Moon'' are cognates. The traditional concept of months arose with the cycle of Moon phases; such lunar mo ...
,
era An era is a span of time. Era or ERA may also refer to: * Era (geology), a subdivision of geologic time * Calendar era Education * Academy of European Law (German: '), an international law school * ERA School, in Melbourne, Australia * E ...
, period of day, in full and abbreviated forms * Translations for time zones and example cities (or similar) for time zones * Translations for calendar fields * Patterns for formatting/parsing dates or times of day * Exemplar sets of characters used for writing the language * Patterns for formatting/parsing numbers * Rules for language-adapted
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fi ...
* Rules for spelling out numbers as words * Rules for formatting numbers in traditional numeral systems (such as
Roman Roman or Romans most often refers to: *Rome, the capital city of Italy *Ancient Rome, Roman civilization from 8th century BC to 5th century AD *Roman people, the people of Roman civilization *Epistle to the Romans, shortened to Romans, a letter w ...
and
Armenian numerals Armenian numerals form a historic numeral system created using the majuscules (uppercase letters) of the Armenian alphabet. There was no notation for zero in the old system, and the numeric values for individual letters were added together ...
) * Rules for
transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or L ...
between scripts, much of it based on
BGN/PCGN romanization BGN/PCGN romanization are the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names (BGN) and the Permanent Committee on Geographical Names for British Official Use (PCGN). The syste ...
The information is currently used in
International Components for Unicode International Components for Unicode (ICU) is an open-source project of mature C/ C++ and Java libraries for Unicode support, software internationalization, and software globalization. ICU is widely portable to many operating systems and envir ...
,
Apple An apple is a round, edible fruit produced by an apple tree (''Malus'' spp.). Fruit trees of the orchard or domestic apple (''Malus domestica''), the most widely grown in the genus, are agriculture, cultivated worldwide. The tree originated ...
's
macOS macOS, previously OS X and originally Mac OS X, is a Unix, Unix-based operating system developed and marketed by Apple Inc., Apple since 2001. It is the current operating system for Apple's Mac (computer), Mac computers. With ...
,
LibreOffice LibreOffice () is a free and open-source office productivity software suite developed by The Document Foundation (TDF). It was created in 2010 as a fork of OpenOffice.org, itself a successor to StarOffice. The suite includes applications ...
,
MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...
, and
IBM International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
's
AIX Aix or AIX may refer to: Computing * AIX, a line of IBM computer operating systems *Alternate index, for an IBM Virtual Storage Access Method key-sequenced data set * Athens Internet Exchange, a European Internet exchange point Places Belg ...
, among other applications and operating systems. CLDR overlaps somewhat with
ISO/IEC 15897 ISO/IEC 15897 (''Procedures for the registration of cultural elements'') is an ISO/ IEC standard for the registration of new POSIX locales and POSIX charmaps. This standard is the result of a "fast-track" adoption of CEN ENV 12005, a European pre ...
(
POSIX The Portable Operating System Interface (POSIX; ) is a family of standards specified by the IEEE Computer Society for maintaining compatibility between operating systems. POSIX defines application programming interfaces (APIs), along with comm ...
locales). POSIX locale information can be derived from CLDR by using some of CLDR's conversion tools. The CLDR covers 400+ languages.


References


External links


Common Locale Data Repository
the informational webpage of the CLDR project
Locale Data Markup Language
{{Unicode navigation Unicode Date and time representation Internationalization and localization