ISO 14651
   HOME

TheInfoList



OR:

'
ISO/IEC 14651:2016
'', ''Information technology -- International string ordering and comparison -- Method for comparing character strings and description of the common template tailorable ordering'', is an ISO/
IEC The International Electrotechnical Commission (IEC; in French: ''Commission électrotechnique internationale'') is an international standards organization that prepares and publishes international standards for all electrical, electronic and r ...
standard specifying an
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
that can be used when comparing two strings. This comparison can be used when collating a set of strings. The standard also specifies a datafile specifying the comparison order, the ''Common Tailorable Template'', CTT. The comparison order is supposed to be tailored for different languages (hence the CTT is regarded as a ''template'' and not a default, though the empty tailoring, not changing any weighting, is appropriate in many cases), since different languages have incompatible ordering requirements. One such tailoring is
European ordering rules The European ordering rules (EOR / EN 13710), define an ordering for strings written in languages that are written with the Latin, Greek and Cyrillic alphabets. The standard covers languages used by the European Union, the European Free Trade A ...
(EOR), which in turn is supposed to be tailored for different European languages. The ''Common Tailorable Template'' (''CTT'') datafile of this ISO/IEC standard is aligned with the ''Default Unicode Collation Entity Table'' (DUCET) datafile of the
Unicode collation algorithm The Unicode collation algorithm (UCA) is an algorithm defined in Unicode Technical Report #10, which is a customizable method to produce binary keys from strings representing text in any writing system and language that can be represented with Uni ...
(UCA) specified in ''Unicode Technical Standard #10''. This is the fourth edition of the standard and was published on 2016-02-15, corrected on 2016-05-01 and covers up to and including Unicode 8.0. One additional amendment '
Amd.1:2017
'' was published in September 2017 and covers up to and including Unicode 9.0.


See also

*
Collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
*
European ordering rules The European ordering rules (EOR / EN 13710), define an ordering for strings written in languages that are written with the Latin, Greek and Cyrillic alphabets. The standard covers languages used by the European Union, the European Free Trade A ...
*
ISO/IEC JTC 1/SC 2 ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that devel ...
*
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...


External links and references


ISO site, "ISO/IEC 14651:2016"
ISO/IEC 14651:2016 and Amd.1:2017 ar



String collation algorithms #14651 Unicode algorithms Collation {{standard-stub