Details
In 8-bit TeX engines the font encoding has to match the encoding of hyphenation patterns where this encoding is most commonly used. In\usepackage 1/code>, while in ConTeXt MkII this is the default encoding already. In modern engines such as XeTeX and LuaTeX Unicode is fully supported and the 8-bit font encodings are obsolete.
Character set
Notes
* Hexadecimal values under the characters in the table are the Unicode character codes.
* The first 12 characters are often used as combining characters.
Supported languages
The encoding supports most European languages written in Latin alphabet. Notable exceptions are:
* Esperanto
Esperanto ( or ) is the world's most widely spoken constructed international auxiliary language. Created by the Warsaw-based ophthalmologist L. L. Zamenhof in 1887, it was intended to be a universal second language for international communi ...
(using IL3)
* Latvian language
Latvian ( ), also known as Lettish, is an Eastern Baltic language belonging to the Baltic branch of the Indo-European language family, spoken in the Baltic region. It is the language of Latvians and the official language of Latvia as well a ...
and Lithuanian language
Lithuanian ( ) is an Eastern Baltic language belonging to the Baltic branch of the Indo-European language family. It is the official language of Lithuania and one of the official languages of the European Union. There are about 2.8 milli ...
(using L7X)
* Welsh language
Welsh ( or ) is a Celtic language of the Brittonic subgroup that is native to the Welsh people. Welsh is spoken natively in Wales, by some in England, and in Y Wladfa (the Welsh colony in Chubut Province, Argentina). Historically, it has ...
Languages with slightly suboptimal support include:
* Galician language, Portuguese language
Portuguese ( or, in full, ) is a western Romance language of the Indo-European language family, originating in the Iberian Peninsula of Europe. It is an official language of Portugal, Brazil, Cape Verde, Angola, Mozambique, Guinea-Bissau ...
and Spanish language
Spanish ( or , Castilian) is a Romance languages, Romance language of the Indo-European language family that evolved from colloquial Latin spoken on the Iberian peninsula. Today, it is a world language, global language with more than 500 millio ...
– due to the lack of characters ª and º, which are not superscript versions of lowercase "a" and "o" (superscripts are thinner) and they are often underlined
* Croatian language, Bosnian language, Serbian language
Serbian (, ) is the standardized variety of the Serbo-Croatian language mainly used by Serbs. It is the official and national language of Serbia, one of the three official languages of Bosnia and Herzegovina and co-official in Montenegro and ...
– due to the shared use of the slot for Đ
* Turkish language
Turkish ( , ), also referred to as Turkish of Turkey (''Türkiye Türkçesi''), is the most widely spoken of the Turkic languages, with around 80 to 90 million speakers. It is the national language of Turkey and Northern Cyprus. Significant sma ...
– due to dotless i having different uppercase and lowercase combinations than in other languages
References
External links
Encoding file
Overview of encodings in LaTeX
{{character encoding
Character encoding
Character sets
TeX