HOME

TheInfoList



OR:

There are two conventional sets
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
substitutions for the letters in the
Esperanto alphabet Esperanto is written in a Latin-script alphabet of twenty-eight letters, with upper and lower case. This is supplemented by punctuation marks and by various logograms, such as the digits 0–9, currency signs such as $ € ¥ £ ₷, and mathema ...
that have diacritics, as well as a number of graphic work-arounds. The diacritics of Esperanto were designed with a French manual typewriter in mind, as French was the international language at the time Esperanto was developed. French typewriters have a
dead key A dead key is a special kind of modifier key on a mechanical typewriter, or computer keyboard, that is typically used to attach a specific diacritic to a base letter. The dead key does not generate a (complete) character by itself, but modifies t ...
for the circumflex that can be used in combination with any other key. In handwritten Esperanto, the diacritics pose no problem. However, since the Esperanto letters with diacritics do not appear on standard computer
keyboard layout A keyboard layout is any specific physical, visual or functional arrangement of the keys, legends, or key-meaning associations (respectively) of a computer keyboard, mobile phone, or other computer-controlled typographic keyboard. is the actua ...
s (French computer keyboards, unlike manual typewriters, typically assign the circumflex only to letters that bear it in French orthography), various alternative methods have been devised for inputting them or substituting them in type. The original method, suggested by Zamenhof for people who did not have access to a French typewriter, was a set of digraphs in ''h'', now known as the "Zamenhof-system" or "h-system". With the rise of computer word processing, the so-called "x-system" has become equally popular. With the advent of
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, wh ...
and more easily customized computer keyboards, the need for such workarounds has lessened.


ASCII transliteration systems

There are two alternative orthographies in common use, which replace the circumflex letters with either ''h'' digraphs or ''x'' digraphs. Another system sometimes noted is a 'QWXY system'; this is a carry-over from an early Esperanto keyboard app named , with which the Q W X and Y keys were assigned to the letters , , , , and the key sequences TX and DY to the letters and . There are also graphic work-arounds such as approximating the circumflexes with carets.


H-system

The original method of working around the
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s was developed by the creator of Esperanto himself,
L. L. Zamenhof L. L. Zamenhof (15 December 185914 April 1917) was an ophthalmologist who lived for most of his life in Warsaw. He is best known as the creator of Esperanto, the most widely used constructed international auxiliary language. Zamenhof first dev ...
. He recommended using in place of , and digraphs with for other the circumflex letters. For example, is replaced by , as in for (chance). Where proper orthography has , the letters should be separated with an apostrophe or a hyphen, as in (six-hour) or (airport).Lenio Marobin, PY3DF (2008
'Morsa kodo kaj Esperanto – rekolekto de artikoloj iam aperintaj'
ILERA Bulteno n-o 70, p-o 04.
Unfortunately, simplistic
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
-based rules for sorting words fail badly when sorting h-digraphs, because lexicographically words in should follow all words in and precede words in . The word should be placed after , but sorted in the h-system, would appear before .


X-system

A more recent system for typing in Esperanto is the so-called "x-system", which uses instead of for the digraphs, including for . For example, is represented by , as in for and for . X-digraphs solve those problems of the h-system: # ''x'' is not a letter in the Esperanto alphabet, so its use introduces no ambiguity. # The digraphs are now nearly always correctly sorted after their single-letter counterparts; for example, (for ) comes after , while h-system comes before it. The sorting only fails in the infrequent case of a ''z'' in compound or unassimilated words; for example, the compound word ("to reuse") would be sorted after (for "rheumatism"). The x-system has become as popular as the h-system, but it has long been perceived as being contrary to the
Fundamento de Esperanto ''Fundamento de Esperanto'' (English: ''Foundation of Esperanto'') is a 1905 book by L. L. Zamenhof, in which the author explains the basic grammar rules and vocabulary that constitute the basis of the constructed language Esperanto. On August ...
. However, in its 2007 decision, the
Akademio de Esperanto The Akademio de Esperanto (AdE; en, Academy of Esperanto, link=yes) is an independent body of Esperanto speakers who steward the evolution of said language by keeping it consistent with the ''Fundamento de Esperanto'' in accordance with the Decl ...
has issued general permission for the use of surrogate systems for the representation of the diacritical letters of Esperanto, under the condition that this is being done only "when the circumstances do not permit the use of proper diacritics, and when due to a special need the h-system fixed in the Fundamento is not convenient." This provision covers situations such as using the x-system as a technical solution (to store data in plain ASCII) yet still displaying proper
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, wh ...
characters to the end user. A practical problem of digraph substitution that the x-system does not completely resolve is in the complication of bilingual texts. for is especially problematic when used alongside French text, because many French words end in or . ''Aux,'' for example, is a word in both languages ( in Esperanto). Any automatic conversion of the text will alter the French words as well as the Esperanto. A few English words like "auxiliary" and "Euxine" can also suffer from such search-and-replace routines. One common solution, such as the one used in
Wikipedia Wikipedia is a multilingual free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and using a wiki-based editing system. Wikipedia is the largest and most-read refer ...
's
MediaWiki MediaWiki is a free and open-source wiki software. It is used on Wikipedia and almost all other Wikimedia websites, including Wiktionary, Wikimedia Commons and Wikidata; these sites define a large part of the requirement set for MediaWi ...
software since the intervention of
Brion Vibber MediaWiki is a free and open-source wiki software. It is used on Wikipedia and almost all other Wikimedia websites, including Wiktionary, Wikimedia Commons and Wikidata; these sites define a large part of the requirement set for MediaWiki ...
in January 2002, is to use to escape the to conversion, e.g. "" produces "aux". A few people have also proposed using "" instead of "" for to resolve this problem, but this variant of the system is rarely used.


Graphic work-arounds

There are several ''ad hoc'' workarounds used in email or on the internet, where the proper letters are often not supported, as seen also in non-ASCII orthographies such as German. These "slipped-hat" conventions make use of the
caret Caret is the name used familiarly for the character , provided on most QWERTY keyboards by typing . The symbol has a variety of uses in programming and mathematics. The name "caret" arose from its visual similarity to the original proofreade ...
(^) or
greater than In mathematics, an inequality is a relation which makes a non-equal comparison between two numbers or other mathematical expressions. It is used most often to compare two numbers on the number line by their size. There are several different n ...
sign (>) to represent the circumflex. For example, ''ŝanco'' may be written ''^sanco, s^anco,'' or ''s>anco.'' However, they have generally fallen out of favor. Before the internet age, had proposed shifting the caret onto the following vowel, since French circumflex vowels are supported in printing houses. That is, one would write ''ehôsângôj cîujâude'' for ''eĥoŝanĝoj ĉiuĵaŭde.''''Plena Analiza Gramatiko,'' end of section 4: ''Cê la sângôj okazintaj en la cî-landa vojkodo, cîuj automobilistoj zorge informigû pri la jûsaj instrukcioj.'' However, this proposal has never been adopted.


See also

* Inputting Esperanto


References

{{Reflist


External links


eoconv
– a tool to convert text between various orthographic substitutions Orthography: reform Orthography reform