Esperanto
Esperanto (, ) is the world's most widely spoken Constructed language, constructed international auxiliary language. Created by L. L. Zamenhof in 1887 to be 'the International Language' (), it is intended to be a universal second language for ...
is written in a
Latin-script alphabet
A Latin-script alphabet (Latin alphabet or Roman alphabet) is an alphabet that uses Letter (alphabet), letters of the Latin script. The 21-letter archaic Latin alphabet and the 23-letter classical Latin alphabet belong to the oldest of this gr ...
of twenty-eight letters, with upper and lower case. This is supplemented by
punctuation
Punctuation marks are marks indicating how a piece of writing, written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, c ...
marks and by various
logograms, such as the
digits 0–9, currency signs such as $ € ¥ £ ₷, and
mathematical symbol
A mathematical symbol is a figure or a combination of figures that is used to represent a mathematical object, an action on mathematical objects, a relation between mathematical objects, or for structuring the other symbols that occur in a formula ...
s. The creator of Esperanto,
L. L. Zamenhof, declared a principle of "one letter, one sound", though this is a general rather than strict guideline.
[Kalocsay & Waringhien, ''Plena analiza gramatiko'', § 17]
Twenty-two of the letters are identical in form to letters of the English alphabet (''q, w, x,'' and ''y'' being omitted). The remaining six have
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
al marks: ''
ĉ,
ĝ,
ĥ,
ĵ,
ŝ,'' and ''
ŭ'' – that is, ''c, g, h, j,'' and ''s
circumflex
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of ...
,'' and ''u
breve
A breve ( , less often , grammatical gender, neuter form of the Latin "short, brief") is the diacritic mark , shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (, the wedge or in ...
.''
Standard alphabet
Standard Esperanto orthography uses the
Latin script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
.
Sound values
The letters have approximately the sound values of the
IPA, with the exception of ''c'' and the letters with diacritics: ''
ĉ'' , ''ĝ'' , ''ĥ'' , ''ĵ'' , ''ŝ'' , ''
ŭ'' . ''J'' transcribes two sounds, consonantal (the English ''y'' sound, as in ''you'') and vocalic .
There is a nearly one-to-one correspondence of letter to sound. Diphthongs such as ''aŭ'' and ''eŭ'' require two letters. has dual consonantal and vocalic use, equivalent to both consonantal and vocalic . For those who consider to be a phoneme, Esperanto contains one consonantal
digraph as well, . Allophony has been noted in the vowels (for example in open vs
closed syllable
A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
s) and is found in the place assimilation of and , the latter of which for example is frequently pronounced before ''g'' and ''k'' for speakers of language backgrounds that do the same.
Phonemic change is perhaps limited to voicing assimilation of
obstruent
An obstruent ( ) is a speech sound such as , , or that is formed by ''obstructing'' airflow. Obstruents contrast with sonorants, which have no such obstruction and so resonate. All obstruents are consonants, but sonorants include vowels as well ...
s, as in the sequence ''kz'' of ''ekzemplo'', ('(an) example') which is 'inevitably' pronounced in normal speech, though purists try to pronounce it . In Zamenhof's writing, obstruents with different voicing such as ''k'' and ''z'' do not meet in compound
lexical word
In grammar, a part of speech or part-of-speech (Abbreviation, abbreviated as POS or PoS, also known as word class or grammatical category) is a category of words (or, more generally, of lexical items) that have similar grammar, grammatical propert ...
s, but rather are separated by an epenthetic vowel, such as ''o'', to avoid such assimilation, though placing such letters together is common among speakers whose language background allows it.
Non-Esperantized names are given an Esperanto approximation of their original pronunciation, at least by speakers without command of the original language. Hard is read as ''k'', as ''kv'', as ''v'', as ''ks'', and as ''j'' if a consonant, or as ''i'' if a vowel. The English digraph is read as ''t''. When there is no close equivalent, the difficult sounds may be given the Esperanto values of the letters in the orthography or roman transcription, accommodating the constraints of Esperanto phonology. So, for example, ''
Winchester
Winchester (, ) is a City status in the United Kingdom, cathedral city in Hampshire, England. The city lies at the heart of the wider City of Winchester, a local government Districts of England, district, at the western end of the South Downs N ...
'' (the English city) is pronounced (and may be spelled) ''Vinĉester'' , as Esperanto ''ŭ'' does not occur at the beginning of ordinary words. ''
Changzhou
Changzhou is a prefecture-level city in southern Jiangsu, China. It was previously known as Yanling, Lanling, and Jinling. Located on the southern bank of the Yangtze River, Changzhou borders the provincial capital of Nanjing to the west, Zhen ...
'' generally becomes ''Ĉanĝo'' , as Esperanto has no ''ng'' or ''ou'' sound. There are no strict rules, however; speakers may try for greater fidelity, for example by pronouncing the ''g'' and ''u'' in ''Changzhou'': ''Ĉangĝoŭ'' (despite there being no ''g'' sound in the Chinese pronunciation). The original stress may be kept, if it is known.
Origin
The script resembles Western Slavic Latin alphabets but uses
circumflex
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of ...
es instead of
caron
A caron or háček ( ), is a diacritic mark () placed over certain letters in the orthography of some languages, to indicate a change of the related letter's pronunciation.
Typographers tend to use the term ''caron'', while linguists prefer ...
s for the letters ''ĉ'', ''ĝ'', ''ĥ'', ''ĵ'', and ''ŝ''. Also, the non-Slavic bases of the letters ''ĝ'' and ''ĵ'', rather than Slavic ''dž'' and ''ž'', help preserve the printed appearance of Latinate and Germanic vocabulary such as ''ĝenerala'' "general" (adjective) and ''ĵurnalo'' "journal". The letter ''v'' stands for either ''v'' or ''w'' of other languages. The letter ''ŭ'' of the diphthongs ''aŭ'' and ''eŭ'' resemble the
Belarusian Łacinka alphabet
The Belarusian Latin alphabet or Łacinka (from , BGN/PCGN: , ) for the Latin script in general is the Latin script as used to write Belarusian. It is similar to the Sorbian alphabet and incorporates features of the Polish and Czech alph ...
.
Zamenhof took advantage of the fact that typewriters for the French language (which, in his lifetime, served as an international ''lingua franca'' for educated people) possess a
dead key
A dead key is a special kind of modifier key on a mechanical typewriter, or computer keyboard, that is typically used to attach a specific diacritic to a base letter (alphabet), letter. The dead key does not generate a (complete) grapheme, charact ...
for the circumflex diacritic: thus, anyone who could avail himself of a French typewriter could type ''ĉ ĝ ĥ ĵ ŝ'' and their uppercase counterparts with no problem. French typewriters also include the letter , which Francophone Esperantists have long used as a substitute for Esperanto ''ŭ''. With the advent of personal computers, French-language keyboards still possess a dead-key ^, but whether it can be used to type Esperanto consonants may depend on the underlying software. Zamenhof's choice of accented letters was familiar to the speakers of some Slavic languages, for instance, Czech and Slovak, where the sounds of Esperanto ''ĉ'' and ''ŝ'' are represented by the letters ''č'' and ''š'', respectively; and Belarusian, because Esperanto ''ŭ'' bears the same relation to ''u'' as Belarusian Cyrillic ''ў'' bears to ''у''.
Geographic names may diverge from English spelling, especially for the letters ''x'', ''w'', ''qu'' and ''gu'', as in ''Vaŝintono'' "
Washington, D.C.
Washington, D.C., formally the District of Columbia and commonly known as Washington or D.C., is the capital city and federal district of the United States. The city is on the Potomac River, across from Virginia, and shares land borders with ...
", ''Meksiko'' "
Mexico City
Mexico City is the capital city, capital and List of cities in Mexico, largest city of Mexico, as well as the List of North American cities by population, most populous city in North America. It is one of the most important cultural and finan ...
", and ''Gvatemalo'' "
Guatemala
Guatemala, officially the Republic of Guatemala, is a country in Central America. It is bordered to the north and west by Mexico, to the northeast by Belize, to the east by Honduras, and to the southeast by El Salvador. It is hydrologically b ...
". Other spelling differences appear when Esperanto words are based on the pronunciation rather than the spelling of English place names, such as ''Brajtono'' for
Brighton
Brighton ( ) is a seaside resort in the city status in the United Kingdom, city of Brighton and Hove, East Sussex, England, south of London.
Archaeological evidence of settlement in the area dates back to the Bronze Age Britain, Bronze Age, R ...
.
Variations
Since all letters with diacritics are unique, they are often simplified in
handwriting
Handwriting in Italian schools (XXth - XXIst century)
Handwriting is the personal and unique style of writing with a writing instrument, such as a pen or pencil in the hand. Handwriting includes both block and cursive styles and is separa ...
. The most common diacritic to be simplified is the circumflex, which often appears more like a
macron or
acute accent
The acute accent (), ,
is a diacritic used in many modern written languages with alphabets based on the Latin alphabet, Latin, Cyrillic script, Cyrillic, and Greek alphabet, Greek scripts. For the most commonly encountered uses of the accen ...
(e.g. ''ḡ'' or ''ǵ'' instead of ''ĝ'').
Names of the letters of the alphabet
Zamenhof simply tacked an ''-o'' onto each consonant to create the name of the letter, with the vowels representing themselves: ''a, bo, co, ĉo, do, e, fo,'' etc. The diacritics are frequently mentioned overtly. For instance, ''ĉ'' may be called ''ĉo ĉapela'' or ''co ĉapela,'' from ''ĉapelo'' (a hat), and ''ŭ'' may be called ''ŭo luneta'' or ''u luneta,'' from ''luno'' (a moon) plus the diminutive ''-et-.'' This is the only system that is widely accepted and in practical use.
The letters of the
ISO basic Latin alphabet
The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in various national and international standards and u ...
not found in the Esperanto alphabet have distinct names, much as letters of the Greek alphabet do. , , are ''kuo, ikso, ipsilono''; has been called ''duobla vo'' (double V), ''vavo'' (using Waringhien's name of ''va'' below), ''vuo'' (proposed by Sergio Pokrovskij), ''germana vo'' (German V), and ''ĝermana vo'' (
Germanic V).
However, while this is fine for
initialism
An acronym is a type of abbreviation consisting of a phrase whose only pronounced elements are the initial letters or initial sounds of words inside that phrase. Acronyms are often spelled with the initial letter of each word in all caps wi ...
s such as ''ktp''
otopofor ''etc.,'' it can be problematic when spelling out names. For example, several consonantal distinctions are difficult for many nationalities, who normally rely on the fact that Esperanto seldom uses these sounds to distinguish words (that is, they do not form many
minimal pair
In phonology, minimal pairs are pairs of words or phrases in a particular language, spoken or signed, that differ in only one phonological element, such as a phoneme, toneme or chroneme, and have distinct meanings. They are used to demonstrate t ...
s). Thus the pairs of letter names ''ĵo–ĝo, ĥo–ho'' (or ''ĥo–ko), co–ĉo'' (or ''co–so, co–to), lo–ro'', and ''ŭo–vo'' (or ''vo–bo'') are problematic. In addition, over a noisy telephone connection, it quickly becomes apparent that
voicing distinctions can be difficult to make out: noise confounds the pairs ''po–bo, to–do, ĉo–ĝo, ko–go, fo–vo, so–zo, ŝo–ĵo,'' as well as the nasals ''mo–no.''
There have been several proposals to resolve this problem.
Gaston Waringhien proposed changing the vowel of voiced
obstruent
An obstruent ( ) is a speech sound such as , , or that is formed by ''obstructing'' airflow. Obstruents contrast with sonorants, which have no such obstruction and so resonate. All obstruents are consonants, but sonorants include vowels as well ...
s to ''a'', so that at least voicing is not problematic. Also changed to ''a'' are ''h, n, r'', distinguishing them from ''ĥ, m, l''. The result is perhaps the most common alternative in use:
::''a, ba, co, ĉo, da, e, fo, ga, ĝa, ha, ĥo, i, jo, ĵa, ko, lo, mo, na, o, po, ra, so, ŝo, to, u, ŭo, va, za''
However, this still requires overt mention of the diacritics, and even so does not reliably distinguish ''ba–va, co–so, ĉo–ŝo,'' or ''ĝa–ĵa''.
The proposal closest to international norms (and thus the easiest to remember) that clarifies all the above distinctions is a modification of a proposal by
Kálmán Kalocsay. As with Zamenhof, vowels stand for themselves, but it follows the international standard of placing vowel ''e'' after a consonant by default ''(be, ce, de, ge),'' but before
sonorant
In phonetics and phonology, a sonorant or resonant is a speech sound that is produced with continuous, non-turbulent airflow in the vocal tract; these are the manners of articulation that are most often voiced in the world's languages. Vowels a ...
s ''(el, en)'' and
voiceless
In linguistics, voicelessness is the property of sounds being pronounced without the larynx vibrating. Phonologically, it is a type of phonation, which contrasts with other states of the larynx, but some object that the word phonation implies v ...
fricative
A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in ...
s ''(ef, es).'' The vowel ''a'' is used for and the voiceless
plosives
In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases.
The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
, , , after the international names ''ha'' for and ''ka'' for ; the French name ''ĵi'' is used for , the Greek name ''ĥi (chi)'' for , and the English name ''ar'' for . The letter has the ''i'' vowel of ''ĵi,'' distinguishing it from , but the other voiced fricative, , does not, to avoid the problem of it
palatalizing and being confused with ''ĵi.'' The diphthong offglide is named ''eŭ,'' the only real possibility given Esperanto
phonotactics
Phonotactics (from Ancient Greek 'voice, sound' and 'having to do with arranging') is a branch of phonology that deals with restrictions in a language on the permissible combinations of phonemes. Phonotactics defines permissible syllable struc ...
besides ''aŭ,'' which, as the word for "or", could cause confusion. The letter is called ''om'' to distinguish it from ; the vowel ''o'' alliterates well in the alphabetical sequence ''el, om, en, o, pa.'' There are other patterns to the vowels in the
ABC rhyme: The lines start with ''a i a i'' and finish with ''a a e e.'' The letters with diacritics are placed at the end of the rhyme, taking the place of ''w, x, y'' in other Latin alphabets, so as not to disrupt the pattern of letters many people learned as children. All this makes the system more easily memorized than competing proposals. The modified Kalocsay
abecedary is:
:::::''a, be, ce, de, e, ef, ge, ha,''
:::::''i, je, ka, el, om, en, o, pa,''
:::::''ar, es, ta, u, vi, ĉa, ĝe,''
:::::''ĥi kaj ĵi, eŝ, eŭ kaj ze,''
:::::''plus ku', ikso, ipsilono,''
:::::''jen la abece-kolono.''
''(kaj'' means "and". The last line reads: ''here is the ABC column)''
Where letters are still confused, such as ''es'' vs ''eŝ'' or ''a'' vs ''ha,'' mention can be made of the diacritic ''(eŝ ĉapela),'' or to the manner of articulation of the sound ''(ha brueta'' "breathy aitch"). Quite commonly, however, people will use the ''aitch as in house'' strategy used in English.
Spelling alphabets
Another strategy is to use a
spelling alphabet
A spelling alphabet (#Terminology, also called by various other names) is a set of words used to represent the Letter (alphabet), letters of an alphabet in Speech, oral communication, especially over a two-way radio or telephone. The words chosen t ...
(''literuma alfabeto''), which substitutes ordinary words for letters. The following words are sometimes seen:
From a German–Esperanto dictionary by
Erich-Dieter Krause:
:''Asfalto, Barbaro, Centimetro, Ĉefo, Doktoro, Elemento, Fabriko, Gumo, Ĝirafo, Hotelo, Ĥaoso, Insekto, Jubileo, Ĵurnalo, Kilogramo, Legendo, Maŝino, Naturo, Oktobro, Papero, Kuo, Rekordo, Salato, Ŝilingo, Triumfo, Universo, Universo-hoketo, Vulkano, Ĝermana vo, Ikso, Ipsilono, Zinko''
A proposal by
Simon Edward Adrian Payne in ''
Monato'':
:''akvo, baldaŭ, cedro, ĉirkaŭ, dolĉa, eĥo, fajfi, golfo, ĝis, hejme, ĥoro, iĝi, jaĥto, ĵuri, korpo, lingvo, morgaŭ, nokto, ofte, pelvo, kuo, riĉa, sankta, ŝaŭmi, tempo, uzi, ŭa-ŭa, vespo, vavo, ikso, ipsilono, zorgi''
A proposal by
Gerrit François Makkink, in which most words are tetrasyllabic so that the syllable beginning with the letter in question receives secondary stress (though only in ''Varsovio'' do both stressed syllables begin with the letter):
:''Akademio, bondeziro, centjariĝo, Ĉe-metodo, delegito, Esperanto, Fundamento, gramatiko, ĝisrevido, harmonio, ĥorkantado, internacia, jubileo, ĵurnalisto, kalendaro, Ludoviko, modernigo, necesejo, okupita, propagando, kuo, redaktoro, sekretario, ŝatokupo, telefono, universala, u-supersigno, Varsovio, vuo, ikso, ipsilono, Zamenhofo''
The
International League of Esperantist Radio Amateurs (ILERA) uses the following adaptation of the
International Radiotelephony Spelling Alphabet (ICAO and NATO "phonetic" alphabet):
:''alfa, bravo, carli, delta, eko, fokstrot, golf, hotel, india, juliet, kilo, lima, majk, november, oskar, papa, kebek, romeo, siera, tango, uniform, viktor, ŭiski'' ~ ''viski, eksrej, janki, zulu''
ILERA also modifies the numerals ''ses'' '6' and ''sep'' '7' to ''sis'' and ''sepen'' to make them more distinct, and uses the nominal form ''nulo'' for zero.
ASCII transliteration
There are two common conventions for inputting and typesetting Esperanto in the ISO basic Latin alphabet when proper orthography is inconvenient. Zamenhof had suggested replacing the circumflex letters with
digraphs in ''h'', the so-called "h-system", thus: ''ch, gh, hh, jh, sh'' for ''ĉ, ĝ, ĥ, ĵ, ŝ'' and ''u'' for ''ŭ'', with an apostrophe or hyphen to disambiguate actual sequences of these letters (e.g. ''ses-hora'').
With the advent of computer word-processing, the so-called "x-system", with digraphs in ''x'' for all diacritics, has become equally popular: ''cx, gx, hx, jx, sx, ux''. The words ''ŝanĝi'' "to change" and ''ĵaŭde'' "on Thursday" are written ''shanghi'', ''jhaude'' and ''sxangxi'', ''jxauxde'', respectively, in the two systems. The h-system has a more conventional appearance, but because the letter ''x'' does not occur in Esperanto, it is fairly straightforward to automatically convert text written in the x-system into standard orthography; it also produces better results with alphabetic sorting.
Punctuation
As with most languages, punctuation is not completely standardized, but in Esperanto there is the additional complication of multiple competing national traditions.
Commas are frequently used to introduce subordinate clauses (that is, before ''ke'' "that" or the ''ki-'' correlatives):
:''Mi ne scias, kiel fari tion.'' (I don't know how to do that.)
The comma is also used for the
decimal point
FIle:Decimal separators.svg, alt=Four types of separating decimals: a) 1,234.56. b) 1.234,56. c) 1'234,56. d) ١٬٢٣٤٫٥٦., Both a comma and a full stop (or period) are generally accepted decimal separators for international use. The apost ...
, while thousands are separated by non-breaking spaces: '','' or sometimes by apostrophes: ''Li enspezis $3'300'000.''
The
question mark
The question mark (also known as interrogation point, query, or eroteme in journalism) is a punctuation, punctuation mark that indicates a question or interrogative clause or phrase in many languages.
History
The history of the question mark is ...
(?) and the
exclamation mark
The exclamation mark (also known as exclamation point in American English) is a punctuation mark usually used after an interjection or exclamation to indicate strong feelings or to show wikt:emphasis, emphasis. The exclamation mark often marks ...
(!) are used at the end of a clause and may be internal to a sentence. Question words generally come at the beginning of a question, obviating the need for
Spanish-style inverted question marks.
Periods may be used to indicate
initialism
An acronym is a type of abbreviation consisting of a phrase whose only pronounced elements are the initial letters or initial sounds of words inside that phrase. Acronyms are often spelled with the initial letter of each word in all caps wi ...
s: ''k.t.p.'' or ''ktp'' for ''kaj tiel plu'' (et cetera), but not abbreviations that retain the grammatical suffixes. Instead, a hyphen optionally replaces the missing letters: ''D-ro'' or ''Dro'' for ''Doktoro'' (Dr). With ordinal numerals, the adjectival ''a'' and accusative ''n'' may be superscripted: ''13a'' or ''13
a'' (13th). The abbreviation ''k'' is used without a period for ''kaj'' (and); the
ampersand
The ampersand, also known as the and sign, is the logogram , representing the grammatical conjunction, conjunction "and". It originated as a typographic ligature, ligature of the letters of the word (Latin for "and").
Etymology
Tradi ...
(&) is not found.
Roman numeral
Roman numerals are a numeral system that originated in ancient Rome and remained the usual way of writing numbers throughout Europe well into the Late Middle Ages. Numbers are written with combinations of letters from the Latin alphabet, ea ...
s are also avoided.
The
hyphen
The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation.
The hyphen is sometimes confused with dashes (en dash , em dash and others), which are wider, or with t ...
is also occasionally used to clarify compounds, and to join grammatical suffixes to proper names that haven't been Esperantized or don't have a nominal ''-o'' suffix, such as the accusative on ''Kalocsay-n'' or ''Kálmán-on.'' The proximate particle ''ĉi'' used with correlatives, such as ''ĉi tiu'' 'this one' and ''ĉi tie'' 'here', may be poetically used with nouns and verbs as well ''(ĉi jaro'' 'this year', ''esti ĉi'' 'to be here'), but if these phrases are then changed to adjectives or adverbs, a hyphen is used: ''ĉi-jare'' 'this year', ''ĉi-landa birdo'' 'a bird of this land'.
Quotation mark
Quotation marks are punctuation marks used in pairs in various writing systems to identify direct speech, a quotation, or a phrase. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the sam ...
s show the greatest variety of any punctuation. The use of Esperanto quotation marks was never stated in Zamenhof's work; it was assumed that a printer would use whatever was available, usually the national standard of the printer's country.
Em dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
es (—...),
guillemets
Guillemets (, also , , ) are a pair of punctuation marks in the form of sideways double Chevron (insignia), chevrons, and , used as quotation marks in some languages. In some of these languages, "single" guillemets, and , are used for a quotat ...
( or reversed ), double quote marks ( and German-style ) and more are all found. Since the age of word-processing, however, American-style quotation marks are the most widespread. Quotations may be introduced with either a
comma
The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
or a
colon.
Time and date format is not standardized among Esperantists, but internationally unambiguous formats such as 1970-01-01 (ISO) or 1-jan-1970 are preferred when the date is not spelled out in full ("la 1-a de januaro 1970").
Capitalization
Capitalization
Capitalization ( North American spelling; also British spelling in Oxford) or capitalisation (Commonwealth English; all other meanings) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in ...
is used for the first word of a sentence and for proper names when used as nouns. Names of months, days of the week, ethnicities, languages, and the adjectival forms of proper names are not typically capitalized ''(anglo'' "an Englishman", ''angla'' "English", ''usona'' "US American"), though national norms may override such generalizations. Titles are more variable: both the Romance style of capitalizing only the first word of the title and the English style of capitalizing all lexical words are found.
All capitals or small capitals are used for
acronym
An acronym is a type of abbreviation consisting of a phrase whose only pronounced elements are the initial letters or initial sounds of words inside that phrase. Acronyms are often spelled with the initial Letter (alphabet), letter of each wor ...
s and
initialism
An acronym is a type of abbreviation consisting of a phrase whose only pronounced elements are the initial letters or initial sounds of words inside that phrase. Acronyms are often spelled with the initial letter of each word in all caps wi ...
s of proper names, like ''
TEJO,'' but not common expressions like ''ktp'' (etc.). Small capitals are also a common convention for
family name
In many societies, a surname, family name, or last name is the mostly hereditary portion of one's personal name that indicates one's family. It is typically combined with a given name to form the full name of a person, although several give ...
s, to avoid the confusion of varying national naming conventions: Kálmán,
Leslie Kwok-wing.
Camel case
The writing format camel case (sometimes stylized autological, autologically as camelCase or CamelCase, also known as camel caps or more formally as medial capitals) is the practice of writing phrases without spaces or punctuation and with cap ...
, with or without a hyphen, may occur when a prefix is added to a proper noun: ''la geZamenhofoj'' (the Zamenhofs), ''pra-Esperanto'' (
Proto-Esperanto). It is also used for
Russian-style syllabic acronyms, such as the name ''ReVo'' for ''Reta Vortaro'' ("Internet Dictionary"), which is homonymous with ''revo'' (dream). Occasionally mixed capitalization will be used for orthographic puns, such as ''espERAnto,'' which stands for the ''esperanta radikala asocio'' (Radical Esperanto Association).
Zamenhof contrasted informal ''ci'' with formal, and capitalized, ''Vi'' as the second-person singular pronouns. However, lower-case ''vi'' is now used as the second-person pronoun regardless of number.
Spesmilo symbol

Unique to the Esperanto script is the ''
spesmilo'' (1000
specie) sign, , an ''Sm''
monogram
A monogram is a motif (visual arts), motif made by overlapping or combining two or more letters or other graphemes to form one symbol. Monograms are often made by combining the initials of an individual or a company, used as recognizable symbo ...
for an obsolete international unit of auxiliary Esperanto currency used by a few British and Swiss banks before World War I. It is often transcribed as ''Sm'', usually italic.
Braille, fingerspelling, and Morse code
Esperanto versions of
braille
Braille ( , ) is a Tactile alphabet, tactile writing system used by blindness, blind or visually impaired people. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone device ...
and
Morse code
Morse code is a telecommunications method which Character encoding, encodes Written language, text characters as standardized sequences of two different signal durations, called ''dots'' and ''dashes'', or ''dits'' and ''dahs''. Morse code i ...
include the six diacritic letters.
An Esperanto braille magazine, ''Aŭroro,'' has been published since 1920.
In Morse code, a dot is added to C and J to derive Ĉ and Ĵ, a dash–dot is added to G and S to derive Ĝ and Ŝ, a dash is added to U to derive Ǔ, and the four dots of H are changed to four dashes for Ĥ. However, users often substitute these novel letters with digraphs ''ch, gh, jh, sh'' etc.
[Lenio Marobin, PY3DF (2008]
'Morsa kodo kaj Esperanto – rekolekto de artikoloj iam aperintaj'
ILERA Bulteno n-o 70, p-o 04.

There is a proposed
manual alphabet
Fingerspelling (or dactylology) is the representation of the letters of a writing system, and sometimes numeral systems, using only the hands. These manual alphabets (also known as finger alphabets or hand alphabets) have often been used in deaf ...
as part of the ''
Signuno'' project. Signuno, as a signed variant of Esperanto, is itself a manual
logographic
In a written language, a logogram (from Ancient Greek 'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a semantic component of a language, such as a word or morpheme. Chinese c ...
Esperanto orthography. The majority of letters of the manual alphabet resemble those of the
American manual alphabet, but also of the
French manual alphabet
The French manual alphabet is an alphabet used for French Sign Language (LSF), both to distinguish LSF words and to sign French language, French words in LSF.
The alphabet has the following letters:
Image:LSF LettreA.jpg, alt=A fist with thumb ...
and others. The diacritic letters Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ have their own signs, and J and Z are distinct from other alphabets, as none of the Signuno letters involve motion.
Digits are formed by extending the fingers from the index to the pinkie for 1 to 4, from the pinkie to the thumb (keeping the middle finger down) for 6 to 9, and from the thumb to index for 11 and 12; the last two are used for months and hours. Zero is represented by the fist, 5 by the whole hand extended, and 10 as the letter X.
Other scripts

The
Shavian alphabet
The Shavian alphabet ( ; also known as the Shaw alphabet) is a Constructed writing system, constructed alphabet conceived as a way to provide simple, phonemic orthography for the English language to replace the inefficiencies and difficultie ...
, which was designed for English, was modified for use with Esperanto by John Wesley Starling. Though not widely used, at least one booklet has been published with sample Shavian texts. Not all letters are equivalent to their English values, and special forms of the letters and have been added for the accusative case ending and verbal inflections; the grammatical endings and the words ''la'' 'the', ''aŭ'' 'or' and ''kaj'' 'and' are written as ligatures.
The vowels necessarily differ from English. Esperanto ''a e i o u'' take the letters for English , with more regard to graphic symmetry than phonetic faithfulness in the cases of ''o'' and ''u''. ''C'' takes the letter for , the
Castilian value of ''c'' before ''e'' and ''i'', and ''ĥ'' that for , the inverse of the letter for . The most divergent letters are those for ''m'' and ''n'', which are in English, but which are graphically better suited to be distinct letters than English Shavian .
The US television series ''
Resident Alien
In law, an alien is generally any person (including an organization) who is not a citizen or a national of a specific country, although definitions and terminology differ across legal systems.
Lexicology
The term "alien" is derived from the L ...
'' uses an invented script that does not distinguish ''u'' and ''v'', and ignores diacritics, to transcribe Esperanto as the alien language. It is written right to left.

The
Cyrillic script
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, Mongolic, Uralic languages, Uralic, C ...
has also been adapted to write Esperanto.
[Ager, Simon]
Esperanto Cyrillic (Есперанто-цирила)
/ref>
The 2017 Japanese-language visual novel
A visual novel (VN) is a form of digital interactive fiction. Visual novels are often associated with the medium of video games, but are not always labeled as such themselves. They combine a textual narrative with static or animated illustratio ...
''The Expression Amrilato
''The Expression Amrilato'', known in Japan as , is a Japanese Bishōjo game, bishōjo visual novel developed by SukeraSparo in 2017. It's a romantic game that contains text in the international language Esperanto, which the game calls ''Julia ...
'' and its 2021 sequel ''Distant Memoraĵo'' feature a language named ''Juliamo'' that is actually Esperanto in a modified Latin alphabet.
Computer input
The Esperanto alphabet is part of the Latin-3 and Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
character sets, and is included in WGL4.
The code points and HTML entities for the Esperanto characters with diacritics and the spesmilo sign are:
See also
*Orthography
An orthography is a set of convention (norm), conventions for writing a language, including norms of spelling, punctuation, Word#Word boundaries, word boundaries, capitalization, hyphenation, and Emphasis (typography), emphasis.
Most national ...
* Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ, ₷
Notes
References
{{Language orthographies
Orthography
An orthography is a set of convention (norm), conventions for writing a language, including norms of spelling, punctuation, Word#Word boundaries, word boundaries, capitalization, hyphenation, and Emphasis (typography), emphasis.
Most national ...
Latin-script orthographies