A caron or háček ( ), is a
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
mark () placed over certain letters in the orthography of some languages, to indicate a change of the related letter's pronunciation.
Typographers tend to use the term ''caron'', while linguists prefer the
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
word '.
The symbol is common in the
Baltic
Baltic may refer to:
Peoples and languages
*Baltic languages, a subfamily of Indo-European languages, including Lithuanian, Latvian and extinct Old Prussian
*Balts (or Baltic peoples), ethnic groups speaking the Baltic languages and/or originatin ...
,
Slavic,
Finnic,
Samic and
Berber
Berber or Berbers may refer to:
Ethnic group
* Berbers, an ethnic group native to Northern Africa
* Berber languages, a family of Afro-Asiatic languages
Places
* Berber, Sudan, a town on the Nile
People with the surname
* Ady Berber (1913–196 ...
language families.
Its use differs according to the orthographic rules of a language. In most Slavic and other European languages it indicates present or historical
palatalization (e → ě; [] → []), iotation, or postalveolar consonant, postalveolar articulation (c → č; → ). In Salishan languages, it often represents a uvular consonant (x → x̌; [] → ). When placed over vowel symbols, the caron can indicate a contour
tone
Tone may refer to:
Visual arts and color-related
* Tone (color theory), a mix of tint and shade, in painting and color theory
* Tone (color), the lightness or brightness (as well as darkness) of a color
* Toning (coin), color change in coins
* ...
, for instance the falling and then rising tone in the
Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
romanization
In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
of Standard Chinese, Mandarin Chinese. It is also used to decorate symbols in mathematics, where it is often pronounced ("check").
The caron is shaped approximately like a small letter "v". For
serif
In typography, a serif () is a small line or stroke regularly attached to the end of a larger stroke in a letter or symbol within a particular font or family of fonts. A typeface or "font family" making use of serifs is called a serif typeface ( ...
typefaces, the caron generally has one of two forms: either symmetrical, essentially identical to an inverted
circumflex
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of ...
; or with the left stroke thicker than the right, like the usual serif form of the letter "v" (, but without serifs). The latter form is often preferred by Czech designers for use in
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
, while for other uses the symmetrical form tends to predominate, as it does also among
sans-serif
In typography and lettering, a sans-serif, sans serif (), gothic, or simply sans letterform is one that does not have extending features called "serifs" at the end of strokes. Sans-serif typefaces tend to have less stroke width variation than ...
typefaces.
The caron is not to be confused with the
breve
A breve ( , less often , grammatical gender, neuter form of the Latin "short, brief") is the diacritic mark , shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (, the wedge or in ...
(, which is curved rather than angled):
Names
Different disciplines generally refer to this diacritic mark by different names. Typography tends to use the term ''caron''. Linguistics more often uses the
Czech word . Pullum's and Ladusaw's ''
Phonetic Symbol Guide
The ''Phonetic Symbol Guide'' is a book by Geoffrey Pullum and William Ladusaw that explains the histories and uses of the symbols of various phonetic transcription conventions. It was published in 1986, with a second edition in 1996, by the Univ ...
'' uses the term ''wedge''.
The term ''caron'' is used in the official names of
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
characters (e.g., ""). The
Unicode Consortium
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the in ...
explicitly states that the reason for this is unknown, but its earliest known use was in the
United States Government Printing Office
The United States Government Publishing Office (USGPO or GPO), formerly the United States Government Printing Office, is an agency of the Legislature, legislative branch of the Federal government of the United States, United States federal gove ...
Style Manual of 1967, and it was later used in character sets such as DIN 31624 (1979), ISO 5426 (1980), ISO/IEC 6937 (1983) and ISO/IEC 8859-2 (1985). Its actual origin remains obscure, but some have suggested that it may derive from a fusion of
caret
Caret () is the name used familiarly for the character provided on most QWERTY keyboards by typing . The symbol has a variety of uses in programming and mathematics. The name "caret" arose from its visual similarity to the original proofre ...
and
macron. Though this may be
folk etymology
Folk etymology – also known as (generative) popular etymology, analogical reformation, (morphological) reanalysis and etymological reinterpretation – is a change in a word or phrase resulting from the replacement of an unfamiliar form by a mo ...
, it is plausible, particularly in the absence of other suggestions. A Unicode technical note states that the name "hacek" should have been used instead.
The ''
Oxford English Dictionary
The ''Oxford English Dictionary'' (''OED'') is the principal historical dictionary of the English language, published by Oxford University Press (OUP), a University of Oxford publishing house. The dictionary, which published its first editio ...
'' gives 1953 as the earliest appearance in English for . In
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
, () means 'small
hook
A hook is a tool consisting of a length of material, typically metal, that contains a portion that is curved/bent back or has a deeply grooved indentation, which serves to grab, latch or in any way attach itself onto another object. The hook's d ...
', the
diminutive
A diminutive is a word obtained by modifying a root word to convey a slighter degree of its root meaning, either to convey the smallness of the object or quality named, or to convey a sense of intimacy or endearment, and sometimes to belittle s ...
form of (, 'hook')". The name appears in most English dictionaries, but they treat the long mark (
acute accent
The acute accent (), ,
is a diacritic used in many modern written languages with alphabets based on the Latin alphabet, Latin, Cyrillic script, Cyrillic, and Greek alphabet, Greek scripts. For the most commonly encountered uses of the accen ...
) differently. British dictionaries, such as the ''
OED
The ''Oxford English Dictionary'' (''OED'') is the principal historical dictionary of the English language, published by Oxford University Press (OUP), a University of Oxford publishing house. The dictionary, which published its first editio ...
'', ''
ODE
An ode (from ) is a type of lyric poetry, with its origins in Ancient Greece. Odes are elaborately structured poems praising or glorifying an event or individual, describing nature intellectually as well as emotionally. A classic ode is structu ...
'', ''
CED'', write (with the mark) in the headwords, while American ones, such as the ''
Merriam-Webster
Merriam-Webster, Incorporated is an list of companies of the United States by state, American company that publishes reference work, reference books and is mostly known for Webster's Dictionary, its dictionaries. It is the oldest dictionary pub ...
'', ''
NOAD'', ''
AHD'', incorrectly omit the acute and write , however, the ''NOAD'' gives as an alternative spelling.
In
Slovak it is called (, i.e., 'softener' or '
palatalization mark'), in
Croatian or ('angled hook' or 'small angled hook'), in
Serbian or ('angled hook' or 'small angled hook'), in
Slovenian ('little
roof
A roof (: roofs or rooves) is the top covering of a building, including all materials and constructions necessary to support it on the walls of the building or on uprights, providing protection against rain, snow, sunlight, extremes of tempera ...
') or ('little hook'), in
Lithuanian ('little bird') or ('little
jackdaw
Jackdaws are two species of bird in the genus ''Coloeus'' closely related to, but generally smaller than, crows and ravens ('' Corvus''). They have a blackish crown, wings, and tail, with the rest of their plumage paler.Madge & Burn (1994) 136� ...
'), in
Estonian
Estonian may refer to:
* Something of, from, or related to Estonia, a country in the Baltic region in northern Europe
* Estonians, people from Estonia, or of Estonian descent
* Estonian language
* Estonian cuisine
* Estonian culture
See also ...
('roof'), in
Finnish ('hat'), and in
Lakota
Lakota may refer to:
*Lakota people, a confederation of seven related Native American tribes
*Lakota language
Lakota ( ), also referred to as Lakhota, Teton or Teton Sioux, is a Siouan languages, Siouan language spoken by the Lakota people of ...
('wedge').
Origin
The caron evolved from the
dot above
When used as a diacritic mark, the term dot refers to the glyphs "combining dot above" (, and "combining dot below" (
which may be combined with some letters of the extended Latin alphabets in use in
a variety of languages. Similar marks are ...
diacritic, which
Jan Hus
Jan Hus (; ; 1369 – 6 July 1415), sometimes anglicized as John Hus or John Huss, and referred to in historical texts as ''Iohannes Hus'' or ''Johannes Huss'', was a Czechs, Czech theologian and philosopher who became a Church reformer and t ...
introduced into
Czech orthography
Czech orthography is a system of rules for proper formal writing (orthography) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech theologian and church reformist Jan Hus, the namesake of ...
(along with the
acute accent
The acute accent (), ,
is a diacritic used in many modern written languages with alphabets based on the Latin alphabet, Latin, Cyrillic script, Cyrillic, and Greek alphabet, Greek scripts. For the most commonly encountered uses of the accen ...
) in his ''
De Orthographia Bohemica
''De orthographia bohemica'' () is a Latin language, Latin work published between 1406 and 1412. It is attributed to Charles University rector and reformer Jan Hus. The book codified the Czech language's modern spelling and orthography and had de ...
'' (1412). The original form still exists in
Polish ''
ż''. However, Hus's work was hardly known at that time, and ''háček'' became widespread only in the 16th century with the introduction of printing.
Usage
For the fricatives ''š'' , ''ž'' , and the affricate ''č'' only, the caron is used in most northwestern Uralic languages that use the Latin alphabet, such as
Karelian,
Veps,
Northern Sami
Northern may refer to the following:
Geography
* North, a point in direction
* Northern Europe, the northern part or region of Europe
* Northern Highland, a region of Wisconsin, United States
* Northern Province, Sri Lanka
* Northern Range, a ...
, and
Inari Sami (although not in
Southern Sami
Southern may refer to:
Businesses
* China Southern Airlines, airline based in Guangzhou, China
* Southern Airways, defunct US airline
* Southern Air, air cargo transportation company based in Norwalk, Connecticut, US
* Southern Airways Express ...
).
Estonian
Estonian may refer to:
* Something of, from, or related to Estonia, a country in the Baltic region in northern Europe
* Estonians, people from Estonia, or of Estonian descent
* Estonian language
* Estonian cuisine
* Estonian culture
See also ...
and
Finnish use ''š'' and ''ž'' (but not ''č''), but only for transcribing foreign names and loanwords (albeit common loanwords such as or 'check'); the sounds (and letters) are native and common in Karelian, Veps, and Sami.
In
Italian
Italian(s) may refer to:
* Anything of, from, or related to the people of Italy over the centuries
** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom
** Italian language, a Romance languag ...
, ''š'', ''ž'', and ''č'' are routinely used as in Slovenian to transcribe
Slavic names in the
Cyrillic script
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, Mongolic, Uralic languages, Uralic, C ...
since in native Italian words, the sounds represented by these letters must be followed by a vowel, and Italian uses ''ch'' for , not . Other
Romance languages
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
, by contrast, tend to use their own orthographies, or in a few cases such as Spanish, borrow English ''sh'' or ''zh''.
The caron is also used in the
Romany alphabet. The
Faggin-Nazzi writing system for
Friulian makes use of the caron over the letters ''c'', ''g'', and ''s''.
The caron is also often used as a diacritical mark on consonants for
romanization
In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
of text from non-Latin writing systems, particularly in the
scientific transliteration
Scientific transliteration, variously called ''academic'', ''linguistic'', ''international'', or ''scholarly transliteration'', is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). Th ...
of Slavic languages. Philologists and the standard Finnish orthography often prefer using it to express sounds for which English require a digraph (''sh, ch'', and ''zh'') because most Slavic languages use only one character to spell the sounds (the key exceptions are Polish ''
sz'' and ''
cz''). Its use for that purpose can even be found in the United States because certain
atlas
An atlas is a collection of maps; it is typically a bundle of world map, maps of Earth or of a continent or region of Earth. Advances in astronomy have also resulted in atlases of the celestial sphere or of other planets.
Atlases have traditio ...
es use it in romanization of foreign
place name
Toponymy, toponymics, or toponomastics is the study of '' toponyms'' (proper names of places, also known as place names and geographic names), including their origins, meanings, usage, and types. ''Toponym'' is the general term for a proper nam ...
s. On the typographical side, Š/š and Ž/ž are likely the easiest among non-Western European diacritic characters to adopt for Westerners because the two are part of the
Windows-1252
Windows-1252 or CP-1252 ( Windows code page 1252) is a legacy single-byte character encoding that is used by default (as the "ANSI code page") in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.
Initially ...
character encoding.
Esperanto
Esperanto (, ) is the world's most widely spoken Constructed language, constructed international auxiliary language. Created by L. L. Zamenhof in 1887 to be 'the International Language' (), it is intended to be a universal second language for ...
uses the
circumflex
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of ...
over ''c'', ''g'', ''h'', ''j'', and ''s'' in similar ways; the circumflex was chosen because there was no caron on most Western European
typewriter
A typewriter is a Machine, mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of Button (control), keys, and each one causes a different single character to be produced on paper by striking an i ...
s, but the circumflex existed on
French ones.
It is also used as an accent mark on vowels to indicate the
tone
Tone may refer to:
Visual arts and color-related
* Tone (color theory), a mix of tint and shade, in painting and color theory
* Tone (color), the lightness or brightness (as well as darkness) of a color
* Toning (coin), color change in coins
* ...
of a syllable. The main example is in
Pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
for
Chinese in which it represents a falling-rising tone. It is used in transliterations of
Thai to indicate a rising tone.
Phonetics
The caron represents a rising tone in the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
. It is used in the
Uralic Phonetic Alphabet
Finno-Ugric transcription (FUT) or the Uralic Phonetic Alphabet (UPA) is a phonetic transcription or notational system used predominantly for the transcription and reconstruction of Uralic languages. It was first published in 1901 by Eemil Nesto ...
for indicating postalveolar consonants and in
Americanist phonetic notation
Americanist phonetic notation, also known as the North American Phonetic Alphabet (NAPA), the Americanist Phonetic Alphabet or the American Phonetic Alphabet (APA), is a system of phonetic notation originally developed by European and American ...
to indicate various types of pronunciation.
The caron below represents
voicing.
Writing and printing carons
In printed Czech and Slovak text, the caron combined with certain letters (lower-case ť, ď, ľ, and upper-case Ľ) is reduced to a small stroke or apostrophe. That is optional in handwritten text. Latin fonts are typically set to display this way by default. In some applications, using the
combining grapheme joiner
The combining grapheme joiner (CGJ), is a Unicode character that has no visible glyph and is "default ignorable" by applications. Its name is a misnomer and does not describe its function: the character does not join graphemes. Its purpose is to ...
, U+034F, between the letter and the combining mark, as in t͏̌, d͏̌, l͏̌, may prevent the caron from looking like a small stroke of the canonical characters.
In
Lazuri orthography, the lower-case ''k'' with caron sometimes has its caron reduced to a stroke while the lower-case ''t'' with caron preserves its caron shape.
Although the stroke looks similar to an
apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
, the
kerning
In typography, kerning is the process of adjusting the spacing between Character (symbol), characters in a Typeface#Proportion, proportional font, usually to achieve a visually pleasing result. Kerning adjusts the space between individual le ...
is significantly different. Using an apostrophe in place of a caron can be perceived as very unprofessional, but it is still often found on imported goods meant for sale in the Czech Republic and Slovakia (compare t’ to ť, L’ahko to Ľahko). (Apostrophes appearing as palatalization marks in some
Finnic languages
The Finnic or Baltic Finnic languages constitute a branch of the Uralic language family spoken around the Baltic Sea by the Baltic Finnic peoples. There are around 7 million speakers, who live mainly in Finland and Estonia.
Traditionally, ...
, such as
Võro and
Karelian, are not forms of caron either.) Foreigners also sometimes mistake the caron for the
acute accent
The acute accent (), ,
is a diacritic used in many modern written languages with alphabets based on the Latin alphabet, Latin, Cyrillic script, Cyrillic, and Greek alphabet, Greek scripts. For the most commonly encountered uses of the accen ...
(compare Ĺ to Ľ, ĺ to ľ).
In Balto-Slavic languages
The following are the
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
and
Slovak letters and
digraphs with the caron (Czech: , Slovak: ):
*
Č/č (pronounced , similar to 'ch' in ''cheap'': , which means
Czech Republic
The Czech Republic, also known as Czechia, and historically known as Bohemia, is a landlocked country in Central Europe. The country is bordered by Austria to the south, Germany to the west, Poland to the northeast, and Slovakia to the south ...
)
*
Š/š (pronounced , similar to 'sh' in ''she'': in )
*
Ž/ž (pronounced , similar to 's' in ''treasure'': 'sorrow')
*
Ř/ř (only in Czech: a special voiced or unvoiced fricative trill or , the former transcribed as in pre-1989 IPA:
Antonín Dvořák
Antonín Leopold Dvořák ( ; ; 8September 18411May 1904) was a Czech composer. He frequently employed rhythms and other aspects of the folk music of Moravia and his native Bohemia, following the Romantic-era nationalist example of his predec ...
)
*
Ď/ď,
Ť/ť,
Ň/ň (palatals, pronounced , , , slightly different from palatalized consonants as found in Russian): , 'The Devil and a beheaded horse')
*
Ľ/ľ (only in Slovak, pronounced as palatal : , 'businessman')
*
DŽ/Dž/dž (considered a single letter in Slovak, Macedonian,
Croatian, and
Serbian, two letters in Czech, pronounced "jungle" - identical to the ''j'' sound in ''jungle'' and the ''g'' in ''genius'', found mostly in borrowings.)
*
Ě/ě (only in Czech) indicates mostly palatalization of preceding consonant:
** , , are , , ;
** but is or , and , , , are .
* Furthermore, until the 19th century,
Ǧ/ǧ was used to represent while
G/g was used to represent .
In
Lower Sorbian
Lower may refer to:
* ''Lower'' (album), 2025 album by Benjamin Booker
*Lower (surname)
*Lower Township, New Jersey
*Lower Receiver (firearms)
*Lower Wick
Lower Wick is a small hamlet located in the county of Gloucestershire, England. It is sit ...
and
Upper Sorbian
Upper Sorbian (), occasionally referred to as Wendish (), is a minority language spoken by Sorbs in the historical province of Upper Lusatia, today part of Saxony, Germany. It is a West Slavic language, along with Lower Sorbian, Czech, Poli ...
, the following letters and digraphs have the caron:
* Č/č (pronounced like 'ch' in ''cheap'')
* Š/š (pronounced like 'sh' in ''she'')
* Ž/ž (pronounced like 's' in ''treasure'')
* Ř/ř (only in Upper Sorbian: pronounced like 'sh' in ''she'')
* Tř/tř (digraph, only in Upper Sorbian, soft (palatalized) sound)
* Ě/ě (pronounced like 'e' in ''bed'')
Balto-Slavic,
Croatian,
Serbian,
Slovenian,
Latvian and
Lithuanian use č, š and ž. The digraph dž is also used in these languages but is considered a separate letter only in Croatian and Serbian. The
Belarusian Lacinka alphabet
The Belarusian Latin alphabet or Łacinka (from , BGN/PCGN: , ) for the Latin script in general is the Latin script as used to write Belarusian. It is similar to the Sorbian alphabet and incorporates features of the Polish and Czech alphabe ...
also contains the digraph dž (as a separate letter), and Latin transcriptions of
Bulgarian and
Macedonian may use them at times, for transcription of the letter-combination ДЖ (Bulgarian) and the letter Џ (Macedonian).
In Uralic languages
In the
Finnic languages
The Finnic or Baltic Finnic languages constitute a branch of the Uralic language family spoken around the Baltic Sea by the Baltic Finnic peoples. There are around 7 million speakers, who live mainly in Finland and Estonia.
Traditionally, ...
,
Estonian
Estonian may refer to:
* Something of, from, or related to Estonia, a country in the Baltic region in northern Europe
* Estonians, people from Estonia, or of Estonian descent
* Estonian language
* Estonian cuisine
* Estonian culture
See also ...
(and transcriptions to
Finnish) uses Š/š and Ž/ž, and
Karelian uses Č/č, Š/š and Ž/ž. Dž is not a separate letter. Č is present because it may be phonemically
geminate
In phonetics and phonology, gemination (; from Latin 'doubling', itself from '' gemini'' 'twins'), or consonant lengthening, is an articulation of a consonant for a longer period of time than that of a singleton consonant. It is distinct from ...
: in Karelian, the phoneme 'čč' is found, and is distinct from 'č', which is not the case in Finnish or Estonian, for which only one length is recognized for 'tš'. (Incidentally, in transcriptions, Finnish orthography has to employ complicated notations like or even the to express Karelian .) On some Finnish keyboards, it is possible to write those letters by typing ''s'' or ''z'' while holding right
Alt key
The Alt key (pronounced or ) on a computer keyboard is used to change (alternate) the function of other pressed keys. Thus, the Alt key is a modifier key, used in a similar fashion to the Shift key. For example, simply pressing ''A'' will ty ...
or
AltGr key
AltGr (also Alt Graph) is a modifier key found on computer keyboards. It is primarily used to type characters that are used less frequently in the language that the keyboard is designed for, such as foreign currency symbols, typographic mar ...
, though that is not supported by the Microsoft Windows keyboard device driver KBDFI.DLL for the Finnish language. The Finnish multilingual keyboard layout allows typing the letters
Š/š and
Ž/ž by pressing AltGr+'+S for
š and AltGr+'+Z for
ž.
In Estonian, Finnish and Karelian these are ''not''
palatalized but postalveolar consonants. For example, Estonian (palatalized) is distinct from (postalveolar).
Palatalization is typically ignored in spelling, but some Karelian and Võro orthographies use an
apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
(') or an acute accent (´). In Finnish and Estonian, ''š'' and ''ž'' (and in Estonian, very rarely ''č'') appear in loanwords and foreign
proper names
A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa''; ''Jupiter''; ''Sarah (given name), Sarah''; ''Walmart'') as distinguished from a common noun, which is a noun that refers to a Class (philoso ...
only and when not available, they can be substituted with 'h': 'sh' for 'š', in print.
In the orthographies of the
Sami languages
Acronyms
* SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft
* Saudi Arabian Military Industries, a government-owned defence company
* South African Malaria Initiative, a virtual expertise ...
, the letters Č/č, Š/š and Ž/ž appear in
Northern Sami
Northern may refer to the following:
Geography
* North, a point in direction
* Northern Europe, the northern part or region of Europe
* Northern Highland, a region of Wisconsin, United States
* Northern Province, Sri Lanka
* Northern Range, a ...
,
Inari Sami and
Skolt Sami. Skolt Sami also uses three other consonants with the caron: Ǯ/ǯ (
ezh-caron) to mark the voiced postalveolar affricate (plain Ʒ/ʒ marks the alveolar affricate ), Ǧ/ǧ to mark the voiced palatal affricate and Ǩ/ǩ the corresponding voiceless palatal affricate . More often than not, they are geminated: ''vuäǯǯad'' "to get". The orthographies of the more southern Sami languages of Sweden and Norway such as
Lule Sami
Lule Sámi (, , ) is a Uralic- Sámi language spoken around the Lule River in Sweden and in the northern parts of Nordland county in Norway. In Norway it is especially seen in Hamarøy Municipality (formerly Tysfjord Municipality), where Lule ...
do not use caron, and prefer instead the
digraphs ''tj'' and ''sj''.
Finno-Ugric transcription
Most other Uralic languages (including
Kildin Sami Kildin may refer to:
* Kildin Island
Kildin (also Kilduin; , North Sami: Gieldasuolu) is a small Russian island in the Barents Sea, off the Russian shore and about 120 km from Norway. Administratively, Kildin belongs to the Murmansk Obla ...
) are normally written with
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
instead of the Latin script. In their scientific
transcription, the
Finno-Ugric Transcription /
Uralic Phonetic Alphabet
Finno-Ugric transcription (FUT) or the Uralic Phonetic Alphabet (UPA) is a phonetic transcription or notational system used predominantly for the transcription and reconstruction of Uralic languages. It was first published in 1901 by Eemil Nesto ...
however employs the letters ''š'', ''ž'' and occasionally ''č'', ''ǯ'' (alternately ''tš'', ''dž'') for the postalveolar consonants. These serve as basic letters, and with further diacritics are used to transcribe also other fricative and affricate sounds.
Retroflex consonant
A retroflex () or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the hard palate. They are sometimes referred to as cerebral consona ...
s are marked by a caron and an
underdot
When used as a diacritic mark, the term dot refers to the glyphs "combining dot above" (, and "combining dot below" (
which may be combined with some letters of the extended Latin alphabets in use in
a variety of languages. Similar marks are ...
(''ṣ̌'', ''ẓ̌'' = IPA , ),
alveolo-palatal
In phonetics, alveolo-palatal (alveolopalatal, ''alveo-palatal'' or ''alveopalatal'') consonants, sometimes synonymous with pre-palatal consonants, are intermediate in articulation between the coronal and dorsal consonants, or which have simu ...
(palatalized postalveolar) consonants by a caron and an
acute (''š́'', ''ž́'' = IPA , ). Thus, for example, the postalveolar consonants of the
Udmurt language, normally written as Ж/ж, Ӝ/ӝ, Ӵ/ӵ, Ш/ш are in Uralic studies normally transcribed as ''ž'', ''ǯ'', ''č'', ''š'' respectively, and the alveolo-palatal consonants normally written as Зь/зь, Ӟ/ӟ, Сь/сь, Ч/ч are normally transcribed as ''ž́'', ''ǯ́'', ''š́'', ''č́'' respectively.
In other languages
In the
Berber Latin alphabet
The Berber Latin alphabet () is the version of the Latin alphabet used to write the Berber languages. It was adopted in the 19th century, using a variety of letters.
History
The Berber languages were originally written using the ancient ''Libyco- ...
of the
Berber language
The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berber communities, who ar ...
(North Africa) the following letters and digraphs are used with the caron:
* Č/č (pronounced like the English "ch" in China)
* Ǧ/ǧ (pronounced like the English "j" in the words "joke" and "James")
* Ř/ř (only in
Riffian Berber: pronounced ) (no English equivalent).
Finnish Kalo uses Ȟ/ȟ.
Lakota
Lakota may refer to:
*Lakota people, a confederation of seven related Native American tribes
*Lakota language
Lakota ( ), also referred to as Lakhota, Teton or Teton Sioux, is a Siouan languages, Siouan language spoken by the Lakota people of ...
uses Č/č, Š/š, Ž/ž, Ǧ/ǧ (voiced post-velar fricative) and Ȟ/ȟ (plain post-velar fricative).
Indonesian uses ě (e with caron) informally to mark the
schwa ().
Many alphabets of African languages use the caron to mark the rising tone, as in the
African reference alphabet
The African Reference Alphabet is a largely defunct continent-wide guideline for the creation of Latin alphabets for African languages. Two variants of the initial proposal (one in English and a second in French) were made at a 1978 UNESCO-organi ...
.
Outside of the Latin alphabet, the caron is also used for
Cypriot Greek
Cypriot Greek (, or ) is the variety of Modern Greek that is spoken by the majority of the Cypriot populace and Greek Cypriot diaspora. It is considered a divergent dialect as it differs from Standard Modern Greek in various aspects of its ...
letters that have a different sound from
Standard Modern Greek
The linguistic varieties of Modern Greek can be classified along two principal dimensions. First, there is a long tradition of sociolectal variation between the natural, popular spoken language on the one hand and archaizing, learned written forms ...
: σ̌ κ̌ π̌ τ̌ ζ̌ in words like ('and'), ('cat').
Other transcription and transliteration systems
The
DIN 31635
DIN 31635 is a (DIN) standard for the transliteration of the Arabic alphabet adopted in 1982. It is based on the rules of the (DMG) as modified by the International Orientalist Congress 1935 in Rome. The most important differences from English-ba ...
standard for transliteration of Arabic uses Ǧ/ǧ to represent the letter . ''
'', on account of the inconsistent pronunciation of
J in European languages, the variable pronunciation of the letter in
educated Arabic , and the desire of the DIN committee to have a one-to-one correspondence of Arabic to Latin letters in its system.
Romanization of
Pashto
Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
uses Č/č, Š/š, Ž/ž, X̌/x̌, to represent the letters چ, ش, ژ, ښ, respectively. Additionally, Ṣ̌/ṣ̌ and Ẓ̌/ẓ̌ are used by the southern Pashto dialect only (replaced by X̌/x̌ and Ǵ/ǵ in the north).
The latter Š/š is also used to transcribe the phoneme in
Sumerian and
Akkadian cuneiform, and the phoneme in
Semitic languages
The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic,
Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
represented by the letter
shin (Phoenician

and its descendants).
The caron is also used in
Mandarin Chinese
Mandarin ( ; zh, s=, t=, p=Guānhuà, l=Mandarin (bureaucrat), officials' speech) is the largest branch of the Sinitic languages. Mandarin varieties are spoken by 70 percent of all Chinese speakers over a large geographical area that stretch ...
pinyin
Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
romanization and orthographies of several other
tonal language
Tone is the use of pitch in language to distinguish lexical or grammatical meaning—that is, to distinguish or to inflect words. All oral languages use pitch to express emotional and other para-linguistic information and to convey emphasi ...
s to indicate the "falling-rising"
tone
Tone may refer to:
Visual arts and color-related
* Tone (color theory), a mix of tint and shade, in painting and color theory
* Tone (color), the lightness or brightness (as well as darkness) of a color
* Toning (coin), color change in coins
* ...
(similar to the pitch made when asking "Huh?"). The caron can be placed over the vowels: ǎ, ě, ǐ, ǒ, ǔ, ǚ. The alternative to a caron is a number 3 after the syllable: = , as the "falling-rising" tone is the third tone in
Mandarin
Mandarin or The Mandarin may refer to:
Language
* Mandarin Chinese, branch of Chinese originally spoken in northern parts of the country
** Standard Chinese or Modern Standard Mandarin, the official language of China
** Taiwanese Mandarin, Stand ...
.
The caron is used in the
New Transliteration System of
D'ni
''Myst'' is a franchise centered on a series of adventure game, adventure video games. The first game in the series, ''Myst'', was released in 1993 by brothers Rand Miller, Rand and Robyn Miller and their video game company Cyan Worlds, Cyan, In ...
in the symbol š to represent the sound (English "sh").
A-caron (ǎ) is also used to transliterate the
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
letter
Ъ () in
Bulgarian—it represents the
mid back unrounded vowel
The mid back unrounded vowel is a type of vowel sound, used in some spoken languages. Although there is no dedicated symbol in the International Phonetic Alphabet that represents the exact mid back unrounded vowel between close-mid and open-mid ...
.
Caron marks a falling and rising tone (bǔ, bǐ) in
Fon languages.
Letters with caron
Unicode encodes a number of cases of "letter with caron" as
precomposed character
A precomposed character (alternatively composite character or decomposable character) is a Unicode entity that can also be defined as a sequence of one or more other characters. A precomposed character may typically represent a letter with a diac ...
s and these are displayed below. In addition, many more symbols may be composed using the
combining character
In digital typography, combining characters are Character (computing), characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritic, diacritical marks (including c ...
facility ( and ) that may be used with any letter or other diacritic to create a customised symbol but this does not mean that the result has any real-world application; such customised characters are not shown in the table.
There are a number of Cyrillic letters with caron but they do not have precomposed characters and thus must be generated using the combining character method. These are:
В̌в̌;
Ǯǯ;
Г̌г̌;
Ғ̌ғ̌;
Д̌д̌;
З̌з̌;
Р̌р̌;
Т̌т̌;
Х̌х̌
Software
Unicode
For legacy reasons, most letters that carry carons are
precomposed character
A precomposed character (alternatively composite character or decomposable character) is a Unicode entity that can also be defined as a sequence of one or more other characters. A precomposed character may typically represent a letter with a diac ...
s in
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
, but a caron can also be added to any letter by using the
combining character
In digital typography, combining characters are Character (computing), characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritic, diacritical marks (including c ...
, for example: b̌ q̌ J̌. The
modifier letter version is encoded with .
The characters Č, č, Ě, ě, Š, š, Ž, ž are a part of the
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
Latin Extended-A
Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 (which is already encoded in the Latin-1 Supplement block) and also legacy characte ...
set because they occur in Czech and other official languages in Europe, while the rest are in
Latin Extended-B
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version ...
, which often causes an inconsistent appearance.
Unicode also encodes , for example: p̬.
A combining double caron was proposed for inclusion in April, 2024.
See also
*
Acute accent
The acute accent (), ,
is a diacritic used in many modern written languages with alphabets based on the Latin alphabet, Latin, Cyrillic script, Cyrillic, and Greek alphabet, Greek scripts. For the most commonly encountered uses of the accen ...
*
Apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
*
Breve
A breve ( , less often , grammatical gender, neuter form of the Latin "short, brief") is the diacritic mark , shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (, the wedge or in ...
*
Caret
Caret () is the name used familiarly for the character provided on most QWERTY keyboards by typing . The symbol has a variety of uses in programming and mathematics. The name "caret" arose from its visual similarity to the original proofre ...
*
Circumflex accent
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of ...
*
Sicilicus
A sicilicus was an old Latin diacritical mark, , like a reversed C (Ɔ) placed above a letter and evidently deriving its name from its shape like a little sickle (which is ''wiktionary:sicilis#Latin, sicilis'' in Latin). The ancient sources say t ...
*
Soft sign
The soft sign (Ь ь; italics: ) is a letter in the Cyrillic script that is used in various Slavic languages. In Old Church Slavonic, it represented a short or reduced front vowel. However, over time, the specific vowel sound it denote ...
(ь)
Notes
References
External links
*
*
{{Latin script, , caron
Greek-script diacritics
Latin-script diacritics