Virama ( ्, ) is a
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
phonological concept to suppress the
inherent vowel
An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol.
There are many known abugida scripts, including most of the Brahmic scripts and Kharosthi, the c ...
that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either
# halanta, hasanta or explicit virāma, a
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
in many
Brahmic scripts
The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
, including the
Devanagari
Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
and
Bengali scripts, or
# saṃyuktākṣara (
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
: संयुक्ताक्षर) or implicit virama, a conjunct consonant or ligature.
Unicode schemes of scripts writing
Mainland Southeast Asia languages, such as that of
Burmese script
Burmese may refer to:
* Something of, from, or related to Myanmar, a country in Southeast Asia
* Burmese people
* Burmese language
* Burmese alphabet
* Burmese cuisine
* Burmese culture
Animals
* Burmese cat
* Burmese chicken
* Burmese (horse)
...
and of
Tibetan script
The Tibetan script is a segmental writing system, or '' abugida'', forming a part of the Brahmic scripts, and used to write certain Tibetic languages, including Tibetan, Dzongkha, Sikkimese, Ladakhi, Jirel and Balti. Its exact origins ...
, generally do not group the two functions together.
Names
The name is
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
for "cessation, termination, end". As a
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
word, it is used in place of several language-specific terms, such as:
Usage
In
Devanagari
Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
and many other
Indic scripts
The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
, a virama is used to cancel the
inherent vowel
An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol.
There are many known abugida scripts, including most of the Brahmic scripts and Kharosthi, the c ...
of a consonant letter and represent a consonant without a vowel, a "dead" consonant. For example, in Devanagari,
# is a consonant letter, ''ka'',
#् is a virāma; therefore,
# (''ka'' + virāma) represents a dead consonant ''k''.
If this ''k'' is further followed by another consonant letter, for example, ṣa ष, the result might look like , which represents ''kṣa'' as ''ka'' + (visible) virāma + ''ṣa''. In this case, two elements ''k'' क् and ''ṣa'' ष are simply placed one by one, side by side. Alternatively, ''kṣa'' can be also written as a
ligature Ligature may refer to:
Language
* Ligature (writing), a combination of two or more letters into a single symbol (typography and calligraphy)
* Ligature (grammar), a morpheme that links two words
Medicine
* Ligature (medicine), a piece of suture us ...
, which is actually the preferred form.
Generally, when a dead consonant letter C
1 and another consonant letter C
2 are conjoined, the result may be:
#A fully conjoined ligature of C
1+C
2;
#Half-conjoined—
#*C
1-conjoining: a modified form (half form) of C
1 attached to the original form (full form) of C
2
#*C
2-conjoining: a modified form of C
2 attached to the full form of C
1; or
#Non-ligated: full forms of C
1 and C
2 with a visible virama.
If the result is fully or half-conjoined, the (conceptual) virama which made C
1 dead becomes invisible, logically existing only in a
character encoding
Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
scheme such as
ISCII
Indian Standard Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Eastern Nagari, Bengali–Ass ...
or
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
. If the result is not ligated, a virama is visible, attached to C
1, actually written.
Basically, those differences are only glyph variants, and the three forms are
semantically identical. Although there may be a preferred form for a given consonant cluster in each language and some scripts do not have some kind of ligatures or half forms at all, it is generally acceptable to use a nonligature form instead of a ligature form even when the latter is preferred if the font does not have a glyph for the ligature. In some other cases, whether to use a ligature or not is just a matter of taste.
The virāma in the sequence C
1 + virāma + C
2 may thus work as an invisible control character to ligate C
1 and C
2 in Unicode. For example,
*''ka'' क + virāma + ṣa ष = ''kṣa''
is a fully conjoined ligature. It is also possible that the virāma does not ligate C
1 and C
2, leaving the full forms of C
1 and C
2 as they are:
*''ka'' + virama + ''ṣa'' = ''kṣa''
is an example of such a non-ligated form.
The sequences ङ्क ङ्ख ङ्ग ङ्घ , in common Sanskrit orthography, should be written as conjuncts (the virāma and the top cross line of the second letter disappear, and what is left of the second letter is written under the ङ and joined to it).
End of word
The
inherent vowel
An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol.
There are many known abugida scripts, including most of the Brahmic scripts and Kharosthi, the c ...
is not always pronounced, in particular at the end of a word (
schwa deletion). No virāma is used for vowel suppression in such cases. Instead, the orthography is based on Sanskrit where all inherent vowels are pronounced, and leaves to the reader of modern languages to delete the schwa when appropriate.
[Akira Nakanishi: Writing Systems of the World, , pp. 48.]
See also
*
Sukun
The Arabic script has numerous diacritics, which include consonant pointing known as (, ), and supplementary diacritics known as (, ). The latter include the vowel marks termed (, ; , ', ).
The Arabic script is a modified abjad, where all ...
, a similar diacritic in Arabic script
*
Zero consonant
In orthography, a zero consonant, silent initial, or null-onset letter is a consonant letter that does not correspond to a consonant sound, but is required when a word or syllable starts with a vowel (i.e. has a null onset). Some abjads, abugid ...
References
External links
Blog: Sorting it all Out
{{Navbox diacritical marks
Brahmic diacritics