HOME

TheInfoList



OR:

The Tamil script ( ) is an
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
script that is used by
Tamils The Tamils ( ), also known by their endonym Tamilar, are a Dravidian peoples, Dravidian ethnic group who natively speak the Tamil language and trace their ancestry mainly to the southern part of the Indian subcontinent. The Tamil language is o ...
and Tamil speakers in
India India, officially the Republic of India, is a country in South Asia. It is the List of countries and dependencies by area, seventh-largest country by area; the List of countries by population (United Nations), most populous country since ...
,
Sri Lanka Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
,
Malaysia Malaysia is a country in Southeast Asia. Featuring the Tanjung Piai, southernmost point of continental Eurasia, it is a federation, federal constitutional monarchy consisting of States and federal territories of Malaysia, 13 states and thre ...
,
Singapore Singapore, officially the Republic of Singapore, is an island country and city-state in Southeast Asia. The country's territory comprises one main island, 63 satellite islands and islets, and one outlying islet. It is about one degree ...
and elsewhere to write the
Tamil language Tamil (, , , also written as ''Tamizhil'' according to linguistic pronunciation) is a Dravidian language natively spoken by the Tamil people of South Asia. It is one of the longest-surviving classical languages in the world,. "Tamil is one of ...
. It is one of the
official scripts of the Indian Republic There are several official scripts of India, which are either used officially by the Union government or by the state governments. The official languages of the Indian Union are Hindi and English, whereas the Devanagari script is used to write Hin ...
. Certain minority languages such as Saurashtra, Badaga, Irula and Paniya are also written in the Tamil script.


Characteristics

The Tamil script has 12
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s (, , "soul-letters"), 18
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s (, , "body-letters") and one special character, the (, ). is called "அக்கு", ''akku,'' and is classified in Tamil orthography as being neither a consonant nor a vowel. However, it is listed at the end of the vowel set. The script is syllabic, not
alphabet An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
ic, and is written from left to right.


History

The Tamil script, like the other
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
, is thought to have evolved from the original
Brahmi script Brahmi ( ; ; ISO 15919, ISO: ''Brāhmī'') is a writing system from ancient India. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as ...
. The earliest inscriptions which are accepted examples of Tamil writing date to the
Ashoka Ashoka, also known as Asoka or Aśoka ( ; , ; – 232 BCE), and popularly known as Ashoka the Great, was List of Mauryan emperors, Emperor of Magadha from until #Death, his death in 232 BCE, and the third ruler from the Mauryan dynast ...
n period. The script used by such inscriptions is commonly known as the Tamil-Brahmi or "Tamili script" and differs in many ways from standard Ashokan Brahmi. For example, early Tamil-Brahmi, unlike Ashokan Brahmi, had a system to distinguish between pure
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s (''m'', in this example) and consonants with an inherent
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
(''ma'', in this example). In addition, according to Iravatham Mahadevan, early Tamil Brahmi used slightly different vowel markers, had extra characters to represent letters not found in
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
and omitted letters for sounds not present in Tamil such as voiced consonants and aspirates. Inscriptions from the 2nd century use a later form of Tamil-Brahmi, which is substantially similar to the writing system described in the ''
Tolkāppiyam ''Tolkāppiyam'', also romanised as ''Tholkaappiyam'' ( , ''lit.'' "ancient poem"), is the oldest extant Tamil grammar text and the oldest extant long work of Tamil literature. It is the earliest Tamil text mentioning Gods, perhaps linked to ...
'', an ancient Tamil grammar. Most notably, they used the ''puḷḷi'' to suppress the inherent vowel. The Tamil letters thereafter evolved towards a more rounded form and by the 5th or 6th century, they had reached a form called the ''early vaṭṭeḻuttu''. The modern Tamil script does not, however, descend from that script. In the 4th century, the Pallava dynasty created a new script called Pallava script for Tamil and the Grantha alphabet evolved from it, adding the Vaṭṭeḻuttu alphabet for sounds not found to write Sanskrit. Parallel to Grantha alphabet a new script (Chola-Pallava script, which evolved to modern Tamil script) again emerged in Pallava and Chola territories resembling the same glyph development like Grantha, however, heavily reduced in its shapes and not overtaking non-native Tamil sounds, thus, ultimately descending from Pallava script. By the 8th century, the new scripts supplanted Vaṭṭeḻuttu in the Pallava and Chola kingdoms which lay in the north portion of the Tamil-speaking region. However, Vaṭṭeḻuttu continued to be used in the southern portion of the Tamil-speaking region, in the Chera and Pandyan kingdoms until the 11th century, when the Pandyan kingdom was conquered by the Cholas who inherited while being feudatory of Pallavas for a short time. With the fall of Pallava kingdom, the Chola dynasty pushed the Chola-Pallava script as the de facto script. Over the next few centuries, the Chola-Pallava script evolved into the modern Tamil script. The Grantha and its parent script influenced the Tamil script notably. The use of palm leaves as the primary medium for
writing Writing is the act of creating a persistent representation of language. A writing system includes a particular set of symbols called a ''script'', as well as the rules by which they encode a particular spoken language. Every written language ...
led to changes in the script. The scribe had to be careful not to pierce the leaves with the stylus while writing because a leaf with a hole was more likely to tear and decay faster. As a result, the use of the ''puḷḷi'' to distinguish pure consonants became rare, with pure consonants usually being written as if the inherent vowel were present. Similarly, the vowel marker () called: , a half-rounded ''u'' which occurs at the end of some words and in the medial position in certain compound words, marking a shortened ''u'' sound, also fell out of use and was replaced by the marker for the simple ''u'' ()''.'' The ''puḷḷi'' () did not fully reappear until the introduction of
printing Printing is a process for mass reproducing text and images using a master form or template. The earliest non-paper products involving printing include cylinder seals and objects such as the Cyrus Cylinder and the Cylinders of Nabonidus. The ...
, but the marker ''kuṟṟiyal-ukaram'' () never came back for this purpose into use although its usage is retained in certain grammatical conceptual words whereas the sound itself still exists and plays an important role in Tamil prosody. The forms of some of the letters were simplified in the 19th century to make the script easier to typeset. In the 20th century, the script was simplified even further in a series of reforms, which regularised the vowel markers used with consonants by eliminating special markers and most irregular forms.


Relationship with other Indic scripts

The Tamil script differs from other Brahmi-derived scripts in a number of ways. Unlike every other Brahmic script, it does not regularly represent voiced or aspirated
stop consonant In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
s as these are not
phoneme A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s of the Tamil language even though voiced and fricative
allophone In phonology, an allophone (; from the Greek , , 'other' and , , 'voice, sound') is one of multiple possible spoken soundsor '' phones''used to pronounce a single phoneme in a particular language. For example, in English, the voiceless plos ...
s of stops do appear in spoken Tamil. Thus the character ''k'', for example, represents but can also be pronounced [] or [] based on the rules of Tamil phonology. A separate set of characters appears for these sounds when the Tamil script is used to write Sanskrit or other languages. Also unlike other Brahmi scripts, the Tamil script rarely uses
typographic ligature In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters and used in English and French, in which the letters and are joined for the first ligature ...
s to represent conjunct consonants, which are far less frequent in Tamil than in other Indian languages. Where they occur, conjunct consonants are written by writing the character for the first consonant, adding the ''puḷḷi'' to suppress its inherent vowel, and then writing the character for the second consonant. There are a few exceptions, namely ''kṣa'' and ''śrī''.
ISO 15919 ISO 15919 is an international standard for the romanization of Indic scripts. Published in 2001, it is part of a series of romanization standards by the International Organization for Standardization. Overview Relation to other systems ...
is an international standard for the transliteration of Tamil and other Indic scripts into Latin characters. It uses diacritics to map the much larger set of Brahmic consonants and vowels to the Latin script.


Letters


Basic consonants

Consonants are called the "body" (''mei'') letters. The consonants are classified into three categories: ''vallinam'' (hard consonants), ''mellinam'' (soft consonants, including all nasals), and ''itayinam'' (medium consonants). There are some lexical rules for the formation of words. The ''
Tolkāppiyam ''Tolkāppiyam'', also romanised as ''Tholkaappiyam'' ( , ''lit.'' "ancient poem"), is the oldest extant Tamil grammar text and the oldest extant long work of Tamil literature. It is the earliest Tamil text mentioning Gods, perhaps linked to ...
'' describes such rules. Some examples: a word cannot end in certain consonants, and cannot begin with some consonants including r-, l- and ḻ-; there are six nasal consonants in Tamil: a
velar nasal The voiced velar nasal, also known as eng, engma, or agma (from Greek 'fragment'), is a type of consonantal sound used in some spoken languages. It is the sound of ''ng'' in English ''sing'' as well as ''n'' before velar consonants as in ''E ...
ங், a
palatal nasal The voiced palatal nasal is a type of consonant used in some spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , a lowercase letter ''n'' with a leftward-pointing tail protruding from the bottom ...
ஞ், a retroflex nasal ண், a
dental nasal The voiced alveolar nasal is a type of consonantal sound used in numerous spoken languages. The symbol in the International Phonetic Alphabet that represents dental, alveolar, and postalveolar nasals is , and the equivalent X-SAMPA symbol ...
ந், a bilabial nasal ம், and an alveolar nasal ன். The order of the alphabet (strictly
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
) in Tamil closely matches that of the nearby languages both in location and linguistics, reflecting the common origin of their scripts from Brahmi. Tamil language has 18 consonants - ''mey eluttukkal''. Traditional grammarians have classified these 18 into three groups of 6 letters each. This classification is done based on the method of articulation and hence the nature of these letters. ''Vallinam'' (hard group), ''mellinam'' (soft group) and ''idaiyinam'' (medium group). All consonants are pronounced for a half unit (māttirai) time length when isolated (consonants combined with vowels will be pronounced with the time length of the vowel).


Extra consonants used in Tamil

The Tamil speech has incorporated many phonemes that were not part of the
Tolkāppiyam ''Tolkāppiyam'', also romanised as ''Tholkaappiyam'' ( , ''lit.'' "ancient poem"), is the oldest extant Tamil grammar text and the oldest extant long work of Tamil literature. It is the earliest Tamil text mentioning Gods, perhaps linked to ...
classification. The letters used to write these sounds, known as Grantha, are used as part of Tamil. These are taught from elementary school and incorporated in Tamil All Character Encoding (TACE16). There is also the compound (), equivalent to in
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
. Combinations of consonants with (, , equivalent to nuqta) are occasionally used to represent phonemes of foreign languages, especially to write
Islamic Islam is an Abrahamic religions, Abrahamic monotheistic religion based on the Quran, and the teachings of Muhammad. Adherents of Islam are called Muslims, who are estimated to number Islam by country, 2 billion worldwide and are the world ...
and
Christian A Christian () is a person who follows or adheres to Christianity, a Monotheism, monotheistic Abrahamic religion based on the life and teachings of Jesus in Christianity, Jesus Christ. Christians form the largest religious community in the wo ...
texts. For example: asif = , azārutīn̠ = ,
Genghis Khan Genghis Khan (born Temüjin; August 1227), also known as Chinggis Khan, was the founder and first khan (title), khan of the Mongol Empire. After spending most of his life uniting the Mongols, Mongol tribes, he launched Mongol invasions and ...
= . A nuqta-like diacritic is used while writing the
Badaga language Badaga is a southern Dravidian language spoken by the Badagas, Badaga people of the Nilgiris district of Tamil Nadu. The language is closely related to the Kannada language with heavy influence from the Tamil language, Tamil language. Of al ...
and double dot nuqta for the
Irula language Irula is a Dravidian language spoken by the Irulas who inhabit the area of the Nilgiri mountains, in the states of Tamil Nadu, Kerala, and Karnataka, India. It is closely related to Tamil. It is written in the Tamil script. Origins The la ...
to transcribe its sounds. There has also been effort to differentiate voiced and voiceless consonants through subscripted numbers – two, three, and four which stand for the unvoiced aspirated, voiced, voiced aspirated respectively. This was used to transcribe Sanskrit words in Sanskrit–Tamil books, as shown in the table below.Sharma, Shriramana. (2010a)
''Proposal to encode characters for Extended Tamil''.
/ref>Sharma, Shriramana. (2010c)
''Follow-up #2 to Extended Tamil proposal''.
/ref> The Unicode Standard uses superscripted digits for the same purpose, as in , , and .


Vowels

Vowels are also called the 'life' (''uyir'') or 'soul' letters. Together with the consonants (''mei'', which are called 'body' letters), they form compound, syllabic (
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
) letters that are called 'living' or 'embodied' letters (''uyir mei'', i.e. letters that have both 'body' and 'soul'). Tamil language has 12 vowels which are divided into short and long (five of each type) and two
diphthong A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
s.


Compound form

Using the consonant 'k' as an example: The special letter , represented by three dots, is called or ''aḵ''. It originally represented an archaic Tamil retention of the Dravidian sound ḥ, which has been lost in almost all modern Dravidian languages, and in Tamil traditionally serves a purely grammatical function, but in modern times it has come to be used as a diacritic to represent foreign sounds. For example, is used for the English sound ''f'', not found in Tamil. It also served before palm leaves became the primary writing medium for words ending with an inherent consonsant-vowel ''u'' as a pronouncing rule for a short ''u'', called'. Following consonants rendered this behaviour: , , , , , . Instead of writing like in modern days without any markers, for example (), it was written with a preceding , like . Another archaic Tamil letter , represented by a small hollow circle and called , is the '' Anusvara''. It was traditionally used as a
homorganic In phonetics, a homorganic consonant (from Latin and ) is a consonant sound that is articulated in the same place of articulation as another. For example, , and are homorganic consonants of one another since they share the bilabial place of ...
nasal when in front of a consonant, and either as a bilabial nasal () or alveolar nasal () at the end of a word, depending on the context. The long () vowels are about twice as long as the short () vowels. The
diphthong A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
s are usually pronounced about one and a half times as long as the short vowels, though some grammatical texts place them with the long () vowels. As can be seen in the compound form, the vowel sign can be added to the right, left or both sides of the consonants. It can also form a ligature. These rules are evolving and older use has more ligatures than modern use. What you actually see on this page depends on your font selection; for example,
Code2000 Code2000 is a serif and pan- Unicode digital font, which includes characters and symbols from a very large range of writing systems. As of the current version 1.176 released in 2023, Code2000 is designed and implemented by James Kass to inclu ...
will show more ligatures than Latha. There are proponents of script reform who want to eliminate all ligatures and let all vowel signs appear on the right side. Unicode encodes the character in logical order (always the consonant first), whereas legacy 8-bit encodings (such as TSCII) prefer the written order. This makes it necessary to reorder when converting from one encoding to another; it is not sufficient simply to map one set of code points to the other.


Compound table of Tamil letters

The following table lists vowel ( or life) letters across the top and consonant ( or body) letters along the side, the combination of which gives all Tamil compound () letters.


Writing order


Numerals and symbols

Apart from the usual numerals (from 0 to 9), Tamil also has numerals for 10, 100 and 1000. Symbols for
fraction A fraction (from , "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight-fifths, thre ...
and other number-based concepts can also be found.


Unicode

Tamil script was added to the
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
Standard in October 1991 with the release of version 1.0.0. The Unicode block for Tamil is U+0B80–U+0BFF. Grey areas indicate non-assigned code points. Most of the non-assigned code points are designated reserved because they are in the same relative position as characters assigned in other South Asian script blocks that correspond to phonemes that don't exist in the Tamil script. Efforts to unify the Grantha script with Tamil have been made;Sharma, Shriramana. (2010b)
''Follow-up to Extended Tamil proposal L2/10-256R''.
/ref> however the proposals triggered discontent by some. Eventually, considering the sensitivity involved, it was determined that the two scripts should be encoded independently, except for the numerals. Proposals to encode characters used for fractional values in traditional
accounting Accounting, also known as accountancy, is the process of recording and processing information about economic entity, economic entities, such as businesses and corporations. Accounting measures the results of an organization's economic activit ...
practices were submitted. Although discouraged by the ICTA of Sri Lanka, the proposal was recognized by the Government of
Tamil Nadu Tamil Nadu (; , TN) is the southernmost States and union territories of India, state of India. The List of states and union territories of India by area, tenth largest Indian state by area and the List of states and union territories of Indi ...
and were added to the Unicode Standard in March 2019 with the release of version 12.0. The Unicode block for Tamil Supplement is U+11FC0–U+11FFF:


Syllabary

Like other South Asian scripts in Unicode, the Tamil encoding was originally derived from the
ISCII Indian Standard Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Eastern Nagari, Bengali–Ass ...
standard. Both
ISCII Indian Standard Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Eastern Nagari, Bengali–Ass ...
and
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
encode Tamil as an
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
. In an abugida, each basic character represents a consonant and default vowel. Consonants with a different vowel or bare consonants are represented by adding a modifier character to a base character. Each code point representing a similar phoneme is encoded in the same relative position in each South Asian script block in Unicode, including Tamil. Because Unicode represents Tamil as an abugida all the pure consonants (consonants with no associated vowel) and syllables in Tamil can be represented by combining multiple Unicode code points, as can be seen in the Unicode Tamil Syllabary below. In Unicode 5.1, named sequences were added for all Tamil consonants and syllables. Unicode 5.1 also has a named sequence for the Tamil ligature SRI (''śrī''), ஶ்ரீ, written using ஶ (''śa''). The name of this sequence is TAMIL SYLLABLE SHRII and is composed of the Unicode sequence U+0BB6 U+0BCD U+0BB0 U+0BC0. The ligature can also be written using ஸ (''sa'') to create an identical ligature ஸ்ரீ composed of the Unicode sequence U+0BB8 U+0BCD U+0BB0 U+0BC0; but this is discouraged by the Unicode standard.


See also

* Simplified Tamil script * Tamil phonology *
Tamil Keyboard The Tamil language, Tamil keyboard layout, keyboard is used in computers and mobile devices to input text in the Tamil script. The keyboard layout approved by the Government of Tamil Nadu is Tamil 99. The InScript keyboard is the keyboard layout ...
* Tamil Braille * Tamil letters (on Tamil Wikibooks) *
Tamil numerals The Tamil language has Numeral (linguistics), number words and dedicated symbols for them in the Tamil script. Basic numbering in Tamil Zero Old Tamil possesses a special numerical character for zero ''(see #Old Tamil numerals, Old Tamil ...
* Tamil units of measurement * Grantha script * Vatteluttu script * Tamil-Brahmi * Pallava script * Kolezhuthu *
Arwi Arwi ( ) or Arabu-Tamil (, is an Arabic-influenced dialect of the Tamil language written with an extension of the Arabic alphabet, with extensive lexical and phonetic influences from the Arabic language. Arwi has been used extensively by the ...
* Tamil bell *
Malayalam Script Malayalam script (; / ) is a Brahmic scripts, Brahmic script used to write Malayalam, the principal language of Kerala, India, spoken by 45 million people. It is a Dravidian language spoken in the Indian state of Kerala and the union ter ...
*
ISO 15919 ISO 15919 is an international standard for the romanization of Indic scripts. Published in 2001, it is part of a series of romanization standards by the International Organization for Standardization. Overview Relation to other systems ...


Notes


References

* *


External links


Findings at Keeladi Site dates back to 6th Century BCETamil Alphabet & Basics
PDF) *


Unicode Chart
– For Tamil (
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
)
TACE 16
(PDF)
Learn TamilTamil LettersTamil Unicode Keyboard
{{DEFAULTSORT:Tamil Script Script, Tamil Brahmic scripts