Tamil/Tamil Script

In Tamil, there are 30 characters. The Tamil alphabet has 12 vowels and 18 consonants. (The practise of taking the cartesian product of the vowels and consonants is widely seen. Their supporting argument is that when a consonant is used with a vowel it is a separate letter. But one cannot of course use any consonant without suffixing it with vowel!)

The vowels are divided into short and long (five of each type) and two dipthongs (ஐ and ஒள). The consonants are classified into three categories with 6 in each category: vallinam - hard, mellinam - soft or nasal, and idayinam - medium. Unlike Devanagari, Tamil has neither conjunct consonants nor aspirated and voiced stops. Some scholars have suggested that in Sentamil (which refers to Tamil as it existed before Sanskrit words were borrowed), stops were voiceless when at the start of a word and unvoiced otherwise. However, no such distinction is observed by modern Tamil speakers.

The script is sometimes called Vattezhuthu, literally "round writing". This characterstic has partly to do with the fact that in ancient times, writing involved carving with a sharp point on palm leaves (olaichuvadi) and it was apparently easier to produce curves than straight lines by this method. The script is syllabic, in the sense that each letter is a syllable. However, the signs for the syllables are derived from that of the inherent consonant; thus it is of the abugida type. Some syllables are written by modifying the shape of the consonant in a way that is inherent to the vowel, others are written by adding vowel-inherent suffix to the consonant, yet others a prefix, and finally some vowels require adding both a prefix and a suffix to the consonant. In every case the vowel symbol is different from the vowel standing alone. An overdot (see image) - equivalent to Devanagari sign virama - suppresses the inherent trailing a sound of the consonant sign - that is, it is a pure consonant.

There are some lexical rules for formation of words. Some examples: a word cannot end in certain consonants, and cannot begin with some consonants including 'r' 'l' and 'll'; there are two consonants for the dental 'n' - which one should be used depends on whether the 'n' occurs at the start of the word and on the letters around it.

The Tamil letters

edit

Basic Consonants

edit

Consonants are also called the 'body' letters.


Consonant Sound Category
ka vallinam
nga mellinam
cha vallinam
nja mellinam
tta vallinam
nnna mellinam
tha vallinam
na mellinam
pa vallinam
ma mellinam
ya idaiyinam
ra idaiyinam
la idaiyinam
va idaiyinam
zha idaiyinam
lla idaiyinam
rra vallinam
nna mellinam

Borrowed consonants

edit

Also called Grantha letters, these are used exclusively for writing words borrowed from Sanskrit, English, and other languages. Of course, not all such words include these letters.


Consonant Sound
ja
sha
sa
ha

Vowels

edit

Vowels are also called the 'life' or 'soul' letters. Together with the consonants (which are called 'body' letters, they form compound, syllabic (abugida) letters that are called 'living' letters (ie. letters that have both 'body' and 'soul').

Isolated Form

edit
Vowel Sound
Short a or a
Long a or aa
Short i or e
Long i or ee
Short u or u
Long u or uu
Short e or ae
Long e or aee
Diphthong ai(considered as long too)
Short o or o
Long o or oo
Diphthong au (considered as long too)

Compound Form

edit

Using the consonant 'k' as an example.

Compound Transliteration
க் k
ka
கா kaa
கி ki
கீ kii
கு ku
கூ kuu
கெ ke
கே
கை kai
கொ ko
கோ koo
கௌ kau

Special letter ஃ (pronounced 'akh') is rarely used by itself - normally serves purely grammatical function as independent vowel form of the dot on consonants that suppresses the inherent 'a' sound in plain consonants.

The long ('nedil') vowels are about twice as long as the short ('kuRil') vowels. The diphthongs are usually pronounced about 1.5 times as long as the short vowels, though some grammatical texts place them with the long ('nedil') vowels.

As can be seen in the compound form, the vowel sign can be added to the right, left or both sides of the consonants. It can also form a ligature. These rules are evolving and older use has more ligatures than modern use. What you actually see on this page depends on your font selection. 'Code 2000' will show more ligatures than 'Latha'.

There are proponents of script reform who want to eliminate all ligatures and let all vowel signs appear on the right side.

Unicode encodes the character in logical order (always the consonant first), whereas legacy 8-bit encodings (like TSCII) prefer the written order. This is a problem in trans-coding these.

Digraphs

edit
Digraph IPA
ஞ்ச் /ntʃ/
ந்த் /nð/

Tamil in Unicode

edit

The Unicode range for Tamil is U+0B80 ... U+0BFF.

    0 1 2 3 4 5 6 7 8 9 A B C D E F
B80  
B90  
BA0  
BB0   ி
BC0  
BD0  
BE0  
BF0   ௿

See also

edit

References

edit


Todo: list the vowels and consonants and describe them