dc.contributor.author |
Santhosh Kumar, G |
|
dc.contributor.author |
Sheena Kurian, K |
|
dc.contributor.author |
Mary, Priya Sebastian |
|
dc.date.accessioned |
2014-07-21T08:47:22Z |
|
dc.date.available |
2014-07-21T08:47:22Z |
|
dc.date.issued |
2009 |
|
dc.identifier.uri |
http://dyuthi.cusat.ac.in/purl/4185 |
|
dc.description.abstract |
Suffix separation plays a vital role in improving the quality of
training in the Statistical Machine Translation from English into Malayalam.
The morphological richness and the agglutinative nature of Malayalam make it
necessary to retrieve the root word from its inflected form in the training
process. The suffix separation process accomplishes this task by scrutinizing the
Malayalam words and by applying sandhi rules. In this paper, various
handcrafted rules designed for the suffix separation process in the English
Malayalam SMT are presented. A classification of these rules is done based on
the Malayalam syllable preceding the suffix in the inflected form of the word
(check_letter). The suffixes beginning with the vowel sounds like ആല, ഉെെ, ഇല
etc are mainly considered in this process. By examining the check_letter in a
word, the suffix separation rules can be directly applied to extract the root
words. The quick look up table provided in this paper can be used as a guideline
in implementing suffix separation in Malayalam language |
en_US |
dc.description.sponsorship |
Cochin University of Science and Technology |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Cochin University of Science And Technology |
en_US |
dc.subject |
suffix separation |
en_US |
dc.subject |
sandhi rules |
en_US |
dc.subject |
English Malayalam translation |
en_US |
dc.subject |
vowels |
en_US |
dc.subject |
consonants |
en_US |
dc.title |
A Classification of Sandhi Rules for Suffix Separation in Malayalam |
en_US |
dc.type |
Article |
en_US |