A framework for translating English text into Malayalam using statistical models

Dyuthi/Manakin Repository

A framework for translating English text into Malayalam using statistical models

Show simple item record

dc.contributor.author Santhosh Kumar, G
dc.contributor.author Mary, Priya Sebastian
dc.contributor.author Sheena Kurian, K
dc.date.accessioned 2014-07-21T04:35:46Z
dc.date.available 2014-07-21T04:35:46Z
dc.date.issued 2011
dc.identifier.uri http://dyuthi.cusat.ac.in/purl/4150
dc.description Procedia Technology 00 (2011) 000–000,2nd International Conference on Communication, Computing & Security en_US
dc.description.abstract A methodology for translating text from English into the Dravidian language, Malayalam using statistical models is discussed in this paper. The translator utilizes a monolingual Malayalam corpus and a bilingual English/Malayalam corpus in the training phase and generates automatically the Malayalam translation of an unseen English sentence. Various techniques to improve the alignment model by incorporating the morphological inputs into the bilingual corpus are discussed. Removing the insignificant alignments from the sentence pairs by this approach has ensured better training results. Pre-processing techniques like suffix separation from the Malayalam corpus and stop word elimination from the bilingual corpus also proved to be effective in producing better alignments. Difficulties in translation process that arise due to the structural difference between the English Malayalam pair is resolved in the decoding phase by applying the order conversion rules. The handcrafted rules designed for the suffix separation process which can be used as a guideline in implementing suffix separation in Malayalam language are also presented in this paper. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics en_US
dc.description.sponsorship Cochin Unniversity of Science and Technology en_US
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.subject Alignment en_US
dc.subject English Malayalam Translation en_US
dc.subject PoS Tagging en_US
dc.subject Statistical Machine Translation en_US
dc.subject Suffix Separation en_US
dc.title A framework for translating English text into Malayalam using statistical models en_US
dc.type Article en_US


Files in this item

Files Size Format View Description
A framework for ... ing statistical models.pdf 578.8Kb PDF View/Open pdf

This item appears in the following Collection(s)

Show simple item record

Search Dyuthi


Advanced Search

Browse

My Account