Extension schemes for the Alignment Model of English-Malayalam Statistical Machine Translator

Dyuthi/Manakin Repository

Extension schemes for the Alignment Model of English-Malayalam Statistical Machine Translator

Show simple item record

dc.contributor.author Santhosh Kumar, G
dc.contributor.author Mary, Priya Sebastian
dc.contributor.author Sheena Kurian, K
dc.date.accessioned 2014-07-21T05:13:31Z
dc.date.available 2014-07-21T05:13:31Z
dc.date.issued 2012
dc.identifier.uri http://dyuthi.cusat.ac.in/purl/4160
dc.description 2012 International Conference on Advances in Computing and Communications en_US
dc.description.abstract In Statistical Machine Translation from English to Malayalam, an unseen English sentence is translated into its equivalent Malayalam sentence using statistical models. A parallel corpus of English-Malayalam is used in the training phase. Word to word alignments has to be set among the sentence pairs of the source and target language before subjecting them for training. This paper deals with certain techniques which can be adopted for improving the alignment model of SMT. Methods to incorporate the parts of speech information into the bilingual corpus has resulted in eliminating many of the insignificant alignments. Also identifying the name entities and cognates present in the sentence pairs has proved to be advantageous while setting up the alignments. Presence of Malayalam words with predictable translations has also contributed in reducing the insignificant alignments. Moreover, reduction of the unwanted alignments has brought in better training results. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics. en_US
dc.description.sponsorship Cochin University of Science and Technology en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject alignment en_US
dc.subject training en_US
dc.subject machine translation en_US
dc.subject English Malayalam translation en_US
dc.title Extension schemes for the Alignment Model of English-Malayalam Statistical Machine Translator en_US
dc.type Article en_US


Files in this item

Files Size Format View Description
Extension schem ... cal Machine Translator.pdf 214.0Kb PDF View/Open pdf

This item appears in the following Collection(s)

Show simple item record

Search Dyuthi


Advanced Search

Browse

My Account