DSpace About DSpace Software
 

Dyuthi @ CUSAT >
e-SCHOLARSHIP >
Computer Science >
Faculty >
Dr.Santhosh Kumar G >

Please use this identifier to cite or link to this item: http://purl.org/purl/4150

Title: A framework for translating English text into Malayalam using statistical models
Authors: Santhosh Kumar, G
Mary, Priya Sebastian
Sheena Kurian, K
Keywords: Alignment
English Malayalam Translation
PoS Tagging
Statistical Machine Translation
Suffix Separation
Issue Date: 2011
Publisher: Elsevier
Abstract: A methodology for translating text from English into the Dravidian language, Malayalam using statistical models is discussed in this paper. The translator utilizes a monolingual Malayalam corpus and a bilingual English/Malayalam corpus in the training phase and generates automatically the Malayalam translation of an unseen English sentence. Various techniques to improve the alignment model by incorporating the morphological inputs into the bilingual corpus are discussed. Removing the insignificant alignments from the sentence pairs by this approach has ensured better training results. Pre-processing techniques like suffix separation from the Malayalam corpus and stop word elimination from the bilingual corpus also proved to be effective in producing better alignments. Difficulties in translation process that arise due to the structural difference between the English Malayalam pair is resolved in the decoding phase by applying the order conversion rules. The handcrafted rules designed for the suffix separation process which can be used as a guideline in implementing suffix separation in Malayalam language are also presented in this paper. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics
Description: Procedia Technology 00 (2011) 000–000,2nd International Conference on Communication, Computing & Security
URI: http://dyuthi.cusat.ac.in/purl/4150
Appears in Collections:Dr.Santhosh Kumar G

Files in This Item:

File Description SizeFormat
A framework for translating English text into Malayalam using statistical models.pdfpdf578.8 kBAdobe PDFView/Open
View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback