DSpace About DSpace Software
 

Dyuthi @ CUSAT >
e-SCHOLARSHIP >
Computer Science >
Faculty >
Dr.Santhosh Kumar G >

Please use this identifier to cite or link to this item: http://purl.org/purl/4140

Title: Alignment Model and Training Technique in SMT from English to Malayalam
Authors: Santhosh Kumar, G
Sheena Kurian, K
Mary, Priya Sebastian
Keywords: Alignment
Parallel Corpus
PoS Tagging
Malayalam
Statistical Machine Translation
Issue Date: 30-Aug-2010
Abstract: This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better training results with improved accuracy
URI: http://dyuthi.cusat.ac.in/purl/4140
Appears in Collections:Dr.Santhosh Kumar G

Files in This Item:

File Description SizeFormat
Alignment Model and Training Technique in SMT from English to Malayalam.pdfpdf379.74 kBAdobe PDFView/Open
View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback