Dyuthi @ CUSAT >
Ph.D THESES >
Faculty of Technology >
Please use this identifier to cite or link to this item:
http://purl.org/purl/3808
|
Title: | Studies on a syntax based approach for translation between structurally different languages and the development of a prototype for Malayalam to English translation |
Authors: | Latha, R. Nair Dr. David, Peter S Dr.Sumam Mary,Idicula |
Keywords: | Direct Machine Translation Rule Based Machine Translation Corpus Based Machine Translation ,Language Morphology and Morphological Analysis |
Issue Date: | Mar-2013 |
Publisher: | Cochin University of Science and Technology |
Abstract: | This thesis summarizes the results on the studies on a syntax based approach for translation between Malayalam, one of Dravidian languages and English and also on the development of the major modules in building a prototype machine translation system from Malayalam to English. The development of the system is a pioneering effort in Malayalam language unattempted by previous researchers. The computational models chosen for the system is first of its kind for Malayalam language.
An in depth study has been carried out in the design of the computational models and data structures needed for different modules: morphological analyzer , a parser, a syntactic structure transfer module and target language sentence generator required for the prototype system. The generation of list of part of speech tags, chunk tags and the hierarchical dependencies among the chunks required for the translation process also has been done. In the development process, the major goals are: (a) accuracy of translation (b) speed and (c) space. Accuracy-wise, smart tools for handling transfer grammar and translation standards including equivalent words, expressions, phrases and styles in the target language are to be developed. The grammar should be optimized with a view to obtaining a single correct parse and hence a single translated output. Speed-wise, innovative use of corpus analysis, efficient parsing algorithm, design of efficient Data Structure and run-time frequency-based rearrangement of the grammar which substantially reduces the parsing and generation time are required. The space requirement also has to be minimised |
Description: | Department of Computer Science,
Cochin University of Science and Technology |
URI: | http://dyuthi.cusat.ac.in/purl/3808 |
Appears in Collections: | Faculty of Technology
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|