Dyuthi @ CUSAT >
e-SCHOLARSHIP >
Computer Science >
Faculty >
Dr. Sumam Mary Idicula >
Please use this identifier to cite or link to this item:
http://purl.org/purl/4090
|
Title: | Development Of A Pos Tagger For Malayalam-An Experience |
Authors: | Sumam, Mary Idicula Soumya, S Manju, K |
Keywords: | Dravidian Language Morphemes HMM Viterbi Tagset. |
Issue Date: | 2009 |
Publisher: | IEEE |
Abstract: | A Parts of Speech tagger for Malayalam which uses
a stochastic approach has been proposed. The tagger makes
use of word frequencies and bigram statistics from a corpus.
The morphological analyzer is used to generate a tagged
corpus due to the unavailability of an annotated corpus in
Malayalam. Although the experiments have been performed on
a very small corpus, the results have shown that the statistical
approach works well with a highly agglutinative language like
Malayalam |
Description: | 2009 International Conference on Advances in Recent Technologies in Communication and Computing |
URI: | http://dyuthi.cusat.ac.in/purl/4090 |
Appears in Collections: | Dr. Sumam Mary Idicula
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|