PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION

PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION

Dyuthi/Manakin Repository

PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION

Title:	PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION
Author:	Poulose Jacob,K; Sonia, Sunny; David, Peter S
Abstract:	Speech is the most natural means of communication among human beings and speech processing and recognition are intensive areas of research for the last five decades. Since speech recognition is a pattern recognition problem, classification is an important part of any speech recognition system. In this work, a speech recognition system is developed for recognizing speaker independent spoken digits in Malayalam. Voice signals are sampled directly from the microphone. The proposed method is implemented for 1000 speakers uttering 10 digits each. Since the speech signals are affected by background noise, the signals are tuned by removing the noise from it using wavelet denoising method based on Soft Thresholding. Here, the features from the signals are extracted using Discrete Wavelet Transforms (DWT) because they are well suitable for processing non-stationary signals like speech. This is due to their multi- resolutional, multi-scale analysis characteristics. Speech recognition is a multiclass classification problem. So, the feature vector set obtained are classified using three classifiers namely, Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Naive Bayes classifiers which are capable of handling multiclasses. During classification stage, the input feature vector data is trained using information relating to known patterns and then they are tested using the test data set. The performances of all these classifiers are evaluated based on recognition accuracy. All the three methods produced good recognition accuracy. DWT and ANN produced a recognition accuracy of 89%, SVM and DWT combination produced an accuracy of 86.6% and Naive Bayes and DWT combination produced an accuracy of 83.5%. ANN is found to be better among the three methods.
Description:	IJRET \| APR 2013 Volume: 2 Issue: 4,590 - 597
URI:	http://dyuthi.cusat.ac.in/purl/3914
Date:	2013-04

Files in this item

Files	Size	Format	View	Description
PERFORMANCE OF ... IN SPEECH RECOGNITION.pdf	433.1Kb	PDF	View/Open	PdF

PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION

Dyuthi/Manakin Repository

PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION

Files in this item

This item appears in the following Collection(s)

Search Dyuthi

Browse

All of Dyuthi

This Collection

My Account