Professional Documents
Culture Documents
Synopsis 3
Synopsis 3
Synopsis 3
Faculty of Engineering
PACIFIC ACADEMY OF HIGHER EDUCATION AND
RESEARCH UNIVERSITY, UDAIPUR.
1
SUMMARY
Abstract
Introduction
Review of Literature
Research Gaps
Scope of Research
Research Objectives
Hypothesis
Research Plan
References
ABSTRACT
INTRODUCTION
REVIEW OF LITERATURE
Remarkable Observations in the review of work are as
follows
REVIEW OF LITERATURE
Remarkable Observations in the review of work are as
follows
REVIEW OF LITERATURE
Remarkable Observations in the review of work are as
follows
10
REVIEW OF LITERATURE
Remarkable Observations in the review of work are
as follows
11
RESEARCH GAPS
ANN architecture
recognition rate.
12
SCOPE OF RESEARCH
13
RESEARCH OBJECTIVES
The objective of our research is to investigate the combined
performance of Wavelet Transform (WT) and Artificial Neural
Network (ANN) for Isolated Marathi Digits so as to improve
accuracy of speech recognition system.
14
ISOLATED
DIGIT
RECOGNITI
ON
15
RESEARCH
METHODOLOGY
16
Hypothesis
The objective of our research is to investigate the
combined performance of wavelet transform and artificial
neural network (ANN) for isolated Marathi digits so as to
improve accuracy of speech recognition system.
2.
3.
4.
17
RESEARCH PLAN
Activity
Literature Survey
Study of Software Tools like
MATLAB/SIMULINK, Neural
Network Toolbox and its
MATLAB link
Survey of Existing Methods and
Algorithms
Suggesting techniques for
removing limitations in existing
algorithms
Simulation of combined
strategies
Comparing results of developed
strategies with existing
algorithms
Performance evaluation and
implementation
Documentation
Review & Research Paper
Preparation &
Presentation/Publication
18
REFERENCES
[1]
[2]
Education, 2002.
R. M. Rao, A. S. Bopardikar, Wavelet Transform, Pearson Education, 2005.
[3]
[4]
[5]
[6]
19
REFERENCES
[7]
Gil
Lopes,
Recognition
[8]
Fernando
in
Noisy
Ribeiro,
Paulo
Environment,
Carvalho,
Whistle
Universidade
do
Sound
Minho,
Systems
on
FPGA-Based
Embedded
Systems
with
SOC
20
REFERENCES
[12] W. M. Campbell, J. P. Campbell, D. A. Reynolds, E. Singer, P. A. TorresCarrasquillo, Support Vector Machines for Speaker and Language
Recognition, in Elsevier Journal of Computer Speech & Language,
vol. 20, issue 2/3, pp 210 229, 2006.
[13] Siddheshwar S. Gangonda, Dr. Prachi Mukherji, Speech Processing for Marathi
Numeral Recognition using MFCC and DTW Features, International Journal of
Engineering Research and Applications (IJERA), pp.218-222, March 2012.
[14] Wahyu Kusuma R., Prince Brave Guhyapati V., Simulation Voice Recognition
System for controlling Robotic Applications, Journal of Theoretical and Applied
Information Technology,vol.39, no.2,pp. 188-196, May 2012.
[15] Thiang and Suryo Wijoyo, Speech Recognition Using Linear Predictive Coding
and Artificial Neural Network for Controlling Movement of Mobile Robot,
International Conference on Information and Electronics Engineering, vol.6,
pp.179-183, 2011.
[16] Bishnu Prasad Das, Ranjan Parekh, Recognition of Isolated Words using
Features based on LPC, MFCC, ZCR and STE, with Neural Network Classifiers,
International Journal of Modern Engineering Research, vol.2, pp.854-858, MayJune 2012.
21
REFERENCES
[17
[18
Arizona, 1998.
Paul A.K., Das D., Kamal M.M., Bangla Speech Recognition System Using LPC
[19
[20
[21
October 2012.
Jagannath H Nirmal, Mukesh A Zaveri, Suprava Patnaik and Pramod H
[22
REFERENCES
[23]
[24]
Faculty
of
Electrical
Engineering
and
Computing,
Haar
Wavelets
and
Proper
Orthogonal
Decomposition,
[26]
Rotterdam, 2009.
Beng T Tan, Robert lang, Hieko Schroder, Andrew Spray, Phillip Dermody,
Applying Wavelet Analysis to Speech Segmentation and Classification,
[27]
23
REFERENCES
24
Thank you
25
Training phase accepts speech samples from different people and trains the
system to create acoustic models for each word in vocabulaey.TP undergoes
through two stages Data preparation & Recording data.
Verification Phase display some random numbers then check for pronouns
number.
Some time system consists of speech processing inclusive of digit boundary
and recognition which uses zero crossing and energy techniques. Mel
Frequency Cepstral Coefficients (MFCC) vectors are used to provide an
estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is
used to detect the nearest recorded voice.
The general methodology of audio classification involves extracting
discriminatory features from the audio data and feeding them to a pattern
classifier. Different approaches and various kinds of audio features were
proposed with varying success rates. The features can be extracted either
directly from the time domain signal or from a transformation domain
depending upon the choice of the signal analysis approach. Some of the audio
features that have been successfully used for audio classification include Mel
Frequency Cepstral Coefficients (MFCC).
26
27
28