AI - Voice Recognition

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 19

VOICE

RECOGNITION

M.SURIYAPRAKASH
16CSE29
III - CSE
KINGS COLLEGE OF ENGINEERING
INTRODUCTION

• A user gives a predefined voice instruction


to the system through microphone, the
system understand this command and
execute the required function.
• It facilitates the user to run windows through
your voice without use of keyboard or
mouse.
KEY TERMS

• Speaking Modes
o Isolated Words
o Continuous Speech
• Vocabulary sizes
• Language Model
• Acoustic Model
• Dictionary
SPHINX

• Sphinx is a set of Java classes used in background to


recognize the voice.

• It is open source and is provided by Java,

• Sphinx is built on JSAPI.

• It uses HMM algorithm and BNF grammar.


OVERALL PROCESSING
HIDDEN MARKOV
MODEL (HMM)

• Real-world has structures and processes which have (or


produce) observable outputs:

o Usually sequential (process unfolds over time)


o Cannot see the event producing the output
Example: speech signals
HMM Uses
• Speech recognition:
Recognizing spoken words and phrases

• Text processing:
Parsing raw records into structured records

• Bioinformatics:
Protein sequence prediction

• Financial:
o Stock market forecasts (price pattern prediction)
o Comparison shopping services
Parser

Verifier

THE LEXICON Coarse


labeler
Lexicon

Anchor
Matcher
Generat
or
Front End

Lattice
Integrator

• The lexicon (dictionary) is stored in the form of a phonetic


network.
• The sources of pronunciations that have been used:
o On-line phonetic dictionary, such as the Shop
o Dictionary. Letter-to-sound compiler (The Talk System).

• The current CMU lexicon is constructed using a base over


150 rules covering several types of phenomena:
o Including co-articulator
o phenomena. Front-end
characteristics.

You might also like