Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 11

VOICE SEARCH

using python

B Pavan kumar
16BD1A051R
Why Speech?
• No visual contact required
• No special equipment required
• Can be done while doing other things
Speech Processing
• Speech Recognition

• Speaker Recognition and text format


Speech Coding
• Compress a Speech File

• MP3 Format to text format

• Buy using Google Voice Api


• We used a modules called
SpeechRecognition, pyAudio
Speech Synthesis
Speech waveform from words

Speaker Quality and Accent


Speech Recognition
• Convert a sound waveform to words
• The most relevant and important task in
the industry
• 90% in lab conditions, much lower in
factory conditions
Speaker Recognition
• Acceptable as a verification technique
• How would this be different from Speech
recognition?
– Speaker Quality
– Prosody(pattern or rythm)
– Pitch
Audio Engineering
• Adding effects to sound
• Clarity of reproduction
• A Big industry with players like – Dolby,
Bose, Phillips etc

• Voice Morphing!

SOURCE TARGET CONV 1 CONV 2


Out put
Future Enhancement
• Future I want to develop by a speech by
telling file recongnisation
Thank you

You might also like