Download as pdf or txt
Download as pdf or txt
You are on page 1of 14

PERSONAL VOICE ASSISTANT

IN PYTHON

SUBMITTED BY-
MIHIR PRATAP SINGH
RAVI AGARWAL
ANANT SARASWAT
CONTENT
1.Introduction
2.How does an intelligent personal assistant work?
3.Existing system and proposed system
4.Dataflow diagrams
5.Automatic Speech Recognition System Model in Google AP
6.Use case diagrams
7.Modules
8.Conclusion
01

INTRODUCTION
A virtual assistant is a technology based on artificial intelligence. The software uses a device’s microphone to
receive voice requests while the voice output takes place at the speaker. But the most exciting thing happens
between these two actions.

It is a combination of several different technologies: voice recognition, voice analysis and language processing.
It is completely developed using one of the most powerful language python.
How does an intelligent
personal assistant work?

1.User asks a personal assistant to perform a task.


2.The natural language audio signal is converted into digital data that can be analyzed
by the software.
3.Compared with a database of the software using an innovative algorithm to find a
suitable answer.
4.This database is located on distributed servers in cloud networks. For this reason ,it
must have a reliable Internet connection.
EXISTING SYSTEM V/S
PROPOSED SYSTEM
DATA FLOW DIAGRAM
DFD-0
DFD-1
DFD-2
Automatic Speech Recognition System
Model in Google API
Feature Extraction

1. Feature Extraction is a common to extract a set of features from speech signal.

2.Classification of Feature extraction is carried out on the set of features instead of the
speech signals themselves.

3.The feature extraction stage seeks to provide a compact representation of the speech
waveform. This form should minimize the loss of information that discriminates between
words, and provide a good match with the distributional assumptions made by the
acoustic models.
ACOUSTIC MODELS
Acoustic model is a relationship between audio signal and phoneme
Phoneme means one of the smallest unit of speech that make one word different from another word

PRONUNCIATION DICTIONARY
The act or result of producing the sounds of speech, including articulation, stress, and intonation
A phonetic transcription of a given word, sound, etc.
An accepted standard of the sound and stress patterns of a word, phrase, etc.

LANGUAGE MODELS
The language model provides context to distinguish between words and phrases that sound similar.

For example, In American English the phrases “recognize speach” and “wreck a nice beach” sound similar , but
mean different things.
USE CASE DIAGRAMS
MODULES

Speech recognition
Process
and system utilities ( psutil )
PlaySound


Protocol client ( smtplib )
SMTP
Google Text To
Speech ( gtts )

CONCLUSION

Voice Controlled Personal Assistant System will use the Natural language processing and
can be integrated with artificial intelligence techniques to achieve a smart assistant that can
control the computer and applications and even solve user queries using web searches.. It
can be designed to minimize the human efforts to interact with many other subsystems,
which would otherwise have to be performed manually. By achieving this, the system will
make human life comfortable

You might also like