Professional Documents
Culture Documents
Virtual Assistant: Project Bachelor of Technology CSE
Virtual Assistant: Project Bachelor of Technology CSE
PROJECT
BACHELOR OF
TECHNOLOGY
CSE
SUBMITTED BY -
Santanu dass (2002618)
Rashau rani (2002587)
Divyanshi grover (2002423)
Deepanshu gupta (2002410)
S.no
INDEX
1
INTRODUCTION
METHODOLOGY
3
OUTPUT
1. INTRODUCTION
Alexa
Alexa, Amazon's virtual assistant, is built into the Amazon Echo line of smart speakers. You can
also find it on some third-party speakers from brands like Sonos. You can ask the Echo questions
like, "Alexa, who is hosting SNL this week?" You can also ask it to play a song, make a phone call,
or control your smart home devices. It has a feature called "multi-room music," which lets you play
the same tunes from each of your Echo speakers.
Alexa recognizes a handful of wake words, including "Alexa," "Amazon," "Computer," "Echo,"
and "Ziggy."
You can also configure the Amazon Echo with third-party apps, so you can use it to call an Uber,
pull up a recipe, or lead you through a workout.
Bixby
Google Assistant
Google Assistant is available on many Android phones, including Google Pixel smartphones, as
well as the Google Home smart speaker, and some third-party speakers from brands including
JBL. You can even set it up on an iPhone.
You can interact with Google Assistant on your smartwatch, laptop, and TV. While you can use
specific voice commands, it also responds to a conversational tone and follow-up questions.
Google Assistant interacts with a multitude of apps and smart home devices.
Siri
Python Backend:
The python backend gets the output from the speech
recognition module and then identifies whether the
command or the speech output is an API Call and Context
Extraction. The output is then sent back to the python
backend to give the required output to the user.
API calls
API stands for Application Programming Interface. An
API is a software intermediary that allows two
applications to talk to each other. In other words, an API
is a messenger that delivers your request to the provider
that you’re requesting it from and then delivers the
response back to you.
Content Extraction
Context extraction (CE) is the task of automatically
extracting structured information from unstructured
and/or semi-structured machine-readable documents. In
most cases, this activity concerns processing human
language texts using natural language processing (NLP).
Recent activities in multimedia document processing like
automatic annotation and content extraction out of
images/audio/video could be seen as context extraction
TEST RESULTS.
Text-to-speech module
Text-to-Speech (TTS) refers to the ability of computers to
read text aloud. A TTS Engine converts written text to a
phonemic representation, then converts the phonemic
representation to waveforms that can be output as sound.
TTS engines with different languages, dialects and
specialized vocabularies are available through third-party
publishers.
Features of virtual assistant:
It opens google.
It opens music list.
It opens Wikipedia.