Project Title: Synopsis

A
Project
SYNOPSIS
on
PROJECT TITLE
Submitted by
Student Name Student Name Student Name

Reg. No: Reg. No: Reg. No:
Section: Section: Section:
Roll No.: Roll No.: Roll No.:
Under the guidance of
GUIDE NAME GUIDE NAME

(External if any) (Internal)
&
Designation Designation Department Name
Company Name Institute Name
DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING
MANIPAL INSTITUTE OF TECHNOLOGY

Manipal Academy of Higher Education
MANIPAL – 576104, KARNATAKA, INDIA
Details of the organization
(with postal address):
Name of Guide with contact

details and email address:
Date of commencement of the

project:
Signature of Guide with seal:

1. Introduction
Language is man's most important means of communication and speech its primary
medium. A speech signal is a complex combination of a variety of airborne pressure
waveforms. This complex pattern must be detected by the human auditory system and
decoded by the brain. This can be done by using a combination of audio and visual cues to
perceive speech more effectively. The project aims to emulate this mechanism in human –
machine communication systems by exploiting the acoustic and visual properties of human
speech.
2. Need for the project
Current speech recognition engines employing only acoustic features are not 100%
robust. Visual cues can be used to undermine the ambiguity in the auditory modality. Hence a
flexible and reliable system for speech perception can be designed which finds a variety of
applications in:
 Dictation systems
 Voice Based Communications in tele-banking, voice mail, data-base query systems,
information retrieval systems, etc.
 System Control in automobiles, robotics, airplanes, etc.
 Security systems for speaker verification
3. Objective
Recognize 10 English words (speaker independent) with at least 90% accuracy in a noisy
environment.
4. Methodology
The project is carried out in into following parts
 Processing of Audio Signals
o Detection of end points to demarcate word boundaries

o Analysis of various acoustic features such as pitch and formants, energy and time
difference of speech signals, etc.
o Extraction of selected features
 Processing of Video Signals
o Demarcate frames from the video sequence

o Identify faces, and then lip regions
o Extract features from the lip profile
 Recognition of Speech by synchronizing Audio and Visual Data
o Synchronize audio and video features for pattern recognition using standardized
algorithms
o Train the system to recognize the spoken word under adverse acoustic conditions.
5. Project Schedule
Sample schedule for six-month duration
o Processing of audio signals

o Feature extraction from the chosen training database
Jan 2023
o Pattern recognition and signature extraction from the features
o Training the HMM with the training set
o Processing of video signals
Feb 2023 o Feature extraction from the chosen training database
o Pattern recognition and signature extraction from the features
o Synchronize audio and video features for pattern recognition
Mar 2023
o Extension of training data set to 10 words
Apr 2023 o Up gradation of system for speaker independent applications
o Performance analysis by comparing results of audio-only
May 2023
approach with that of joint audio-visual approach
Jun 2023 o Documentation
6. References
1. Tsuhan Chen, "Audiovisual Speech Processing, Lip Reading and Lip synchronization",
IEEE Signal Processing Magazine, January 2001.
2. R.Chellapa, C.L. Wilson and S. Sirohoey, ‘Human and Machine Recognition of
Faces : A survey’, Proceedings of the IEEE, vol 83, no.5 May 1995
Student Details
Student Name
Register Number Section / Roll No
Email Address Phone No (M)
Student Name
Register Number Section / Roll No
Email Address Phone No (M)
Project Details
Project Title
Project Duration Date of reporting
Organization Details
Organization Name
Full postal address with
pin code
Website address
External guide details(if any)
Name
Designation
Contact number
Email Address

Project Title: Synopsis

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Project Title: Synopsis

Uploaded by

Copyright:

Available Formats

A

Student Name Student Name Student Name

Under the guidance of

GUIDE NAME GUIDE NAME

DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING

MANIPAL INSTITUTE OF TECHNOLOGY

Name of Guide with contact

Date of commencement of the

Signature of Guide with seal:

2. Need for the project

The project is carried out in into following parts

 Processing of Audio Signals

o Detection of end points to demarcate word boundaries

 Processing of Video Signals

o Demarcate frames from the video sequence

Sample schedule for six-month duration

o Processing of audio signals

You might also like