Integration of OCR With TTS

You might also like

Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 6

ON

Integration of OCR with TTS

Under The Guidance of


Mr. Tushar Patnaik
Sr. Project Engineer
CDAC NOIDA

Submitted By
Simple Batra
M. Tech. (CSE) – IIIrd Semester
0161184807

Centre for Development of Advanced Computing, Noida

Affiliated to
Guru Gobind Singh Indraprastha University
Kashmere Gate, Delhi – 110006
ON

Integration of OCR with TTS

Submitted By
Simple Batra
M. Tech. (CSE) – IIIrd Semester
0161184807

Mr. Tushar Patnaik Mr. Munish Kumar


Sr. Lecturer Sr. Lecturer
School of IT Head, M.Tech Division
CDAC NOIDA School Of IT
CDAC NOIDA

Centre for Development of Advanced Computing, Noida

Affiliated to
Guru Gobind Singh Indraprastha University
Kashmere Gate, Delhi – 110006

1. Title of The Project

“Integration of OCR with TTS”

2. Problem Definition

The aim is to identify the mechanism involved in integrating OCR (Optical


Character Recognition) for Hindi language with TTS (Text-To-Speech). TTS
provides the interface for digitized output of OCR and enables access to digital
content in the voice mode.

3. Motivation

An OCR (Optical Character Recognition) is responsible for converting


an image of text, such as a scanned paper document or electronic fax file, into
computer-editable text.The function of Text-To-Speech (TTS) System is to
convert the given text to a spoken waveform.
Upon a request from the user, this text is sent to a Text-To-Speech
(TTS) system for conversion into a speech signal.
Integration of TTS and OCR in different languages like English,
Telugu and Tamil has already been done. To facilitate the increase in work in
our national language Hindi it is necessary to develop such software which can
enable the TTS tool to capture the digitized output of OCR of Hindi document
and can convert the Hindi text into voice or can speak it in human voice.
Development of such a system in Hindi can become a good source of
accessing the information for those who can understand only Hindi.
4. Objective & Scope of the Project.

Objective: The objective is to design and implement code for integration of


TTS tool with the OCR for Hindi i.e. enable coupling of TTS with OCR
Scope: Scope of this project is-
 Can provide access of digital content to illiterate people
 Can prove very helpful for vision impaired people for enabling the
learning work.

5. Abstract Architecture

Input
Document Of
Hindi OCR

Speech TTS

5. Methodology

Study all available open source TTS tools


• Study all available open source OCR
• Study Text Selection algorithms.
• Study Hindi Speech Synthesis algorithms
• Approaches of Text To Speech conversion
• Study Unit Selection algorithm supported by Festvox used for Building
voices.
6. Gantt Chart:

1 Aug 1 Sep 1 Oct 1 Nov 4 dec

Literature survey of the project problem

Design and Analysis of algorithm

Initialization of code
9. References

1. A text to speech interface for Universal Digital Library


PRAHALLAD Kishore1,2, BLACK Alan1
(1Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA
15217, USA)
(2International Institute of Information Technology, Hyderabad, AP, 500019,
India)

2. Building Hindi and Telugu Voices using Festvox


S.P. Kishore, Rajeev Sangal and M. Srinivas
Language Technologies Research Center
International Institute of Information Technology Hyderabad

3. High-quality text-to-speech synthesis: an overview.


Thierry DUTOIT
Faculte Polytechnique de Mons, TCTS Lab
31, bvd Dolez, B-7000 MONS (Belgium)

4. www.festvox.org/festvox/index.html, 2000.

You might also like