Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 1

IMAGE TO SPEECH WEB BASED APPLICATION USING PYTHON

Abstract

The recent technological advancements in the field of Image Processing and Natural Language

Processing are focusing on developing smart systems to improve the quality of life. In this project, an

effective approach is suggested for text recognition and extraction from images and text to speech

conversion. The incoming image is firstly enhanced by employing gray scale conversion. The two library

files are used in the project , one is Google vision API it is used to extract the text from images and

another is gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google

Translates text-to-speech API that is used to convert the extracted text to speech. The proposed

algorithm is tested on images representing different scenes ranging from documents to natural scenes.

Promising results have been reported which prove the accuracy and robustness of the proposed

algorithm and encourage its practical implementation in real world scenarios.

You might also like