Welcome to Scribd!

Image To Speech Web Based Application Using Python

Uploaded by

0% found this document useful (0 votes)

3 views1 page

This document proposes a web-based application using Python that can extract text from images using Google Vision API, convert the extracted text to speech using the gTTS Python library, and interface with Google's text-to-speech API. The application first enhances images using grayscale conversion before extracting text. It was tested on various image types like documents and natural scenes, with promising results showing the accuracy and robustness of using image recognition, text extraction, and text-to-speech conversion in a single proposed algorithm.

Original Description:

Image to speech

Original Title

IMAGE TO SPEECH WEB BASED APPLICATION USING PYTHON

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views1 page

Image To Speech Web Based Application Using Python

Uploaded by

dk2940016

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 1

Search inside document

IMAGE TO SPEECH WEB BASED APPLICATION USING PYTHON

Abstract

The recent technological advancements in the field of Image Processing and Natural Language

Processing are focusing on developing smart systems to improve the quality of life. In this project, an

effective approach is suggested for text recognition and extraction from images and text to speech

conversion. The incoming image is firstly enhanced by employing gray scale conversion. The two library

files are used in the project , one is Google vision API it is used to extract the text from images and

another is gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google

Translates text-to-speech API that is used to convert the extracted text to speech. The proposed

algorithm is tested on images representing different scenes ranging from documents to natural scenes.

Promising results have been reported which prove the accuracy and robustness of the proposed

algorithm and encourage its practical implementation in real world scenarios.

KDnuggets 10 ChatGPT Projects Cheat Sheet
Document11 pages
KDnuggets 10 ChatGPT Projects Cheat Sheet
Misa Nguyễn
No ratings yet
Major Project Synopsis
Document14 pages
Major Project Synopsis
Agra
No ratings yet
House Predictor J Ask I Rat
Document30 pages
House Predictor J Ask I Rat
Abhishek thapa
No ratings yet
Hardik Review Paper
Document8 pages
Hardik Review Paper
HArdik Negi
No ratings yet
A News Reader Web APP (Machine Learning) : Submitted By-Rahul Bansal and Tushar Baheti
Document16 pages
A News Reader Web APP (Machine Learning) : Submitted By-Rahul Bansal and Tushar Baheti
WebDev Coder
No ratings yet
Python
Document23 pages
Python
Manish Goyal
No ratings yet
Project CSE
Document29 pages
Project CSE
1095Ayush
No ratings yet
Mayank PPT Gui Language Editor Using Python
Document20 pages
Mayank PPT Gui Language Editor Using Python
Mayank Gupta
No ratings yet
Synopsis On: B.S. Anangpuria Educational Institute Alampur, Faridabad
Document5 pages
Synopsis On: B.S. Anangpuria Educational Institute Alampur, Faridabad
sahil
No ratings yet
Python:: Python Is An Interpreted, High-Level, General-Purpose Programming Language. Created by
Document24 pages
Python:: Python Is An Interpreted, High-Level, General-Purpose Programming Language. Created by
Shreyas M.O
No ratings yet
Resonate Website On Text To Speech
Document8 pages
Resonate Website On Text To Speech
IJRASETPublications
No ratings yet
Smart translation
Document24 pages
Smart translation
Chandan R
No ratings yet
Software Environment
Document11 pages
Software Environment
King K
No ratings yet
RecurrentGPT Language Based AI For Long Text Generation
Document6 pages
RecurrentGPT Language Based AI For Long Text Generation
My Social
No ratings yet
Workshop On Python Programming: By: Sumanta Biswas
Document111 pages
Workshop On Python Programming: By: Sumanta Biswas
sherrymian
No ratings yet
Python Used by Organization
Document11 pages
Python Used by Organization
tanoj
No ratings yet
Id CARD GENRATOR FILE
Document26 pages
Id CARD GENRATOR FILE
ROCKY
No ratings yet
INTERNSHIP
Document12 pages
INTERNSHIP
Brijesh Revella
No ratings yet
Programming Intro
Document21 pages
Programming Intro
Hoorain Sajjad
No ratings yet
Python Programming Tutorial by Easylearning Guru
Document26 pages
Python Programming Tutorial by Easylearning Guru
AbhishekKumarSingh
No ratings yet
Rev Chat GPT
Document49 pages
Rev Chat GPT
Azul Quintanilla
No ratings yet
IoT Based Smart Book Reader For Visually Impaired
Document7 pages
IoT Based Smart Book Reader For Visually Impaired
IJRASETPublications
No ratings yet
2021 mmtlrl-1 6
Document9 pages
2021 mmtlrl-1 6
Aparajita Aggarwal
No ratings yet
An Adaptive Approach to Text to Image
Document5 pages
An Adaptive Approach to Text to Image
ankithah98
No ratings yet
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
Document1 page
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
s5xmbgnxn6
No ratings yet
WWW Javatpoint Com Python Interview Questions
Document50 pages
WWW Javatpoint Com Python Interview Questions
GEN GENTLE INFAMOUS
No ratings yet
Overview
Document2 pages
Overview
mamtadevi07764
No ratings yet
Voice - Assistant - Research Paper
Document4 pages
Voice - Assistant - Research Paper
utpalchoudhary177
No ratings yet
Taller Búsqueda de Fuentes Biblioteca
Document6 pages
Taller Búsqueda de Fuentes Biblioteca
Samuel Losada Infante
No ratings yet
Chatbot For Generation of Subtitles in Regional Languages: Shreyas B, Aravinth P, Kripakaran P
Document4 pages
Chatbot For Generation of Subtitles in Regional Languages: Shreyas B, Aravinth P, Kripakaran P
17druva M
No ratings yet
Research On The Relations Between Machine Translation and Human Translation
Document7 pages
Research On The Relations Between Machine Translation and Human Translation
carolina
No ratings yet
Python
Document2 pages
Python
MEHALA S
No ratings yet
1 s2.0 S2667345223000317 Main
Document10 pages
1 s2.0 S2667345223000317 Main
Surat Penda
No ratings yet
Speech Emotion Recognition and Classification Using Deep Learning
Document39 pages
Speech Emotion Recognition and Classification Using Deep Learning
John Cena
100% (1)
Python
Document111 pages
Python
Monalisa Chatterjee
100% (3)
A Large Scale Dataset
Document7 pages
A Large Scale Dataset
Rashi MrBRD
No ratings yet
Jobst - 2022 - Efficient GitHub Crawling Using The GraphQL API
Document16 pages
Jobst - 2022 - Efficient GitHub Crawling Using The GraphQL API
David Fritsch
No ratings yet
Nabhi Jain Computer Ppt (1)
Document13 pages
Nabhi Jain Computer Ppt (1)
nabhijain9
No ratings yet
Python Course in Trivandrum
Document9 pages
Python Course in Trivandrum
Mubin A
No ratings yet
Geetha Internship
Document17 pages
Geetha Internship
Geetha
No ratings yet
Meta Title Python
Document5 pages
Meta Title Python
UDAYAKUMAR
No ratings yet
Python Introduction
Document65 pages
Python Introduction
Krutika Salotagi
No ratings yet
Sujets PFE NYUAD
Document3 pages
Sujets PFE NYUAD
chakibking66
No ratings yet
HP's Project
Document57 pages
HP's Project
ALBERT J
No ratings yet
Python Chatbot Project: January 2022
Document6 pages
Python Chatbot Project: January 2022
VISHNU PRASHANTH REDDY S
No ratings yet
Python Chat Bot Project
Document6 pages
Python Chat Bot Project
Gagana
100% (1)
Advik Report On Python
Document49 pages
Advik Report On Python
Prints Bindings
No ratings yet
Individual Project - Mason Leary
Document15 pages
Individual Project - Mason Leary
api-273094620
No ratings yet
Methodology To Use in Speech To Text Python - Google Search PDF
Document1 page
Methodology To Use in Speech To Text Python - Google Search PDF
Godspower Uhuaba
No ratings yet
Python Based Recognition of Sign
Document10 pages
Python Based Recognition of Sign
19bcs2856
No ratings yet
R VS Python
Document12 pages
R VS Python
Tamim Iqbal
No ratings yet
Srs Main Icg Akash
Document22 pages
Srs Main Icg Akash
savan3019
No ratings yet
About Python For AI and ML
Document3 pages
About Python For AI and ML
Hitesh Son Son
No ratings yet
Get Best Python Training in Chennai From Softlogic Systems
Document7 pages
Get Best Python Training in Chennai From Softlogic Systems
Softlogic
No ratings yet
Report
Document25 pages
Report
humayounsaeed008
No ratings yet
Crafting Applications with Chat GPT API
From Everand
Crafting Applications with Chat GPT API
Mike Gold
No ratings yet
Python Chatbot Project
Document6 pages
Python Chatbot Project
Bhavani Reddy
No ratings yet
Image Recognition and Its Language Translation Using OCR
Document8 pages
Image Recognition and Its Language Translation Using OCR
Darshan Dharmik
No ratings yet
A Greater Foundation for Machine Learning Engineering: The Hallmarks of the Great Beyond in Pytorch, R, Tensorflow, and Python
From Everand
A Greater Foundation for Machine Learning Engineering: The Hallmarks of the Great Beyond in Pytorch, R, Tensorflow, and Python
Dr. Ganapathi Pulipaka
No ratings yet
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
Idea Presentation Format SIH2023 College
Document5 pages
Idea Presentation Format SIH2023 College
dk2940016
No ratings yet
Review Calculator
Document10 pages
Review Calculator
dk2940016
No ratings yet
Encryption Algorithm
Document62 pages
Encryption Algorithm
dk2940016
No ratings yet
Car Parking
Document45 pages
Car Parking
dk2940016
No ratings yet
Advanced Image To Speech Conversion
Document46 pages
Advanced Image To Speech Conversion
dk2940016
No ratings yet