Welcome to Scribd!

Project

Uploaded by

0% found this document useful (0 votes)

29 views8 pages

This document discusses developing an end-to-end neural network approach for captioning complex images. The core problem areas are computer vision and generating detailed descriptions of entire scenes from images. Existing methods struggle with detecting multiple overlapping objects and labeling complex, dense images. The proposed approach uses a convolutional network to localize regions of interest, with a recurrent neural network language model to generate natural language captions describing each region and the full scene. The goal is an architecture that jointly performs localization and generation of descriptive label sequences for complex images.

Original Description:

proj

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

29 views8 pages

Project

Uploaded by

Shreyas Kash Prince

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 8

Search inside document

AN END-TO-END TRAINING BASED APPROACH FOR CAPTIONING

COMPLEX IMAGES USING NEURAL NETWORKS

Shreyas V Kashyap - 1BG13CS097

Swaroop Gupta B.A -1BG13CS113
Spandana Rao B.A - 1BG13CS103

Under the Guidance of

Smt.Jebah Jaykumar
AREA OF RESEARCH

We have hereby chose the area of research to be a combination of the following

research extensive areas.

Machine Learning

Image Processing
CORE AREA OF PROBLEM

Our core area of problem is

Computer Vision

Program a computer to "understand" a scene or features in an image.

Concerned with the automatic extraction, analysis and understanding of useful

information from a single image or a sequence of images
RELEVANCE OF THE PROBLEM

Object detection

Currently the existing state of the art methods can detect a single object or
multiple non overlapping objects

This makes the detection useless for any analysis of an entire scene

Object labelling

The labelling of single prominent object in an image

Unable to describe a scene in a complex dense detailed image

APPLICATION OF THE PROBLEM

Currently the problem persists in the following applications of Computer Vision

Image search

The existing image search works only on the file name of an image, and not on the
details of the scene
This should be overcome and the search should happen only on the basis of what is
there in the image, instead of the filename

Image scene detection

The existing Image detection, detects a single prominent part of the image and
cannot detect if there are variations of viewpoint.
This should be overcome and multiple objects need to be detected and the whole
scene has to be described.
PROBLEM STATEMENT

Our ability to effortlessly describe all aspects of an image relies on a strong semantic
understanding of a visual scene and all of its elements. However, despite numerous
potential applications, this ability remains a challenge for our state of the art visual
recognition systems

Our goal is to design an architecture that jointly localizes regions of interest and the
describes each with natural language

Architecture is composed of a Convolutional Network, an efficient dense localization

layer, and Recurrent Neural Network language model that generates the label
sequences for the complex images.
INPUT / OUTPUT EXAMPLE

Sample Input :

Output of existing System :

Output Expected :
THANK YOU

Shreyas V Kashyap - Amusement Park
Document47 pages
Shreyas V Kashyap - Amusement Park
Shreyas Kash Prince
43% (7)
Chapters (Word) Header As Well
Document50 pages
Chapters (Word) Header As Well
naikjaikishan
No ratings yet
Image Dehazing Using Artificial Intelligence and Multi Exposure
Document50 pages
Image Dehazing Using Artificial Intelligence and Multi Exposure
naikjaikishan
No ratings yet
B.N.M. Institute of Technology: An End-To-End Training Based Approach For Captioning Complex Images Using Neural Networks
Document1 page
B.N.M. Institute of Technology: An End-To-End Training Based Approach For Captioning Complex Images Using Neural Networks
Shreyas Kash Prince
No ratings yet
Deep Learning
Document9 pages
Deep Learning
Anonymous xMYE0TiNBc
No ratings yet
Image Segmentation For Object Detection Using Mask R-CNN in Colab
Document5 pages
Image Segmentation For Object Detection Using Mask R-CNN in Colab
GRD Journals
No ratings yet
Zaid Ubay Siregar PDF
Document4 pages
Zaid Ubay Siregar PDF
muhammad fajar alwi
No ratings yet
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
Document3 pages
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
Tic ON
No ratings yet
Attendance Marking System Using Image Recognition: Professor: Sanjay Srivastava
Document15 pages
Attendance Marking System Using Image Recognition: Professor: Sanjay Srivastava
Ghanshyam s.nair
No ratings yet
Multiple Object Detection and Tracking: 1912405@nec - Edu.in 1912036@nec - Edu.in 1912011@nec - Edu.in
Document7 pages
Multiple Object Detection and Tracking: 1912405@nec - Edu.in 1912036@nec - Edu.in 1912011@nec - Edu.in
SRIRAM P
No ratings yet
Block-Based Feature-Level Multi-Focus Image Fusion
Document6 pages
Block-Based Feature-Level Multi-Focus Image Fusion
Anonymous 1aqlkZ
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
Document4 pages
Real-Time Object Detection Using Deep Learning and Open CV
Anand Balagar [021]
No ratings yet
Facial Recognition Using Deep Learning
Document6 pages
Facial Recognition Using Deep Learning
International Journal of Innovative Science and Research Technology
No ratings yet
1902 08546
Document5 pages
1902 08546
Dinuo Liao
No ratings yet
Automated Caption Generator For The Visually Imapired: Abstract: Automated Captioning of Photos Is A Mission That
Document6 pages
Automated Caption Generator For The Visually Imapired: Abstract: Automated Captioning of Photos Is A Mission That
anchal
No ratings yet
Understanding Semantic Segmentation With UNET - by Harshall Lamba - Towards Data Science
Document33 pages
Understanding Semantic Segmentation With UNET - by Harshall Lamba - Towards Data Science
dasrisita17
No ratings yet
Structure-Aware Motion Deblurring Using
Document14 pages
Structure-Aware Motion Deblurring Using
pradogel000
No ratings yet
Recognition of Front and Rear IJEIR-13
Document4 pages
Recognition of Front and Rear IJEIR-13
Vivek Deshmukh
No ratings yet
Poster 2
Document1 page
Poster 2
Kishan Senjaliya
No ratings yet
1406 6247 PDF
Document12 pages
1406 6247 PDF
Mihai Ilie
No ratings yet
How It Works
Document9 pages
How It Works
RavenKunレイブン
No ratings yet
Recurrent Models of Visual Attention
Document12 pages
Recurrent Models of Visual Attention
omonait17
No ratings yet
1 s2.0 S0141938221000974 Main
Document6 pages
1 s2.0 S0141938221000974 Main
kamrulhasanbdyahoo.com
No ratings yet
Finalreport
Document56 pages
Finalreport
Saranya Raj
No ratings yet
Computer Vision and Image Understanding: Deli Pei, Zhenguo Li, Rongrong Ji, Fuchun Sun
Document10 pages
Computer Vision and Image Understanding: Deli Pei, Zhenguo Li, Rongrong Ji, Fuchun Sun
tvboxsmart new
No ratings yet
Image Captioning Using R-CNN & LSTM Deep Learning Model
Document4 pages
Image Captioning Using R-CNN & LSTM Deep Learning Model
International Journal of Innovative Science and Research Technology
No ratings yet
Evpplus Whitepaper M1-467
Document9 pages
Evpplus Whitepaper M1-467
Raquel Bedoya
No ratings yet
Real-Time Face Pose Estimation: Jamaldonado12@espe - Edu.ec Agoaq@espe - Edu.ec Msramrezc@espe - Edu.ec
Document13 pages
Real-Time Face Pose Estimation: Jamaldonado12@espe - Edu.ec Agoaq@espe - Edu.ec Msramrezc@espe - Edu.ec
Jandres Maldonado
No ratings yet
Motion Blur Detection and Removal in Images
Document3 pages
Motion Blur Detection and Removal in Images
International Journal of Innovative Science and Research Technology
No ratings yet
Deep Learning Based Object Detection and Recognition Framework For The Visually-Impaired
Document5 pages
Deep Learning Based Object Detection and Recognition Framework For The Visually-Impaired
Abdunabi Muhamadiev
No ratings yet
Ai Review 2 DA 2
Document10 pages
Ai Review 2 DA 2
Babi Nunnaguppala
No ratings yet
Iris Segmantation
Document23 pages
Iris Segmantation
عمر اسامه محمد نصر ٢٠١٠٧٨٤٨
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
Document56 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
Shashwat srivastava
No ratings yet
NNDL Unit 5
Document21 pages
NNDL Unit 5
T. S kesav
No ratings yet
Generating Super-Resolution Images Using Computer Vision Approaches
Document6 pages
Generating Super-Resolution Images Using Computer Vision Approaches
abiseban
No ratings yet
Ex 3 SRS
Document5 pages
Ex 3 SRS
Vaibhav Puri
No ratings yet
Narrative Paragraph Generation
Document13 pages
Narrative Paragraph Generation
sid202pk
No ratings yet
Sagar Institute of Research & Technology Department of Electronics & Communication
Document13 pages
Sagar Institute of Research & Technology Department of Electronics & Communication
Shanu Prakash
No ratings yet
CNN Model For Image Classification Using Resnet: Dr. Senbagavalli M & Swetha Shekarappa G
Document10 pages
CNN Model For Image Classification Using Resnet: Dr. Senbagavalli M & Swetha Shekarappa G
TJPRC Publications
No ratings yet
Image and Video Face Retrieval With Query Image Using Convolutional Neural Network Features
Document8 pages
Image and Video Face Retrieval With Query Image Using Convolutional Neural Network Features
IAES IJAI
No ratings yet
SR22804211151
Document8 pages
SR22804211151
Paréto Bessanh
No ratings yet
Research Article: Image Enhancement Method Based On Deep Learning
Document9 pages
Research Article: Image Enhancement Method Based On Deep Learning
JONAS
No ratings yet
Soft Computing
Document11 pages
Soft Computing
Chandra Reddy
No ratings yet
Object Detection
Document29 pages
Object Detection
Bab
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
Document6 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
musicalización pacífico
No ratings yet
Document 39
Document7 pages
Document 39
Harsh Modi
No ratings yet
Main Paper
Document6 pages
Main Paper
077bct002.aakrit
No ratings yet
Object Detection and Trackinfg in Videos: N. Rasathi
Document8 pages
Object Detection and Trackinfg in Videos: N. Rasathi
IJITJournals
No ratings yet
A Comprehensive Guide To Deep Neural Network-Based Image Captions
Document17 pages
A Comprehensive Guide To Deep Neural Network-Based Image Captions
International Journal of Innovative Science and Research Technology
No ratings yet
Contour Based Tracking
Document20 pages
Contour Based Tracking
Sahil Singh
No ratings yet
YOLO V3 ML Project
Document15 pages
YOLO V3 ML Project
Annie Shukla
No ratings yet
Cite 69 Memory Enhanced Global-Local Aggregation For Video Object Detection - CVPR - 2020 - Paper
Document10 pages
Cite 69 Memory Enhanced Global-Local Aggregation For Video Object Detection - CVPR - 2020 - Paper
姜华为
No ratings yet
Neural Network Based Image Retrieval With Multiple Instance Leaning Techniques
Document2 pages
Neural Network Based Image Retrieval With Multiple Instance Leaning Techniques
Felipe Camilo Galindo Melo
No ratings yet
Deep Learning For X Ray Image To Text Generation
Document4 pages
Deep Learning For X Ray Image To Text Generation
Editor IJTSRD
No ratings yet
Object Detection & Recognition Techniques and Process: Ankush Bhalla, Sambhav Jain, and Shyam Prasad Gupta
Document5 pages
Object Detection & Recognition Techniques and Process: Ankush Bhalla, Sambhav Jain, and Shyam Prasad Gupta
Aditya Anand
No ratings yet
Image Processing
Document5 pages
Image Processing
Arun Ece
No ratings yet
Machinelearning Unit 4
Document6 pages
Machinelearning Unit 4
yogesh
No ratings yet
2019-Deep Optics For Monocular Depth Estimation and 3D Object Detection
Document10 pages
2019-Deep Optics For Monocular Depth Estimation and 3D Object Detection
cynorr rain
No ratings yet
PA-GAN: A Patch-Attention Based Aggregation Network For Face Recognition in Surveillance
Document10 pages
PA-GAN: A Patch-Attention Based Aggregation Network For Face Recognition in Surveillance
GOURISREE M
No ratings yet
Image Segmentation in Deep Learning
Document12 pages
Image Segmentation in Deep Learning
Hema malini
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
B.N.M. Institute of Technology: An End-To-End Training Based Approach For Captioning Complex Images Using Neural Networks
Document1 page
B.N.M. Institute of Technology: An End-To-End Training Based Approach For Captioning Complex Images Using Neural Networks
Shreyas Kash Prince
No ratings yet
Claytronics A Synthetic Reality
Document4 pages
Claytronics A Synthetic Reality
Shreyas Kash Prince
No ratings yet
Placements 2012
Document3 pages
Placements 2012
Shreyas Kash Prince
No ratings yet
CS 3340 Written Assignment Unit 5
Document4 pages
CS 3340 Written Assignment Unit 5
pohambadaniel
No ratings yet
Routing Code
Document3 pages
Routing Code
Anil Kumar Reddy Chintha
No ratings yet
Unit 3
Document37 pages
Unit 3
Anuj Sood
No ratings yet
2 Security+Concepts
Document10 pages
2 Security+Concepts
moama
No ratings yet
ch2 PC
Document44 pages
ch2 PC
fariha2002
No ratings yet
Operation Manual: CMCP575-XXX-XXX Speed Transmitter
Document8 pages
Operation Manual: CMCP575-XXX-XXX Speed Transmitter
Yeral Poblete
No ratings yet
Agile Flash Cards
Document24 pages
Agile Flash Cards
Raj
No ratings yet
IP Subnetting: Shahzad Rashid NOC Executive Engineer (TXN) Telenor Pakistan
Document14 pages
IP Subnetting: Shahzad Rashid NOC Executive Engineer (TXN) Telenor Pakistan
Shahzad Rashid
No ratings yet
Advantech Adam 3600 Intelligent Rtu Ebook
Document33 pages
Advantech Adam 3600 Intelligent Rtu Ebook
rusnardi hanif raditya
No ratings yet
Siemens SIMATIC PCS 7
Document3 pages
Siemens SIMATIC PCS 7
Jemerald
No ratings yet
Assignment # 2
Document8 pages
Assignment # 2
Muhammad Irfan
No ratings yet
OffSec Course Catalog 2022
Document10 pages
OffSec Course Catalog 2022
kishorekishore5283
No ratings yet
CS 5 - RTOS-21.08.2021 - Priority Based Scheduling and Laxity Strategie For Dependent Tasks
Document20 pages
CS 5 - RTOS-21.08.2021 - Priority Based Scheduling and Laxity Strategie For Dependent Tasks
neetika gupta
No ratings yet
SB ACS Managed Security Services
Document4 pages
SB ACS Managed Security Services
Yawovi
No ratings yet
Free PDF Reader & Viewer - Online Download - Foxit Software
Document5 pages
Free PDF Reader & Viewer - Online Download - Foxit Software
BaTop BaTop
100% (1)
Ihstat - Bayes Readme
Document1 page
Ihstat - Bayes Readme
Marcus Braga
No ratings yet
Pan Os New Features
Document156 pages
Pan Os New Features
always_red
No ratings yet
421-4-5 GS 20121214
Document3 pages
421-4-5 GS 20121214
sparkCE
No ratings yet
Aim Documents
Document2 pages
Aim Documents
biyyamobulreddy
No ratings yet
3 8086 Instructions
Document135 pages
3 8086 Instructions
saksham mahajan
No ratings yet
AI 102 Dump1
Document201 pages
AI 102 Dump1
Pawan N
No ratings yet
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
Document53 pages
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
AYAN CHAKRABORTY 1604098
No ratings yet
NOSQL
Document6 pages
NOSQL
AKSHAY Kumar
No ratings yet
Esp32-Mini-1 Datasheet en
Document32 pages
Esp32-Mini-1 Datasheet en
Mr Ghost
No ratings yet
Best of Oracle Security 2022
Document56 pages
Best of Oracle Security 2022
pb
No ratings yet
v1 Covered
Document16 pages
v1 Covered
Dr. V. Padmavathi Associate Professor
No ratings yet
GA-E6010N: User's Manual
Document32 pages
GA-E6010N: User's Manual
Miguel Angel Palos M.
No ratings yet
Hetauda School of Management and Social Sciences
Document2 pages
Hetauda School of Management and Social Sciences
Biju aryal
No ratings yet
Computer Graphics Polygon
Document86 pages
Computer Graphics Polygon
Ayush Modi
No ratings yet
Workday Advance Report Tool
Document10 pages
Workday Advance Report Tool
Haritha
No ratings yet