Professional Documents
Culture Documents
Image Caption Generator
Image Caption Generator
Image Caption Generator
Present By:
Krupa Patel(190633116001)
Guided By : Riya Soni(190633116004)
Maitri Patel(190633116003)
Prof. Pritesh Pandey Moniska Patel(180630116014)
Content:
Abstract
Introduction
Canvas
What is CNN?
What is LSTM?
CNN LSTM Model
Image Caption Generator Model
Abstract:
You saw an image and your brain can easily tell what the
image is about, but can a computer tell what the image is
representing? Computer vision researchers worked on
this a lot and they considered it impossible until now!
With the advancement in Deep learning techniques,
availability of huge datasets and computer power, we can
build models that can generate captions for an image.
• Encoder-decoder model
• Multimodal layer
Encoder-decoder model: machine
translation
Encoder-decoder model: image caption
Encoder-decoder model: image caption
Use the Encoder - Decoder structure
• Encoder (RNN): transform the source language into a
rich fixed length vector
• Decoder (RNN): take the output of encoder as input and
generates the target sentence
Encoder-decoder model in time:
Multimodal layer
Conclusion and future work:
We analyzed and modified an image captioning method
LRCN. To understand the method deeply, we
decomposed the method to CNN, RNN, and sentence
generation. For each part, we modified or replaced the
component to see the influence on the final result.