Professional Documents
Culture Documents
Project
Project
By
Machine Learning
Image Processing
CORE AREA OF PROBLEM
Computer Vision
Object detection
Currently the existing state of the art methods can detect a single object or
multiple non overlapping objects
This makes the detection useless for any analysis of an entire scene
Object labelling
Image search
The existing image search works only on the file name of an image, and not on the
details of the scene
This should be overcome and the search should happen only on the basis of what is
there in the image, instead of the filename
The existing Image detection, detects a single prominent part of the image and
cannot detect if there are variations of viewpoint.
This should be overcome and multiple objects need to be detected and the whole
scene has to be described.
PROBLEM STATEMENT
Our ability to effortlessly describe all aspects of an image relies on a strong semantic
understanding of a visual scene and all of its elements. However, despite numerous
potential applications, this ability remains a challenge for our state of the art visual
recognition systems
Our goal is to design an architecture that jointly localizes regions of interest and the
describes each with natural language
Sample Input :
Output Expected :
THANK YOU