Professional Documents
Culture Documents
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
CS-6350
Prof. Sukhendu Das
Deptt. of Computer Science and Engg.,
IIT Madras, Chennai 600036.
Email:
sdas@iitm.ac.in
URL: //www.cse.iitm.ac.in/~sdas
//www.cse.iitm.ac.in/~vplab/computer_vision.html
1
INTRODUCTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Contents to be covered
Introduction
Neighborhood and Connectivity of pixels
Fourier Theory, Filtering in spatial and spectral domains
3D transformations, projection and stereo
Histogram based image processing
Concepts in Edge Detection
Hough Transform
Scale-Space - Image Pyramids
Feature extraction (recent trends) detectors and descriptors
Image segmentation
Texture analysis using Gabor filters
Pattern Recognition
Bag of Words and Prob. Graphical Models
Object Recognition
Motion Analysis
Use slides as brief :
Shape from Shading
Points, comments, links
Wavelet transform
Reconstruction - affine, model-based
Registration and Matching
20 Solid Modelling;
22. Hardware;
21. Color
23. Morphology
References
1. Digital Image Processing; R. C. Gonzalez and R. E.
Woods; Addison Wesley; 1992+.
2. 3-D Computer Vision; Y. Shirai; Springer-Verlag, 1984.
3. Digital Image Processing and Computer Vision; Robert J.
Schallkoff; John Wiley and Sons; 1989+.
4. Pattern Recognition: Statistical. Structural and Neural
Approaches; Robert J. Schallkoff; John Wiley and Sons;
1992+.
5. Computer Vision: A Modern Approach; D. A. Forsyth and
J. Ponce; Pearson Education; 2003.
6. Computer Vision: Algorithms and Applications by
Richard Szeliski; Springer-Verlag London Limited 2011 .
7.
References (Contd..)
Journals:
IEEE-T-PAMI ( Transactions on Pattern Analysis and
Machine Intelligence)
IEEE-T-IP ( Transactions on Image processing)
PR (Pattern Recognition)
PRL (Pattern Recognition Letters)
CVGIP ( Computer Vision, Graphics & Image
Processing)
IJCV (International Journal of Computer Vision)
Online links
1. CV online: http://homepages.inf.ed.ac.uk/rbf/CVonline
2. Computer Vision Homepage:
http://www-2.cs.cmu.edu/afs/cs/project/cil/ftp/html/vision.html5
15 - 20
35 40
TPA -
35 - 40
TUTS -
05 - 10
___________________
Total
100
Image
Digitizer
light
Computer
system
Reflected
light
Computer
Vision
Models,
Object/Scene
representation
Images,
scenes,
pictures
Visualization
VLSI &
Architecture
CG
ANN
Optimization
Techniques
DIP
Computer
Vision
Parallel and
Distributed
Processing
PR
Probability
&
Fuzzy
11
ML
Computational
Neurosciences
GPU
Fuzzy
& Soft computing
ANN
Optimization
Methods
PR
Computer
Graphics
DSP
Prob.
& Stat.
Linear algebra;
Subspaces
Mass
storage
Digitizer
Image
Processor
Digital
Computer
Operator
Console
Display
Hard copy
device
14
16
A digital Image
Image is an array of integers: f(x,y)
where, x,y
{0,1,.,Imax-1},
{0,1,..,N-1}
(spatial
sampling)
and
brightness
(quantization) values.
The elements of such an array are called pixels
(picture elements).
The storage requirement for an image depends on
the spatial resolution and number of bits necessary for
pixel quantization.
The
processing
of
an
image
depends
on
the
Compression,
(ii)
Segmentation,
(iii)
Recognition and
(iv)
motion.
18
Character Recognition,
Document processing,
Commercial (signature & seal verification) application,
Biometry and Forensic (authentication: recognition
and verification of persons using face, palm &
fingerprint),
Pose and gesture identification,
Automatic inspection of industrial products,
Industrial process monitoring,
Biomedical Engg. (Diagnosis and surgery),
Military surveillance and target identification,
Navigation and mobility (for robots and unmanned
vehicles - land, air and underwater),
22
Vehicle Segmentation
Anti-forging Stamps
Scratch Detection
Vehicle Categorization
Security Monitoring
Pattern Recognition
for Objects, scenes;
Feature extraction:
Canny, GHT, Snakes,
DWT, Corners,
SIFT, GLOH, LESH;
Multi-sensor data,
Decision and feature fusion;
Steganography and
Watermarking;
24
image enhancement,
image reconstruction
feature extraction,
computational geometry,
image segmentation,
image morphology,
image matching,
Neuro-fuzzy techniques,
image synthesis,
computational geometry,
image representation,
26
27
Results of
Segmentation
Input Image
Segmented map
before integration
Results
Handdrawn
SNAKECUT Extraction of a
Foreground Object with holes
Our proposed approach for
segmentation of an object with a hole,
using a combination of
(i) Active Contour and (ii) GrabCut
Here, objective is
to crop the soldier
from the input image
Cropped image should
not contain any part of
the background
30
Snake
Output
GrabCut Output
SnakeCut
Output
31
Method 1
Image
Unsupervised Saliency
IT
FT
CA
GB
IS
RC
HFT
SF
Proposed
GT
Method 2
Image
PARAM
MR
wCrt
Proposed
GT
Results of top 20 image retrievals (arranged in row-major order) shown for visual comparative
study, using: (a) query image from the PASCAL datasets; (b) MTH (2010); (c) MSD (2011), (d)
SLAR (2012); (e) CDH (2013); and (f) our proposed RADAR framework. Erroneous results are
highlighted using a red template
Gallery
Image
Landmark
Localization
VJFD
Probe
Image
Detection
of Face
Parts
Gallery
FR_SURV
Probe
EDA
MDS
MDS
COMP_DEG
COMP_DEG
SIFT : Result
Object detection
IMRN
IMT
IMT
IMR
44
IMRN
IMR
IMT
45
IEEE TRANSACTIONS ON
COMMUNICATIONS,
VOL. COM-34, NO. 11, NOVEMBER 1986
Classified Vector Quantization of Images
BHASKAR RAMAMURTHI
& Allen G.
46
Thank you
47
48