Lecture 2 Digital Image Processing

Digital Image Processing
Basics
Dr. Surya Prakash

surya@iiti.ac.in
CS419/619: Computer Vision

 Digital Image processing
– concerned with taking
• Image (one array of pixels) as input and
• producing another image (array of pixels) as output
– This transformation in some way represents
an improvement to the original image
Image Processing
Transformation
Input output
2 CS419/619: Computer Vision
 Improvement of Pictorial Information
– improve the contrast of the image
– remove noise

 Improvement of Pictorial Information …contd
– remove blurring caused by movement of the camera
during image acquisition
– correction of geometrical distortions caused by the lens

Digital Image-representation
 How do we store an image in computer?
Intensity or
colour of each tiny
square can be
stored in an array
2D array for

grayscale images
3D array for

colour images

Digital Image-representation
Magnified view
 A digital image is a 2-D array of intensity values, say f(x, y)

 f(x,y) represents 2-D intensity function discretized
– in spatial coordinates (spatial sampling) and
– brightness (quantization) values. An Image represented in the
form of a 2-D grid

Pixel
 Pixels:
– The elements of such an array (2-D grid) are
called pixels (picture elements)
– Smallest unit of an image
 Storage Requirement of an Image:
– The storage requirement for an image depends on
the
• Spatial resolution (image rows, columns) and
• Number of bits necessary for pixel quantization

Gray Levels in an Image
 A digital Image is an array of integers:
– f(x,y) ∈ {0,1,….,Imax-1},
– x, y ∈ {0,1,…..,N-1}
 N is the resolution of the image and Imax is the
level of discretized brightness value
 Note:
– Larger the value of N, more is the clarity of the
picture (larger resolution), but more data to be
analyzed in the image

Digital Image - Storage Size
 Gray-level Image
– If the image is a gray-level (8-bit per pixel)
image, then it requires N2 Bytes for storage
 Color Image
– If the image is a color image – RGB
representation, each pixel requires 3 Bytes of
storage space.
– Hence color image of size N × N would be 3N2
bytes

Digital Image - Storage Size
Storage space Storage space

Image Size needed for needed for color
grayscale image image
64 x 64 4 KB 12 KB
256 x 256 64 KB 192 KB
512 x 512 256 KB 768 KB

Levels in Image Processing
 Three main categories of Task in CV
– There are generally three main categories of
tasks involved in a complete computer vision
system.
Image processing
 They are:
– Low level processing
– Intermediate level processing
– High level processing
Computer Vision

Levels of Image Processing
 Low level processing:
– Involves image processing tasks in which the quality of the
image is improved
– This improvement benefits the human observers and higher
level routines to perform better
Enhanced
Input Image
Image

Low Level Processing – Example
Input Image Enhanced Image

 Intermediate level processing:
– Involves the processes of feature extraction and
pattern detection tasks.
– The algorithms used here are chosen and tuned in
a manner as may be required to assist the final
tasks of high level vision.

Intermediate Level Vision – Example
 Feature Extraction
– Ridge ending
– Ridge bifurcation
Enhanced Image

 High level vision:
– Involves autonomous interpretation of scenes for
• pattern classification
• recognition and
• identification of objects in the scenes
• any other information required for human
understanding
– A top down approach, rather than a bottom-up
approach, is used in the design of these systems
in many applications
• Top down – general to specific
• Bottom up – specific to general

High level vision – Example
Minutia points defined as Line shows the matching minutiae points between
[(x, y,), type ] two fingerprints

4 Important High-level DIP Tasks
 There exists four broad categories of
tasks in digital image processing:
– Segmentation
– Recognition
– Compression
– Motion analysis

Segmentation
Examples
Segmentation
 It deals with the process of fragmenting the
image into homogeneous meaningful parts,
regions or sub-images
 Techniques used for segmentation
– generally based on the analysis of the histogram
of images using gray level values as features.
– Other features used for segmentation are
• edges or lines,
• colors and
• textures.

Recognition
 It deals with identification
or classification of objects
in an image for the purpose
of interpretation or
identification
Recognition of various
objects present in the
scene

Object Recognition Approaches
 Recognition is based on models, which
represent an object
 A system is trained (using HMM, GMM, ANN etc.) to

learn or store the models, based on training samples
– HMM: Hidden Morkov Model
– GMM: Gaussian Mixture Model
– ANN: Artificial Neural Network
 The test data is matched with all such models to

identify the object with a certain measure of
confidence

Compression
 It involves methodologies for efficient storage and
retrieval of image data, which occupies large disk
space
 Image compression may be
– Lossy or
– lossless
 Lossless compression is preferred where every
bit is important
– for archival purposes
– medical applications
– technical drawings

Compression
 Lossy compression methods:
– Used especially when low bit rates required
– introduces compression artifacts
– suitable for natural images such as photographs in
applications where minor (sometimes
imperceptible) loss of fidelity is acceptable to
achieve a substantial reduction in bit rate
 Note:
– The lossy compression that produces
imperceptible differences may be called visually
lossless

Compression
 Typical methods of image compression are:
• Run length coding (Lossless)
• Huffman Coding (Lossless)
• Discrete Cosine Transform (DCT) based (Lossy)
– Used in JPEG compression
• Wavelet based (Lossy)
 Typical methods of video compression are:
• MPEG-I, II, IV & VII
• H.264 or MPEG-4 Part 10, Advanced Video
Coding (MPEG-4 AVC)
– It is a block-oriented motion-compensation-based video
compression standard

Motion Analysis
 Also known as dynamic scene analysis
 It involves tracking and estimation of the

path of movement of object/s from a
sequence of frames
 Typical methods for motion analysis use
• Optical flow
• Kalman filter
• Mean-shift tracking (Kernel-based tracking)
• Kanade–Lucas–Tomasi (KLT) feature tracker
Motion Analysis
 Motion analysis is based on
– (i) tracking
– (ii) obtaining correspondence between
frames and
 And then
– (iii) estimating the motion parameters and
– (iv) structure of moving objects

End

Lecture 2 Digital Image Processing

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture 2 Digital Image Processing

Uploaded by

Copyright:

Available Formats

Digital Image Processing

Dr. Surya Prakash

CS419/619: Computer Vision

3 CS419/619: Computer Vision

– correction of geometrical distortions caused by the lens

4 CS419/619: Computer Vision

2D array for

3D array for

5 CS419/619: Computer Vision

 A digital image is a 2-D array of intensity values, say f(x, y)

6 CS419/619: Computer Vision

7 CS419/619: Computer Vision

8 CS419/619: Computer Vision

9 CS419/619: Computer Vision

Storage space Storage space

10 CS419/619: Computer Vision

11 CS419/619: Computer Vision

12 CS419/619: Computer Vision

Input Image Enhanced Image

13 CS419/619: Computer Vision

14 CS419/619: Computer Vision

15 CS419/619: Computer Vision

16 CS419/619: Computer Vision

17 CS419/619: Computer Vision

18 CS419/619: Computer Vision

20 CS419/619: Computer Vision

21 CS419/619: Computer Vision

 A system is trained (using HMM, GMM, ANN etc.) to

 The test data is matched with all such models to

22 CS419/619: Computer Vision

23 CS419/619: Computer Vision

24 CS419/619: Computer Vision

25 CS419/619: Computer Vision

 It involves tracking and estimation of the

27 CS419/619: Computer Vision

28 CS419/619: Computer Vision

You might also like