Professional Documents
Culture Documents
Multimedia Systems: Multimedia Databases - Image Processing Basics
Multimedia Systems: Multimedia Databases - Image Processing Basics
Multimedia Databases
Outline
• Image Processing Basics
– Image Features
– Image Segmentation
• Additional Reference: Wasfi Al-Khatib, Y.
Francis Day, Arif Ghafoor, and P. Bruce Berra.
Semantic modeling and knowledge representation
in multimedia databases. IEEE Transactions on
Knowledge and Data Engineering, 11(1):64-80,
1999.
Image Processing
• Image processing involves the analysis of
scenes or the reconstruction of models from
images representing 2D or 3D objects.
– Image Analysis
• Image Segmentation
• Image Recognition
• We will look at image processing from a
database perspective.
– Objective: Design of robust image processing and
recognition techniques to support semantic modeling,
knowledge representation, and querying of images.
Semantic Modeling and Knowledge
Representation in Image Databases
•Feature Extraction.
•Salient Object Identification.
•Content-Based Indexing and Retrieval. •
Query Formulation and Processing. Multi-
Level Abstraction
Semantic Modeling
Object
Recognition Layer Object Models Object Recognition
Process
Feature
Extraction Layer
Feature Specification Feature Extraction
Process
Multimedia
Data
Image Data
Still Video Frames
Global Features
• Generally emphasize coarse-grained pattern
matching techniques.
• Transform the whole image into a functional
representation.
• Finer details within individual parts of the image are
ignored.
• Examples: Color histograms and coherence vectors,
Texture, Fast Fourier Transform, Hough Transform,
and Eigenvalues.
Color Histogram
• How many pixels of the image take a
specific color
– In order to control the number of colors, the
domain is discretized
• E.g. consider the value of the two leftmost bits in
each color channel (RGB).
+j 1⎟⎟⎞⎠
Texture
• Texture is a small surface structure
– Natural or artificial
– Regular or irregular
• Examples include
– Wood barks
– Knitting patterns – The surface of a
sponge Texture Examples
– Artificial/pe
riodic
– Artificial/no
n-periodic
– Photographi
c/pseudo-
periodic
– Photographi
c/random –
Photographic/structured – Inhomogeneous
(non-texture) Texture
• Two basic approaches to study texture
Global Features
• Advantages:
– Simple.
– Low computational complexity.
• Disadvantages: – Low accuracy
Local Features
• Images are segmented into a collection of smaller
regions, with each region representing a potential
object of interest (fine-grained).
• An object of interest may represent a simple
semantic object (e.g. a round object).
• Choice of features is domain specific:
– X-ray imaging, GIS, ...etc require spatial features (e.g.
shapes [may be calculated through edges] and
dimensions.)
– Paintings, MMR imaging, ...etc may use color features in
specific regions of the image.
Edge Detection
• A given input image E is used to gradually
compute a (zero-initialized) output image A.
– A convolution mask runs across E pixel by pixel
and links the entries in the mask at each position
that M occupies in E with the gray value of the
underlying image dots.
– The result of the linkage (and the subsequent sum
across all products from the mask entry and the
gray value of the underlying image pixel) is
written to the output image A.
Convolution
• Convolution is a simple mathematical operation which is
fundamental to many common image processing operators.
• Convolution provides a way of `multiplying together' two
arrays of numbers, generally of different sizes, but of the
same dimensionality, to produce a third array of numbers
of the same dimensionality.
• This can be used in image processing to implement
operators whose output pixel values are simple linear
combinations of certain input pixel values.
• The convolution is performed by sliding the kernel over
the image, generally starting at the top left corner, so as to
move the kernel through all the positions where the kernel
fits entirely within the boundaries of the image.
Convolution Computation
• If the image E has M rows and N columns, and the
kernel K has m rows and n columns, then the size
Similarity Metrics
• Minkowski Distance 1
⎛F ⎞
r r
⎜∑x[i]− y[i] ⎟
⎝ i=1 ⎠
• Weighted Distance
– Average Distance
Image Segmentation
• Assigning a unique number to “object” pixels
based on different intensities or colors in the
foreground and the background regions of an
image
– Can be used in the object recognition process, but
it is not object recognition on its own
• Segmentation Methods
– Pixel oriented methods – Edge oriented methods –
Region oriented methods
– ....
Pixel-Oriented Segmentation
• Gray-values of pixels are studied in isolation
• Looks at the gray-level histogram of an image and
finds one or more thresholds in the histogram
– Ideally, the histogram has a region without pixels,
which is set as the threshold, and hence the image is
divided into a foreground and a background based on
that (Bimodal Distribution)
• Major drawback of this approach is that object and
background histograms overlap.
– Bimodal distribution rarely occurs in nature.
Edge-Oriented Segmentation
• Segmentation is carried out as follows
– Edges of an image are extracted (using Canny
operators, e.g.)
– Edges are connected to form closed contours
around the objects.
• Hough Transform
Region-Oriented Segmentation
• A major disadvantage of the previous
approaches is the lack of “spatial” relationship
considerations of pixels.
– Neighboring pixels normally have similar properties
• The segmentation (region-growing) is carried out
as follows
1N N 2
• With standard deviation σk = n2 ∑∑i= =1 j 1(P(i, j)−mk ) •
– Semantic networks
– Mathematical logic
– Constraints
– Inclusion hierarchies – Frames.
Semantic Networks
• First introduced to represent the meanings of English
sentences in terms of words and relationships between
them.
• Semantic networks are graphs of nodes representing
concepts that are linked together by arcs representing
relationships between these concepts.