Professional Documents
Culture Documents
Vector Quantization K Means Nearest Neig
Vector Quantization K Means Nearest Neig
Vector Quantization K Means Nearest Neig
Nearest-Neighbor Rules
Vector quantization (VQ)
Is a classical quantization technique from signal processing that
allows the modeling of probability density functions by the
distribution of prototype vectors. It was originally used for
data compression.
It works by dividing a large set of points (vectors) into groups
having approximately the same number of points closest to them.
Each group is represented by its centroid point, as in k-means and
some other clustering algorithms.
Vector quantization (VQ) has been commonly used in the compression
of image and speech signals. In vector quantization, a reproduction
vector (codevector) in a predesigned set of vectors (codebook)
approximates each set (vector) of the input signal. This representative
codevector, the nearest neighbor of the source vector, gives the least
dissimilarity (distortion) among all the codevectors in the codebook. In
vector quantization, compression is achieved by transmitting or storing
the indices associated to the codevectors instead of the codevectors
because of the far fewer bits required for the indices. The following
Figure 1 shows the principle of the resulting encoder and decoder.
Vector quantization is the extension of scalar quantization. Basically in signal
processing converting a analog source - continuous time and amplitude - into a
digital source - discrete time and amplitude - results in two
operations: sampling and quantization. In the following context, a source is
regarded as stochastic process described by a probability density function. A
particularly realization of such a source is a signal. So sampling means to convert
a continuous-time signal into a discrete-time signal – a set of data samples.
Quantization means to convert a continuous-amplitude signal into a discrete-
amplitude signal.
Vector quantization is characterized by its dimension, equal to the number of data
samples in a set, which is quantized jointly as a single vector. Then vector
quantization means to approximate an infinite set of vectors by a limited set of
vectors. This approximation can be regarded as a lossy compression method
characterized by its distortion and its compression rate. The distortion measures
the loss of information induced by the quantization. For images is this loss a
degradation in details. The compression rate measures the gain in representing the
samples by passing from an infinite set to a limited set. In the sense of
transmission, a large compression rate implies a low bit rate and low compression
rate implies a large bit rate to transmit the same number of data samples in the
same time.
Principles of Vector Quantization