Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Fundamentals of Multimedia Signal Processing

Lecture 1
Digital Signal Processing: Basics
Hoang Van Xiem, Ph.D.
xiemhoang@gmail.com

Hanoi, 2018
Outline

1. DSP: Introduction
2. Fourier Transform
3. Discrete Cosine Transform (DCT)
4. DCT in Image and Video Coding: A case study

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


2
DSP: Introduction

 Digital Signal: a sequence of numbers that


represent samples of a continuous variable in a domain
such as time, space, or frequency.

Audio signal Image signal Digital Image signal

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


3
DSP: Introduction

 Signal Sampling: To convert a continuous signal form


into a discrete signal form and assign an appropriate
digital value for the obtained form.
 Discretization
 Quantization

 Nyquist–Shannon sampling theorem states that a signal can be


exactly reconstructed from its samples if the sampling frequency is
greater than twice the highest frequency component in the signal

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


4
Fourier Transform

 Target: To decompose a signal (time domain) into the


frequencies (frequency domain)

 Formulation:

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


5
Discrete Fourier Transform

 Target: To convert a finite sequence of equally-


spaced samples of a function into a same-length
sequence of equally-spaced samples of the discrete-
time Fourier transform (DTFT), which is a complex-
valued function of frequency

 Formulation:

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


6
Exercises

1
 Let N = 4; 𝑥 = 2−𝑖
−𝑖
−1 + 2𝑖
Compute the Fourier transform (X) of x ?

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


7
Solution

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


8
Discrete Cosine Transform (DCT)

 DCT: expresses a finite sequence of data points in


terms of a sum of cosine functions oscillating at
different frequencies

 DCTs are important to numerous applications in


science and engineering, from lossy
compression of audio (e.g. MP3)
and images (e.g. JPEG) (where small high-frequency
components can be discarded).

 DCT is a Fourier-related transform similar to


the discrete Fourier transform (DFT), but using only real
numbers
Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing
9
Discrete Cosine Transform (DCT)

 DCT: expresses a finite sequence of data points in


terms of a sum of cosine functions oscillating at
different frequencies

 DCTs are important to numerous applications in


science and engineering, from lossy
compression of audio (e.g. MP3)
and images (e.g. JPEG) (where small high-frequency
components can be discarded).

 DCT is a Fourier-related transform similar to


the discrete Fourier transform (DFT), but using only real
numbers
Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing
10
DCT in video coding: Transform coding

 Block – wise transform coding


 Orthonormal transform
 DCT – Type II
 Thresholding Coding
 DCT coding artifact

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


11
Block – wise transform coding

@B. Girod, EE398A Image and Video Compression

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


12
Orthonormal transform

 Forward transform

 Inverse transform

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


13
DCT – Type II

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


14
Thresholding coding

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


15
DCT coding artifact

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


16
Homework

 Xây dựng chương trình mô phỏng phép biến đổi DCT


ứng dụng trên xử lý ảnh
 Biểu diễn giá trị hệ số DCT sau biến đổi cho một vài
khối hình 8x8

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


17
Lecture References

1. Wikipedia.com

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


18
Recap

 Homework assignment: 15%


 Project: 30%
 Mid. Exam: 20%
 Fin. Exam: 35%

Hoang Van Xiem, Ph.D., VNU-UET Multimedia Signal Processing


19

You might also like