Dundigal, Hyderabad - 500 043 Industry Oriented Mini Project

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 24

INSTITUTE OF AERONAUTICAL ENGINEERING

(Autonomous)
Dundigal, Hyderabad - 500 043
ELECTRONICS AND COMMUNICATION ENGINEERING
INDUSTRY ORIENTED MINI PROJECT
III B.Tech II Semester
2018 - 2019

TITLE

SPEECH ENHANCEMENT IN TERMS OF OBJECTIVE


QUALITY MEASURES USING DISCRETE WAVELET
TRANSFORM WITH HYBRID THRESHOLDING

Mentor : Dr. S China Venkateswarlu


Student’s Name: Shweta Verma
Student’s Roll No.: 15951A04J3
EXISTING SPEECH ENHANCEMENT
METHODS OR TECHNIQUES OR
ALGORITHM
 Spectral Subtraction
 Multiband spectral subtraction
 Wiener filter
BLOCK DIAGRAM
WAVELET
WINDOWING

 Windowing is the process of taking a small subset of a


larger dataset, for processing and analysis. A naive
approach, the rectangular window, involves simply
truncating the dataset before and after the window,
while not modifying the contents of the window at all.
 Windows considered in the project are as follows
1.Dolph-Chebyshev window
2.Gaussian window
3.Kaiser window
WHAT IS A WAVELET?

 A wavelet is a rapidly decaying wave like oscillation that


has zero mean.
 Unlike sinusoids which extends till infinity, a wavelet
exists for finite duration.
 Wavelets are more localized in time and frequency
compared to sinusoids.
 Wavelets come in different sizes and shapes.
KEY WAVELET CONCEPTS
 Scaling
 Shifting
 Scaling refers to the process of shrinking or stretching
the signal in time.
 ‘S’ is the scale factor used to depict scaling.
=> ψ(t/s)s
 ‘S’ is inversely proportional to the frequency.
 Stretched wavelets helps in capturing the slowly varying
changes in the signal while the compressed wavelets
helps in capturing the abrupt changes in the signal.

Stretched wavelet Compressed wavelet


 Shifting the wavelet means delaying or advancing the
onset of the wavelet along the signal.
 A shifted wavelet is expressed using the expression
=>Ø(t-k)
 The above equations means that the wavelet is shifted
and centered to ‘k’.
WAVELET TRANSFORM

 Continuous wavelet transform


 Discrete wavelet transform
DISCRETE WAVELET TRANSFORM

 DWT is ideal for de-noising and compressing signals and


images, as it helps in representing many naturally
occurring signals and images with fewer coefficients.
 This enables the sparser representation.
 The base scale in DWT is set to 2, we can vary the base
scale value by raising it to the power of an integer ‘j’, as
shown below:
scaling =>2j where j=(1,2,3,…)
Translation => 2j m where m=(1,2,3,…)
(Translation occurs at inter multiple i.e., ‘m’)
 The DWT process is equivalent to comparing a signal to
discrete multi rate filter banks.
 Conceptually here it is how it works…

 Where,
h[n] represents high pass filter
g[n] represents low pass filter
2 represents down sampling by a factor 2
 DWT equation is given as
 DWT that consists in successively decomposing the
original signal into low frequency components and high
frequency components correspondingly by the mean of
low-pass filter analysis and the high-pass filter analysis.

 The high frequency components (details coefficients) are


not analyzed any further, however the low frequency
components (approximation coefficients) are next
decomposed into new approximation coefficients and
detail coefficients via the following equations,
HYBRID THRESHOLDING
 After performing the transformation method(DWT), de-
noising involves thresholding which is the most important
step in a transform based de-noising
 It consists of eliminating the coefficients of the DWT
transform inferior to a given threshold. There are different
methods of thresholding, such as the hard and the soft
thresholding which are the frequently used methods.
 In this work, we have used the global threshold proposed by
Donoho and Johnstone
=> 2log √ (nλσ)
Where ‘n’ designates the noisy signal length
‘σ’ represents the estimate of the noise standard deviation,
given by:
=> σ = MAD/0.6745
Where the MAD is the absolute median.
 The Soft Thresholding was carried out on the DWT
coefficients before reconstructing the signal.

 Hard Thresholding is given by,

 Finally, we conduct inverse DWT to get the reconstruct


speech signal by grouping back the approximate and
detailed coefficients which are currently noise free.
INVERSE DISCRETE WAVELET
TRANSFORM

 The original speech signal is reconstructed using the


inverse discrete wavelet transform (IDWT). Firstly, the
approximation and detail coefficients are up-sampled
through a factor of 2, and next, in that order filtered by
the low-pass analysis filter h[n] and high pass filter g[n].
OVERLAP ADD METHOD

 In signal processing, the overlap–add method (OA,


OLA) is an efficient way to evaluate the
discrete convolution of a very long signal x[n] with
a finite impulse response (FIR) filter h[n]:
OBJECTIVE MEASURES

1. SNR: SNR defined as the ratio of signal power to the


noise power, often expressed in decibels(dB).

2. Segmental SNR: The Seg-SNR is the frame-based SNR


3. Weighted Spectral Slope Distance: WSS distance
measure computes the weighted difference betweenthe
spectral slopes in each frequency band. The spectral slope
is obtained as the difference between adjacent spectral
magnitudes in decibels.

4. Log Likelihood Ratio: The log-likelihood function is


typically used to derive the maximum likelihood estimator
of the parameter The LLR measure is defined as
5. Cepstrum distance: The Cepstrum distance [20]
provides an estimate of the log spectral distance between
two spectra. It is defined as

6.Frequency segmental SNR: It is expressed by the


following equation,
OBTAINED RESULTS
0 -1.4
-CHEBYSHEV-WIN -CHEBYSHEV-WIN
--HAMMING-WIN --HAMMING-WIN
-1.6
-1

-1.8

-2
-2

SEG-SNR
SNR

-3 -2.2

-2.4
-4

-2.6
-5
-2.8

-6 -3
0 10 20 30 40 50 60 70 80 0 10 20 30 40 50 60 70 8
CHEBYSHEV WINDOW alpa VALUE CHEBYSHEV WINDOW alpa VALUE
THANK YOU

You might also like