Professional Documents
Culture Documents
Signal
Signal
net/publication/318224324
CITATIONS READS
4 7,260
1 author:
Mittal Darji
G H Patel College of Engineering and Technology (GCET)
8 PUBLICATIONS 15 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Mittal Darji on 06 July 2017.
CSEIT1722347 | Received : 03 May 2017 | Accepted : 11 May 2017 | May-June-2017 [(2)3: 227-230 ] 227
Rather than its actual value at a time, its variations over
Audio signal varies with the time. They can be the time can provide information that is more useful. It
represented either in time domain or in frequency provides the representation of variations of amplitude
domain. Each of our sound sensation is related with over the time. It can be calculated as,
one or more spectral or temporal property of sound.
Hence, features of both, time domain and frequency STE = ∑ | | | | (2)
domain are jointly required for representation of sound.
3) Spectral Centroid
III. AUDIO FEATURES
It indicates where most of energy of the signal is. It
Heart of classification is to find features, which can measures the brightness of the signal. Due to its
provide a good discrimination among various classes. effectiveness of describing spectral shape, centroid has
Years of research on audio signal features has given 2 been used in classification of audio. It can be given as,
broad categories of features: physical features and
perceptual features. Features can also be classified as ∑ [ ]| [ ]|
static and dynamic. Static features are the measurement C= (3)
∑ | [ ]|
of feature of signal at a particular time. Long time
variations in static features can provide dynamic Where, N is the number of FFT points, [ ] is STFT
features. In this paper, physical and perceptual types of of frame r and f[K] is the frequency of bean k.
features are focused on.
4) Spectral Roll-off
A. Physical Features
This is another measure of spectral shape of signal. It
Physical features are such low-level features, which can
was first used as feature to separate voiced and
directly be measured from frequency or time
unvoiced signal. It is calculated as,
representation of audio signal. They are physical as
R = f[K]
they can be computed directly from the amplitude
values or other spectral values of signal.
If K is the largest bean having,
1) Zero Crossing Rate
∑ | [ ]| ∑ | [ ]| (4)
It is the rate of sign-changes along a signal. It is
defined as the number of times zero crossed within a 5) Spectral Flux
frame. [6]
Spectral flux is also called as delta spectrum
ZCR is an important feature for many classifications magnitude. It measures rate of change in spectral shape.
like separating signal into voiced and unvoiced, gender It can be calculated as the frame-to-frame magnitude
classification etc. it provides useful information at very spectral difference. High value of flux indicated sudden
cost. It can be calculated as, change in magnitude.