Professional Documents
Culture Documents
ML CEP Group 3
ML CEP Group 3
Submitted by:
Shehroz Jajja 2019-EE-081
Ahmad Daniyal 2019-EE-084
Noman Khalid 2019-EE-166
Ammar Akhtar 2019-EE-167
Group No. 03
List of Figures ii
1 Introduction 1
1.1 Speaker Recognition System . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Mel Frequency Cepstral Coefficients (MFCC) . . . . . . . . . . . . . . . . 2
1.3 Support Vector Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Methodology 4
2.1 Audio Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2 Extracting Features from Audio Samples . . . . . . . . . . . . . . . . . . . 4
2.3 Train and Test Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.4 Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
i
List of Figures
ii
Chapter 1
Introduction
1
Chapter 1. Introduction 2
o separate the two classes of data points, there are many possible hyperplanes that
could be chosen. Our objective is to find a plane that has the maximum margin, i.e
the maximum distance between data points of both classes. Maximizing the margin
distance provides some reinforcement so that future data points can be classified with
more confidence.
Hyperplanes are decision boundaries that help classify the data points. Data points
falling on either side of the hyperplane can be attributed to different classes. Also, the
dimension of the hyperplane depends upon the number of features. If the number of
input features is 2, then the hyperplane is just a line. If the number of input features is
3, then the hyperplane becomes a two-dimensional plane. It becomes difficult to imagine
when the number of features exceeds 3.
Methodology
2.4 Classification
Classification is done 8 speakers. So, kernel being a linear can easily classify the data as
in our case accuracy on Support vector machines comes out to br 95.87 percent. SVM is
imported from the sklearn python library. Audio of 10 seconds recorded by microphone
is used as a test to check the Speaker id provided by classfication.
4
Bibliography
[1] https://medium.com/analytics-vidhya/speaker-identification-using-machine-
learning-3080ee202920 , Last accessed by 2022.
[4] “https://jonathan-hui.medium.com/speech-recognition-feature-extraction-mfcc-plp-
5455f5a69dd9.” , Last accessed by 2022.
[5] https://www.sciencedirect.com/topics/engineering/speaker-recognition-system” ,
Last accessed by 2022.