Professional Documents
Culture Documents
Inform
Inform
yamnet
neural network
Carlos Saldana.
2022
CONTENIDO
INTRODUCTION ........................................................................................................................ 3
PROBLEM ................................................................................................................................... 3
SOLUTION .................................................................................................................................. 3
RESULTS ..................................................................................................................................... 3
REFERENCES ............................................................................................................................. 6
INTRODUCTION
YAMNET is a pretrained acoustic detection model trained by Dan Ellis on the AudioSet dataset which
contains labelled data from more than 2 million Youtube videos. It employs the MobileNet_v1 depth-wise-
separable convolution architecture. This pretrained model is readily available in Tensorflow Hub, which
includes TFLite(lite model for mobile) and TF.js(running on the web) versions. [1]
PROBLEM
3. Identify ONLY sounds from the folders (like '\kick', '\snare', '\hihat'
“Ideally, just add lines to my transfer learning example, and delete YAMNets Drum category and
all of Drum's sub categories (like Bass drum)”
SOLUTION
1.The principal problem I ´ve found was the samples taken, I created a file .m where to built new
sound file with more time of duration for each one, only it was made for the 3 folders. But you could
made more folders for more sounds save, the file called, snare, kick, hihat respectively
2.The new file let yamnet work like we are thinking, the tag are well, drum kit 157, snare drum 160
hihat 167 is into a excel file when you download the yamnet folder, and it could be rewrite, but no
es recommendable because it is a midi classification, a midi is a type of file to write sound, and the
classification made is a general midi sound.
3.The yamnet identify only sound from the folder kick, snare, and hihat.
RESULTS
3. Identify ONLY sounds from the folders '\kick', '\snare', '\hihat') train and validation
3. Contend of rar
REFERENCES
[1] M. Rustagi, “Guide to YAMNet : Sound Event Classifier,” Analytics India Magazine, Jun. 08, 2021.
https://analyticsindiamag.com/guide-to-yamnet-sound-event-classifier/ (accessed Nov. 14, 2022).