Welcome to Scribd!

Inform

Uploaded by

0% found this document useful (0 votes)

6 views6 pages

This document discusses using a pretrained YAMNet acoustic model to identify specific drum sounds from audio clips. The key points are: 1. YAMNet is pretrained on AudioSet to detect sound events but identifies broad drum categories rather than specific drums. 2. New 1-second audio samples were created for kick, snare, and hihat drums to train a customized model. 3. The results showed the customized model could identify sounds solely from the kick, snare, and hihat folders.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

6 views6 pages

Inform

Uploaded by

CARLOS SALDANA

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 6

Search inside document

Work:

yamnet
neural network

Carlos Saldana.

2022
CONTENIDO

INTRODUCTION ........................................................................................................................ 3
PROBLEM ................................................................................................................................... 3
SOLUTION .................................................................................................................................. 3
RESULTS ..................................................................................................................................... 3
REFERENCES ............................................................................................................................. 6
INTRODUCTION

YAMNET is a pretrained acoustic detection model trained by Dan Ellis on the AudioSet dataset which
contains labelled data from more than 2 million Youtube videos. It employs the MobileNet_v1 depth-wise-
separable convolution architecture. This pretrained model is readily available in Tensorflow Hub, which
includes TFLite(lite model for mobile) and TF.js(running on the web) versions. [1]

PROBLEM

1. Read in an audio signal

2. Call classifySound to return the detected sounds

3. Identify ONLY sounds from the folders (like '\kick', '\snare', '\hihat'

“Ideally, just add lines to my transfer learning example, and delete YAMNets Drum category and
all of Drum's sub categories (like Bass drum)”

SOLUTION

1.The principal problem I ´ve found was the samples taken, I created a file .m where to built new
sound file with more time of duration for each one, only it was made for the 3 folders. But you could
made more folders for more sounds save, the file called, snare, kick, hihat respectively

2.The new file let yamnet work like we are thinking, the tag are well, drum kit 157, snare drum 160
hihat 167 is into a excel file when you download the yamnet folder, and it could be rewrite, but no
es recommendable because it is a midi classification, a midi is a type of file to write sound, and the
classification made is a general midi sound.

3.The yamnet identify only sound from the folder kick, snare, and hihat.

4.The results are show to the following screen capture

RESULTS

1. Read in an audio signal (up 1 second duration)

I had to erase some samples (hihat 7,49,54,55,57,67,68,81,84 and 88) because they need to be rec again
with a little silent time after the sound, and could be replaced into the file oldsample
If you decide do it you have to follow this steps:
1. Rec again this sound, with a silence (0.5 second) at least
2. Replace the files on folder hithat of old sample folder
3. Copy the hithat folder into the drumtrain
4. Run de hihat.m file
5. replace the files of the folders into of newsample_1second
6. And begin the train again.

3. Identify ONLY sounds from the folders '\kick', '\snare', '\hihat') train and validation

3. Contend of rar

The exampletransferlearningYAMNEt_cas is the file modified, and hithat.m, kick.m,snare.m are

the file to create the new sample folder (up 1 second of duration).
I give you:
File .m to create new sample sound
Net= DrumNet.mat
New files with up 1 second of duration
exampletransferlearningYAMNEt_cas file. before run this file, change the path file.
Mel spectrograms generated from audioIn, returned as a 96-by-64-by-1-by-K array, where:

• 96 –– Represents the number of 25 ms frames in each mel spectrogram

• 64 –– Represents the number of mel bands spanning 125 Hz to 7.5 kHz
• K –– Represents the number of mel spectrograms and depends on the length
of audioIn, the number of channels in audioIn, as well as OverlapPercentage

REFERENCES
[1] M. Rustagi, “Guide to YAMNet : Sound Event Classifier,” Analytics India Magazine, Jun. 08, 2021.
https://analyticsindiamag.com/guide-to-yamnet-sound-event-classifier/ (accessed Nov. 14, 2022).

Sound Magic Spectral Manual
Document15 pages
Sound Magic Spectral Manual
George Dumitriu
100% (1)
Forest Kingdom 3 Manual 1.0 PDF
Document14 pages
Forest Kingdom 3 Manual 1.0 PDF
FREE MUSIC - Musica para creadores de contenido
No ratings yet
Bad Bad Leroy Brown - 4 Horns + Rhythm - Amy
Document36 pages
Bad Bad Leroy Brown - 4 Horns + Rhythm - Amy
efze71
67% (3)
Practical MATLAB Deep Learning: A Project-Based Approach
From Everand
Practical MATLAB Deep Learning: A Project-Based Approach
Michael Paluszek
No ratings yet
TarsosDSP 2.3 Manual
Document14 pages
TarsosDSP 2.3 Manual
benito
No ratings yet
Open Data Structures: An Introduction
From Everand
Open Data Structures: An Introduction
Pat Morin
Rating: 4 out of 5 stars
4/5 (4)
Unit2 - Sound - Sampling Rate - Answerkey
Document4 pages
Unit2 - Sound - Sampling Rate - Answerkey
Lara Haider
No ratings yet
09alab9 ProTools PDF
Document21 pages
09alab9 ProTools PDF
Walid_Sassi_Tun
No ratings yet
Ultrastar Deluxe: (For Grown-Up Children Too and at Least Pedagogically Valuable)
Document10 pages
Ultrastar Deluxe: (For Grown-Up Children Too and at Least Pedagogically Valuable)
Gerald Goco
No ratings yet
Sound HW PDF
Document6 pages
Sound HW PDF
Latoya Woods
No ratings yet
Audiosegment Readthedocs Io en Latest
Document23 pages
Audiosegment Readthedocs Io en Latest
semua
No ratings yet
AudioThing Hats
Document7 pages
AudioThing Hats
Walter Cruz
No ratings yet
Audio Editing Tasks
Document1 page
Audio Editing Tasks
Ghadeer Alshoum
No ratings yet
Zara Radio User Manual
Document37 pages
Zara Radio User Manual
pafilco
No ratings yet
Velvet Manual
Document22 pages
Velvet Manual
madmarcos
No ratings yet
Blamberger Ti Project
Document47 pages
Blamberger Ti Project
sangdewa
No ratings yet
DaTrackSwitcher Guide
Document6 pages
DaTrackSwitcher Guide
carsanv
No ratings yet
2011 Fall Midterm2 Soln CS 439
Document6 pages
2011 Fall Midterm2 Soln CS 439
brunosipod
No ratings yet
Examining Data Runs of A Fragmented File in NTFS
Document15 pages
Examining Data Runs of A Fragmented File in NTFS
Carlos Cajigas
No ratings yet
ParthShah TMA03
Document12 pages
ParthShah TMA03
parth98
No ratings yet
Manual - Mp3 Convert
Document11 pages
Manual - Mp3 Convert
Chiko studio
No ratings yet
Clearing The Disk Space of The M2000 Server
Document3 pages
Clearing The Disk Space of The M2000 Server
Bogdan Lupu
No ratings yet
Tutorial Audacity (Iwan)
Document30 pages
Tutorial Audacity (Iwan)
umbel_ceria
No ratings yet
Daniel Mintz
Document120 pages
Daniel Mintz
jfk
No ratings yet
XM File Format
Document21 pages
XM File Format
clip4food
No ratings yet
MP 3 Steg On Ography
Document33 pages
MP 3 Steg On Ography
dharadipal
No ratings yet
Introduction To Music Technology Tutorials: Project 1: Sound Explorations
Document5 pages
Introduction To Music Technology Tutorials: Project 1: Sound Explorations
Zachary Cruz
No ratings yet
Groove Monkee Supplemental Mappings
Document5 pages
Groove Monkee Supplemental Mappings
Trey
No ratings yet
Ruaomoko Setup
Document12 pages
Ruaomoko Setup
ram27
No ratings yet
Tyros To Loops Import Instructions
Document10 pages
Tyros To Loops Import Instructions
Ali Dkali
No ratings yet
HMMPY Doc
Document14 pages
HMMPY Doc
jaknopp
No ratings yet
Sound and Music in Squeak: CS 345: Programming Language Paradigms
Document4 pages
Sound and Music in Squeak: CS 345: Programming Language Paradigms
geaplanet1915
No ratings yet
Package Audio': R Topics Documented
Document10 pages
Package Audio': R Topics Documented
Brian Wheeler
No ratings yet
04lab4 PDF
Document22 pages
04lab4 PDF
Walid_Sassi_Tun
No ratings yet
Analysis of Env Sounds
Document132 pages
Analysis of Env Sounds
Jason Wong
No ratings yet
Open Source Audio Platform For Embedded Systems
Document134 pages
Open Source Audio Platform For Embedded Systems
FabioCastrilloCotes
No ratings yet
Introduction To Music Theory 5.68
Document97 pages
Introduction To Music Theory 5.68
N.vaishnavi 7th d
No ratings yet
N Theorem
Document58 pages
N Theorem
ALBERTO314
No ratings yet
Speaker Recognition
Document11 pages
Speaker Recognition
Amel Alma
No ratings yet
Twangstrom User Guide
Document34 pages
Twangstrom User Guide
Antonio Carlos Dias
No ratings yet
Red Hat Enterprise Linux-7-Storage Administration Guide-En-US
Document248 pages
Red Hat Enterprise Linux-7-Storage Administration Guide-En-US
Mala Kondaiah
No ratings yet
Manual Axisem1.3
Document14 pages
Manual Axisem1.3
Zhiqiang Liu
No ratings yet
CS425 Audio and Speech Processing - Hodgkinson - 2012
Document106 pages
CS425 Audio and Speech Processing - Hodgkinson - 2012
Dương Quá
No ratings yet
Battery 3 Manual English
Document87 pages
Battery 3 Manual English
Andrew Wade
No ratings yet
Lukashevich Et Al. - IDMT Time Stretch Pitch Shift Library Documentation
Document5 pages
Lukashevich Et Al. - IDMT Time Stretch Pitch Shift Library Documentation
Alessandro Ratoci
No ratings yet
Hanoi University of Science and Technology
Document9 pages
Hanoi University of Science and Technology
Nguyễn Tiến Đạt
No ratings yet
OS - Lab2. Process & Multithreaded Process
Document21 pages
OS - Lab2. Process & Multithreaded Process
Anh Huỳnh
No ratings yet
Libraries: Sound - Minim: Notes From The Boards Board Notes Set 22 Page
Document5 pages
Libraries: Sound - Minim: Notes From The Boards Board Notes Set 22 Page
Brian Wheeler
No ratings yet
Octave System Sound Processing Library: Lóránt Oroszlány
Document39 pages
Octave System Sound Processing Library: Lóránt Oroszlány
Bhoomika Shetty M
No ratings yet
Musicgenre-Pages Merged
Document12 pages
Musicgenre-Pages Merged
sharmayash8028
No ratings yet
Tarsos Latest Manual
Document25 pages
Tarsos Latest Manual
Pa Ndelas
No ratings yet
UNIX For Users LabBook
Document13 pages
UNIX For Users LabBook
anilkumarpv
No ratings yet
Lab Samples
Document22 pages
Lab Samples
Normalia Samian
No ratings yet
Atracmp 3
Document10 pages
Atracmp 3
Eric Scott
No ratings yet
PsrUti Workshop Eng
Document17 pages
PsrUti Workshop Eng
hg n
No ratings yet
L Tex L Tex
Document22 pages
L Tex L Tex
chat_watchara
No ratings yet
A10-2: Sound and Music Description, Revisited: Audio Signal Processing For Music Applications
Document2 pages
A10-2: Sound and Music Description, Revisited: Audio Signal Processing For Music Applications
PABLO PASEIRO
No ratings yet
Homework 1
Document3 pages
Homework 1
Toàn Nguyễn Gia
No ratings yet
HTK (v.3.1) : Basic Tutorial: Content
Document18 pages
HTK (v.3.1) : Basic Tutorial: Content
Janković Milica
No ratings yet
Backups
Document11 pages
Backups
duccsgo312
No ratings yet
Advances in Digital Speech Transmission
From Everand
Advances in Digital Speech Transmission
Rainer Martin
No ratings yet
Choral Concert Program
Document7 pages
Choral Concert Program
api-294225795
No ratings yet
Varese Poeme Electronique Analysis
Document31 pages
Varese Poeme Electronique Analysis
Wei Fangxia
No ratings yet
clp370s308 en Om DL b0
Document7 pages
clp370s308 en Om DL b0
ladydalady
No ratings yet
Instructions: Ion Drum Rocker + ALESIS DM5
Document2 pages
Instructions: Ion Drum Rocker + ALESIS DM5
Karne
No ratings yet
AudioProz Art MPA Mods
Document5 pages
AudioProz Art MPA Mods
rashaan
No ratings yet
Learn To Play The Piano
Document31 pages
Learn To Play The Piano
Ade Pardi Putra Sunda
100% (1)
PT2000
Document4 pages
PT2000
carlosjpirela
No ratings yet
Musical Texture
Document11 pages
Musical Texture
Grace Vernadette Gange Oaquira
No ratings yet
Seeing Sounds Worksheet: Tuning Fork Station
Document2 pages
Seeing Sounds Worksheet: Tuning Fork Station
Eji Alcoreza
No ratings yet
DLL Sound 5
Document2 pages
DLL Sound 5
IAN BERNARDO
No ratings yet
ElectronicMusician 12.2016
Document76 pages
ElectronicMusician 12.2016
LesterGarciaEspinosa
100% (1)
1402vlzpro Om SP
Document36 pages
1402vlzpro Om SP
proimagen2007
No ratings yet
Sound Amplification System
Document3 pages
Sound Amplification System
api-3731257
No ratings yet
UR22C Operation Manual English
Document39 pages
UR22C Operation Manual English
Károly Molnár
No ratings yet
Beginning Band Rehearsal Techniques
Document37 pages
Beginning Band Rehearsal Techniques
Brianna Williams
100% (1)
2021 F4 KSSM Physics Chap 5.3 Exe
Document7 pages
2021 F4 KSSM Physics Chap 5.3 Exe
IVAN TIONG WEI JUN Moe
No ratings yet
1.sound and Waves - Final
Document39 pages
1.sound and Waves - Final
Vilakshan Gupta
No ratings yet
Basic Mixing Method
Document7 pages
Basic Mixing Method
kastimo
No ratings yet
B3
Document3 pages
B3
Jericho Morales
No ratings yet
Specsheet Eng TTW 4 A
Document10 pages
Specsheet Eng TTW 4 A
JairoGR
No ratings yet
AS TPM 10 QP Merged 1 2
Document27 pages
AS TPM 10 QP Merged 1 2
Addict- ion
No ratings yet
Acoustic Measurement Tips
Document6 pages
Acoustic Measurement Tips
Irfangi
No ratings yet
(IISc Lecture Notes Series, V. 3) M L Munjal - Noise and Vibration Control
Document294 pages
(IISc Lecture Notes Series, V. 3) M L Munjal - Noise and Vibration Control
Avinash Reddy
No ratings yet
Instruments of The Orchestra
Document5 pages
Instruments of The Orchestra
alexnekita
No ratings yet
Sound System-Model PDF
Document1 page
Sound System-Model PDF
midia
No ratings yet
Ultimate Ap Exam Review Part 1
Document160 pages
Ultimate Ap Exam Review Part 1
api-287575009
No ratings yet
Thesaurus of Orchestral Instruments 1
Document26 pages
Thesaurus of Orchestral Instruments 1
Angela
No ratings yet
Definition Menaning Physics
Document14 pages
Definition Menaning Physics
IEyra ShaHera
No ratings yet
Evid Design and Installation Guide
Document13 pages
Evid Design and Installation Guide
Elvis Vasile Dulea
No ratings yet