Welcome to Scribd!

Workflow

Uploaded by

0% found this document useful (0 votes)

11 views2 pages

The document discusses preprocessing audio data by converting audio files of coughs, mucus, and asthma into spectrogram images. It then describes training a CNN on the spectrogram images to classify them into conditions like asthma or hypothorax. The CNN will contain convolution, max pooling, dropout, flatten, and dense layers. Finally, it mentions testing the trained model on new audio clips recorded by microphones.

Original Description:

project workflow

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

11 views2 pages

Workflow

Uploaded by

Pray

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Preprocessing

1. Collect audio datasets of cough, mucus and asthma

2. Convert each audio file to an image – by plotting the spectrogram

Spectrograms represent the frequency content in the audio as colors in an image. Frequency content
of milliseconds chunks is stringed together as colored vertical bars. Spectrograms are basically two-
dimensional graphs, with a third dimension represented by colors.

Time runs from left (oldest) to right (youngest) along the horizontal axis.

The vertical axis represents frequency, with the lowest frequencies at the bottom and the
highest frequencies at the top.

The amplitude (or energy or “loudness”) of a particular frequency at a particular time is

represented by the third dimension, color, with dark blues corresponding to low amplitudes and
brighter colors up through red corresponding to progressively stronger (or louder) amplitudes.

Example:

How this will be done:

a) Divide audio file into millisecond chunks

b) Compute Short-Time Fourier Transform for each chunk
c) Plot this chunk as a colored vertical tine in the spectrogram

Training

3. Training the CNN on these spectrogram images to classify these audio-file images into
asthma, hypothorax and other diseases. It will be a supervised training process, as labels will
be available for each audio clip, which will be stored in a csv file. The file will contain the
path of each audio spectrogram image, and its corresponding label.

The CNN will have the following layers

2 Convolution layers with same kernel size (to be decided)

1 Max Pooling layer with 2X2 pooling size

1 Dropout Layer

1 Flattening Layer

2 Dense layered Neural Network at the end

We will be using keras to train the network after the preprocessing part.

Testing

4. Audio clips recorded using microphones will be used to test the model.

JBL Audio Engineering for Sound Reinforcement
From Everand
JBL Audio Engineering for Sound Reinforcement
John M. Eargle
Rating: 5 out of 5 stars
5/5 (2)
Internet and Web Programming Project Report Review 3: Travel Tours Booking System
Document22 pages
Internet and Web Programming Project Report Review 3: Travel Tours Booking System
Pray
No ratings yet
Ultrasound Fundamentals
Document1 page
Ultrasound Fundamentals
semiconductorman
No ratings yet
Mini Pro 2
Document18 pages
Mini Pro 2
Dhivyaan Ramesh Sreenivas
No ratings yet
Introduction To Multimedia. Analog-Digital Representation
Document29 pages
Introduction To Multimedia. Analog-Digital Representation
raghudathesh
100% (1)
Muzic Genre Classification
Document4 pages
Muzic Genre Classification
azura net
No ratings yet
Sound Synthesis Methods
Document8 pages
Sound Synthesis Methods
jeriklesh
No ratings yet
MM Ass2
Document6 pages
MM Ass2
Waguma Leticia
No ratings yet
CNN Music Classification
Document6 pages
CNN Music Classification
Mr Bennigan
No ratings yet
A Pure Data Toolkit
Document6 pages
A Pure Data Toolkit
L.F.
No ratings yet
Audio Coding For TV
Document36 pages
Audio Coding For TV
Kuldip Gor
No ratings yet
Exp FA
Document8 pages
Exp FA
cooljsean
No ratings yet
1.1.3 Sound
Document6 pages
1.1.3 Sound
Sarah Hussain
No ratings yet
Predicting Singer Voice Using Convolutional Neural Network
Document17 pages
Predicting Singer Voice Using Convolutional Neural Network
Puru Mathur
No ratings yet
DeepClassic: Music Generation With Neural Neural Networks
Document6 pages
DeepClassic: Music Generation With Neural Neural Networks
Sam J
No ratings yet
DC Unit 1 Assignment
Document2 pages
DC Unit 1 Assignment
Meghashyam Sarma
No ratings yet
Acoustic Analysis of Pathological Voices Compressedwith MPEG System - 2003 - Journal of Voice
Document14 pages
Acoustic Analysis of Pathological Voices Compressedwith MPEG System - 2003 - Journal of Voice
ricardobuhrer
No ratings yet
10 1 1 422 7844 PDF
Document2 pages
10 1 1 422 7844 PDF
Abigail Miranda Alvarez Gavidia
No ratings yet
Speech Acoustics Project
Document22 pages
Speech Acoustics Project
yeasir089
No ratings yet
Modal Audio Effects: A Carillon Case Study
Document8 pages
Modal Audio Effects: A Carillon Case Study
quentin.prescott.is.an.edgelord
No ratings yet
FMA-V8-October - CSSE .V4
Document14 pages
FMA-V8-October - CSSE .V4
Thành Chu Bá
No ratings yet
Ear Does Fourier Analysis
Document14 pages
Ear Does Fourier Analysis
LuckySharma Sharma
No ratings yet
Project 3
Document2 pages
Project 3
mamtarajpootsingh
No ratings yet
A Music Emotion Recognition Algorithm with Hierarchical SVM Based Classifiers
Document4 pages
A Music Emotion Recognition Algorithm with Hierarchical SVM Based Classifiers
aankitachawda
No ratings yet
Color Television-A Primer On The NTSC System-Mh9
Document8 pages
Color Television-A Primer On The NTSC System-Mh9
julio perez
No ratings yet
Fabrication of Message Digest To Authenticate Audio Signals With Alternation of Coefficients of Harmonics in Multi-Stages (MDAC)
Document11 pages
Fabrication of Message Digest To Authenticate Audio Signals With Alternation of Coefficients of Harmonics in Multi-Stages (MDAC)
IJMAJournal
No ratings yet
T: Generalized Braitenberg Vehicles That Recognize Complex Real Sounds As Landmarks
Document7 pages
T: Generalized Braitenberg Vehicles That Recognize Complex Real Sounds As Landmarks
Juanjo Fernandez Imaz
No ratings yet
Analysis and Synthesis of Sound Textures
Document16 pages
Analysis and Synthesis of Sound Textures
Kat Canales
No ratings yet
Atracmp 3
Document10 pages
Atracmp 3
Eric Scott
No ratings yet
Spectrogram
Document9 pages
Spectrogram
subramani muthusamy
No ratings yet
First Research Paper
Document15 pages
First Research Paper
Hasitha Hiranya Abeykoon
No ratings yet
Chapter Five PDF
Document4 pages
Chapter Five PDF
Ali Alibrahimi
No ratings yet
Chapter Five PDF
Document4 pages
Chapter Five PDF
Ali Alibrahimi
No ratings yet
Speech Chapter 4
Document41 pages
Speech Chapter 4
ai20152023
No ratings yet
Fourier Analysis of Musical Notes: Experiment 16
Document8 pages
Fourier Analysis of Musical Notes: Experiment 16
Utkarsh Agarwal
No ratings yet
Lab 2
Document6 pages
Lab 2
api-272723910
No ratings yet
Vogel Jordan Wessel Icassp04
Document4 pages
Vogel Jordan Wessel Icassp04
glerm
No ratings yet
Sampling Image Video Processing
Document19 pages
Sampling Image Video Processing
Srinivas Reddy
No ratings yet
Chap 5 Audio Dbms
Document16 pages
Chap 5 Audio Dbms
windasempurna82
No ratings yet
Unit-2 Multimedia Information Representation
Document72 pages
Unit-2 Multimedia Information Representation
Abdullah Gubbi
No ratings yet
Animal Sound Classification Using A Convolutional Neural Network
Document5 pages
Animal Sound Classification Using A Convolutional Neural Network
Hln Frcnt
No ratings yet
Music Source Separation: Francisco Javier Cifuentes Garc Ia
Document7 pages
Music Source Separation: Francisco Javier Cifuentes Garc Ia
Frank Cifuentes
No ratings yet
Bird Song Recognition Through Spectrogram Processing and Labeling
Document8 pages
Bird Song Recognition Through Spectrogram Processing and Labeling
dmzy2
No ratings yet
The Mel-Frequency Cepstral Coefficients in The Context of Singer Identification
Document4 pages
The Mel-Frequency Cepstral Coefficients in The Context of Singer Identification
justspamme
No ratings yet
Analytical Description
Document15 pages
Analytical Description
kv792
No ratings yet
Adsp - Lec 2
Document16 pages
Adsp - Lec 2
amnapa
No ratings yet
Cross-Correlation As A Measure For Cross-Modal Analysis of Music and Floor Data
Document5 pages
Cross-Correlation As A Measure For Cross-Modal Analysis of Music and Floor Data
navkul1
No ratings yet
Note 01 N
Document49 pages
Note 01 N
Lilililil
No ratings yet
Chapter 2 Sound and Audio
Document20 pages
Chapter 2 Sound and Audio
rp
No ratings yet
Tuning Curves
Document3 pages
Tuning Curves
Laura Salido
No ratings yet
Lecture 3
Document7 pages
Lecture 3
abrito
No ratings yet
Eco Localization by The Analysis of The Characteristics of The Reflected Waves in Audible Frequencies
Document6 pages
Eco Localization by The Analysis of The Characteristics of The Reflected Waves in Audible Frequencies
Josue Manuel Pareja Contreras
No ratings yet
Spectrogram S
Document9 pages
Spectrogram S
senowm
No ratings yet
Binaural Sonification of Disparity Maps: Alfonso Alba, Carlos Zubieta, Edgar Arce-Santana
Document7 pages
Binaural Sonification of Disparity Maps: Alfonso Alba, Carlos Zubieta, Edgar Arce-Santana
Jaime Ramos Solorzano
No ratings yet
Intro To Biomedical Signal Processing
Document22 pages
Intro To Biomedical Signal Processing
Asmaa Mosbeh
No ratings yet
Lab 03
Document7 pages
Lab 03
Pitchaya Myotan Es
No ratings yet
Using Inharmonic Strings in Musical Inst PDF
Document12 pages
Using Inharmonic Strings in Musical Inst PDF
Cadós Sanchez
No ratings yet
Notes - 1.2.1 - Multimedia - Sound
Document6 pages
Notes - 1.2.1 - Multimedia - Sound
dragongskdbs
No ratings yet
Bruh
Document28 pages
Bruh
Chesta
No ratings yet
Phonocardiography With A Smartphone
Document5 pages
Phonocardiography With A Smartphone
Erik Maldonado
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Table Details: Student Table
Document25 pages
Table Details: Student Table
Pray
No ratings yet
Automated Image Captioning Using CNN and RNN
Document17 pages
Automated Image Captioning Using CNN and RNN
Pray
No ratings yet
Cryptography Using Artificial Neural Network
Document18 pages
Cryptography Using Artificial Neural Network
Pray
No ratings yet
Natural Language Processing Project Review-3: Cyber Bullying Detection System Using Sentiment Analysis
Document30 pages
Natural Language Processing Project Review-3: Cyber Bullying Detection System Using Sentiment Analysis
Pray
No ratings yet
Microprocessor and Interfacing CSE-2006: TOPIC - Water Level Indicator Using 8051 Microcontroller
Document13 pages
Microprocessor and Interfacing CSE-2006: TOPIC - Water Level Indicator Using 8051 Microcontroller
Pray
No ratings yet
Robotics Project Review-1: Title:-Autonomous Car Simulation
Document8 pages
Robotics Project Review-1: Title:-Autonomous Car Simulation
Pray
No ratings yet
ESEx 1
Document3 pages
ESEx 1
Pray
No ratings yet