Professional Documents
Culture Documents
Workflow
Workflow
Spectrograms represent the frequency content in the audio as colors in an image. Frequency content
of milliseconds chunks is stringed together as colored vertical bars. Spectrograms are basically two-
dimensional graphs, with a third dimension represented by colors.
Time runs from left (oldest) to right (youngest) along the horizontal axis.
The vertical axis represents frequency, with the lowest frequencies at the bottom and the
highest frequencies at the top.
Example:
Training
3. Training the CNN on these spectrogram images to classify these audio-file images into
asthma, hypothorax and other diseases. It will be a supervised training process, as labels will
be available for each audio clip, which will be stored in a csv file. The file will contain the
path of each audio spectrogram image, and its corresponding label.
1 Dropout Layer
1 Flattening Layer
We will be using keras to train the network after the preprocessing part.
Testing
4. Audio clips recorded using microphones will be used to test the model.