Welcome to Scribd!

Audio Fingerprint Generation: Andre Mosley Po T Wang John Broadway Yu-Heng Lee

Uploaded by

0% found this document useful (0 votes)

14 views2 pages

This document summarizes an audio fingerprinting method that represents audio signals in a more compact way while retaining distinguishing information. It works by dividing audio into time frames, taking the fast Fourier transform to examine frequency content, dividing the spectrum into 25 critical frequency bands, calculating the energy in each band, and storing the differences in energy between bands to create a fingerprint of 1s and -1s indicating energy fluctuations over time. This fingerprint can then be used to identify and compare audio files.

Original Description:

Original Title

m14232

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

14 views2 pages

Audio Fingerprint Generation: Andre Mosley Po T Wang John Broadway Yu-Heng Lee

Uploaded by

Lionel Marcos Aguilar Olivarez

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Connexions module: m14232

Audio Fingerprint Generation

Andre Mosley Po T Wang John Broadway Yu-Heng Lee

This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License

Abstract Often times, there is a need to represent a long audio signal in a way that it is smaller but yet retains enough information to distinguish it from another audio le. This is useful when implementing a system that compares audio les in order to determine their identity. This module explains how we used a particular audio ngerprinting method in our audio recognition system.

In order to provide a more compact representation of our audio signals, we researched methods of audio ngerprinting. The method we used, which appeared to be very eective, was one that compared the amount of energy in dierent frequency bands. This method takes advantage of the fact that the human ear can only distinguish audio frequencies that are very dierent from one another. There are 25 audible frequency bands, also known as critical bands, that vary in width and range from 0 to 20kHz. The human ear can be modeled as a series of 25 band pass lters. As long as two frequencies fall within the same critical band, they are generally indistinguishable. Therefore rather than keep the entire spectrum for each one of our audio signals we can examine the patterns of energy variations from one critical band to the next, which only requires retaining one value, (the total energy), for each critical band. To implement this system in Matlab, we rst divided the signal into time frames using Hanning windows of about 37 ms. The purpose of using Hanning windows is to prevent extraneous oscillations from occurring in the spectrum. We also overlapped the Hanning windows in order to prevent the amplitude modulating eect of consecutive Hanning window.
Version
1.2: Dec 22, 2006 9:45 am US/Central

http://creativecommons.org/licenses/by/2.0/

http://cnx.org/content/m14232/1.2/

Connexions module: m14232

Figure 1

Figure 1: Audio Fingerprint Generation scheme. Each time window goes through this process. The Fast Fourier Transform (FFT) is taken for each windowed timeframe and the generated spectrum is divided into the 25 critical bands. The energy is calculated for each critical band and energy dierence between two consecutive bands is stored into an array of length 24. This process is done for each timeframe which results in a matrix of 24 rows and one column for each timeframe. Within each row of the matrix, consecutive values are compared. If the value increases from one column index to the next, the value in the prior column is replaced with a 1. Otherwise it becomes a -1. This column to column, or timeframe to timeframe, comparison examines energy uctuations similar to the way we compared energy dierences between consecutive critical bands. This overall scheme results in a matrix of dimensions 24 x (#timeframes -1). This set of 1s and -1s is the audio signal's ngerprint. The only thing left to do once is compare audio ngerprints to nd a match and return song information.

http://cnx.org/content/m14232/1.2/

Speech Recognition Seminar Report
Document32 pages
Speech Recognition Seminar Report
Suraj Gaikwad
86% (96)
Porter's Five Forces Model Cadbury
Document14 pages
Porter's Five Forces Model Cadbury
Yogita Ghag Gaikwad
75% (4)
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
Rating: 3.5 out of 5 stars
3.5/5 (22)
VERIZON COMMUNICATIONS Case Study
Document2 pages
VERIZON COMMUNICATIONS Case Study
VIPIN SHARMA
100% (1)
Syllabus Formwork in Construction
Document2 pages
Syllabus Formwork in Construction
Allan Cunningham
No ratings yet
Design of SDR-based HD Video Communication: Prof. Mithun Mondal From BITS Pilani Hyderabad Campus
Document46 pages
Design of SDR-based HD Video Communication: Prof. Mithun Mondal From BITS Pilani Hyderabad Campus
Apurva Singh
No ratings yet
Audio Compression Using Daubechie Wavelet
Document4 pages
Audio Compression Using Daubechie Wavelet
IOSRjournal
No ratings yet
ةىة
Document7 pages
ةىة
Wiam Khettab
No ratings yet
Cdma Tutorial
Document19 pages
Cdma Tutorial
ShankarPrasai
No ratings yet
Study On Noise Reduction
Document14 pages
Study On Noise Reduction
Md. Sharif Uddin Shajib
No ratings yet
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing Shumeet Baluja & Michele Covell Google, Inc. 1600 Amphitheatre Parkway, Mountain View, CA. 94043
Document4 pages
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing Shumeet Baluja & Michele Covell Google, Inc. 1600 Amphitheatre Parkway, Mountain View, CA. 94043
petardsc
No ratings yet
Perceptual Evaluation of Speech Quality (Pesq) - A New Method For Speech Quality Assessment of Telephone Networks and Codecs
Document4 pages
Perceptual Evaluation of Speech Quality (Pesq) - A New Method For Speech Quality Assessment of Telephone Networks and Codecs
Imran Al Noor
No ratings yet
Sree Narayana Mangalam Institute of Management and Technology Maliankara.P.O., North Paravoor Ernakulam District, Pin: 683516
Document6 pages
Sree Narayana Mangalam Institute of Management and Technology Maliankara.P.O., North Paravoor Ernakulam District, Pin: 683516
motturosh
No ratings yet
Music Genre Classification Project Repor
Document19 pages
Music Genre Classification Project Repor
Ananth HN
No ratings yet
LPC, Which Has Mathematically Tractable and Well-Understood Model. This Model Is
Document14 pages
LPC, Which Has Mathematically Tractable and Well-Understood Model. This Model Is
Leonardo Orosco
No ratings yet
BER Analysis of DSSS-CDMA Using MATLAB
Document7 pages
BER Analysis of DSSS-CDMA Using MATLAB
Anonymous lPvvgiQjR
No ratings yet
Performance Analysis of Multicarrier DS-CDMA System Using BPSK Modulation
Document7 pages
Performance Analysis of Multicarrier DS-CDMA System Using BPSK Modulation
iaetsdiaetsd
No ratings yet
1568004850339-TCT - 2 - PDH Principles
Document61 pages
1568004850339-TCT - 2 - PDH Principles
gurjant singh rai
No ratings yet
Application of Microphone Array For Speech Coding in Noisy Environment
Document5 pages
Application of Microphone Array For Speech Coding in Noisy Environment
scribd1235207
No ratings yet
Bhgyui
Document5 pages
Bhgyui
Rup Joshi
No ratings yet
ST Final Report TOMMOROW 4-4-2011 Report
Document57 pages
ST Final Report TOMMOROW 4-4-2011 Report
sureshkumar_scool
No ratings yet
Comparative Analysis Between DWT and WPD Techniques of Speech Compression
Document9 pages
Comparative Analysis Between DWT and WPD Techniques of Speech Compression
IOSRJEN : hard copy, certificates, Call for Papers 2013, publishing of journal
No ratings yet
Radio Frequency Detection System: "Multiuser Detectors": Bouhia Ahmed and Bouhlal Sofia, Mobile Networks, INPT
Document5 pages
Radio Frequency Detection System: "Multiuser Detectors": Bouhia Ahmed and Bouhlal Sofia, Mobile Networks, INPT
AhMed BushiDo
No ratings yet
Sahil Dcs
Document28 pages
Sahil Dcs
Prasun Sinha
No ratings yet
Design and Analysis of A Digital Active Noise Control System For Headphones Implemented in An Arduino Compatible Microcontroller
Document5 pages
Design and Analysis of A Digital Active Noise Control System For Headphones Implemented in An Arduino Compatible Microcontroller
Leul Seyfu
No ratings yet
Lecture 20 Direct Sequence Spread Spectrum
Document13 pages
Lecture 20 Direct Sequence Spread Spectrum
n4vrxcry
No ratings yet
Lin21d Interspeech
Document5 pages
Lin21d Interspeech
Prasad Nizampatnam
No ratings yet
ELEC4123 Lab2 Report Floyd DSouza and Bryan Loh
Document15 pages
ELEC4123 Lab2 Report Floyd DSouza and Bryan Loh
Bryan Loh
No ratings yet
Multiple Xing
Document40 pages
Multiple Xing
Arun Gopinath
No ratings yet
Digital Audio Broadcasting Full Seminar Report
Document23 pages
Digital Audio Broadcasting Full Seminar Report
bindug1
67% (3)
Comparative Analysis of Advanced Thresholding Methods For Speech-Signal Denoising
Document5 pages
Comparative Analysis of Advanced Thresholding Methods For Speech-Signal Denoising
Purushotham Prasad K
No ratings yet
Digital Signal Processing System of Ultrasonic Sig
Document7 pages
Digital Signal Processing System of Ultrasonic Sig
keltoma.bouta
No ratings yet
59-65-Speech Signal Enhancement Using Wavelet Threshold Methods
Document7 pages
59-65-Speech Signal Enhancement Using Wavelet Threshold Methods
DeepakIJ
No ratings yet
Acoustic Trap Display
Document4 pages
Acoustic Trap Display
Krishnaprasad
No ratings yet
Fast Fourier Transform of Frequency Hopping Spread Spectrum in Noisy Environment
Document5 pages
Fast Fourier Transform of Frequency Hopping Spread Spectrum in Noisy Environment
Naveed Ramzan
No ratings yet
TransDetector: A Transformer-Based Detector For Underwater Acoustic Differential OFDM Communications
Document28 pages
TransDetector: A Transformer-Based Detector For Underwater Acoustic Differential OFDM Communications
Anderson Rufino
No ratings yet
A Systematic Approach To Peak-to-Average Power Ratio in OFDM
Document11 pages
A Systematic Approach To Peak-to-Average Power Ratio in OFDM
Waled Fouad Gaber
No ratings yet
Report On Cdma Tech
Document47 pages
Report On Cdma Tech
Subhalaxmi Nayak
No ratings yet
Speech Recognition Using AANN
Document4 pages
Speech Recognition Using AANN
test
No ratings yet
PC 2 PC Laser Communication System
Document3 pages
PC 2 PC Laser Communication System
Mamun Limon
No ratings yet
Anti Ali A
Document17 pages
Anti Ali A
saske969
No ratings yet
Micka El de Meuleneire, Herv e Taddei, Olivier de Z Elicourt, Dominique Pastor, Peter Jax
Document4 pages
Micka El de Meuleneire, Herv e Taddei, Olivier de Z Elicourt, Dominique Pastor, Peter Jax
pavithramasi
No ratings yet
Spectral Energy Based Voice Activity Detection For Real-Time Voice Interface
Document17 pages
Spectral Energy Based Voice Activity Detection For Real-Time Voice Interface
Harshal Patil
No ratings yet
HIGH PRECISION FM DSP
Document20 pages
HIGH PRECISION FM DSP
Tamanatao Hadji Ali
No ratings yet
Comparisons of Adaptive Median Filter Based On Homogeneity Level Information and The New Generation Filters
Document5 pages
Comparisons of Adaptive Median Filter Based On Homogeneity Level Information and The New Generation Filters
International Organization of Scientific Research (IOSR)
No ratings yet
Laboratory6 ELECTIVE2
Document5 pages
Laboratory6 ELECTIVE2
Ed Lozada
No ratings yet
Laser Based Voice and Data Communication
Document41 pages
Laser Based Voice and Data Communication
anshu_ranjan1
80% (5)
Lec 17
Document28 pages
Lec 17
david lyod
No ratings yet
Art:10.1186/1687 1499 2013 159
Document11 pages
Art:10.1186/1687 1499 2013 159
radhakodirekka8732
No ratings yet
Digital Signal Processing "Speech Recognition": Paper Presentation On
Document12 pages
Digital Signal Processing "Speech Recognition": Paper Presentation On
Siri Sreeja
No ratings yet
Ultrasonic Signal De-Noising Using Dual Filtering Algorithm
Document8 pages
Ultrasonic Signal De-Noising Using Dual Filtering Algorithm
vhito619
No ratings yet
Title: Comparison of Short-Time Fourier Transform and Wavelet Transform
Document9 pages
Title: Comparison of Short-Time Fourier Transform and Wavelet Transform
Shaza Hadi
No ratings yet
A Novel Technique For Advanced Ultrasonic Testing of Concrete by Using Signal Conditioning Methods and A Scanning Laser Vibrometer
Document5 pages
A Novel Technique For Advanced Ultrasonic Testing of Concrete by Using Signal Conditioning Methods and A Scanning Laser Vibrometer
srgoku
No ratings yet
Analysis and Synthesis of Speech Using Matlab
Document10 pages
Analysis and Synthesis of Speech Using Matlab
ps1188
No ratings yet
MM Ass2
Document6 pages
MM Ass2
Waguma Leticia
No ratings yet
PCM & Mux
Document52 pages
PCM & Mux
ABC
No ratings yet
Ma Kale
Document3 pages
Ma Kale
salihyucael
No ratings yet
Design of Low-Cost Noise Measurement Sensor Network Sensor Function Design PDF
Document8 pages
Design of Low-Cost Noise Measurement Sensor Network Sensor Function Design PDF
Frew Frew
No ratings yet
A Comparative Study of Different Entropies For Spectrum Sensing Techniques
Document15 pages
A Comparative Study of Different Entropies For Spectrum Sensing Techniques
suchi87
No ratings yet
BB 06 SpreadSpectrum PDF
Document59 pages
BB 06 SpreadSpectrum PDF
jm
No ratings yet
Frame Based Adaptive Compressed Sensing of Speech Signal
Document60 pages
Frame Based Adaptive Compressed Sensing of Speech Signal
Ganesh Garigapati
No ratings yet
Led Spectrum Analyzer: Conference Paper
Document8 pages
Led Spectrum Analyzer: Conference Paper
Niltoncientista
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Software Radio: Sampling Rate Selection, Design and Synchronization
From Everand
Software Radio: Sampling Rate Selection, Design and Synchronization
Elettra Venosa
No ratings yet
Getting Started Guide HP 1910 Gigabit Ethernet Switch Series
Document52 pages
Getting Started Guide HP 1910 Gigabit Ethernet Switch Series
cakspam
No ratings yet
TX/RX Ant (TX/RXA) : Internal Use
Document3 pages
TX/RX Ant (TX/RXA) : Internal Use
Lionel Marcos Aguilar Olivarez
No ratings yet
098-00106-000-Rev A
Document124 pages
098-00106-000-Rev A
Lionel Marcos Aguilar Olivarez
No ratings yet
2G & 3G - Flexi Multi-Radio BTS-MoP-Comms and Integ - Concurrent Mode-Cell C - ZA - V1.3
Document61 pages
2G & 3G - Flexi Multi-Radio BTS-MoP-Comms and Integ - Concurrent Mode-Cell C - ZA - V1.3
Lionel Marcos Aguilar Olivarez
86% (7)
098-00106-000-Rev A
Document124 pages
098-00106-000-Rev A
Lionel Marcos Aguilar Olivarez
No ratings yet
2G & 3G - Flexi Multi-Radio BTS-MoP-Comms and Integ - Concurrent Mode-Cell C - ZA - V1.3
Document61 pages
2G & 3G - Flexi Multi-Radio BTS-MoP-Comms and Integ - Concurrent Mode-Cell C - ZA - V1.3
Lionel Marcos Aguilar Olivarez
86% (7)
Antenas
Document2 pages
Antenas
Lionel Marcos Aguilar Olivarez
No ratings yet
Group3 Safelite
Document11 pages
Group3 Safelite
Veeranjaneyulu Kacherla
No ratings yet
Algebra Test No. 5
Document1 page
Algebra Test No. 5
AMIN BUHARI ABDUL KHADER
No ratings yet
DLL Q2 Use of Search Engine
Document3 pages
DLL Q2 Use of Search Engine
ariane galeno
100% (1)
City of Waco Landfill, Parts I-II General Application Requirement (Admin Complete 09-14-18)
Document264 pages
City of Waco Landfill, Parts I-II General Application Requirement (Admin Complete 09-14-18)
KCEN Channel 6
No ratings yet
Audition Form Eng
Document2 pages
Audition Form Eng
Jessie Isabel
No ratings yet
Spirulina Growing System
Document2 pages
Spirulina Growing System
kingofcool192122
No ratings yet
Fineotex Chemical Ltd.
Document239 pages
Fineotex Chemical Ltd.
Ganesh
No ratings yet
Physics Practical CL XII
Document11 pages
Physics Practical CL XII
Keshav Kundan
No ratings yet
CHAPTER 1 - Quantity-Surveying-Introduction
Document8 pages
CHAPTER 1 - Quantity-Surveying-Introduction
Ahmed Amedi
No ratings yet
Applying GAMP 5 To Validate An ERP System
Document8 pages
Applying GAMP 5 To Validate An ERP System
Tahir Zia
No ratings yet
Making Strategies Work NOTES at MBA
Document25 pages
Making Strategies Work NOTES at MBA
Babasab Patil (Karrisatte)
No ratings yet
Dislocations and Plastic Deformation
Document6 pages
Dislocations and Plastic Deformation
Prashanth Vantimitta
No ratings yet
How Are The Working Hours at TTEC
Document1 page
How Are The Working Hours at TTEC
Teletech
No ratings yet
Medidores Rotativos Série FMR
Document3 pages
Medidores Rotativos Série FMR
mateu
No ratings yet
Chapter - III Financial System and Non-Banking Financial Companies - The Structure and Status Profile
Document55 pages
Chapter - III Financial System and Non-Banking Financial Companies - The Structure and Status Profile
chirag10pn
No ratings yet
Tokyo Keiso - FS-100 Datasheet
Document8 pages
Tokyo Keiso - FS-100 Datasheet
Aizat Romaino
No ratings yet
FirmUpSWL3300 AP
Document3 pages
FirmUpSWL3300 AP
Draon Linux Linux
No ratings yet
Department of First Year Engineering Programming and Problem Solving
Document20 pages
Department of First Year Engineering Programming and Problem Solving
sameer
No ratings yet
Engine Valve Lash Inspect/Adjust: Pantalla Anterior
Document6 pages
Engine Valve Lash Inspect/Adjust: Pantalla Anterior
Edison Pfoccori Barrionuevo
No ratings yet
TLC 2543
Document40 pages
TLC 2543
Hoii Clark
No ratings yet
Naplan 2013 Final Test Numeracy Year 9 (Calculator)
Document13 pages
Naplan 2013 Final Test Numeracy Year 9 (Calculator)
Mohammad Ali
No ratings yet
Maximising Naphtha Through Hydrocracking Refinery of The Future
Document25 pages
Maximising Naphtha Through Hydrocracking Refinery of The Future
Mustafa Ahsan
No ratings yet
Cap-Max-System-Server Ds Ap 0516 PDF
Document2 pages
Cap-Max-System-Server Ds Ap 0516 PDF
Bao Quoc Mai
No ratings yet
Comparative Study of Hindustan and Dainikjagran Hindi Daily Newspaper
Document67 pages
Comparative Study of Hindustan and Dainikjagran Hindi Daily Newspaper
Aditya Parmar
No ratings yet
Shreya Singhal Vs Union of India
Document3 pages
Shreya Singhal Vs Union of India
Shivang Sharma
No ratings yet
Wet and Dry Contacts
Document7 pages
Wet and Dry Contacts
Khairul Ashraf
No ratings yet