Professional Documents
Culture Documents
Formant Analys
Formant Analys
Abstract
Analyze an input speech sample and return the vowels that are present through formant analysis.
1 Abstract
In the past few years, Voice Recognition software applied to word processing and other computer tasks has received much media hype. One of the methods employed by this software is the use of formants, harmonics found in vowels due to resonance in the vocal tract, that are inherent in speech. To limit the amount of variables, only ve fundamental vowel sounds in a group member's speech were recorded and analyzed to serve as a database for vowel detection. The frequency values of the formants and their magnitudes were logged. Simple words with these fundamental vowel sounds were recorded and analyzed to detect the vowels. The rst detection method employed was a Cartesian distance method. The distance between an ideal vowel's formants and the possible vowel's formants in the word were calculated. The shortest distance, which was also below a dened threshold, signied the vowel our input contained. When this proved inaccurate with certain input vowels, a second and ultimately more eective method was implemented. This method employed systematically eliminating the vowels that the input vowel could not be. This was done by comparing the rst and second formants with those in the database. With this method, it is possible that an input vowel could be detected as two or more vowels. However, with proper threshold leveling, we found this could be largely circumvented.
1.1 Motivation
Our motivation to work on this project was to gain insight on the nature of speech recognition systems. We now realize how dicult it is to build such a system reliably.
Version
http://creativecommons.org/licenses/by/1.0
http://cnx.org/content/m11731/1.4/