Professional Documents
Culture Documents
Sample Rate Conversion
Sample Rate Conversion
The top graphics panel is used to display a waveform plot of the original speech waveform or its long time spectrum,
and the bottom graphics panel is used to display the sampling rate converted waveform or its long time spectrum. The
title box displays information about the file that is sampling rate converted, and the functionality of the 13 buttons is:
of the processing is not the signal itself but a parametric representation of the speech signal which can be well represented at the converted rate.
2
12. a pushbutton to save the converted speech in the file designated by an earlier button, in a user designated direc-
tory,
2. hit the pushbutton labeled ’Speech Directory’and click ’OK’ on the pre-selected directory ’speech files’,
3. select the speech file ’test 16k.wav’ from the list of files in the popupmenu button,
5. hit the ’Plot Original Signal Waveform’ button to plot the original speech signal in the top graphics panel
(displacing any previous plot),
6. hit the ’Plot Original Signal Long Time Spectrum’ button to plot the original signal long time spectrum in the
top graphics panel (displacing any previous plot),
7. using the defaults for the sampling rate converted speech filename (’Output File Conv’), and for the sampling
rate of the converted file (10,000 samples/second), hit the ’Convert Sampling Rate’ button to convert the signal
to the new sampling rate,
8. hit the ’Play Converted Speech Signal’ button to play the converted speech signal,
9. hit the ’Plot Converted Speech Signal’ button to plot the converted speech signal in the bottom graphics panel
(again displacing any existing plot),
10. hit the ’Plot Converted Speech Long Time Spectrum’ button to plot the long time log magnitude spectrum of
the converted speech in the lower graphics panel (again displacing any previous plot),
11. hit the ’Save Converted Speech in File’ button to save the converted speech file in a user-selected directory,
using the filename in the designated button above,
12. experiment with different speech files, and different sampling rate conversion parameter settings to see the effect
of changes in the sampling rate.
The resulting graphical display for converting the speech file ’test 16k.wav’ (originally having a sampling rate of
16,000 samples/second) to a converted file with a sampling rate of 10,000 samples/second, is shown in Figure 1.
3
Figure 1: Example of graphical output of program ’SRC GUI25.m’ for the speech file ’test 16k.wav’
• can you hear the difference between the original speech file (with a sampling rate of fs = 16, 000 samples
per second), and the sampling rate converted signal (with a sampling rate of fs = 10, 000 samples per
second)?
• can you see the difference between the original speech file waveform display and the waveform display of
the converted signal?
• can you see the difference between the original speech file long time spectrum and the long time spectrum
of the converted signal?
• what sounds of this speech utterance are most affected by the sampling rate conversion process? Can you
see the differences in the waveform plots?
2. change the sampling rate of the converted file from fs = 16, 000 samples per second to fs = 8000 samples per
second; what effect does the reduced sampling rate have on the speech signal?
3. change the sampling rate of the converted file from fs = 16, 000 samples per second to fs = 6000 samples per
second; what effect does the reduced sampling rate have on the speech signal naturalness and intelligibility?
4. change the sampling rate of the converted file from fs = 16, 000 samples per second to fs = 20, 000 samples
per second; what effect does this up-sampling have on the speech signal intelligibility or naturalness?