Professional Documents
Culture Documents
Arabic Phonetic Database
Arabic Phonetic Database
In experiment 1, a color video camera (Sony Model: The results of experiment 1 are under directory C in KAPD.
DXC-637P) was used for side view and another one They include facial images of the 7 subjects during the
(Sony Model: BVP-7P) was used for the front view. production of Arabic sounds (Figure 4), NSP files with tags
showing the target sound onset and offset, and WAV files of
The recording was made by a video cassette recorder
the same tokens. The NSP files contain the acoustic data
(Model: BVW-75P BETACAM SP). In experiment 2,
plus other data such as linguapalatal contact and EGG. NSP
3 and 4, a Rhino-Laryngeal stroboscope (Model: RLS files can be opened by a CSL or Multi-Speech program,
9100) was used (Figure 1). In experiment 5, an both produced by Kaytelemetrics. WAV files can be opened
aerophone (Model: Aerophone-II 6800) was used to by many programs including Windows and CSL. The
collect and record the data (Figure 2). In experiment 6, results of experiment 2 are under the directory V. They
Palatometer (Model: 6300) was used (Figure 3). In include images of the velopharyngeal port (Figure 5), NSP
experiment 7, ElectroGlottoGraph was used. In and WAV files. The results of experiment 3 are under the
experiment 8, Nasometer (Model: 6200-3) was used. In directory E. The results of this experiment include images
all the above experiments CSL (Computerized Speech of the pharyngeal cavity (Figure 5), NSP and WAV files.
Laboratory, Model: 4300B) was used to record the The results of experiment 4 are under the directory X. They
acoustic signal of the subjects' utterances. Most of the include images of the glottis (Figure 5), NSP and WAV files.
equipment used in these experiments carries the name of The results of experiment 5 are under Directory A. They
Kaytelemetrics. However, some of them were modified include NSP and WAV files. The NSP files have the
to cover the need of the experiments and the Arabic aerodynamic data (Figure 6). The results of Experiment 6
sounds (for example, Figure 3). In experiment 9, a are under Directory P. They include NSP (Figure 7) and
computer program was written to give the subjects the WAV files. The results of experiment 7 are under Directory
choice of selecting the target sound that they have heard, G. They include NSP (Figure 8) and WAV files. The results
its position in the token and its adjacent vowel. of experiment 8 are under Directory N. They include NSP
(Figure 9) and WAV files. The results of experiment 9 are in
the File DB1. They are the responses of the 10 informants
who listened to the tokens produced by the subjects in the
first 8 experiments and made their judgments on the target
sounds they had heard.
4. CONCLUSIONS
This paper introduces the reader to a database on Arabic
sounds produced by Saudi subjects. The database includes
articulatory, acoustic and perceptual information. The
articulatory data consist of images of the face,
velopharyngeal port, pharyngeal cavity and glottis. They
also include linguapalatal contacts and aerodynamic data.
The acoustic data consist of the audio recording of the
tokens made by the subjects in all experiments. The
perceptual data show the responses of 10 informants to the
Arabic sounds in the acoustic data.
Figure 7. Window 1 shows the waveform of /zaXXaz/,
window 2 shows the linguapalatal contact during the KAPD can be utilized in various areas. To mention some, in
production of the target sound /X/ (experiment 6). speech-to-text experiments, the acoustic part of the
database can be used to train and test the system where the
recording of 7 native Arabic speakers producing Arabic
sounds 8 times in different token positions. KAPD can be
used in text-to-speech systems especially when using the
concatenation method. It can also be used in vocal tract
modeling and facial animation during speech. It would be
of value for researchers in phonetics, speech therapy and
forensic phonetics.
5. ACKNOWLEDGEMENT
This paper and the database project are sponsored by King
Abdulaziz City for Science and Technology. The database
team consists of: Mansour Alghamdi, Ashraf Alkhairi,
Figure 8. Window 1 shows the waveform of /zakkaz/, Abdullah Azzamil, Muhammad Alfarraj, Muhammad
window 2 shows the laryngeal activities and window 3 Aljabr, Muhammad Basalamah, Sayed Hussain, Surraya
magnifies a portion of window 2 (experiment 7). Addawsari, and Waqayyan Addawsari.