Professional Documents
Culture Documents
Homma 1981
Homma 1981
1. Introductio~
Japanese is a mora-counting language. The length of an utterance phonologically depends on
the number of moras . For example, kan ("a can") and kana ("syllabary") are two mora
words which linguistically have the same duration. The ratio of the duration of gaka
("painter" ) and gakka ("lesson") corresponds to the number of moras namely 2 : 3 , because
the first part of the geminated stop /kk/ is counted as one mora. An examination of the
durational relationship between Japanese stops and vowels in several contexts, measuring
closure duration, voice onset time (VOT) and vowel duration, might reveal acoustic evidence
for the linguistic uses of segmental duration in Japanese. This paper will contrast the results
of experiments in Japanese , a mora-counting language, with those of English, a stress-
counting language .
Various people have conducted experiments on closure duration, VOT, and vowel
duration in English. They have found that voiceless stops have greater closure duration
than voiced stops. According to Lisker (1957), the average closure duration for /p/ was
about 120 ms and the average value for /b/ was 75 ms in trochee words . Most investigators
agree that when other factors are constant, closure duration for labials is longer than
alveolars and velars (Lehiste, 1970).
VOT increases as the place of closure moves toward the back (Lisker & Abramson, 1964).
VOT is longer in stressed syllables than in unstressed syllables (Lisker & Abramson, 1967).
VOT is closely related with the following vowel quality (Klatt, 1975 ; Port & Rotunno ,
1979).
Vowel duration in English varies greatly according to environments. Vowels are the
longest before pauses. Stressed vowels are longer than unstressed ones (Klatt , 1976). Vowels
before voiced consonants are longer than vowels before voiceless consonants (House &
Fairbanks, 1953; Denes , 1955; House, I 961; Peterson & Lehiste, 1960). The number of
syllables (Klatt , 1973; Harris & Umeda, 1974) and speaking rate (Lehiste, 1970; Port, 1976)
are also important factors which influence vowel duration.
In Japanese some experiments on closure duration and vowel duration have been reported.
In closure duration, voiceless stops are longer than voiced ones (Han, 1962; Okada, 1969,
1971 ); fpf is longer than ftf and /k/ (Han, 1962). Vowel duration is longer in voiced environ-
ments (Okada, 1969 ; Homma, 1973). Pitch accent does not have a significant influence on
vowel duration (Homma , 1973). Temporal compensation works within a word (Homma,
1973; Maeda, 1979). VOT in Japanese, however, has been given the least attention (Homma,
1980).
The specific objectives of this paper are (1) to confirm the results of the previous experi-
ments in Japanese; (2) to discuss VOT; and (3) to study the durational relationship between
stops and vowels.
2. Experiment
The purpose of this experiment was to make acoustic measurements of closure duration,
VOT, and vowel duration of two and three mora words with single and geminated stops.
2.1 Methods
Twenty-four real and nonsense test words were prepared. The words contained vowel /a/
and voiceless stops fp, t , k, b, d , gf at the initial and also at the medial position.
Test words
(A) two moras (B) three moras
/p/ papa pappa
paba pabba
/b/ bapa bappa
baba babba
/t/ tata tatta
tad a tadda
/d/ data datta
dada dadda
/k / kaka kakka
kaga kagga
Jg/ gaka gakka
gaga gagga
These words were placed in the following sentence frame.
Kore wa _ _ _ desu. This is _ __
The sentences with the test words were randomly arranged on a list. The list was read
three times by four Japanese speakers, three males and one female, in natural speed with
accent on the first syllable . The total 288 sentences were recorded and wide-band spectro-
grams were made of each utterance with a Kay-Sonagraph (6061B) in the Phonetic Labor-
atory of Indiana University.
duration of the medial stop was measured between termination of the first vowel formant
transition and the burst of the medial stop.
2.2.2. Voice onset time. The release of plosive consonant, especially voiceless stops, involves
three phases - explosion, frication, and aspiration. Frication and aspiration are called voice
onset time (VOT) (Lisker & Abramson, 1964). VOT is one of the most important cues to
separate voiced and voiceless stops. Klatt measured frication and aspiration separately in his
paper (1975), but in this experiment the whole VOT was measured by marking off the
interval between the release of the stop and the onset of glottal vibration for voice. The
onset of glottal vibration was shown on the spectrogram with the beginning of the regularly
spaced vertical striations.
2.2.3. Vowel duration. The duration of the first and second vowels was measured from the
onset of glottal vibration to the closure for the following stop shown by the abrupt cessation
of energy in all the formants.
Table I Average closure duration of the medial stops for each speaker in
milliseconds
S1 S2 S3 S4 Mean
IPI 90 64 72 80 77
lbl 50 59 56 55 55
It I 87 56 42 62 62
ldl 37 41 22 39 35
lkl 75 56 51 62 61
lgl 33 49 41 ? 41
IPPI 184 180 199 170 183
lbbl 144 161 181 148 159
Itt I 190 178 154 159 170
lddl 130 148 156 141 144
lkkl 169 179 174 176 175
lggj 109 124 151 151 134
Table II
(A) Average closure (B) Closure duration ratio (C) Closure duration ratio
duration (ms) between voiced and between single and
voiceless stops(%) geminated stops(%)
Single Geminated Single Geminated Single Geminated
stops but also for voiced ones in geminated stops, but seldom in single stops, a phenomena
we will discuss later.
Voicing effects on closure duration, and the ratio between single and geminated stop
closure by and large agreed with Han's measurements (1962). She reported "/p, t, k, c, s/
show approximately 20 to 40% increase in duration over their corresponding voiced ones"
and that "the duration of short and long consonants is, on the average, in the ratio of 1.0
to 2.6 and often 1.0 to 3 .0."
Table III Voice onset time of the initial and medial stops for each speaker
in milliseconds
(A) VOT of the initial stops (B) VOT of the medial stops
Sl S2 S3 S4 Mean Sl S2 S3 S4 Mean
(p - pj 18 34 14 29 24 !PI 3 7 3 13 7
/p-b/ 24 43 19 31 29 ft I 8 16 18 23 16
/t-t I 26 42 26 34 32 /k/ 22 26 21 25 24
j t-df 16 53 24 33 32 /PP/ 0 16 7 20 11
/k-k/ 48 51 35 46 45 ftt I 1 14 10 28 13
/k- g/ 63 62 57 60 61 /kk/ 19 37 20 36 28
fg-kf 14 20 10 12 14 /bb/ 0 4 1 2 2
fg-gf 8 19 17 13 14 /dd/ 0 7 11 12 8
fggf 25 25 14 23 22
indicates that English voiced stops are not truly voiced at the initial position. Although
comparing VOT of English and Japanese is not a simple task , it may be safe to say that
English has longer VOT than Japanese (Homma, 1980) .
Lisker & Abramson (196 7) found that although the effects of stress on VOT were rather
limited , in English "stress and VOT are not strictly independent of one another." They
reported that the difference between mean values for /p, t , k / in isolated words as against
sentences was about 25 ms , and that the difference between stressed and unstressed /p, t, k/
was about 6 ms in sentences. In Japanese , accent and VOT were clearly related. The differ-
ence between accented and unaccented /p, t, k / was about 21 ms .
2.3.3. Vowel duration. Tables IV and V show the average duration of the first vowel fa/ and
the second vowel /a/ respectively.
Table IV Average vowel duration of the first /a/ for each speaker in
milliseconds
Sl S2 S3 S4 Mean
IP- PI 90 63 59 69 70
IP- bl 101 59 86 78 81
l b-· PI 103 93 81 96 93
lb- b/ 104 97 90 108 100
It- t I 94 49 71 62 69
l t - dj 113 63 85 77 85
ld- tl 111 98 99 100 102
l d- d j 124 105 97 108 109
lk- kj 74 64 85 65 72
lk- gl I 02 82 75 84 86
lg- kj 107 111 113 108 110
lg- gl 126 108 107 114 114
S1 S2 S3 S4 Mean
consonant had a slightly stronger influence, while Homma (1973) recognized a stronger
effect of the preceding consonant, and Maeda (1979) found the same results as Homma.
Table VI shows the average vowel duration in milliseconds and in percentage under the
influence of (A) the preceding stop, and (B) the following stop. Table VI was based on
Table IV. For all the stops, vowel duration in voiced environments was assumed to be 100%.
Comparing (A) and (B), we fmd that the preceding stop has more influence than the
following stop. In (A) vowel duration is reduced approximately 25% and in (B) 10%. There-
fore, we can conclude that in Japanese the preceding consonant has a stronger effect on
vowel duration.
(2) As seen in Table VII, vowel duration of the first /a/ slightly increased as the place
of closure of the adjacent stops moved toward the back.
Labial 86 98
Apical 91 96
Velar 96 91
Mean 91 95
These findings agreed with VOT. However, in the second /a/, vowel duration decreased in
the same direction.
(3) In duration there was no significant difference between accented and unaccented
vowels. The second unaccented vowels were somewhat longer as in Homma (1973). This is
very different from English.
(4) Gemination of stop had a slight influence on the following vowel. Vowels were a little
shorter after geminated stops. The average difference was about 8 ms.
Contextual effects on vowel duration in Japanese are very different from those in English.
First, in English, voicing of the following consonant has much stronger influence on vowel
duration (Peterson & Lehiste, 1960; Chen, 1970) than in Japanese.
Secondly, in English, stress-accented vowels are much longer than unstressed vowels. In
1976 Klatt reported that the average duration for stressed vowels is approximately 130 ms,
and that the average duration for unstressed vowels is approximately 70 ms in a connected
discourse. In Japanese the mean of the first /a/ in accented syllables was 91 ms, and that of
the second /a/ was 95 ms.
2.3.4. Durational relationship. Table VIII shows the average word duration in milliseconds
and ratio. That the word-duration ratio by acoustic measurements fits well into the linguistic
mora ratio 2 : 3 is very interesting.
Japanese stops and vowels 279
Although Han wrote that a unit of duration in Japanese is associated with a syllable, it
may be more appropriate to say that domain of durational pattern is not a syllable, but a
word. The following examples illustrate this.
VOT
IVowol dumtion
1Clow<O dumtion
t Vowe\duration
p a p a
18 67\ 77 98 260ms
I
'I
I
13 109 '. 40 105 267 ms
g a g a
Here /papa/ and /gaga/ are two mora words which have almost the same word duration.
However, when we compare these two words, we find a big difference in vowel duration
and in closure duration. In /papa/ vowels are shorter and closure duration is longer because
of the voiceless environment, while in /gaga/ vowels are longer and closure duration is
shorter. Therefore, the duration of the first syllable of /papa/ is 85 ms, and that of /gaga/
is 122 ms. In spite of the big difference in the first syllable duration, the two words are
almost the same in the word duration. This means that temporal compensation works
within a word, not within a syllable or mora. The previous experiments by Homma (1973)
and Maeda (1979) support this observation.
3. Conclusions
In this durational study of Japanese, the following points were observed.
References
Chen, M. (1970). Vowel length variation as a function of the voicing of the consonant environment.
Phonetica , 22, 129- 159.
Denes, P. (1955) . Effect of duration on the perception of voicing. Journal of the Acoustical Society of
America, 25, 105 - 113.
Han, S. M. (1962) . The feature of duration in Japanese. To kyo : Phonetic Society of Japan, Study of
Sounds, 10, 65- 75 .
Harris, M. S. & N. Umed a (1974) . Effect of speaking mode on temporal factors in speech : vowel duration.
Journal of th e Acoustical Society of America, 56, 1016- 101 8.
Homma, Y. (1973). An acoustic study of Japanese vowels: their quality, pitch, amplitude, and duration.
Study of Sounds , 16, 34 7- 368.
Japanese stops and vowels 281
Homma, Y. (1978). Vowel duration in English. Osaka: Osaka Gakuin University, Gaikokugo Ronshu, 6,
51-67 (in Japanese).
Homma, Y. (1980). Voice onset time in Japanese stops. Bulletin of the Phonetic Society of Japan, 163,
7-9.
House, A. S. (1961). On vowel duration in English. Journal of the Acoustical Society of America , 33,
1174- 1178.
House, A. S. & G. Fairbanks (1953). The influence of consonant environment upon the secondary
acoustical characteristics of vowels. Journal of the Acoustical Society of America, 25, 105-113 .
Klatt, D. H. (1973). Interaction between two factors that influence vowel duration. Journal of the
Acoustical Society of America, 54, 1102-1104.
Klatt , D. H. (1975). Voice onset time, frication and aspiration in word-initial consonant clusters. Journal
of Speech and Hearing Research, 18,686-705.
Klatt, D. H. (1976). Linguistic uses of segmental duration in English: acoustic and perceptional evidence.
Journal of the Acoustical Society of America, 59, 1208-1221.
Lehiste, I. (1970). Suprasegmentals. Cambridge, Massachusetts; London: M.l.T. Press.
Lisker, L. (1957). Closure duration and the intervocalic voiced-voiceless distinction in English. Language,
33, 42--49.
Lisker, L. & A. S. Abramson (1964 ). A cross-language study of voicing in initial stops: acoustical measure-
ments. Word, 20, 384--422.
Lisker, L. & A. S. Abramson (1967). Some effects of context on voice onset time in English stops.
Language and Speech, 10, 1-28.
Maeda, S. (1979). Timing control in Japanese speech production. Nara: Tenri University, Tenri Daigaku
Gakuho, 121, 1-21 .
Okada, T. (1969). The influence of voiced or voiceless consonants on vowel duration. Kyoto: Literary
Association of Doshisha University,Jimbungaku, 115,68-84 (in Japanese) .
Okada, T. (1971). A spectrographic study of the duration correlation between some Japanese vowels and
consonants. Literary Assocation of Doshisha University, Doshisha Studies in English, 1, 32-49 (in
Japanese) .
Peterson, G. E. & I. Lehiste (1960). Duration of syllable nuclei in English. Journal of the Acoustical
Society of America, 32, 693-703.
Port, R. F. (1976). Influence of Speaking Tempo on the Duration of Stressed Vowel and Medial Stop in
English Trochee Words. Bloomington: Indiana University Linguistics Club.
Port, R. F. & R. Rotunno (1979). Relation between voice-onset time and vowel duration. Journal of the
Acoustical Society of America, 66, 654-662.