Professional Documents
Culture Documents
Theory Phonetics 1
Theory Phonetics 1
1. COMMUNICATION
1.1. SPEECH
One of the chief characteristics of the human beings is his ability to communicate complicated messages
concerning every aspect of his activity. A man possessing the normal human faculties achieves this exchange
of information mainly by two types of stimulation, auditory and visual. The child will learn from a very early
age to respond to the sounds and tunes which his elders habitually use in talking to him. In due course, from
a need to communicate, he will himself begin to imitate the recurrent sound patterns with which he has
become familiar. In other words he becomes to make use of speech. His constant exposure to the spoken
form of his own language, together with his need to convey increasingly subtle types of information, leads to
a rapid acquisition of the framework of his spoken language. Nevertheless, with all the conditions in his
favour, a number or years will pass before he has mastered not only the sound system used in his community,
but also has at his disposal a vocabulary of any extent or is entirely familiar with the syntactical arrangements
in force in his language system.
It is no wonder, therefore, that the learning of another language later in life, acquired artificially in brief and
sporadic spells of activity and without the stimulus arising from an immediate need for communication, will
tend to be tedious and rarely more than partially successful.
In addition, the more firmly consolidated the basis of a first language becomes or, in other words, the later in
life that a second language is begun, the more the learner will be subject to resistance and prejudices deriving
from the framework of his original language. It may be said that, as we grow older, the acquisition of a new
language entails a great deal of conscious, analytical effort, instead of the child’s ready and facile imitation.
1.2. WRITING
Later in life the child will be taught the conventional visual representation of speech: he will learn to use
writing. To-day, in considering those languages which have long possessed a written form, we are apt to
forget that the written form is originally an attempt at reflecting the spoken language and that the latter
precedes the former for both the individual and the community. Indeed, in many languages, so parallel are
the two forms felt to be that the written form may be responsible for changes in the pronunciation or may at
least tend to impose restraints upon its development. In the case of English, this sense of parallelism, rather
than of derivation, may be encouraged by the obvious lack of consistent relationship between sound and
spelling. A written form of English based on the Latin alphabet, has existed for more than 1,000 years and ,
though the pronunciation of English has been constantly changing during this time, few basic changing of
spelling has been made during the 15th century.
The result is that written English is often an inadequate misleading representation of the spoken language of
to-day. Clearly it would be unwise, to say the least, to base our judgments concerning the spoken language on
prejudices derived from the orthography. Moreover, if we are to examine the essence of the English language,
we must make our approach through the spoken rather than the written form. Our primary concern will be
the production, transmission and reception of the sounds of English—in other words, the phonetics of
English.
1.3 LANGUAGE
From the moment that we abandon orthography as our starting point, it is clear that the analysis of the
spoken form of English is by no means simple. Each of us uses an infinite number of different speech sounds
when we speak English. Indeed, it is true to say that it is difficult to produce two sounds which are precisely
identical from the point of view of instrumental measurement: two utterances by the same person of the
word ‘cat’ may well show quite marked differences when measured instrumentally. Yet we are likely to say
that the same sound sequence has been repeated. In fact we may hear clear and considerable differences of
quality in the vowel of cat as for instance, in the London and Manchester pronunciations of the word; yet,
though we recognize differences of vowel quality, we are likely to feel that we are dealing with a ‘variant’ of
the ‘same’ vowel. It seems, then, that we are concerned with two kinds of reality: the concrete, measurable
reality of the sound uttered, and another kind of reality, an abstraction made in our minds, which appears to
reduce this infinite number of different sounds to a ‘manageable’ number of categories. In the first, concrete,
approach, we are dealing with sound in relation to the speech; at the second, abstract, level, our concern is
1
the behaviour of sound in a particular language. A language is a system of conventional signals used
for communication by a whole community.
This pattern of conventions covers a system of significant sound units: the phonemes, the inflexion and
arrangement of words, and the association of meaning with words. An utterance, an act of speech, is a single
concrete manifestation of the system at work.
As we have seen, several utterances which are plainly different on the concrete, phonetic level may fulfil the
same function, i.e. are the ‘same,’ on the systematic language level. It is important in any analysis of the
spoken language to keep this distinction in mind and we shall later be considering in some detail how this
dual approach to the utterance is to be made. It is not, however, always possible or desirable to keep the two
levels of analysis entirely separate: thus, as we shall see, we will draw the linguistically significant units to
help us in determining how the speech continuum shall be divided up on the concrete, phonetic level; and
again, our classification of the linguistic units will be helped by our knowledge of their phonetic features.
1.4 REDUNDANCY
Finally, it is well to remember that, although the sound system of our spoken languages serves us primarily
as a medium of communication, its efficiency as such an instrument of communication does not depend
upon the perfect production and perception of every single element of speech. A speaker will, in almost any
utterance, provide the listener with far more cues than he needs for easy comprehension.
In the first place, the situation or context will itself delimit the purpose of an utterance. Thus, in
any discussion about a zoo, involving a statement such as ‘We saw lions and tigers’ , we are predisposed by
the context to understand lions, even if the ‘n’ is omitted and the word actually said is liars.
Or again, we are conditioned by grammatical probabilities, so that a particular sound may lose much
of its significance, e.g. in the phrase ’These men are working’, the quality of the vowel in ‘men’ is not as vitally
important for deciding whether it is a question of men or man as it if the word were said in isolation, since
here the plurality is determined by the demonstrative adjective preceding men and the verb form following.
The again, there are particular probabilities in every language as to the different combination
of sounds which will occur. Thus in English, if we hear an initial ‘th’ sound /θ/we expect a vowel to
follow, and of the vowels some are much more likely than the others. We distinguish such sequences as –gl
and –dl in final positions, e.g. in beagle and beadle, but this distinction is not relevant initially, so that even if
dloves is said, we understand gloves. Or again, the total rhythmic shape of a word may provide an
important cue to its recognition: thus, in a word such as become, the general rhythmic pattern may be
said to contribute as much for the recognition of the word as the precise quality of the vowel in the first,
weakly accented syllable. Indeed, we may come to doubt the relative importance of vowel as a help to
intelligibility, since we can replace our 20 English vowels by a single vowel (schwa) in any utterance and still,
if the rhythmic pattern is kept, retain a high degree of intelligibility.
An utterance, therefore, will provide a large complex of cues for the listener to interpret, but a great deal of
this information will be unnecessary, or redundant, as far as the listener’s needs are concerned. On the other
hand, such an over proliferation of cues will serve to offset any disturbances, such as noise, or to counteract
the sound quality divergences which may exist between speakers of two dialects of the same language. But to
insist, for instance, upon exaggerated articulation in order to achieve clarity may well be to go beyond the
requirements of speech as a means of communication. Indeed, certain obscuration of quality are, and have
been for centuries, characteristic of English.
LINGUISTICS
1-PHONOLOGY: The concrete phonetic characteristics (articulatory, auditory, acoustic) of the sounds
used in the language. The concrete phonetic level is often separated from the more abstract phonological
level which analyses the patterning of the sounds in language and includes the functional, phonemic
behaviour of these sounds for distinctive purposes; the combinatorial possibilities (syllabic structure or
phonotactics) of the phonemes; the nature and use of such prosodic features as pitch, stress and length. A
study of the phonic substance of the language may be accompanied by a description of the written form of the
language (graphology)
2
2-LEXIS: The total number of word forms which exist.
3-MORPHOLOGY: The structure of words.
4- SYNTAX: The system of rules governing the structure of phrases, clauses and sentences consisting of
words contained in the lexicon.
(Statements may be made which combine phonemic and morphemic features (morphophonemics or
morphophonology) , e.g. the phonemes /s/ and /z/ in cats and dogs are exponents of the same morpheme of
plurality which might be symbolized as |s|.
5-SEMANTICS: The relation of the meaning to the signs and symbols of the language.
Other aspects of the language which would require investigation include the variation of the same language
in different regions and social classes (dialectology); the influence of context and style upon the form and
substance of the language and the society in which it is spoken (sociolinguistics)
Finally, it is clear that the phonology, lexis, syntax and semantics of a language are always undergoing change
in time. The state of a language at any (synchronic) moment must be seen against a background of its
historical (diachronic) evolution.
3
be brought together or parted be the rotation of the arytenoid cartilages (attached at the posterior end of the
folds) through muscular action. The inner edge of these folds has a length of 23mm. in adult men and about
18 mm. in women. The opening between the vocal folds is known as the glottis. Biologically, the vocal folds
act as a valve which is able to prevent the entry into the trachea and lungs of any foreign body or which may
have the effect of enclosing the air stream within the lungs to assist in muscular effort on the part of the arms
or the abdomen. In using the vocal folds for speech, the human being has adapted and elaborated upon this
original open-shut function in the following ways:
a. Tightly closed: The glottis may be held tightly closed, with the lung air pent up below it. This glottal stop
[?] frequently occurs in English, e.g. when it precedes the energetic articulation of a vowel or when it
reinforces or even replaces /p/ t/ k/. It may also be heard in defective speech, such as the arising from the
cleft palate, when [?] may be substituted for the stop consonants, which, because of the nasal air escape,
cannot be articulated with proper compression in the mouth cavity. We must take into account that the
glottal stop is an alternative. It is not a significant sound because it doesn’t change the meaning of the
words
b. Wide open The glottis may be held open as for normal breathing, characteristic position to produce
voiceless sounds and whispering sounds.
c. Loosely together: The action of the vocal folds which is most characteristically a function of speech
consists in their role of vibrator set in motion by lung air—the production of voice or phonation. This
vocal fold vibration is a normal feature of all vowels or of such a consonant /z/ compared with voiceless
/s/.
In order to achieve the effect of voice, the vocal folds are brought sufficiently close together that they vibrate
when subjected to air pressure from the lungs. This vibration, of a somewhat ondulatory character, is caused
by the pressure of the air stream and the activation of the sub-glottal nerves. The resultant reduced air
pressure permits the elastic folds to come together once more. The vibratory effect may easily be felt by
touching the neck in the region of the larynx when saying ah... or z.
In the typical speaking voice of a man, this opening and closing action is likely to be repeated between 100
and 150 times in a second, i.e. there are that number of vibrations cycles per second (cps). In the case of a
woman’s voice, this frequency of vibration might well be between 200 and 325 cps. We are able, within this
limits, to vary the speed of vibration of our vocal folds or, in other words, are able consciously to change the
pitch of the voice produced in the larynx; the more rapid the rate of vibration, the higher id the pitch.
Moreover, we are able, by means of variations in pressure from the lungs, to modify the size of the puffs of air
which escapes at each vibration of the vocal folds; in other words, we can alter the amplitude of the vibration,
with a corresponding change of loudness of the sound heard by a listener.
The normal human being soon learns to manipulate his glottal mechanism so that most delicate changes of
pitch and loudness are achieved. Control of this mechanism is, however, very largely exercised by the ear, so
that such variations are exceedingly difficult to teach to those who are born deaf; and a derangement of pitch
and loudness control is liable to occur among those who become totally deaf later in life
4
accompanied by friction between the rear part of the soft palate and the pharyngeal wall.
c. The soft palate may be held at its raised position, eliminating the action of the nasopharynx, so that the
air escape is only through the mouth. All normal English sounds, with the exception of the nasal
consonants mentioned, have this oral escape. Moreover, if for any reason the lowering of the soft
palate cannot be effected, or if there is an enlargement of the organs enclosing the nasopharynx or a
blockage brought about by mucus, it is often difficult to articulate either nasalized vowels or nasal
consonants. In such a speech, typical of adenoid enlargement or obstruction caused by a cold, the
English word morning would have its nasal consonants replaced by [b, d, g]
On the other hand, the inability to make an effective closure by means of the raising of the soft palate – either
because the soft palate is defective or because an abnormal opening of the roof of the mouth gives access to
the nasal cavity—will result in the general nasalization of vowels and the failure to articulate such oral stop
consonants as [b, d, g] This excessive nasalization (or hypernasality) is typical of a such condition as cleft
palate.
5
the pronunciation of [f,v], a light contact being made between the lower lip and the upper teeth.
b. Of all the movable organs within the mouth, the tongue is by far the most flexible and is capable of
assuming a great variety of positions in the articulation of both vowels and consonants. The tongue is a
complex muscular structure which does not show obvious sections yet, since its position must often be
described in considerable detail, certain arbitrary divisions are made. When the tongue is at rest, with its
tip lying behind the lower teeth, that part that lies opposite the hard palate is called the front and that
which is opposite the soft palate is called the back. The region where the front and back meet is known as
the centre. This whole upper area is called dorsum. The tapering section facing the teeth ridge is called the
blade and its extremity the tip. The tip and blade region is sometimes known as the apex. The edges of the
tongue are known as the rims.
Generally in the articulation of vowels, the tongue tip remains low behind the lower teeth. The body of the
tongue may, however, be “bunched up” in different ways, e.g. the front may be the highest part as when we
say the vowel of he, or the back may be more prominent as in the case of the vowel in who, or the whole
surface may be relatively low and flat as in the case of the vowel in ah. Such changes in shape can easily be
felt if the above words are said in succession. These changes, moreover, together with the variations in lip
position, have the effect of modifying very considerably the size of the mouth cavity and of dividing this
chamber into two parts, that cavity which is in the forward part of the mouth behind the lips and that
which is in the rear in the region of the pharynx.
The various parts of the mouth can also come in contact with the roof of the mouth. Thus, the tip, blade
and rims may articulate the teeth as for the sounds in English, or with the upper alveolar ridge in the case
of [t, d, s, z, n] or the apical contact may be only partial as in the case of [l] where the tip\ blade makes firm
contact while the rims make none (or intermittent as in a rolled [r]).
The front of the tongue may articulate against or near to the hard palate. Such a raising of the front of the
tongue towards the palate (palatalization) is an essential part of the
or,
again, there may merely be a narrowing between the soft palate and the back of the tongue, so that friction
of the type occurring in the Scottish loch is heard. And finally, the uvula may vibrate against the back of the
tongue, or there may be a narrowing in this region which causes uvular friction.
2.3 Articulatory description
We have now reviewed briefly the complex modifications which are made to the original air stream by a
mechanism which extends from the lungs to the mouth and nose. The description of any sound necessitates
the provision of certain basic information
1. The nature of the air stream, usually, this will be expelled by direct action of the lungs, but we shall later
consider cases where this is not so. It may be also relevant to assess the force of exhalation.
2. The action of the vocal folds, in particular, whether they are closed, wide apart or vibrating.
3. The position of the soft palate, which will decide whether or not the sound has nasal resonances.
4. The disposition of the various movable organs of the mouth, i.e. the shape of the lips and tongue, in
order to determine the nature of the related oral and upper pharyngeal cavities.
In addition, it may be necessary to provide other information concerning, for instance, a particular
secondary stricture or tenseness which may accompany the primary articulation, or again, when it is a
question of a sound with no steady state to describe, an indication of the kind of movement which is taking
place.
6
consonant have been defined phonetically, the criterion of distinction has generally been one of stricture, i.e.
the articulation of vowels is not accompanied by any closing or narrowing in the speech tract which should
prevent the escape of the air stream through the mouth or give rise to audible friction; all other sounds
(necessitating a closure or a narrowing which involves friction) are considered consonants. Such a definition
entails difficulty with such sounds as Southern British [l, r], which are traditionally consonants but which are
more characteristically vowel-like in the terms of the definition. The difficulty has sometimes been avoided
by calling them semi-vowels or semi-consonants. From the practical phonetic standpoint, it is convenient to
distinguish two types of speech sound, simply because the majority if sounds may be described and classified
most appropriately according to one of two techniques:
1. The type of sound which is most easily described in terms of articulation, since we can generally feel the
contacts and movements involved. Such sounds may be produced with or without vocal fold vibration (voice)
and very often have a ‘noise’ component in the acoustic sense; these sounds fall generally into the traditional
category of consonants and will be known as the consonantal type.
2. the type of sound, depending largely on very slight variations of tongue position, which is more easily
described in terms of auditory relationships, since there are no contacts or strictures which we can feel with
any precision Such sounds are generally voiced, having no noise component but rather a characteristic
patterning of formants. These sounds generally fall into the traditional category of vowels and will be known
here as the vowel type.
We shall be able to assign the sounds of English to one or other of these phonetic classes but, the linguistic
categorization of sounds units will not always correspond to the phonetic one.
7
VELAR. - The back of the tongue articulates with the soft palate, e.g. [k/g]
UVULAR. - The back of the tongue articulates with the uvula [R]
GLOTTAL. - An obstruction or narrowing causing friction but not vibration, between the vocal folds, e.g.
[?]. In the case of some consonantal sounds, there may be a secondary place of articulation in addition to
the primary. Thus, in the o called ‘dark’ [] in addition to the partial alveolar contact, there is an essential
raising of the back of the tongue towards the velum (velarization).
The place of primary articulation is that of the greatest stricture, that gives rise to the greatest obstruction to
the airflow. The secondary articulation exhibits a stricture of lesser rank. Where there are two-co-extensive
strictures of equal rank an example of double articulation results.
Fricative: Two organs approximate to such an extent that the air-stream passes between them with
friction, e.g.
3.3.4 APPROXIMANTS
All the consonants have a noise component, produced by the articulation of two organs together.
Nevertheless, approximants or frictionless continuants, have neither the closure nor the noise component
characteristic of consonantal articulations. They are, however, frequently variants of consonantal types, a
well a having the functional status of consonants and may therefore be included under this heading.
[w] and [j] (also referred to as semivowels) are usually included in the consonantal category on functional
grounds, but from the point of view of phonetic description they are more properly treated as vowel glides.
8
The essential factors to be included in any classificatory chart refer to:
1- The place of articulation.
2- The manner of articulation.
3- The presence or absence of voice.
4- The position of the soft palate.
The customary chart is based on a vertical axis showing manner of articulation; a horizontal axis showing
place of articulation, a pairing of consonantal types to show the voiceless (or fortis) variety on the left and the
voiced (or lenis) variety on the right. An extra column under ‘manner’ to include the relatively few
consonantal types which require the lower position of the soft palate