Professional Documents
Culture Documents
CUPELT GuideToListeningSkills
CUPELT GuideToListeningSkills
Listening Skills
Table of contents
Key points and recommendations 3
5. Listening strategies 11
This paper is © Cambridge University Press. We provide it free of charge to Better Learning network members,
but it is not to be shared outside your organisation or published in any form – printed or online – without the
written permission of Cambridge University Press.
August 2017.
Cambridge ELT Research Papers for Better Learning network members
John Field is Reader in Cognition in Language Learning at the CRELLA Research Institute, where he undertakes
research and consultancy on the testing of listening. He formerly taught Psycholinguistics and Child Language
Development at the University of Reading UK and Cognitive aspects of L2 learning at Cambridge University. His
main interest is in second language listening, which formed the topic of his PhD at Cambridge, and on which he
has written and researched widely. His 'Listening in the Language Classroom' (CUP 2008) has become a standard
text in teacher training and MA courses. In another life, he was an ELT teacher trainer and materials writer and a
language school inspector. He wrote national coursebooks for Saudi Arabia and skills materials for Hong Kong
schools. He also designed two English-teaching series for the BBC World Service and TV programmes for the Open
University of Mainland China.
3 PERCEPTUAL 2 MEANING-BASED
decoding the input (speech sounds) putting parsed information into context and
interpreting meaning
matching groups of sounds to words
linking pieces of information to construct a line
parsing words into grammatical patterns
of argument
Within these operations, a skilled listener commands a range of processes, e.g. word recognition includes: using
syllables as cues to word identity, identifying where one word ends and the next begins, accessing all possible
senses of a word. These processes can be practised individually via small-scale tasks, some involving transcription.
Classroom practice
Teacher training should ensure that listening instructors:
use conventional comprehension classes to diagnose where learners’ problems lie; then provide small-scale
remedial tasks to address the problems identified;
prepare for a class by anticipating which parts of a passage are likely to prove challenging to the learners;
replay demanding sections of the recording several times and get learners to discuss them and even to
transcribe short clips.
Successful handling of the comprehension approach depends heavily upon the skills of the individual teacher.
Thoroughly checking comprehension requires an understanding of the problems that listening poses, the nature
of the listening process and the ways in which learners respond to the demands of listening in an L2. It is not
enough simply to elicit check answers to questions, then to move on. A competent teacher has to anticipate
which parts of a passage are likely (for a range of reasons) to prove challenging to the learners – highlight them in
the transcript – replay them – get learners to discuss them – even get learners to transcribe short clips.
‘Sub-skills’
From the early 1980s, handbooks for L2 teachers presented reading as a componential activity. They proposed
programmes of small-scale localised exercises based upon what were termed sub-skills. Curiously, the approach
was little applied to the teaching of listening. Richards (1983) put together an extensive list of listening sub-skills,
for both general and academic contexts; but formats for practising them were slow to emerge.
The sub-skills approach had its limitations:
The taxonomies proposed were based entirely upon intuition.
They were very miscellaneous: mixing basic perception of sounds and words, listening for gist, recognising
the logic of a text and strategies such as guessing word meanings.
They proved difficult to grade in relation to the proficiency levels of learners.
Strategies
Distinct from sub-skills but sometimes confused with them were the strategies that less experienced L2 listeners,
speakers, readers and writers use in order to compensate for their limited mastery of the skill and their lack of
language knowledge. Listening programmes were developed that supported the comprehension approach by
training learners to use such techniques. They became particularly popular in the USA, where much local teaching
of L2 listening focused (and still focuses) upon target lists of strategies.
The strategy approach has its limitations too:
Commentators do not always distinguish between two types of strategy: those that compensate for gaps in
understanding and those that help learners to acquire language knowledge.
They do not distinguish between
• processes (e.g. how to identify words in connected speech): which all listeners have to use in
order to make sense of what they hear;
• strategies: short-term expedients which enable early-stage learners to make sense of a text that
they have understood incompletely.
The available lists of strategies largely reflect the impressions of commentators; and although a lot of
research has been conducted, much of it adopts the assumption that certain strategies exist and asks
learners if they do or do not use them.
Using them to define tasks ignores the fact that more than one type is used in many real-world listening
events. It depends upon the material that is being listened to, the context (presentation vs conversation) and
the listener’s goals.
Listening processes
These days a more rigorous approach is available. Extensive research into L1 listening has enabled psycholinguists
to construct detailed models of what the skill entails. This information potentially equips L2 instructors to
understand more clearly what it is that constitutes expert listening – and thus to define their goals more clearly
when planning classroom practice and designing tests of the skill. There are five fundamental operations which
make up competent listening.
By way of explanation:
input decoding: the sounds reaching the ear of the listener are converted into the sounds of speech;
lexical search: a group of sounds is matched to a word; information about the word, including its range of
meanings, is then retrieved;
parsing: a group of words is built up in the listener’s mind, until such time as a grammatical pattern can be
traced in it;
meaning construction: the information from parsing is placed in a wider context, which might involve the
ongoing topic, world knowledge or the listener’s knowledge of the speaker. Information is inferred that the
speaker has not explicitly expressed;
discourse construction: pieces of information are linked to each other, their relative importance is considered
and a line of argument is built up.
These five headings provide teachers with a systematic way of approaching listening. The first three can be
thought of as perceptual (or lower-level) operations, where the listener handles units of language. See Section 3.
The last two are conceptual (higher level) operations, where the listener interprets the ideas that have been
extracted. See Section 4.
Within these operations, there are many different processes (see Appendix A) which provide a basis for possible
small-scale practice exercises. The term process is preferable to sub-skill. Whereas sub-skills were intuitive
suggestions by methodologists, the processes referred to here are based on research evidence over many years,
which indicates that they are psychologically real.
Up to B1+ level, teachers thus do well to focus on perceptual processes - in particular, on the features of
connected English speech that cause problems. They also have to address the reality that early L2 listeners will be
unable to decode all they hear. Teachers need to provide training in the use of compensatory strategies such as
ignoring unknown words or inferring their meaning from context – identifying key words marked out by
intonation – deciding the likelihood of a recognised word – checking guesses against later information. These are
critical in order to support the morale of early-stage listeners and to make them feel that progress is being made.
In short, a double programme is needed:
1. training in specific processes central to listening in the L2 (Sections 3-4);
2. training in strategies that enable learners to compensate, short term, for problems of understanding.
(Section 5)
Younger learners
Young learners represent a special case. In determining what they can handle in terms of L2 listening, we have to
take account of more than just the need to feature the topics and lexical fields with which they are familiar. We
also need to consider their cognitive development. From research into their comprehension of read-aloud
passages, we know that, until the age of about 12, YLs have problems with higher-level processes such as
interpreting pronouns, making inferences and monitoring their own understanding for inconsistencies.
Whatever the target language level, teachers thus need to focus largely on low-level processes. We also know
that YLs are likely to have difficulty with the type of decision-making activity that is involved in using strategies.
Strategy-based practice of the type recommended for adults may not be appropriate.
Phoneme variation
The input that reaches the listener’s ear takes the form of acoustic sensations, which the listener has to match to
the sounds of the language. That is not as easy as it might seem. Phonemes do not possess a standard form; they
vary considerably according to the phonemes before and afterwards. Speech scientists discovered long ago that
phonemes spliced from connected speech cannot be accurately identified by L1 listeners and that there is no
standard set of cues for any single phoneme.
We therefore have to assume either that we store many versions of every phoneme in our minds in order to
recognise the sounds we hear, or (perhaps more likely) that the smallest unit of perception for the listener is the
Word variation
Words also vary in form under the influence of the words that come before and after them.
1. Resyllabification. English speech has a tendency to follow a strong-weak rhythm. This may lead to some
syllables being redistributed: Lizbe/camea/star.
2. When articulating a group of words, a speaker tends to adopt the easiest possible route, avoiding
complicated consonant sequences. The result may change the form of the words:
• Elision: the final sound of a word may be omitted: night club [naɪ klʌb]
• Assimilation: the final sound of a word may anticipate the word that comes next [tem pounds]
3. Liaison. For similar reasons, speakers often insert an extra consonant to assist the transition between
vowels: to (w)eat, the (y)end
4. Reduction. When producing a familiar chunk of language, a speaker might omit important parts or reduce
them to [ə]. Do you know what I mean? might become [narp mean].
Speaker variation
Speakers’ voices vary enormously in fundamental pitch and in range.
Speech rate varies from one speaker to another and one encounter to another.
Speakers’ voices vary in terms of regional or class accent.
Listeners often find themselves having to adjust (normalise) to the voice of a new speaker or to the rate at which
a known speaker is speaking. While this occurs very rapidly in an L1, it is inevitably more complicated in an L2.
Word boundaries
There are occasional short pauses where speakers plan what to say next; but there are no consistent gaps
between words in connected speech as there are in writing. A listener has to engage in lexical segmentation,
working out the points where one word ends and the next begins. In English, an important cue used by
experienced listeners is provided by syllables bearing lexical stress. They provide a relatively reliable indicator of
where a new content word is likely to begin because the majority of content words in running speech in English
are either monosyllabic or start with a stressed syllable.
This indicates the value of simple transcription exercises, where learners listen, perhaps four or five times, to a
short section of authentic speech of up to 15 seconds. They write down the words they recognise, compare their
efforts in pairs and add to them on each new hearing.
Relative prominence
While lexical stress assists L2 learners in recognising content words, they still have difficulty in identifying some of
the most frequent words in the language (i.e. function words) in connected speech. The words are short and
often include the weak vowel schwa; and their duration gets reduced within an intonation group. A breakthrough
in decoding these words occurs around CEFR level B1+; it appears to be connected with the listener’s growing
ability to recognise common chunks of language.
By contrast, the stressed word in an intonation group is more prominent and longer in duration than the others
and is therefore easier to recognise. Identifying the focal word provides an important strategic prop for early-
stage learners.
There is a tendency for items in listening tests to focus on facts and to neglect these more sophisticated
operations. This is unfortunate as it is not difficult to design tasks that elicit them. There is no reason why, in the
classroom, conventional comprehension questions cannot target areas such as speaker attitude, inference or
logical connections:
Did the speaker like the film? If so, why? If not, why not?
Compare the attitudes of these two speakers to mobile phones.
How likely is the speaker to vote for the new motorway?
Do you think the couple really enjoyed their holiday or not?
What happened first: the driving ban or the accident?
What caused the accident? What was the result?
How did the speaker’s phone lead to him missing the plane?
Why is palm oil relevant in this talk on global warming?
Similarly, various forms of note-taking can demonstrate a learner’s ability to report argument structure. One easy
format is to ask learners to complete a skeleton Table of Contents showing the major and minor points of a talk
(listed as 1…. 1.1 …. 1.2…. 2…. 2.1…. 2.2…. 2.3…. 3.... etc.) – with perhaps one or two answers included as points
of reference.
5. Listening strategies
L2 listeners in the earlier stages of development (up to about B1+) often have to focus heavily on the language of
the speaker because they are unable to recognise every word that is said. Inability to follow L2 speech can be
demotivating to learners, especially nowadays with so much spoken English available on the internet and through
broadcast media. Training learners in compensatory strategies is sometimes said to be useful for higher-level
listeners; in fact, it is those at lower levels who benefit most. The two strands of a listening programme identified
so far (practice in the processes used by experienced listener and getting used to the features of connected
speech) should thus be supplemented with tasks which train these learners to handle listening passages that they
only partly understand.
Types of strategy:
Avoidance strategies: Listener learns to ignore a piece of text that seems unimportant or to accept a general
impression of what was said;
Achievement strategies at clause level: Listener uses context or topic to infer what was said, listens for key
words or uses prominent words to guess clause meaning
Achievement strategies at word level: Listener uses context or analogy with other words in L2 or in L1 to
guess the meaning of new or unrecognised words. Listener extracts a general sense (oak = a type of tree).
Evidence suggests that many listeners deal with unknown words by making approximate matches and then
trying to fit them into what the speaker says.
Repair strategies: Listener learns formulaic expressions used in interactive situations to get clarification or
repetition.
Pro-active or metacognitive strategies: Ways of predicting in advance what a speaker will say or of ensuring
that different points are committed to memory. Many of these are useful in the classroom and in listening
tests but not relevant to real-world interaction.
There is some disagreement about methodology. A ‘direct’ approach to strategy training widely favoured in the
US introduces, demonstrates and practises individual strategies. Its problems are that:
strategic behaviour varies considerably from one individual to another. Some learners are risk avoiders (often
for cultural reasons); others are risk takers.
much strategy use follows a problem-solution pattern, and thus varies according to the specifics of the
problem. So it may be better for learners to become aware of appropriate strategies during a more general
listening task
An alternative to direct teaching is therefore a task-based strategy approach, where learners
1. listen to a recording (or a short clip)
2. share ideas on how they managed to make sense of parts they did not fully follow
3. report to the class
4. listen again
Strategy practice of this kind entails using recordings that are at or slightly beyond the language level of the
learners (useful for demonstrating how to deal with unknown words). Even at quite low levels, these recordings
should include simple clips of authentic speech.
A recognition that listening difficulties may derive from the recorded material and not simply from the
language of the transcript;
An understanding of how listeners process speech, turning sounds into ideas;
The ability to diagnose perceptual and conceptual problems as and when they occur;
The ability to design local tests of listening which are not heavily dependent upon conventional
comprehension questions or test formats.
Field, J. (1998) Skills and strategies: towards a new methodology for listening. ELT Journal 54/2: 110-
118
Field, J. (2012) Listening instruction. In A. Burns. A. & J.C. Richards (eds.) Pedagogy and Practice in
Second Language Teaching. Cambridge: Cambridge University Press, pp.207-217.
Field, J. (1999) Key concepts in ELT: bottom up and top down. ELT Journal, 53/4: 338-339.
Field, J. (2000) ‘Not waving but drowning’: a reply to Tony Ridgway. ELT Journal, 54/2: 186-195. Also:
ELT Journal, 54/3: 307.
Field, J. (2003) Promoting perception: lexical segmentation in L2 listening. ELT Journal, 57/4: 325-334.
Field, J. (2004) An insight into listeners problems: too much bottom-up or too much top-down?
System, 32: 363–77.
Richards, J. (1983) Listening comprehension: approach, design, procedure. TESOL Quarterly, 17: 219-
39.
Field, J. (2014) Myth: Pronunciation teaching needs to fix in the minds of learners a set of distinct
consonant and vowel sounds. In L. Grant (ed.) Pronunciation Myths. Ann Arbor: U. of Michigan
Press, pp. 80-106.
Cohen, A. (1998) Strategies of Language Learning and Language Use. Harlow: Longman.
Dornyei, Z. & Scott, M.L. (1997) Communication strategies in a second language: Definitions and
taxonomies. Language Learning, 47/1: 173-210.
Faerch, G. & Kasper, G. (1983) Plans and strategies in foreign language communication, In G. Faerch &
G. Kasper (eds.) Strategies in Interlanguage Communication. London: Longman, pp.20-60.
Field, J. (2000) ‘Not waving but drowning’: a reply to Tony Ridgway. ELT Journal, 54/2: 186-195. Also:
ELT Journal, 54/3: 307.
Macaro, E., Graham, S. & Vanderplank, R. (2008). A review of listening strategies: focus on sources of
knowledge and on success. In A.D. Cohen & E. Macaro (eds.): Language Learner Strategies.
Oxford: Oxford University Press.
Decoding processes
1. Phoneme level
1.1 Phoneme recognition in a range of contexts
1.2 Discriminating consonants
1.3 Discriminating vowels
1.4 Recognising consonant clusters
1.5 Extrapolating spellings from sounds
2. Syllable level
2.1 Recognising syllable structure
2.2 Recognising syllable stress
2.3 Treating stressed syllables as more reliable
2.4 Using stressed syllables to recognise words
2.5 Using weak syllables to locate function words
3. Word level
3.1 Lexical segmentation
3.1.1 Rhythm based strategies
3.1.2 Using prefixes and suffixes as boundary markers
3.1.3 Using fixed stress (where appropriate)
3.2 Recognising variant forms of words
3.2.1 Allowing for cliticisation
3.2.2 Allowing for resyllabification
3.2.3 Recognising weak forms of function words
3.2.4 Recognising assimilated words
3.2.5 Allowing for elision
3.2.6 Recognising reduced words within intonation groups
3.3 Recognising complete formulaic chunks
3.4 Using awareness of word frequency
3.5 Current activation
3.6 Spreading activation (word networks in the mind)
3.7 Distinguishing known and unknown words
3.8 Dealing with unknown words: infer – generalise – ignore
3.9 Automatic lexical access
4. Syntactic parsing
4.1 Building syntactic structures during pauses and fillers
4.2 Using planning pauses to demarcate syntactic structures
4.3 Distinguishing planning and hesitation pauses
4.4 Using intonation groups to demarcate syntactic structures.
4.5 Building a syntactic structure on-line
4.5.1 Testing hypotheses
4.5.2 Using probability
4.5.3 Recognising syntactic chunks
4.5.4 Recognising the sentence pattern associated with the verb
4.5.5 Recognising primary L2 cues to syntactic organisation
4.6 Understanding pragmatic intentions
4.7 Drawing inferences based on syntax
5. Intonation
5.1 Relating intonation groups to syntactic structure
5.2 Forming and testing decoding hypotheses within an intonation group
5.3 Identifying focally stressed syllables
5.4 Treating focally stressed syllables as central to the message
5.5 Guessing words of low prominence in the intonation group
5.6 Allowing for variations in intonation between varieties of English
1. Word meaning
1.1 Narrowing word sense to fit context
1.2 Dealing with word ambiguity
1.3 Inferring meaning of unknown words
2. Syntactic meaning
2.1 Relating syntax to context
2.2 Interpreting speaker’s functional intentions
2.3 Forming inferences from syntactic information
3. Intonation meaning
3.1 Recognising given / new relationships
3.2 Distinguishing given/new and contrastive and emphatic stress
3.3 Relating contrastive and emphatic focal stress to context
3.4 Recognising finality
3.5 Recognising the end of a speaker turn
3.6 Using intonation to identify questions in statement form
3.7 Distinguishing a confirmation request from a more open question
3.8 Distinguishing echoes and challenges
3.9 Distinguishing neutral – emotive – withdrawn intonations
7. Using inference
7.1 Inferring information the speaker has left unsaid
7.2 Inferring connections between pieces of information that were not made explicitly