Professional Documents
Culture Documents
Bucher Niemann 2012 283.full With Cover Page v2
Bucher Niemann 2012 283.full With Cover Page v2
Bucher Niemann 2012 283.full With Cover Page v2
Bucher-Niemann-2012-283
Hans-Jürgen Bucher
PowerPoint : A t ree of good and evil; Ident ifying t heoret ical frameworks for invest igat ing Pow…
Arash (Ali) Farhadipour
Published by:
http://www.sagepublications.com
Additional services and information for Visual Communication can be found at:
Subscriptions: http://vcj.sagepub.com/subscriptions
Reprints: http://www.sagepub.com/journalsReprints.nav
Permissions: http://www.sagepub.com/journalsPermissions.nav
What is This?
visual communication
ARTICLE
Visualizing science:
the reception of
powerpoint presentations
ABSTRACT
Presentations with PowerPoint, made possible by digitalization, are another
step towards more visualization in the history of science communication.
This new genre of communicating scientific knowledge to an audience com-
bines several semiotic sign systems and therefore can be analysed as a
form of multimodal discourse that integrates pictures, text, design, etc. on
the slides, as well as spoken language, gestures, acts of pointing, etc. by
the speaker. This study approaches the problem of multimodal discourse
– how meaning is constituted by the different modes – empirically from a
recipient’s perspective. To reconstruct the meaning-making process, the
authors apply eye tracking and other methods of reception research in real-
life scenarios as well as in laboratory settings.
KEYWORDS
audience research • eye tracking • multimodality • PowerPoint • science
communication
SAGE Publications (Los Angeles, London, New Delhi, Singapore and Washington DC:
http://vcj.sagepub.com) Copyright © The Author(s), 2012.
Reprints and permissions: http://www.sagepub.co.uk/journalspermissions.nav/
Vol 11(3): 283–306 DOI 10.1177/1470357212446409
Scenario I
The first principle is that reception research on scientific presentations has
to be done in real-life settings (scenario I) to make the ‘naturalized engage-
ment’ (Kress, 2010: 170) of recipients visible. Only under this condition it
is possible to track interaction between the orchestration of a scientific pre-
sentation – the sign-making process of the presenter – and the audience’s
reception – the meaning-making process of the addressee. In the study pre-
sented here, the eye movements of about 60 persons were recorded in con-
ference presentations of various scientific disciplines, mainly in the third
and fourth quarters of 2008. More precisely, the gaze motions of an expert
in the discipline of each documented presentation were tracked using the
video-based SMI mobile eye tracker ‘iView X RED’ (50 Hz) (Figure 1).
Additionally, a questionnaire was distributed to the audience focusing on
socio-demographics, professional expertise, the degree of experience with
PowerPoint, and the individual evaluation of each presentation in differ-
ent dimensions. Furthermore, all these presentations were documented by
video, and all PowerPoint-slides were archived. In sum, more than 24 hours
SCENARIO IV
Because of technical limitations in the live scenario – only one head-
mounted eye-tracking device was available – the data of only one recipi-
ent exist for each presentation. To overcome idiosyncratic assumptions
about understanding multimodal communication, as a third principle a
control scenario was developed (scenario IV) that permits reproducing the
same presentation life size as often as is required on a 3-by-2m wall screen
(Figure 2). The staging of the videotaped presentation was already guided
by hypotheses derived from the former scenarios – for example, on the rel-
evance of pointing gestures by the presenter or hypotheses on reception
patterns of slide types. Using this procedure, reception data from 23 per-
sons attending the same presentation could be collected. Similar to scenario
I, eye movements of the recipients were tracked (SMI, ‘iView X HED’, 50
Hz) and the participants completed a questionnaire. As in scenarios II and
III, there was a subsequent knowledge test and, in 10 cases, retrospective
interviews.
A fourth methodological principle, derived from reception theory, led
to a research design that permitted tracking the influence of recipient fea-
tures on the understanding of multimodal presentations. Therefore, in sce-
narios II, III and IV, two populations of test subjects were recruited: experts
and novices. As they differed in terms of prior knowledge and competence
regarding the topic of the presentations, the influence of these features could
be measured.
These four methodological conclusions, i.e. using a natural setting,
manipulating modal density, providing a control scenario, and isolating recip-
ient features, guaranteed rich and well-matched reception data and therefore
permitted reconstruction of the different layers of the process of appropriat-
ing multimodal presentations.
Qualitative methods
In addition to eye tracking, the think-aloud method or retrospective ad hoc
commenting, interviews and knowledge tests provide process data on how
the recipients piece together the different modes of scientific presentations
(especially think aloud), and they yield information about interpretations
and knowledge acquisition (post-hoc methods like interviews and knowledge
tests). Moreover, the post-hoc methods allow for the interpretation of direct
reception data generated by eye tracking and the think-aloud method.
EMPIRICAL RESULTS
The presentation of the empirical results of our reception study addresses
the following main questions: What is the influence of the types of slide of a
PowerPoint presentation on the reception? What is the impact of modal den-
sity? How relevant are referring actions? And, finally, how important are indi-
vidual characteristics of the recipients, especially their prior knowledge, for
the understanding of multimodal presentations?
Types of slide
In order to analyse reception processes under a multimodal perspective, slides
may be usefully classified into three types: text-only slides, pictorial-only slides,
which can contain figurative or graphical elements, and mixed slides, which can
be composed of textual and all types of pictorial elements. This classification
of slides permits inferences about the basic types of intersemiotic relations,
namely relations between speech and text, speech and pictures, and speech
and multimodal slides. The distribution of the three types varies in the differ-
ent scientific disciplines (see Table 2). In economics, for example, the greatest
number of the slides are textual (48%), the most ‘pictorial’ presentations come
from the natural sciences, where nearly two thirds (65%) of all slides contain
pictorial material. As for pictorial slides, the humanities occupy a middle posi-
tion with 28 per cent figurative and 24 per cent mixed slides. Apart from the
differences between the disciplines, the distribution of the types of slide indi-
cates a strong overall tendency towards a visualization of scientific conference
presentations in all disciplines.
Although the eye-tracking data show fundamental differences con-
cerning the three types of slides, they indicate an overall feature of all mean-
ing-making processes of scientific presentations. The recipients do not add the
meaning of the speech and the meaning of the slides mode by mode towards
an overall meaning of a presentation, but they combine and integrate the dif-
ferent semiotic resources from the beginning. However, the different types of
slides activate different patterns of meaning-making, which are indicated by
typical sequences of eye movements. One can interpret these patterns as traces
of the recipients’ strategies to construct coherence between what is said and
what is shown.
Text slides
Text slides in scientific presentations are not only words on a screen but well-
designed spatial arrangements of information units, which we can call visual
text (Bucher, 2007). Therefore, understanding a text slide not only means
comprehending what is written, but also grasping the spatial constellation
of the text units (i.e. the text design of the slide), which can, for example,
express hierarchical relations or a relevance structure. Text slides appear in
Pictorial-only slides
Pictorial-only slides in scientific presentations occur in many variants, for
instance as drawings covering only parts of the slide, as full-size photographs,
or as photo collages. Independent of size and number of the pictorial ele-
ments, these slides typically fulfill one or more of the following functions:
They demonstrate, visualize, provide backing for, or illustrate the topic of a
presentation. Throughout our reception study, pictorial-only slides in differ-
ent variants elicited the same three-part pattern of reception, indicated by the
following sequence of eye movements. The appearance of the slide initially
attracts the participants’ entire visual attention for some seconds (A); subse-
quently, they avert their eyes from the slide, focusing on the speaker or other
elements in the room (B); and, finally, they briefly re-fixate the pictorial slide
(C) (Figure 5). The eye movements reveal a linear block-by-block reception
comparable in a way to the static text slide. As with the text slide, this pattern
was confirmed in the control scenario IV with 15 participants. This interactive
pattern, and especially the re-fixations, are strong arguments for a ‘generative
theory’ of knowledge acquisition, which means that ‘cognitive processes …
are applied segment by segment rather than to the entire message as a whole’
(Mayer, 2005: 41).
files without any audio mode only reached 1.76 points on average (Figure 6).
These results are less ambivalent than in Wiebe et al. (2007), who used com-
parable settings in a science learning context and concluded that ‘PowerPoint
with voiceover (scenario II in our study) does slightly more in terms of reten-
tion of science content than does a PowerPoint with no voiceover, but not
enough to show significance’ (pp. 338ff).3
The comparison of knowledge acquisition in scenarios with different
modal density shows that a higher number of modes in scientific presenta-
tions does not constrain the transfer of knowledge (e.g. due to cognitive over-
load), but rather that multiple modes may gainfully support one another (see
also Mayer, 2005).
on he verbally referred to the map next to him by saying: ‘And in the back-
ground the … information graphics.’ The visualization of the participants’
eye-tracking data (Figure 8) reveals that the reference with the laser pointer
instantly directs the audience to the anchor on the slide. The verbal reference
also has an impact on the allocation of visual attention, but a less distinct one.
During and after the reference, measurably more participants focus on the
map next to the anchor.
This example stresses the overall importance of referential actions to
the process of meaning-making. Nevertheless, the realization of their commu-
nicative functions presupposes the existence of several fundamental factors:
Figure 8 Visualization of the gaze motions of 15 participants in scenario IV during the reception process of the slide shown in Figure 7.
Figure 9 Mixed slide from a veterinary medical presentation.
On the left hand side (laser pointer on the left part of the PET scans)
you see two pictures of a naive rat (laser pointer follows the explana-
tions). Here in this area are the eyes, here you see the lung, and this
is the image of the brain. And you see quite well, that in the naive rat
Verapamil is not absorbed by the brain.
Figure 10 Visualization of the gaze motions of an expert and a novice in scenario II during the reception process of the slide shown in Figure 9.
Experts vs novices: scenario II
To evaluate the influence of the recipients’ prior knowledge on the allocation
of attention, we compared an expert and a novice in scenario II, where the
performative mode of the speaker (his facial expressions, the gestural deixis,
etc.) was cut off. Figure 10 illustrates the gaze motion of the two recipients
while the presenter gives the explanations documented in the excerpt above.
Obviously, the expert looks at the relevant areas in the picture showing the
PET scans (the lung and the brain of the rat on the left). The novice, however,
quite often gazes at the wrong rat (the one on the right), his scan path can be
described as a search pattern.7
Apparently, the expert’s higher level of prior knowledge allows him to
interpret the spoken references of the presenter in the intended way, so there is
no need for pointing actions with a laser pointer to direct his attention to the
relevant areas on the slide. For the novice, the elimination of the performative
mode of the presenter seems to be a major problem, which may be related to
a lack of knowledge of the anatomy of a rat, a lack of experience in analysing
PET scans, or deficits in using relevant technical terms (e.g. ‘naive rat’).
This interpretation is backed by a statement the expert made during
the retrospective interview, commenting: ‘Yes, due to the fact that the brain
looks like that … Therefore, I put this in a chronological sequence, because
he explained it like that.’ This reveals that the expert is familiar with the ana-
tomical structure of a rat’s brain as well as with the PET imaging technique.
Therefore, he is able to allocate his attention adequately on the basis of the
presenter’s verbal expressions.
The novice’s problems allow us to draw the conclusion that higher
modal density can assist meaning-making. With the additional performative
mode of reference actions it would have been much easier for the novice to
understand what is relevant on the slide at this specific point of the discourse.
The importance of the recipients’ prior knowledge for the process of
reception is also highlighted by the results of the knowledge test conducted
in scenario IV. When looking at the average score of the test participants
(n = 20), the experts scored considerably higher than the novices, reaching 1.92
points on average out of 4 possible points, compared to 1.6 points for novices.
LIMITATIONS
Two limitations of the findings must be mentioned. In contrast to eye track-
ing, no technical device exists for ear tracking. Therefore, audio attention,
which works simultaneously with visual attention, can only be deduced from
secondary data such as utterances of the recipients or reception results such as
knowledge tests. Because of technical limitations in the live scenario, we could
only collect data with one eye-tracking device. We tried to compensate for
this shortcoming with the help of a control scenario with a life-size presenta-
tion on a wall screen. Of course, it would be necessary to repeat the study in
a real-life scenario at different presentations with more than one eye-tracking
ACKNOWLEDGEMENTS
The authors thank Jochen Adam, Julia Harrer, Lisa Keimburg, Martin Krieg,
Christian Lehberger and the HumTec team at RWTH Aachen University, for
their support during data collection and preparation. We would also like to
thank Gerd Fritz and David Hudson for useful suggestions and corrections
to earlier versions of this article. Last but not least we would like to thank the
‘Volkwagen Stiftung’ for funding the research project ‘Interactive Science’.
NOTES
1. Nevertheless, it should be noted that, when evaluating the success of
comprehension in a reception process, eye-tracking data in themselves
are an important, but insufficient variable.
2. Due to temporal limitations at the conferences and symposia, where
the data of the real-life scenario were recorded, a knowledge test could
not be administered in scenario I.
3. Using the same Mann-Whitney U analysis applied in Wiebe et al.
(2007), the differences between the mean scores of scenario II and
scenario III prove to be statistically significant (0.014).
4. A status epilepticus is an emergency situation for animals and for
humans as well.
5. PET stands for Positron Emission Tomography which is an imaging
technique used in nuclear medicine.
6. The slides from the veterinary medical presentation we refer to were
presented by and used with the kind permission of Jens Bankstahl
(jens.bankstahl@tiho-hannover.de).
7. The scan path of a second novice on the same slide in scenario II
showed a comparable search pattern.
REFERENCES
Baldry, A. and Thibault, P.J. (2005) Multimodal Transcription and Text Analysis:
A Multimedia Toolkit and Coursebook with Associated On-line Course.
London: Equinox.
Bucher, H.-J. (2007) ‘Textdesign und Multimodalität: Zur Semantik
und Pragmatik medialer Gestaltungsformen’, in K.S. Roth and J.
Spitzmüller (eds) Textdesign und Textwirkung in der massenmedialen
Kommunikation, pp. 49–76. Konstanz: UVK.
BIOGRAPHICAL NOTE
HANS-JÜRGEN BUCHER is Professor at the Department of Media Studies,
University of Trier, Germany, with a background in discourse analysis and
practical journalism. His research interests include multimodal media com-
munication, journalism, Internet research, audience research and science
communication.
Address: Department of Media Studies, University of Trier, 54286 Trier,
Germany. [email: bucher@uni-trier.de]