Professional Documents
Culture Documents
Eye-Tracking: A Comprehensive Guide To Methods, Paradigms and Measures
Eye-Tracking: A Comprehensive Guide To Methods, Paradigms and Measures
net/publication/323779800
CITATIONS READS
63 11,211
2 authors, including:
Kenneth Holmqvist
Lund University
177 PUBLICATIONS 6,387 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Kenneth Holmqvist on 15 March 2018.
Eye tracking
—
a comprehensive guide to methods, paradigms
and measures
Kenneth Holmqvist (Regensburg University, Germany, Masaryk University, Brno, Czech Republic,
and NWU Vaal, South Africa)
Richard Andersson (Tobii AB, Stockholm, Sweden. Formerly Lund University, Sweden)
The moral rights of the author have been asserted through RightsLink and in personal
communication.
This 2nd edition was first published in 2017. The first edition was published by Oxford University
Press in 2011.
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or
transmitted, in any form or by any means, without the prior permission in writing of the main
author, or as expressly permitted by law, or under terms agreed with the appropriate reprographics
rights organisation. Enquiries concerning reproduction outside the scope of the above should be sent
to the main author.
You may not circulate this book in any other binding or cover and you must impose this same
condition on any acquirer.
ISBN-13: 978-1979484893
ISBN-10: 1979484899
almost any study, while other measures are tightly linked to a specific practice. Antisaccade latencies
do not exist outside of a particular paradigm, for instance, and neither do diversion durations and
regression scanpaths. These measures require a task and a stimulus of a specific kind to make any
sense. Paradigms are often well-known by the researchers in a field where they are used, as they
form the core of the experimental trial, but remain unknown outside of the field they originated from.
Knowledge of paradigms simplifies and strengthens experimental designs, adding value to research.
Users of eye-trackers often lack proficient training because there is little or no teaching com-
munity to rely on. As a result people are often self-taught, or depend on second-hand knowledge
which may be out of date or even incorrect. When they participate in our eye tracking methodology
courses, we find that many new users are very focused on their research questions, but are surprised
how much time they need to invest in order to master eye tracking properly. Often people attend-
ing have just purchased an eye-tracker to compliment their research, or for use in their company to
tackle ergonomic and marketing related questions. Our aim for this book is to make learning to use
eye-trackers a much easier process for these readers. If you have a solid background in experimental
psychology, computer science, or mathematics you will often find it straightforward to embrace the
technologies and workflow surrounding eye tracking. Whatever your background, you should be able
to achieve the same level of knowledge and understanding from this book as you would from training
on eye tracking in-house in a fully competent laboratory.
More specifically, this book has been written to be a support when:
1. Evaluating or acquiring a commercial eye-tracker,
2. Planning an experiment where eye tracking is used as a tool,
3. About to record eye movement data,
4. Planning how to process and interpret the recorded data, before carrying out statistical tests on
it,
5. Reading or reviewing eye movement research.
In our e↵orts to classify eye tracking methods, measures, and paradigms, combined with useful
practical hints and tips, we hope to provide the reader with a thoroughly updated second edition of
our comprehensive textbook on methodology. While helpful as an introduction for new users of eye
tracking, this book also caters to the advanced researcher.
4. 23 engineering students and 21 students of the humanities were asked to solve 43 mathematical
problems of three di↵erent kinds, while their eye movements were monitored using the SMI
HiSpeed 1250 Hz system. Data are described in Holmqvist et al. (2011).
5. 24 ninth-graders browsed the web for 15 minutes using the SMI RED 4, 60 Hz system, with a
set of 20 starting links to choose from, but with an instruction to browse freely. Data described
in Sandberg, Gidlöf, and Holmberg (2011).
Ideally, we would have liked to have collected our examples from all 20+ manufacturers of eye-
trackers, but this has proven to be practically impossible. The vast majority of examples in this book
have been collected with LC Technology, DPI, EyeTribe, SMI, Tobii, and SR Research systems in
our lab, or in the labs of colleagues. We use the examples we have, because we think they are very
general to many eye-trackers, and not specific to a single system or manufacturer. Software used and
described are versions used during 2006–2017, the period during which we wrote the first and second
editions of the book, and are included to describe principles. We refer to manufacturer manuals for
current versions.
By definition, a book on eye tracking methodology will have to contain many examples of what
works well and what can go wrong in eye tracking research. Thus, during a few years, we have
collected many examples of successes and mishaps in our own laboratory and the labs of colleagues,
that we have used in the book as warnings and eye-openers. Our examples are not considered as an
endorsement or critique of a particular eye-tracker, but the illustrated property should be critically
evaluated against any eye-tracker you consider using.
Acknowledgements
A great thanks to everyone who has helped us writing this book. Firstly, Halszka Jarodzka has been
a very appreciated co-author both in the first and during the preparations of the second edition. Ellen
Kok updated the Chapters 13, 14, 15, and 16 for the second edition of this book. They have both done
an excellent job throughout and fully deserve co-authorship. We are also very thankful to Richard
Dewhurst for starting the paradigm chapter, and for his valuable work on the first edition, and wish
him the best in his new position and with his new family.
The eye tracking seminar in Lund has debated previous manuscripts of both editions many times
over. In particular, the authors would like to thank Diederick Niehorster, Raimondas Zemblys, Ig-
nace Hooge, Alexander Strukelj, Kerstin Gidlöf, Paulina Lindström, Nils Holmberg, Roger Johans-
son, Roy Hessels, Jana Holsanova, Janna Spanne, Lenisa Brandão, Linnea Larsson, Johan Pihel, and
Philip Pärnamets and the many seminar guests.
Well over five hundred students have read and given feedback on earlier drafts while taking the
7.5 ECTS eye tracking courses given in Lund since 2000, and at the LETA crash workshops in eye
tracking methods that we have organized since 2008.
We would also like to thank the many colleagues who have contributed by reviewing draft chap-
ters for this edition: Koos van Geel for excellent feedback on the introduction, William Schmidt and
Sam Hutton for critical opinions on Chapter 2 and many other parts of the book. Ignace Hooge gave
us invaluable help with the experimental design and paradigms chapters. Je↵ Pelz for his in-depth
review of Chapter 4 and other chapters, and Dan Witzner Hansen, Warren Ward, Jan Ober, Anders
Kingbäck, Carlos Morimoto, Meike Mischko, and Diederick Niehorster for very specific reviews of
this and other chapters. Tom Foulsham and Sam Hutton for their reviews of Chapter 5 and other parts
of the book. We are also very grateful to Pieter Blignaut, Daniel Jacobus Wium and Jan Drewes for
their very thorough reviews of Chapter 6, Oleg Komogortsev for a fresh and initiated view on Chapter
7, Jacob Lund Orquin for carefully checking Chapter 8, and Parag Mital and Andrew Duchowsky for
suggesting excellent improvements in Chapter 9. Olivier Le Meur for the many good suggestions for
Chapter 10. Walter Bishof, Sandra Starke and Nicola Anderson have kindly reviewed parts of the
measure chapters. Many experts and nonexperts have provided invaluable comments and corrections
to our attempt at paradigm overviews: Chrystalina Antoniades, Sara Farshchi, Filip Dechterenko,
Elena Eriksson, Andreas Falck, Sam Hutton, Roger Johansson, Amir Kheradmand, Alan Kingstone,
Jorge Otero-Milan, Jan Theeuwes, Jeremy Wolfe, Ronald Rensink, Michele Rucci, William Schmidt,
and Neal Snape.
Thousands of participants in ours and other studies have contributed by providing data and eye-
images that we have selected. Although anonymous, their participation has provided us with the
examples in many of the chapters, and we extend our thanks to them also.
Thanks to Nils Holmberg for setting up and managing our own Subversion server while we wrote
both editions. Not having to bother with who works with what version made writing this book so very
much easier.
Michael Cutter has done a great job proofreading an earlier version of this book. Our best thanks
for countless improvements on the flow of the text, usage of correct British grammar, spelling and
hyphenation.
This book has been written in the same spirit as the first edition. Our decision to change publisher
has come about for reasons unrelated to our collaboration with OUP. Publication was simplified by
OUPs generous decision to let us reuse the style of content and cover.
Finally, without electronic access to journal papers, this book would have been much more dif-
ficult to write. The authors want to thank all the libraries and scholars who have given (or sold our
universities) access to thousands of peer-reviewed journal papers on eye-tracking-based research.
Contents
1 Introduction 3
1.1 Success stories in eye tracking 4
1.2 Your first few eye tracking studies—step-by-step 5
4 Eye-tracker Hardware 64
4.1 A brief history of eye-tracking technologies 64
4.2 Sampling of raw data 83
4.3 Feature detection 86
4.4 Calibration 88
4.5 Sampling frequency: what speed do you need? 90
4.6 Types of eye-trackers and the properties of their set-up 95
4.7 Manufacturers and customers 108
4.8 How to set up an eye-tracking laboratory 115
4.9 Summary 119
12 Paradigms 389
12.1 The extended fixation paradigm 390
12.2 Prosaccade paradigms 391
12.3 Perisaccadic perception paradigms 395
12.4 Multiple target paradigms 396
12.5 Memory-guided saccade paradigms 398
12.6 The multiple object tracking paradigm (MOT) 399
12.7 Electrical stimulation paradigms 400
12.8 The antisaccade paradigm 400
12.9 The oculomotor capture paradigm 403
12.10 Vestibulo-ocular reflex (VOR) paradigms 403
12.11 Optokinetic nystagmus (OKN) paradigms 405
12.12 Vergence paradigms 406
12.13 Smooth pursuit paradigms 407
12.14 Visual search paradigm(s) 409
12.15 The preferential-looking paradigm 413
12.16 Spatial cueing paradigms 414
12.17 Social interaction paradigms 416
12.18 Visual world paradigm 417
12.19 Paradigms of looking at nothing 419
xii | CONTENTS
17 What are Eye-Movement Measures and How can they be Harnessed? 609
17.1 Eye movement measures: plentiful but poorly accessible 609
17.2 Measure concepts and operationalizing them 611
17.3 Proposed model of eye-tracking measures 613
17.4 Measures and paradigms 617
17.5 Classification of eye movement measures 619
17.6 How to construct even more measures 621
| xiii
References 627
Index 713
xiv | ABOUT THE AUTHORS
Part I introduces the three central areas of competence required for running an eye tracking study: the
hardware, the experimental design, and the actual recording of the data from participants. Mastering
these skills is the key to recording high-quality eye movement data. Each of the three benefit from
understanding the biology of the eye, the neurology behind visual perception, fundamental vision
science, and the psychological theories and models of attention and eye movements.
1 Introduction
Eye tracking as a research tool is more accessible than ever, and is growing in popularity amongst
researchers from a whole host of di↵erent disciplines. Usability analysts, sports scientists, cogni-
tive psychologists, reading researchers, psycholinguists, neurophysiologists, electrical engineers, and
many others all have a vested interest in eye tracking for di↵erent reasons. There is no doubt that it is
useful to record eye movements, and that it advances science and leads to technological innovations.
At the same time, the growth of eye tracking in recent years presents a variety of challenges, the
most pressing of which is how to support the rapidly increasing number of people using eye-trackers.
To address this concern this book follows the process of empirical investigation using eye tracking
from beginning to end, providing detailed advice and discussion of the issues that can be encountered
en route.
Whether a researcher or not, most beginning users of eye tracking equipment today buy both
hardware and analysis software from manufacturers. This includes all of the technical properties of
the eye-tracker, as well as algorithms for recording, filtering, and analysing data, and the provided
default settings. The user initially trusts the system, because they are completely dependent on it.
However, as their understanding grows they find an increasing number of oddities. For instance, they
get an average first fixation duration that di↵ers too much from that reported in the classic literature,
and they are very uncertain why: Is it the hardware with its sampling frequency and resolution, or
possibly a bug in the software? It could perhaps also be the settings, the algorithm, or the study itself
that made the di↵erence. Furthermore, there are strangely short fixations of only 1-2 ms in the data.
What does that mean? It is never reported in the literature; not once. The manual says that they should
be removed, but could they have caused the shorter first fixation durations? Calling the manufacturers
of the system does not really help: They explain the architecture of the system and the algorithms,
but they only implemented an algorithm from a journal paper, and do not really know how good it is
or what precise settings to use for the data in the study.
There are in fact a whole range of real issues in eye tracking methodology that have never been
written down and published. For example, how do you best calculate fixation durations from gaze-
overlaid videos? What angles from the camera to the eye allows you to record data for the entire mon-
itor? Are participants with contact lenses a problem? Can you fix poor data quality post-recording?
What are the sources of latencies in eye tracking data? The tricks of the trade are learnt only from the
experience of recording lots of data from many people in a variety of di↵erent set-ups, and looking
at the data that comes out of the system. The manufacturer manuals do not describe the experiences
gathered by eye tracking researchers, because manufacturers are seldom users of their own systems.
The established eye movement researchers themselves, irrespective of research field, are so focused
on publishing their theoretically important results that they often forget or omit any description of
hardware, settings, or their procedure that the rest of us could learn from. Many journal editors are
also inclined to not want to publish that methodological information.
Occasionally there is a methodological point that is made clear in a result-oriented paper, but
methods papers that are specifically written for eye tracking researchers are only rarely published
(journals such as Behaviour Research Methods provide excellent exceptions). Even when such meth-
ods papers appear they reach very few readers, because the users of eye-trackers are so fragmented
into their own research fields, traditions, terminology, and methods that they are not likely to read a
methods paper published in one of the other traditions. We wrote this book to compile as much of the
in-house knowledge from the best eye tracking groups into one single book, available for all.
4 | INTRODUCTION
Clinical neuropsychology
The use of eye tracking to study schizophrenia started only a decade after the first eye-trackers were
constructed (Diefendorf & Dodge, 1908). Today we know that eye movements—specifically smooth
pursuit gain (p. 604) and antisaccades (p. 400)—allow clinicians to diagnose the illness in passive
phases, as well as in relatives who carry the genes but who are not directly a↵ected. The antisaccade
task has become a reliable tool for studying prefrontal executive control, including its development
and deviations. O’Driscoll and Callahan (2008) stated that ‘Average e↵ect sizes and confidence limits
for global measures of pursuit and for maintenance of gain place these measures alongside the very
strongest neurocognitive measures in the literature.’ As if this were not enough, inspection of eye
movements is the standard method for diagnosing issues with the balance system in any hospital you
visit.
Reading
Reading studies were first conducted very early in eye tracking research. Erdmann and Dodge (1898)
conducted a systematic enquiry into reading, showing that we do not look at every letter in the words
we read, and investigated saccade amplitudes and fixation durations during typical reading. Since
then, thousands of eye movement studies of reading have been conducted and published. In the 1970s,
it was found that readers only need to see text in an area around the point of fixation, referred to as
‘the visual span’. When you change words outside of the visual span to gibberish, people are still able
to read unhindered. Eye tracking research into reading continues to thrive partly because text can be
varied in so many ways, not least into di↵erent languages with di↵erent writing systems, and also
because text is used in so many situations. Since the 1990s, computational models have been able
to successfully simulate eye movements during reading, using word properties as the input (Reichle,
Rayner, & Pollatsek, 2004).
identify points in time when your participant should be engaged in the particular behaviour in which
you are interested (e.g. search behaviour), and then extract the selected measures during just these
points. This is more powerful than extracting all instances of this measure during the whole trial,
where the particular behaviour of interest is mixed with many other forms of gaze behaviour, which
essentially just contribute noise to your data. Check that:
⇤ I have a concrete hypothesis or prediction.
⇤ I have drawn the eye movements I expect from my theory, previous research, and other predic-
tions onto the stimuli images that I plan to show.
⇤ I have then carefully studied the lines and blobs in my drawings, and figured out what kind of
measures could demonstrate a contradiction vs. a validation of my hypothesis.
⇤ I have imagined that I recorded data from 20 participants, and that I have all the records of their
movements. With this, I made an imaginary chart that illustrates the pattern of results I expect
in my measures.
⇤ I searched for and identified other functionally equivalent measures for my research question.
Are you interested in mental workload for example? Then find out what other measures are
used to investigate this, for instance using the index of this book. Perhaps some of the alterna-
tive measures are better and completely missed by you and others using your paradigm.
⇤ I have formulated the consent form, the participant information letter, and the data management
plan.
⇤ I have applied for and been granted approval from the ethics committee.
⇤ I have decided what kind of statistics I will use to analyse my data.
Chapter 3 describes experimental design in detail, with a particular emphasis on eye-tracking
research.
Processing of data
When all of your data are recorded and backed-up, your day-to-day work will change to the pro-
cessing of your data. This can appear very simple, as manufacturer supplied software provides many
functions for the advanced export of values for measures calculated from your data. However, a lot
of things happen under the hood, and decisions you make about your data depend on them. Many
researchers prefer to program their entire analysis themselves, to control it as they want to, and to
know what happens. The data processing step is one where it is easy to get lost. Getting through this
stage is easier if you decided during your experimental design which measures and what statistics to
use, and even easier if you already piloted it. Part II describes the most common types of processing.
⇤ I have plotted my fixations next to my raw data, as in Figure 7.31 on p. 233, and selected a
setting so that all of the fixations I see in the raw data appear in the fixation plot.
⇤ I have added a note to my manuscript specifying the algorithm and the detection parameters
that I have used.
If you cannot get the algorithm to produce the fixations you see in the raw data, you could examine
the distribution of events in your measure at di↵erent settings (look at the histograms), before you
decide which setting to use. Make parallel analyses with several settings, and see how this a↵ects
your results (see Green, 2006, who does this).
Beware of smooth pursuit and movements which appear to be smooth pursuit in your data file.
This is likely to be part of your data if you use dynamic stimuli or a head-mounted eye-tracker. Cur-
rent manufacturer algorithms have been designed to analyse only data recorded from static stimuli;
if you use animation, look for algorithms designed for such data. Specific algorithms will also be
needed if you have recorded very noisy data and are yet to use it, or if you want to de-saccade your
data—the practice of removing saccades from smooth pursuit data and interpolating—in order to
calculate gain and phase.
Beware that some implementations divide events (such as a fixation) that cross a trial boundary
into two parts. This may lead to artificially low first fixation durations for trials; one option is to
exclude such partial events from the analysis. Clearly define the events that you use in your article.
For example, do ‘fixations’ refer to implicitly detected intersaccadic intervals or explicitly detected
oculomotor periods of stillness? If saccade endings or fixation onsets are important, what did you do
with post-saccadic oscillations?
Manually coding dwells and transitions from gaze-overlaid videos is not overly difficult for lim-
ited amounts of data, but coding hours of recordings with data from head-mounted eye-trackers is
very time-consuming.
Using scanpaths
Scanpath visualizations are excellent for first inspections of data, answering questions such as: is
the data quality good, did the fixation detection algorithm do a good job, and is this recording in
line with my hypothesis? Do not put scanpath visualizations in your papers just as decoration. Ask
yourself why you have put it there, and see to it that the scanpath visualization aligns well with your
YOUR FIRST STUDIES | 11
hypothesis, operationalizations, and results. There are a whole number of scanpath events ready to be
used in statistical analyses, such as returns, regressions, look-aheads and sweeps, and many more that
could be defined. In order to attribute meaningful interpretations to individual scanpaths, you need to
disambiguate the data using a tight experimental design, and verbal data or other complementary data
recordings. All of the scanpath representations used in particular measures reduce the level of detail
in the scanpaths, in terms of both spatial and temporal accuracy. Other properties such as fixation
duration are sometimes ignored completely.
Be sure to use a scanpath representation that retains the properties that you want to measure. If
you are using measures that utilize scanpath representations, be aware that raw data quality, event
detection algorithms and their settings, and all issues around AOI identification may all introduce
noise into the values you get from the measure. Always see to it that you have a baseline similarity to
compare against your measured similarity. Scanpath events and representations are at the top of the
hierarchy, and results may depend upon choices that you made in earlier steps of the analysis.
In Chapter 10, we present in detail the scanpath concept, the many usages of scanpaths, common
scanpath events, and the methods for comparing scanpaths.
Statistical analysis
In a study with a simple experimental design, statistical analysis can be done in almost no time.
In more complex studies, statistical analysis often requires a series of explorative investigations of
statistical distributions of your measures (and the intercorrelations between them) before you can
actually calculate e↵ect sizes and p-values. Participants may have to be removed, and this can skew
your balanced experimental design so much that you may decide to go back and recruit some more
participants.
When the statistical analysis is done, you just need to write up and publish. If you have prepared
texts from each of the steps above, a lot of the writing will be done already. In many cases, however,
writing up involves as many complicated considerations and as much work as conducting the actual
study.