Professional Documents
Culture Documents
Previewpdf
Previewpdf
Analysis of Variance
Typeset in Times by
J&L Composition Ltd, Filey, North Yorkshire
~ To my parents
0 ~
Riccardo Russo
c: VI c:
(J) VI (J)
VI
Max Roberts
~
I.. :::::l I..
I..
(J)
(Ij
0... c:::: (Ij
0...
..0 >.. 0 >..
0 ""0
c:::: E I..
(Ij
E
u
(Ij ~ ~
>< u
I: i:2
This page intentionally left blank
Contents
List of figures X
1 Introduction 1
Notes 258
References 260
Index 261
Figures
refresher for graduate students, and cover useful ground for any
researcher in related disciplines who is comparing groups in order
to find out whether they differ.
The book is structured so that readers are introduced to topics
fairly gently, and we have kept advanced material to a minimum.
However, we do assume that all readers have already successfully
completed an introductory statistics or research methods course.
We have tried to keep the notation under control, and have made
full use of figures where appropriate. Before introducing Analysis
of Variance, we begin with two revision chapters. Their aim is to
identify the key topics in previous teaching and outline their
importance. In our experience, this is useful to students for at least
two reasons. The long vacation gives ample opportunity to forget
material, and introductory courses quite often shower students
with a very diverse range of topics, not all of which are relevant
to Analysis of Variance (ANOVA). In subsequent chapters, we
cover single- and multi-factor ANOVA, and have extensive cover-
age on the meaning of interaction and how to make sense of
factorial designs. Although this book contains many formulae, it
is neither a cookbook nor a workbook of exercises. However, we
do believe that performing ANOVA on small data sets with small
numbers assists the learning of the underlying principles. Small is
the key word here, and we also believe that there is little to be
gained in analysing real data sets by hand. In addition, all of our
examples have been devised with our likely readership in mind
wherever possible.
Given that we expect most students to perform ANOVA by
computer, the reader may be surprised that we have not linked
this text to any of the currently available computer packages. There
are several reasons for this, not least that it prevents the book from
quickly going out of date. In addition, linking the book to a
particular package will deter instructors who do not wish to use
that package for their teaching purposes, immediately making
many pages at best irrelevant and at worst confusing. We are
also concerned that a statistics course that is too heavily based
on computer exercises can substitute a knowledge of procedures
for the understanding of fundamental principles, so that the out-
come is highly proficient computer operators rather than compe-
tent statisticians. For example, the psychologist who understands
why unbalanced factorial design should be analysed by using the
unweighted means approach is far more likely to bother to check
Preface xix
We would like to thank the people who have read various versions
of this manuscript and made suggestions and corrections. These
include David Clark-Carter, Margot Roberts, Linda Murdoch,
Elizabeth Newton and Christopher Douce. Many improvements
were made as a result of their feedback. In addition, staff and
students at various universities have brought to our attention
numerous corrections and suggestions for the course handouts
upon which this book is based. We are very grateful for all the
help that we have received from the named sources and others,
although, of course, any remaining errors are the authors' alone.
The Studentized Range Statistic tables are based upon those
devised by L.H. Harter and are reproduced with the permission
of the Institute of Mathematical Statistics.
Chapter I
Introduction
Experiment
Variables
A variable has the property that it can take different values. In
other words, it varies. All psychologists are looking for relation-
ships between variables. In experiments there are two types of
variable. The independent variable is manipulated by the experi-
menter. This is also known as the treatment variable. Independent
variables have at least two different levels; these are the experi-
mental conditions. The dependent variable is measured by the
experimenter in order to investigate the effects of manipulating
the independent variable, hence it depends upon the independent
variable for its value. Good dependent variables are readily obser-
Introduction 3
Experimental design
For a between-subjects design, also called an independent subjects
design, two or more separate, completely independent groups of
subjects are tested. Each group is assigned to a different experi-
mental condition. Each subject contributes one single score to the
final analysis. These designs have the advantage that you do not
need to worry about fatigue or practice effects due to subjects'
having to perform lots of different tasks. They have the disadvan-
tage that they are less powerful than the alternative, so that many
more subjects are required (see below). It is also possible that, due
to bad luck, groups may be assembled that differ even before you
test them. In order to avoid this problem, plenty of subjects need to
be run, and they must be allocated to groups randomly or as
unsystematically as possible, so that it is unlikely that groups are
being created that differ.
For a within-subjects design, sometimes called a repeated meas-
ures design, one single group of subjects is tested once with each of
the experimental conditions, and there must be at least two of
these. Hence, each subject contributes at least two scores to the
final analysis. Alternatively, similar subjects such as identical twins
can be given one task each, but are then analysed as matched pairs
using a within-subjects design. Within-subjects designs have the
advantage that they are more powerful than between-subjects
4 Introduction
morning and for another in the afternoon. Unless you are careless,
these errors should not be a problem.
In addition,jloor effects and ceiling effects can bias the results of
an experiment. A floor effect occurs when most subjects are per-
forming so badly that their performance cannot get any worse. A
ceiling effect occurs when performance is so good that it cannot get
any better.
Statistical errors
Statistical testing is based upon confidence and not certainty.
You could be extremely confident that the means of two groups
differ, or you could lack confidence that the means of two
groups differ. Without certainty, however, there is always the poss-
ibility that there might be a statistical error. A Type I Error occurs
when the null hypothesis is rejected by mistake. You conclude that
there is a difference between the means of two groups when in fact
the null hypothesis is true for the population from which the
sample was taken. In other words, you have concluded that the
independent variable influences performance, when really it is not
related to performance in the general population. A Type II Error
occurs when there is a failure to reject the null hypothesis by
mistake. You conclude that there is no evidence for a difference
between the means of the two groups when, in fact, the null
hypothesis is false for the population from which the sample was
taken. In other words, you have concluded that there is no evi-
dence that the independent variable influences performance, when
really it does influence performance in the general population.
Significance levels
It is possible to calculate the probability that the difference
between a pair of means arose due to chance. If the probability
is unlikely enough, the null hypothesis is rejected and therefore the
independent variable must have been responsible for the difference.
Psychologists use an arbitrary cut-off point in order to decide
whether to reject or fail to reject the null hypothesis. They thus
decide upon a significance level. If the probability that the results
arose due to chance is 0.05 (or 1120, or 5%) or less, then you can
say that p < 0.05 and the null hypothesis can be rejected. Thus, the
Introduction 7
Table of Contents
Cohen, J. (1988) Statistical Power Analysis for the Behavioral Sciences (2nd
edn.), Hillsdale: Erlbaum.
Gaito, J. (1965) 'Unequal intervals and unequal n in trend analyses',
Psychological Bulletin 63: 125-127.
Greenhouse, S. Wand Geisser, S. (1991) 'On methods in the analysis of
profile data', Psychometrika 24: 95-112.
Howell, D. C. (1997) Statistical Methods for Psychology (4th edn.),
Belmont, CA: Duxbury Press.
Keppel, G., Saufley, W H., Jr. and Tokunaga, H. (1992) Introduction to
Design & Analysis: A Student's Handbook (2nd edn.), New York:
Freeman.
Kirk, R. E. (1982) Experimental Design: Procedures for the Behavioral
Science (2nd edn.), Pacific Grove, CA: Brooks/Cole.
Maxwell, S. E. and Delaney, H. D. (1990) Designing Experiments and
Analyzing Data, Pacific Grove, CA: Brooks/Cole.
Neave, H. R. and Worthington, P. L. (1988) Distribution-Free Tests,
London: Unwin Hyman.
Rosenthal, R. and Rosnow, R. L. (1991) Essential of Behavioral Research:
Methods and Data Analysis (2nd edn.), New York: McGraw-Hill.
Siegel, S. and Castellan, N. J., Jr. (1988) Nonparametric Statistics for the
Behavioral Sciences (2nd edn.), New York: McGraw-Hill.
Wickens, T. D. (1995) The Geometry of Multivariate Statistics, Hillsdale:
Erlbaum.