Professional Documents
Culture Documents
Statistics For Business: Analysis of Variance
Statistics For Business: Analysis of Variance
Analysis of Variance
Deepak Mathivathanan
Learning Objectives
One-Way Two-Way
ANOVA ANOVA
F-test Interaction
Effects
Tukey-
Kramer
test
General ANOVA Setting
Assumptions
Populations are normally distributed
Populations have equal variances
Samples are randomly and independently drawn
Hypotheses: One-Way ANOVA
All population means are equal
i.e., no treatment effect (no variation in means among groups)
At least one population mean is different
i.e., there is a treatment (groups) effect
Does not mean that all population means are different (at
least one of the means is different from the others)
Hypotheses: One-Way ANOVA
or
Partitioning the Variation
Where:
SST = Total sum of squares
c = number of groups
nj = number of values in group j
Xij = ith value from group j
X = grand mean (mean of all data values)
The Total Sum of Squares
Among-Group Variation
SST = SSA + SSW
Where:
SSA = Sum of squares among groups
c = number of groups
nj = sample size from group j
Xj = sample mean from group j
X = grand mean (mean of all data values)
Among-Group Variation
µ1 µ2 µc
Within-Group Variation
SST = SSA + SSW
Where:
SSW = Sum of squares within groups
c = number of groups
nj = sample size from group j
Xj = sample mean from group j
Xij = ith value in group j
Within-Group Variation
Obtaining the Mean Squares
c = number of groups
n = sum of the sample sizes from all groups
df = degrees of freedom
One-Way ANOVA
Test Statistic
H0: μ1= μ2 = … = μc
H1: At least two population means are different
Test statistic
Decision Rule:
Reject H0 if F > FU, = .05
otherwise do not reject H0
0 Do not Reject H0
reject H0
FU
One-Way ANOVA
Example
Critical
Value:
FU = 3.89
= .05
0 Do not Reject H0
reject H0 F = 25.275
FU = 3.89
One-Way ANOVA
Example
H0: μ1 = μ2 = μ3
H1: μj not all equal
= .05
df1= 2 df2 = 12
Within
1119.6 12 93.3
Groups
Total 5836.0 14
The Tukey-Kramer Procedure
Tells which population means are significantly different
e.g.: μ1 = μ2 ≠ μ3
Done after rejection of equal means in ANOVA
Allows pair-wise comparisons
Compare absolute mean differences with critical
range
μ1= μ2 μ3 x
Tukey-Kramer Critical Range
where:
QU = Value from Studentized Range Distribution with c
and n - c degrees of freedom for the desired level of (see
appendix E.9 table)
MSW = Mean Square Within
nj and nj’ = Sample sizes from groups j and j’
The Tukey-Kramer Procedure
1. Compute absolute mean
Club 1 Club 2 Club 3 differences:
254 234 200
263 218 222
241 235 197
237 227 206
251 216 204
2. Find the QU value from the table in appendix E.9 with
c = 3 and (n – c) = (15 – 3) = 12 degrees of freedom for
the desired level of ( = .05 used here):
The Tukey-Kramer Procedure
3. Compute Critical Range:
4. Compare:
Assumptions
Total Variation:
Factor A Variation:
Factor B Variation:
Two-Way ANOVA
Equations
Interaction Variation:
MSA MSA/
Factor A r–1 SSA
= SSA /(r – 1) MSE
MSB MSB/
Factor B c–1 SSB
= SSB /(c – 1) MSE
AB MSAB MSAB/
(r – 1)(c – 1) SSAB
(Interaction) = SSAB / (r – 1)(c – 1) MSE
MSE
Error rc(n’ – 1) SSE
= SSE/rc(n’ – 1)
Total n–1 SST
Two-Way ANOVA:
Features
Degrees of freedom always add up
n-1 = rc(n/-1) + (r-1) + (c-1) + (r-1)(c-1)
Total = error + factor A + factor B + interaction
The denominator of the F Test is always the same
but the numerator is different
The sums of squares always add up
SST = SSE + SSA + SSB + SSAB
Total = error + factor A + factor B + interaction
Two-Way ANOVA:
Interaction
No Significant Interaction: Interaction is present:
Factor B Level 1
Mean Response
Mean Response
Factor B Level 1
Factor B Level 3
Factor B Level 2
Factor B Level 2
Factor B Level 3
• There are two types of chi-square tests. Both use the chi-square statistic and
distribution for different purposes:
A chi-square goodness of fit test determines if sample data matches a population.
A chi-square test for independence compares two variables in a contingency table to see if
they are related. In a more general sense, it tests to see whether distributions of categorical
variables differ from each another
• A very small chi square test statistic means that your observed data fits your
expected data extremely well. In other words, there is a relationship.
• A very large chi square test statistic means that the data does not fit very well. In
other words, there isn’t a relationship.
Chi Square P-Values
A chi square test will give you a p-value. The p-value will tell you if your test results
are significant or not. In order to perform a chi square test and get the p-value, you
need two pieces of information:
https://www.youtube.com/watch?v=epTeSAmCCM8 https://www.youtube.com/watch?v=vsrj647Tb3g
Thank You