Professional Documents
Culture Documents
Analysis of Variance
Analysis of Variance
Analysis of Variance
Analysis of variance (ANOVA) - a test of means for two or more populations. The null hypothesis is- all means are equal. ANOVA must have a dependent variable that is metric (measured using an interval or ratio scale). One or more independent variables which should be categorical (non-metric). Categorical independent variables are also called factors.
Questions ???? Do the various segments differ in terms of their volume of product consumption? Do the brand evaluations of groups exposed to different commercials vary? What is the effect of consumers' familiarity with the store (measured as high, medium, and low) on preference for the store? Null Hypothesis when comparing three groups H0: m1 = m2 = m3
F statistic. The null hypothesis that the category means are equal in the population is tested by an F statistic based on the ratio of mean square related to X and mean square related to error. Mean square. This is the sum of squares divided by the appropriate degrees of freedom. eta2 (2). The strength of the effects of X (independent variable or factor) on Y (dependent variable) is measured by eta2 ( 2). The value of 2 varies between 0 and 1.
F-Ratio
Sum of Squares
The total sum of squares or Sst is computed by squaring the deviation of each score from grand mean and summing these squares
SStotal = ( X ij X ) 2
i = 1 j =1
X ij = individual scores, i.e., the ith observation or pi test unit in the jth group X pi = grand mean n = number of all observations or test units in a group c = number of jth groups (or columns)
Sswithin the observed variability within each group is computed by squaring the deviation of each score from group mean and summing these scores
SS within = ( X ij X j ) 2
i = 1 j =1
p X iji = individual scores = group mean for jth group n = number of all observations or test units in a group c = number of jth groups (or columns)
Ssbetween, the variability of the group mean about the grand mean is calculated by squaring the deviation of group mean from grand mean, multiplying by the number of items, and summing these scores
SS between =
X
nj(X
j =1
X )2
Category Mean
X1 Y1 Y2 : : Yn Y1
X2 Y1 Y2 Yn Y2
Categories X3 Xc Y1 Y1 Y2 Y2
Yn Y3 Yn Yc
MSwithin
SS within = N k
SS between = n j ( X j X ) 2
j =1
(X
i = 1 j =1
ij
X j )2
Step 6: Calculate the number of degrees of freedom in the numerator (k-1) and in the denominator (N-k) Step 7: Calculate mean square of between group (Msbetween) and mean square of within group (Mswithin)
SS between MSbetween = k 1
MSwithin
SS within = N k
CALCULATION F RATIO
Df = k-1 (3-1=2) numerator Df= N-k (30-3=27) denominator SStotal = SSbetween + SSwithin 185.867 = 106.067 +79.80 F = Ssbetween/ (k-1) 106.067/2 (Msbetween ) Sswithin/ (N-k) 79.80/27 (Mswithin) F = Msbetween Mswithin 2.956 = 17.94 with df 2, 27 the critical value of F is 3.35 for = .05 . 53.033
ETA2
(2).
eta2 (2). The strength of the effects of X (independent variable or factor) on Y (dependent variable) is measured by eta2 ( 2). The value of 2 varies between 0 and 1. The strength of the effects of X on Y are measured as follows: 2 = Ssbetween/Sstotal = 106.067/185.867 = 0.571 57.1% of the variation in sales (Y) is accounted for by in-store promotion (X), indicating a modest effect.
SPSS EXAMPLE
Demo sav. The variable Level of education [ed] divides employees into five independent groups by level of education. Use the OneWay ANOVA procedure to test whether Household income in thousands [income] means for the five groups are significantly different.
2-WAY ANOVA
Do educational levels (graduate, post graduate, PhD) and age (less than 35, 35-55, more than 55) affect consumption of a brand? What is the effect of consumers' familiarity with a department store (high, medium, and low) and store image (positive, neutral, and negative) on preference for the store? How do advertising levels (high, medium, and low) interact with price levels (high, medium, and low) to influence a brand's sale?
ANOVA
Consider the simple case of two variables X1 and X2 having categories k1 and k2. The total variation in this case is partitioned as follows: SStotal = SS due to X1 + SS due to X2 + SS due to interaction of X1 and X2 + SSwithin or
SS
t
= SS
x1
+ SS
x2
+ SS
x 1x 2
+ SS
erro r
F =
S S err o r / d f d
S S x /d f n
2
MSx 1 MSerror
MSx 2 MSerror
SSx1x2 = SSt-(SSx1+SSx2+SSerror)
ETA SQUARE
The strength of the joint effect of two factors, called the overall effect, or multiple 2, is measured as follows: multiple 2 = SSx1+SSx2+SSx1x2/ SSt
ANOVA
Source X1 X2 X1X2 Error Total SS SSx1 SSx2 df k1- 1 k2 -1 MS SSx1/ k1- 1 SSx2/ k2 -1 SSx1x2/(k1-1) (k2-1) SSwithin/ N-k1k2 F MSX1/MSerror MSX2/MSerror MSx1X2/ MSerror P<.05 Eta SSx1/SSt SSx2/SSt SSx1+ SSx2+ SSx1x2/SSt
SPSS EXAMPLE
Examine the effect of in-store-promotion and couponing on store sales. In-store-promotion (high, medium, low) Couponing (distributed, not distributed) 3X2 factorial design grocery_1month.sav. This hypothetical data file is the grocery_coupons.sav data file with the weekly purchases "rolled-up" so that each case corresponds to a separate customer. Some of the variables that changed weekly disappear as a result, and the amount spent recorded is now the sum of the amounts spent during the four weeks of the study.
SS
106.067 53.33 162.67 23.00 185.87
df
2 1 2 24 29
MS
53.03 53.33 1.63 0.97 6.41
F
54.86** 55.172** 1.690
P<.05
.00 .00 .206
2
.821 697 .123
Interpretation: the main effects of in-store promotion (F 2, 24 = 54.86; p<.05) and couponing (F 1, 24 = 55.17; p<.05) are found to be significant. These results indicates that the three levels of in-store promotion are significantly different for sale. The interaction between in-store promotion and couponing is not significant (F 2, 24 = 1.69; p<.12).
MULTIVARIATE ANOVA
Example: In an experiment designed to measure the effect of gender and frequency of travel on preference and satisfaction for the foreign travel a 2x3 between subject factorial was used. Five respondents were assigned to each cell for a total sample size of 30. Preference and satisfaction level was measure on a 9-point scale. Conduct a 2x3 ANOVA.