Professional Documents
Culture Documents
MA Notes
MA Notes
Hypothesis Non-Hypothesis
Chi-Square EFA (fact Analysis)
T-test Cluster Analysis
1. One Sample
2. Two Sample
3. Paired Sample
Predictive/Forecasting (Regression) Discriminant Analysis
Regression
1. Simple Linear
2. Multiple
Multi-Dimensional
mapping
In factor analysis all the variable under the investigation is analysed together to
extract the underlined factors.
It is very useful method to reduce a large number of data (attributes) to the
few manageable factors. These factors explain most part of the variation of the
original set of data.
Example - A market researcher might have collected data on 50 attributes of
the product which may be difficult to analyse. Therefore, he plans to apply FA
(Factor Analysis) which reduces the 50 attributes into 5-6 manageable factors.
Interpretation
The value of KMO and Barlett’s test should be more than 0.5 in case to accept
factor analysis.
The total sum of the variance percentage should be more than 50%.
Component Matrixa
Component
1 2
X1 -.182 .756
X2 .507 .127
X3 .341 -.692
X4 .312 .126
X5 .765 -.198
X6 .562 .645
X7 .781 .076
Extraction Method: Principal
Component Analysis.
a. 2 components extracted.
Upon looking to the case processing Summary Table, we can see that, valid
cases= 100 which is also signifies the sample size, therefore we can say that N =
100, We can also see Excluded is = 0 which means 100% uploaded data is
calculated.
Upon looking to the reliability static table we can clearly see that the
Cronbach’s alpha is .401 which Means that the Calculated value of Cronbach’s
Alpha is below the Minimum desired value i.e. 0.5. We can also say that the
Data can be Dropped among the various items of the data. Further we can also
see “N of items” = 10 which means that the test has been performed on 10
only.
Investment behaviour of employees= 1/ risk awareness
Note:
Items/Attributes
Items or Attributes are basically the factors which plays an important role in
decision making of a consumer.
CASE – 2
A study was conducted to determine the factor responsible for measuring the
satisfaction level among consumer of aerated drink. A survey conducted with a
sample size of 100 consumers of soft drinks from different age and income
groups, the respondent were a mix of male and female, some of the question
asked in a survey is as followed.
X1 = Aerated soft drinks are refreshing
X2 = are bad for health 1 = Strongly disagree
X3 = are very convenient to serve 2 = Disagree
X4 = should be avoided with age 3 = Neither disagree nor
agree
X5 = are very tasty 4=
Agree
X6 = are not good for children 5 = Strongly agree
X7 = should be consumed occasionally
X8 = should not be taken in large quantity
X9 = are not as good as energy drinks
X10 = are better than fruit juices
Interpretation
Upon looking to the KMO& Bartlett’s Test table, we can see that the sample
adequacy value= 0.722 this indicates the significance of correlation matrix
which means that correlation coefficient is significant as indicated by P-value
(0.000): where alpha = 0.5 corresponding to the chi square statics. This may be
noted that KMO value is equal to or above 0.5 will be only considered.
Upon looking to the total variance table, we can clearly see that 10 items/
Attributes have been reduced into 3 factors. The value of %Variance mentioned
in the second last column= 28.572+16.231+13.172 = 57.975. The sum of all 3
factor’s percentage of Variance (%variance) must be equal or above 50.
Type of Segmentation:
1. Demographic segmentation
Age, Income, Education level, Qualification level, Gender
2. Geographic segmentation
It based upon area, topography, region
3. Behavioural segmentation/Benefit Sort
What are the different benefits that a customer is getting, added value,
Consumer actions, Decision making patterns, Market Data
4. Psychographic segmentation (Lifestyle)
When a product is sold on the basis of your lifestyle, passion, personality,
opinions
5. Firmographic
When the organization employee are segmentate on the basis of above
segments
Positioning
It is the ranking that you give to a brand’s product or to a product in your mind.
What is Differentiation?
1. Unique Attribute
2. Cutting Edge Technology
3. One of a kind Packaging
4. Price
5. Service
T-Test
Types of T-Test
1. One sample
2. Two sample – Two sample T-test are about the difference of opinion
between two different population. Ex – Male and female.
The test value for one sample T-test is fixed at 3. (Parameter to measure
satisfaction level)
Testing i.e. all types of T-test are hypothesis testing, which are used to
measure/manage satisfaction or dissatisfaction level.
One sample T-test
Case: Ban of Plastic Bag
Objectives:
1. To identify the parameters of plastic bags in which consumers have
favorable opinion (Dissatisfaction/ satisfaction level) or to understand
the satisfaction and the dissatisfaction level of the respondent w.r.t the
ban of plastic bags.
2. To examine whether the view of male and female respondent are same
or to examine the difference in the perception of male and female
respondent.
3. To Understand the difference in the perception of the plastics bags
before and after the ban.
4. To analyze the satisfaction level of the respondent w.r.t age.
Note:
Chi-square test is used for demographic aspect of the data i.e. age.
One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
x12_a 44 2.80 1.374 .207
x12_b 44 1.41 .871 .131
x12_c 44 3.50 1.151 .174
x12_d 44 2.77 1.138 .172
x12_e 44 1.80 .930 .140
x12_f 44 3.16 1.328 .200
One-Sample Test
Test Value = 3
95% Confidence Interval of the
Difference
t Df Sig. (2-tailed) Mean Difference Lower Upper
x12_a -.988 43 .329 -.205 -.62 .21
x12_b -12.113 43 .000 -1.591 -1.86 -1.33
x12_c 2.881 43 .006 .500 .15 .85
x12_d -1.324 43 .192 -.227 -.57 .12
x12_e -8.595 43 .000 -1.205 -1.49 -.92
x12_f .794 43 .431 .159 -.24 .56
INTERPRETATION OF ONE SAMPLE TEST
Upon looking to the one sample statistics table, we can clearly see that the
N= 44 which signifies that all the respondents have been under consideration
and none of the respondents have been left out.
Upon looking One Sample Test table, we can clearly see that Test Value= 3. This
is because the respondent respond’s is measured on 5-point scale. Since the
scale is capturing the perception which cannot be measured in decimals,
therefore we cannot assume the perception value as 2.5 (provided test value is
always the mid value). Therefore, we assume 3 as a mid-value which is neutral.
Upon looking at the third column sig (2 tailed), we can clearly see that X12_B,
X12_C, X12_E are the only significant values (p is less than alpha which is
incase of X1B- 0.000 is less than 0.05 respectively. Alpha is the assumed level of
significance which is 0.5). The values of X1A, X12_D, X12_F are insignificant
because p is greater than alpha.
After filtering out the significant value which were B,C,E; we will again look
back to One Sample Statistics table. In the One Sample Statistics table, we can
clearly see the MEAN value of X12_B, X12_C, X12_E; we found that only X12_C
is above the Test Value while X12_B, X12_E is below the Test Value. This means
that only X12_C attribute results into satisfaction level of the respondents. In
other words we can say that, respondents are strongly satisfied with the
statement that they do not wish to quit using plastic bags. Incase of X12-B.
X12_E; we can conclude that respondents are strongly dissatisfied with the
statement that plastic bags are harmful for the environment and Plastic bags
ban are not enforced properly.
2 Sample T-test
Output
Group Statistics
Gender N Mean Std. Deviation Std. Error Mean
x12_a 1 31 2.90 1.491 .268
2 13 2.54 1.050 .291
x12_b 1 31 1.42 .958 .172
2 13 1.38 .650 .180
x12_c 1 31 3.35 1.170 .210
2 13 3.85 1.068 .296
x12_d 1 31 2.84 1.241 .223
2 13 2.62 .870 .241
x12_e 1 31 1.87 .957 .172
2 13 1.62 .870 .241
x12_f 1 31 3.29 1.346 .242
2 13 2.85 1.281 .355
Independent Samples Test
Levene's Test for
Equality of Variances t-test for Equality of Means
95% Confidence
Interval of the
Sig. (2- Mean Std. Error Difference
F Sig. t Df tailed) Difference Difference Lower Upper
x12_a Equal variances 5.327 .026 .800 42 .000 .365 .456 -.555 1.285
assumed
Equal variances .922 31.787 .000 .365 .396 -.441 1.171
not assumed
x12_b Equal variances .331 .568 .119 42 .906 .035 .291 -.553 .622
assumed
Equal variances .139 32.888 .890 .035 .249 -.473 .542
not assumed
x12_c Equal variances .192 .663 - 42 .001 -.491 .377 -1.253 .270
assumed 1.302
Equal variances - 24.628 .001 -.491 .363 -1.240 .257
not assumed 1.352
x12_d Equal variances 3.195 .081 .589 42 .559 .223 .379 -.542 .988
assumed
Equal variances .680 31.926 .501 .223 .328 -.446 .892
not assumed
x12_e Equal variances .123 .727 .829 42 .412 .256 .308 -.367 .878
assumed
Equal variances .863 24.733 .397 .256 .296 -.355 .866
not assumed
x12_f Equal variances .841 .364 1.012 42 .003 .444 .439 -.441 1.330
assumed
Equal variances 1.033 23.663 .003 .444 .430 -.444 1.332
not assumed
Interpretation
Independent T Test is majorly used to understand the difference in the
perception of the 2 different population (Mainly male and female in our case),
Upon the application of the independent sample T- test we get two major table.
Upon Looking to the group statistics table, we can clearly see that end value is =
44 (1. Is 31 + 2. Is 13 = 44).
Moving on to independent sample T-test table, we can clearly see that sig. two
tailed of any of the variable (x1 -x5) is not less than alpha (where Alpha =
assumed level of significance = .05 ; p is calculated level of significance)
therefore we can clearly say that H02 is accepted and H2 is rejected.
Upon looking at the independent sample Test table, we can clearly see that the
Sig. two tailed value of the attribute x12a, x12c, x12f are significant having
values 0.000, 0.001, 0.003 respectively (P < alpha ; H02 is rejected and H2 is
accepted, Therefore we will move forward by only considering the attributes
x12a, x12c, x12f.
Upon Looking to the 1st column of paired sample test table i.e. means values,
we can clearly see that the mean value of pair 1,2 are negative i.e. (-1.667 & -
1.422 respectively). While the mean value of pair 5,7,8,10 is positive i.e. (
0.800, 0.566, 0.467, 0.689 respectively). The negative value of pair 1 & pair 2
means Y(Dhaba Food ) is better than (>) X(Mess food). While in case of
positive value, we can clearly assume that X(Mess food) > than Y(Dhaba food).
T3
CLUSTER ANALYSIS
Cluster Analysis is also referred to as a classification technique wherein grouping
can be done Objects, Individuals and Entities.
In Cluster Analysis, the whole population sample is undifferentiated and trhe
attempt to access similarity in response to variable and the grouping happens post
the clustering. Cluster Analysis is the best classification technique when multiple
factors are involved in data collection.
Agglomeration Schedule
Cluster Combined Stage Cluster First Appears
Stage Cluster 1 Cluster 2 Coefficients Cluster 1 Cluster 2 Next Stage
1 5 9 .000 0 0 7
2 3 7 .444 0 0 5
3 4 10 .548 0 0 7
4 6 8 .558 0 0 5
5 3 6 .609 2 4 6
6 2 3 .810 0 5 8
7 4 5 .811 3 1 9
8 1 2 1.037 0 6 9
9 1 4 1.850 8 7 0
Agglomeration Schedule
Cluster Combined Stage Cluster First Appears
Stage Cluster 1 Cluster 2 Coefficients Cluster 1 Cluster 2 Next Stage
1 5 9 .000 0 0 7
2 3 7 .444 0 0 5
3 4 10 .548 0 0 7
4 6 8 .558 0 0 5
5 3 6 .609 2 4 6
6 2 3 .810 0 5 8
7 4 5 .811 3 1 9
8 1 2 1.037 0 6 9
9 1 4 1.850 8 7 0
INTERPRETATION FOR CLUSTER ANALYSIS (Milk Supplementary case)
Upon looking at the agglomeration schedule, the first cluster consumes more of
Bournvita, Milo and Horlicks. Thus, we name them as the cluster which is MILK
ADDITIVE cluster. The second cluster is Chawanprash consuming cluster and we name
them as MILK ACCOMPANIMENT AYURVEDIC FOCUSED Cluster. The third
cluster only consumes Protinex, Horlicks and Complain. Thus, we name them as MILK
ADDITIVE NUTRITION FOCUSED cluster. The number of cases in each cluster is
can be seen in “Cluster Summary”.
REGRESSION ANALYSIS
Model Summary
Mode R R Adjusted R Std. Error of
l Square Square the Estimate
a
1 .928 .860 .849 .699
a. Predictors: (Constant), p_qlty, nutrition, taste
ANOVAa
Model Sum of df Mean F Sig.
Squares Square
1 Regression 108.375 3 36.125 73.891 .000b
Residual 17.600 36 .489
Total 125.975 39
a. Dependent Variable: pref
b. Predictors: (Constant), p_qlty, nutrition, taste
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig.
Coefficients
B Std. Error Beta
1 (Constant) .733 .301 2.436 .020
nutrition .295 .103 .284 2.865 .007
taste .170 .103 .198 1.655 .107
p_qlty .548 .118 .522 4.660 .000
a. Dependent Variable: pref
Impact, Cause and effect, Prediction, Forecast are the keyword to identify
Regression
INTERPRETATION OF REGRESSION
As per the case of regression of MRP biscuit preference is assumed to be
dependent variable (DV) while Nutrition, Taste and Preservation Quality is
assumed to be Independent Variable (IV).
Upon looking to the “Model Summary Table”, we can clearly see that the value
of R2 and Adjusted R2. We will consider Adjusted R2 for further interpretation.
The adjusted R square value comes out to be 0.849 which means that 84.9% of
the predictive model is being defined by this analysis. The minimum value of
Adjusted R2 which has to be considered as 0.5. Since the obtained value which
is 0.849 is greater than 0.5, therefore we will consider the current value of
Adjusted R2.
ANOVA TABLE
From the Anova table, we can clearly see that the sig significant value of the
entire predictive model comes out to be 0.000 which is significant, therefore,
can be considered for the further analysis.
Nutrition
With reference to the Objective 1 and H0 and H1, when we look upon to the
coefficient table, we can easily make out that sig value of nutrition is 0.007
which is significant (P is less than alpha; 0.007<0.05. Therefore, H0 is rejected,
H1 is accepted). The unstandardized coefficient beta value of Nutrition is 0.295
which means that almost 29.5% of the regression predictive model is explained
by Nutrition. From the positive beta value of Nutrition we can also infer that
Nutrition is directly proportional to the preference of choice.
Nutrition = Preference of choice.
Above formula clearly interprets that if everything else remains constant then
preference of choice increases by 0.295 when nutrition increases by 1 unit.
Taste
From the significant value of taste, we can clearly see that 0.107 which is
insignificant (p is greater than alpha therefore, H0 is accepted, H1 is rejected)
Preservation of Quality
In this case we can clearly see, the sig value is 0.00 which is significant (p is
less than alpha therefore H0(3) is rejected, H1(3) is accepted). Upon looking at
the unstandardized coefficient beta value, we can clearly see that preservation
quality explains the entire model by 54.8%. The positive value of Preservation
Quality explains that preservation quality is directly proportional to preference
of choice.
PREDICITIVE MODEL
Preservation
Quality
Nutrition
Preference of choice
HYPOTHETICAL QUESTION AND ITS EXAMPLE (Values changed)
Model Summary
Mode R R Adjusted R Std. Error of
l Square Square the Estimate
a
1 .928 .860 .849 .699
a. Predictors: (Constant), p_qlty, nutrition, taste
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig.
Coefficients
B Std. Error Beta
1 (Constant) .733 .301 2.436 .020
nutrition -.295 .103 .284 2.865 .007
taste .170 .103 .198 1.655 .107
p_qlty .548 .118 .522 4.660 .000
a. Dependent Variable: pref
DV = Preference of choice
IV = Nutrition, Taste, Preservation Quality
Interpretation
The value of Nutrition is assumed to be -0.295.
From the above “coefficient table”, we can clearly see that sig value of
Nutrition comes out to be 0.007 which is less than the assumed level of
significance (P is less than alpha; 0.007 is less than 0.05; H0(1) rejected, H1(1)
accepted. Upon looking to the “unstandardized coefficient beta value”,we can
clearly see that the value comes out to be 0.295 which means that nutrition is
contributing 29.5% of the total regression predictive model. The negative sign
of nutrition indicates that nutrition is inversely proportional to preference of
choice.
Nutrition = 1/Preference of Choice.
The above formula means that if everything else remains constant the value of
nutrition decreases by 1 unit when preference of choice increases by 0.295.
Taste: From the above analysis, we can confirm that Taste [IV(2)] is
insignificant. Therefore, it doesn’t have any role/relationship/impact while
choosing MRP biscuit.