RM Lab File Harpreet Kaur Combined

GURU GOBIND SINGH INDRAPRASTHA UNIVERSITY
MAHARAJA SURAJMAL INSTITUTE

NAME – HARPREET KAUR
COURSE – BBA (GEN)
SECTION – A
SHIFT – EVENING
SUBJECT – RESEARCH METHODOLOGY LAB
S.NO MODULE PAGE NO
NAME
1. Introduction to SPSS 3-15
2. Manage data in SPSS 16-24
3. Coding and Recoding 25-32
4. Selecting, Sorting 33-39
and Analyzing
5. Finding Missing 40-53
Values
6. Box Plots 54-67
7. Correcting data 68-85
problems
8. One Sample T test 86-90
9. Independent Sample 91-95
T test
10. One way ANOVA 96-104
11. Correlation 105-112
12. Chi Square test 113-122
MODULE – 1
(Introduction to SPSS)
The SPSS software platform offers advanced
statistical analysis, a vast library of machine learning
algorithms, text analysis, open source extensibility,
integration with big data and seamless deployment
into applications. Its ease of use, flexibility and
scalability make SPSS accessible to users of all skill
levels.
SPSS Statistics is a software package used

for interactive, or batched, statistical analysis. Long
produced by SPSS Inc., it was acquired by IBM in
2009. Current versions (post 2015) have the brand
name: IBM SPSS Statistics.
LOGO:
1.1 Launching SPSS

To start IBM SPSS go to
Start--- Programs--- IBM SPSS
A dialog box will open listing several options to
choose from. The following options are
available:
 Run the tutorial
 Type in data
 Run an existing query
 Open an existing data source
 Open another type of file
SPSS has two working windows:
 SPSS Data Editor

 SPSS Output Viewer
1.1 Inserting data in IBM SPSS

1. Insert data in variable view
2. Enter data in data view.

1.2 Recoding into different variables
1. Go to transform tab, click on “recode into different variables”.
3. Enter the variable as “Percentage of first semester”. Add output

variables as shown. Then click on “change”.
Then click on “old and new values”.
4. Enter all the old and new values. Then click on “continue”.
Output:
Variable view:
Data View:
1.3 Descriptive Statistics

1.3.1 Frequencies
1. Go to Analyze tab, click descriptive statistics, then click on
frequencies.
2. Put “Gender” as variable.
Then press “OK”.
Output:
1.3.2 Cross Tabs
1. Go to Analyze tab, click descriptive statistics, then click on cross tabs.
2. Put variable in row as “Gender” and in column as “age of

respondant”.
Then click “OK”.
Output:
1.3.3 Descriptives
1. Go to Analyze tab, click descriptive statistics, then click on
descriptive.
2. Enter two variables and then click on “Options”.
3. Tick on the boxes as shown below. Then click on “continue”.

4.Click on “OK”.
1.4 Graphs
1.4.1 Bar Graph
1. Go to Graphs tab, click “ legacy dialogs” and then click on “Bar”.
2. Select “simple” and “Values of individual cases”. Then click
“Define”.
3. Enter bars represent as “percentage in second semester” and variable

as “ student’s name”. Then enter “OK”.
Output:
1.4.2 Pie
1. . Go to Graphs tab, click “ legacy dialogs” and then click on “pie”.
2. click on “summaries for groups of cases”. Then click on “define”.
3. Now enter “Gender” in define slices by.

Then press “OK”.
Output:
MODULE-2
(Manage data in SPSS)
2.1 Finding out the case summary
To understand the nature of data, one will use case summary.
2.1.1 On the basis of Gender
1. Go to analyze tab, then click on reports and then click on case

summaries.
2. Then put variables as “percentage in first semester”

and grouping variable as “Gender”.
Limit cases to first “100”
3. Then click on statistics and put cell statistics as
“number of cases” and “mean”. Then click on “continue”.
4. Then click on “OK”.

Output:
2.1.2 On the basis of age group:
1. Go to analyze tab, then click on reports and then click on case
summaries.
2. Then put variables as “percentage in first semester”

and grouping variable as “age of respondant”.
Limit cases to first “100”
3. Then click on statistics and put cell statistics as

“number of cases” and “mean”. Then click on “continue”.
4. Then click “OK”.

Output:
2.2 Computing new variable
1. Go to the transform tab, click “compute variable”.
2. Put Target Variable as “FYmarks”.

Click function group to “all”.
Put functions and special variable as “Mean”.
3. Enter numeric expression as “ Mean(F.S,S.S)”. Then
press “OK”.
Output:
A new entry is shown in variable view.
MODULE-3
(CODING AND RECODING)
3.1 Coding and Recoding in SPSS.

3.1.1 Recode into same variable
1. Enter the row naming “CGPA” in the variable view as
shown below
2. Enter the values for CGPA.

3. Click on transform tab and then click on record into
same variables.
4. Put CGPA as a variable.

5. Now click on “Old and New Values”.
6. Enter the range and the new values for it. Then click on
“Continue”.
7. Now press “OK”.
OUTPUT:
3.1.2 Recode into different variables.
1. Enter the data in variable view and data view.
2. Then click on the Transform tab and then click on
“recode onto different variables”.
2. Put numeric variable as “FYmarks”.

3. In the Output Variable column, add name and Label as
“Grade”. Then enter “Change”.
6. Enter on “Old and new values”. Click on the “Output

variables are strings”.
Give the values as shown below. Enter “Continue”.
7. Press “OK”.
OUTPUT:
MODULE-4
(Selecting, Sorting and Analyzing)
4.1 Selecting, Sorting and analyzing the data in SPSS.
1. Go to data tab and click on “Select Cases”.
2. Click on “if conditions are satisfied”. And then click on “IF”.

3. Put variable as gender and click “=” and put 2 as shown
below. Click on “continue”.
6. Click on “OK”.
OUTPUT:
4.2 Case Summaries
1. Go to Analyze tab and click on “reports”. Click on “Case
summaries”.
2. Put variable as “FYmarks” and grouping variable as “gender”.

Then click on Statistics.
3. Put cell statistics as “mean”. Then press “Continue”.
4. Then press “OK”.

OUTPUT:
4.3 Sort Cases
1. Go to Data tab and click on “Sort Cases”.
2. Sort by “age of the respondant” and put sort order as

“ascending”. Then press “OK”.
OUTPUT:
MODULE-5
(Finding Missing Values)
5.1 Finding out the missing values and recoding the same
variables after filling the missing values according to
group splitting files.
5.1.1 Find the missing value.
1. Find the missing value in your entered data.
2. Go to Analyze tab, then click on descriptive statistics

and click on descriptive.
3. Put the variable as “percentage in first semester”.
Then click on “options”.
4. Select “Mean”. Click on “continue”.

5. Press “OK”.
OUTPUT:
5.2 Recode into same variables.
1. Go to transfer tab. Then click on Recode in same
variables.
2. Put in the variable as “percentage in first semester”.

Then click on “Old and new values”.
3. Click on “system missing”. Put the new value as mean
(76.88) as we obtained earlier. Click on continue.
4. Click on “OK”.
OUTPUT:
5.3 Spilt Files

1. Go to data tab and click on “spilt file”.
2. Click on “compare groups”. Put “grades” in “groups
based on”. Then click on “OK”.
OUTPUT:
5.4 Calculation of mean as grouped as grades.
1. Remove the mean value filled in the missing place.
2. Go to analyze tab, click on “descriptive statistics”, and

then click on “frequencies”.
3. Put the variable as “percentage in first semester”.
Then click on “Statistics”.
4. Select “Mean”. Click “continue”.

5. Click “OK”.
OUTPUT:
5.5 Finding the missing value on the basis of grades.
1. Go to transform tab, click on recode into same
variables.
2. Put the variable as “percentage in first semester” and

click on “IF”.
3. Click on “Include If case statistics conditions” and write
Grade=”C” and press continue.
4. Now click on “old and new values”.

5. Click on system missing and enter new value as the
mean derived for the grade “c” which is 73.44
Click on “add” and press “continue”.
6. Press “OK”.
OUTPUT:
MODULE-6
(Box Plots)
6) BOXPLOTS
6.1) FINDING NORMAL DISTRIBUTION ACCORDING TO THE GIVEN CASE

STUDY.
1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS, CLICK ON EXPLORE.
2. SELECT “BEHAVIOUR OF THE OWNER “ AS THE DEPENDENT LIST .

3. GO TO PLOTS, CLICK ON HISTOGRAM. THEN CLICK CONTINUE.PRESS
OK.
OUTPUT:
6.2)
1. GO TO GRAPHS, CLICK ON LEGACY DIALOGS, THEN CLICK ON BOX

PLOTS.
2. CHOOSE SIMPLE. CLICK DEFINE.
3. SELECT “INCOME”AS THE VARIABLE. THEN CLICK ON OPTIONS.

4. CLICK ON “DISPLAY GROUPS DEFINED BY MISSING VALUES”.THEN
CLICK ON CONTINUE.
5. SELECT “GENDER” AS THE CATEGORY AXIS. CLICK OK.

OUTPUT:
6.3)
1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS, CLICK ON EXPLORE.

2. SELECT “BEHAVIOUR OF THE OWNER” AS THE DEPENDENT LIST AND
“MARITAL STATUS” AS THE FACTOR LIST.
3. CLICK ON PLOTS, CLICK ON HISTOGRAM. THEN CLICK CONTINUE.

PRESS OK.
OUTPUT:
6.4) CHECKING NORMALITY AND HOMOGENITY.
6.4.1) NORMALITY
1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS , CLICK ON EXPLORE.

2. SELECT “INCOME” AS THE DEPENDENT LIST AND “GENDER “ AS THE
FACTOR LIST.
3. UNCHECK STEM AND LEAF. CLICK ON CHECK NORMALITY PLOTS

WITH TESTS.
OUTPUT:
6.4.2) HOMOGENITY
1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS , CLICK ON EXPLORE.
2.CLICK ON PLOTS. THEN CLICK ON “NORMALITY PLOTS WITH TESTS”.

3. CLICK ON UNTRANSFORMED. THEN CLICK CONTINUE.PRESS OK.
OUTPUT:
MODULE - 7
(CORRECTING DATA PROBLEMS)
There are three methods to correct data problems
7.1 LOG TRANSFORMATION - IT REDUCES POSITIVE

SKEWNESS
Log 10 (X). Example LOG (DAY 2 +1)
1. Go to Transform" and choose "Compute Variable" from the drop

down menu
2. Compute Variable dialog box will appear. Type "Log" under Target
and Variable column and then select Type & Lable, in appeared dialog
box again write "Log" under label and choose numeric as "type and
press continue.
3. For the column of numeric expression, choose "Lg10" from the list of function and
special variables' then select "Hygiene condition of day 2"; then do +1 in the bracket.
Press "OK".
4. Then go to Analyze choose "Explore" from drop down menu. Under dialog box named
"Explore" choose Log' under dependent list and Gender in factor list. Select histogram
from plots option and then press 'ok
5. Another new column is added in "editor sheet" named "Log"
6. In output viewer, Histogram of Log is shown.
7.2 RECIPROCAL TRANSFORMATION
Dividing one by each score also reduces the impact of large scores This transformation reverses
the scores.
1. Go to Transform tab, choose "Compute Variable from the drop-down menu. In the dialog
box, type "reci" for Reciprocal in Target and Variables. In numeric expression type "1" and
select "Hygiene condition for Day 2' from the list: press "ok"
2. Then go to Analyse choose "Explore" from drop down menu. Under dialog box named
"Explore" choose reci' under dependent list and Gender in factor list. Select histogram from
plots option and then press ok'
3. In output viewer, Histogram of 'reci’ is shown.

7.3 SQUARE ROOT
It reduces positive skewness and useful in stabilizing variance.
1. Go to Transform tab, choose Compute Variable from the drop-down menu.
In the dialog box, type "SQRT" for Square root in Target and Variables. For the column of
numeric expression, choose "SQRT" from the list of function and special variables'. Then select
"Hygiene condition of day 2" in the bracket. Press "OK".
2. In "Date editor" a new column named "SQRT" is shown.
3. Go to explore and select log in dependent variable.

4.In output viewer, Histogram of 'SQRT is shown with other graphs.
5.Then go to Analyze choose "Explore" from drop down menu. Under
dialog box named "Explore" choose Log' under dependent list and
Gender in factor list. Select histogram from plots option and then
press 'ok
6. Another new column is added in "editor sheet" named "Log"
7.In output viewer, Histogram of Log is shown.

7.2 RECIPROCAL TRANSFORMATION
Dividing one by each score also reduces the impact of large scores
This transformation reverses the scores.
2.Go to Transform tab, choose "Compute Variable from the drop-

down menu. In the dialog box, type "reci" for Reciprocal in Target
and Variables. In numeric expression type "1" and select "Hygiene
condition for Day 2' from the list: press "ok"
3. Then go to Analyse choose "Explore" from drop down menu.
Under dialog box named "Explore" choose reci' under dependent
list and Gender in factor list. Select histogram from plots option
and then press ok'
4. In output viewer, Histogram of 'reci’ is shown.
7.3 SQUARE ROOT
It reduces positive skewness and useful in stabilizing variance.
1. Go to Transform tab, choose Compute Variable from the drop-

down menu.
In the dialog box, type "SQRT" for Square root in Target and
Variables. For the column of numeric expression, choose "SQRT"
from the list of function and special variables'. Then select "Hygiene
condition of day 2" in the bracket. Press "OK".
4. In "Date editor" a new column named "SQRT" is shown.
3. Go to explore and select log in dependent variable.

3. In output viewer, Histogram of 'SQRT is shown with other
graphs.
MODULE 8
(ONE SAMPLE T TEST)
The marketing researcher has agreed to help the management
of Deli Depot to determine whether customers think the price
of menu is competitive.
The Respondent have answered this question using a 10 point
scale. On which 1= “Not Competitive” and 10= “Very
Competitive”.
The management thinks the customers consider the prices of
food at deli depot to be average.
This means the responses to the question on competitive prices
will have a mean of 5.5
( Approximately half way between 1 and 10 on the response
scale)
The Scale is assumed to be interval.
The Variable we took is X2- Competitive prices
NULL HYPOTHESIS:
H0: The mean of the X2- Competitive prices will not be
significantly different from 5.5
ALTERNATIVE HYPOTHESIS:
H1: The mean of the answers X2- competitive prices will not be
5.5. There is in fact a true difference between the sample mean
and the mean we think it is(5.5).
SIGNIFICANCE LEVEL:
The significance level will be set at .05
Click through sequence foe testing the responses in the Deli
Depot database
STEP1: (SELECTION OF VARIABLES)
 We will choose variable X2- Competitive Prices from the
database of Deli Depot.
STEP2: (USING ANALYZE OPTION)

 Click “ANALYZE” option.
 Click on “Compare Means”.
 Click on “One sample t test”.
STEP3: (USING ONE SAMPLE T TEST)
 When you get to the dialog box, Click on X2- Competitive
Prices to highlight it and drag it to the Test variable box.
 In the box labeled test value, enter the number 5.5
 Click on the “Options” box and enter 95 in the confidence
interval box.
 Then Click on the “Continue” button and “OK” to execute
the program.
OUTPUT:
Table 1: (One- Sample Statistics)
N Mean Std. Std Error
deviation Mean
X2- 50 2.22 1.148 .162
Competitive
Prices
Table2: (One Sample Test)

TEST VALUE=5.5
T df Sig. (2 Mean 95%
tailed) difference Confidence
Interval of the
difference
Lower Upper
X2- -20.203 49 .000 -3.280 -3.61 -2.95
Competitive
Prices
First Table – One Sample statistics, gives the summary and basic
statistics.
Second table- One Sample table, gives the results of the t-test
for the null hypothesis that the average response to X2 to 5.5
RESULT:
The t-test statistic is -20.203 and significance level is .000
This means that the null hypothesis can be rejected and
alternative hypothesis accepted with a high level of confidence.
INFERENCE:
From Practical standpoint, in terms of Deli Depot, the results of
uni variate hypothesis test mean that the respondents felt that
the prices of menus items were not competitive at all.
MODULE-9
(Independent Sample T Test)
In this module, we shall discuss the various steps
regarding the use of independent sample t-test via SPSS
software in statistical analysis:
Step 1: (SELECTION OF REQUIRED VARIABLES)
 In this step, we have selected the following
variables:
a) Here, we have two samples, i.e., Satisfaction
level(X9) and gender(X7).
Null Hypothesis: There is no significant difference
between the satisfaction level of males and females
customers of Deli Depot.
b)Here, we have to find the difference between the
satisfaction level of males and female customers in
Deli Depot.
 Here is the design view of the selected variables.
 The label in the given data are as follows:
Step 2: (USING ANALYZE OPTION)
 In this step, following things are done:
a) Go to the “ANALYZE” option.
b)Click on “COMPARE MEANS” option.
c) Further, click on “INDEPENDENT SAMPLE T-test”
option.
Step 3: (USING INDEPENDENT t-TEST OPTION)

a) Under the independent sample t-test option, choose
“SATISFACTION LEVEL(X9)” for “TEST VARIABLE(S)”.
b)Further, choose “GROUP(0,1)” for the “GROUPING
VARIABLES”.
Step4: (DEFINING GROUPS)

a) Under the defining group option, select the option
“use unspecified values”.
b) Further, assign 1 to Group 1.
c) Assign 0 to Group 2.
OUTPUT:
Inference:
Hence, the sig. value for the groups is <0.5. Hence, we
infer that there are not enough evidences to support
null hypothesis. Thus, null hypothesis is rejected.
MODULE-10
(ANNOVA ANALYSIS)
 Here we have the following data,

The management of Deli Depot wants to know
 Whether the customers who come to
restaurant from greater distances
Differ from the customers who live nearby in
their willingness to recommend the restaurant
to a friend.
They might want to know whether that
difference in willingness, if any, is influenced by
the sex of the customers.
The database variables are:
 X8- Recommend to a friend, measured on a
seven point scale.
With 1 = “Definitely not recommend”
With 7 = “Definitely Recommend”
 X11- Market Area
1= “Came from within 1 mile”
2= “Came from 1-3 miles”
3= “Came from more than 3 miles”
 X7- Gender
1= Male
0= Female
Step 2: (Using Analysis Option)
 It consists of the following steps
On the basis of informal comments from the
customers, the Deli Depot Management thinks
that the customers who came from more than
 3 miles will be more likely to recommend the
restaurant to the friends
 And that female customer will be more likely to
recommend the Deli to a friend than the males.
The Null Hypothesis are:
 H0: The mean scores for recommendation likely
hood will not be different for customers who
have travelled different distances to get to Deli.
 H1: That there will be no difference between the
male and female customers willingness to
recommend the restaurant.
The best way to analyze the Deli Depot data to
answer the manager’s questions is:
To use Factorial Model
A Factorial Model is a type of anova in which the
individual effects of each independent variable on
the dependent variable are considered separately
and then
The combined effects (interaction) of the
independent variables on the dependent variable
are analyzed.
The Click through sequence is:
Step 1: Analyze
Step 2: General Linear Model.

Step 3: UniVariate
Step 4:Highlight the dependent variable X8-

Recommend to a friend – and move it to the
dependent variable box.
Step 5:Highlight X11- Distance driven, and X7-
Gender- and move them to the fixed factor box.
Step 6: Click OK.
OUTPUT:
BOX1: Between Subjects Factor

Value Label N
Distance 1 22
Driven
2 20
3 8
Gender 0 Female 30
1 Male 20
BOX 2: Tests of Between-Subjects Effects:
Dependent Variable: RECOMMEND TO FRIEND
Source Type III df Mean F Sig
Sum of Squares
Squares
Corrected 20.727(a 5 4.145 6.974 .000
Model )
Intercept 565.335 1 556.335 935.995 .000
X11 14.149 2 7.074 11.902 .000
X7 .022 1 .022 .837 .000
X11*X7 .723 2 .361 .808 .848
Error 25.153 44 .504 .549
Total 1142.000 50
Corrected 45.850 49
Total
RESULT:
The purpose of ANOVA Analysis is to see if the
differences that do exist are big enough to be
statistically significant
We use F-ratio to determine the mean difference
for independent variable comparison
F= Variance Between the groups
Variance with in Groups
In Deli Depot case there are three groups of
respondents who have travelled:
 Less than a mile
 1-3 miles
 More than 3 miles
To come to Deli Depot.
INTERPRETATION:
 The bigger the F-ratio, the bigger the difference
among the means of the various groups wit
respect to their likelihood of recommending the
restaurant to a friend.
 The tests of Between-Subjects effects table that
tests of Between- Subjects effects table(BOX 2)
shows that the F-ratio X11- Distance Driven is
11.092, which is statistically significant at the .
000 level significant at the .000 level.
 This implies that customers who live farther
away are more likely to recommend Deli Depot
to a friend.
MODULE- 11
TESTING OF ASSOCIATION(CORREALTION ANALYSIS)
CASE: DELI DEPOT
Management anticipates that the relationship between satisfaction with
restaurant and likelihood to recommend the restaurant would be positive
and significant.
Variables:
X 9 – SATISFACTION LEVEL
X 8 – RECOMMEND TO A FRIEND
Click through sequence is:
 ANALYZE
 COORELATE
 BIVARIATE
And this leads to DIALOGUE BOX – Transfer variables X 8 and X 9

into Variable boxes.
Use the THREE DEFAULT options given in dialogue box shown
below:
 PEARSON CORREALTION
 TWO TAILED TEST OF SIGNIFICANCE
 FLAG SIGNIFICANT CORRELATIONS
 Next go to OPTIONS box.

 Click on MEAN, STANDARD DEVIATIONS.
 Click CONTINUE
 Finally click OK
OUTPUT:
Descriptive Statistics
Mean Standard deviation N
RECOMMEND TO 4.68 .978 50
FRIEND
SATISFACTION 4.78 .954 50
LEVEL
RECOMMEND SATISFACTION
TO FRIEND
RECOMMEND PEARSON 1 .601(**)
TO CORRELATION
FRIEND SIG.(2-TAILED) . .000
N 50 50
SATISFACTION PEARSON .601(**) 1
LEVEL CORRELATION
SIG.(2- TAILED) .000
N 50 50
Correlations
** CORRELATION IS SIGNIFICANT AT THE 0.01 LEVEL

(2-TAILED).
R^2 = .36
If one or both the variables have been measured using ordinal

scales, SPEARMAN RANK ORDER
CORRELATIONCOEFFICIENT rather than PEARSON
CORRELATION is used.
In Deli Depot example, CUSTOMER SURVEY collected data
that ranked Six Restaurant selection factors
These are represented by Variables X 12 to X 17
The Management is interested in knowing whether “FOOD
QUALITY” is significantly better than “FOOD VARIETY”
SINCE THESE ARE ORDINAL (RANKING) DATA, THE
PEARSON CORREALTIONS IS NOT SUFFICIENT.
SPEARSMAN RANK ORDER CORREALTION is to be used.
Variables for ANALYSIS are:
X15- FOOD QUALITY
X16 – FOOD VARIETY
CLICK THROUGH SEQUENCE IS:
*ANALYZE
*CORRELATE
*BI- VARIATE
This leads to DIALOGUE BOX:
Transfer Variables X15 and X16 are used.
Default shows PEARSON CORREALTION.
Unclick PEARSON CORRELATION.
Then click on SPEARMAN
Click OK
The OUTPUT is shown in box below:
The CORRELATION is -.495 and the SIGNIFICANCE
VALUE is .01
It is negative which tells us that a CUSTOMER who RANKS
Food Quality HIGH in importance tends to RANK Food Variety
significantly LOWER.
CORRELATIONS
Food Food Quality
Variety Rank
Rank
Spearsman’ Food Correlatio 1.000 -.495(**)
s rho Variety n
Rank Coefficient
Sig.(2- . .000
tailed)
N 50 50
Food Correlatio -.495(**) 1.000

Quality n
Rank Coefficient
Sig.(2- .000 .
tailed)
N 50 50
** CORRELATION IS SIGNIFICANT AT 0.01 LEVEL (2 –
Tailed)
MODULE 12
(NEW CROSSTABS USING CHI SQUARE TESTS)
CASE- DELI DEPOT
The Manager of Deli Depot might believe that
there is no difference in the percentage of men
and women who travel various distances to
visit the restaurant.
NULL HYPOTHESIS
H0: The null hypothesis would be that
percentage of men and women customers are
the same who travel less than one mile to eat
at Deli Depot, those who travel one to three
miles, and those who travel more than three
miles.
Step 1: (Selection of required data)
 For this module, we have taken the
following data:
Step 2: (Using Analyze option)
 Click on “Analyze” Option.
 Choose “DESCRIPTIVE STATISTICS”
option.
 Further, click on “CROSSTABS” option.
Step3: (Selection of variables for rows and
columns in crosstabs)
 In this step, followings things are done:
 Click on the variable to be selected for
“Row(s)” option.
 Here, we have taken “Distance Driven”
as the required variable.
 Now, select the variable for “Column(s)”
option.
 Here, we have taken “Gender” as the
required variable.
Step 4: (Using of Cell-Display option)
 In this step followings things are done:
 Further, in crosstabs, click on “OPTIONS”.
 In this option, choose “EXPECTED” and
“OBSERVED” in the “COUNT” option.
 Further, choose “ROWS” and “COULMN” in
the “PERCENTAGE” option.
Step5: (Using the Crosstabs Statistics)
 It includes the following steps:
a) In the crosstab option, click on the
‘STATISTICS’ option.
b) In the statistics option, select the Chi
Square test.
Then click on the ‘continue’ option.
OUTPUT:
The expected frequencies are calculated on the

basis of the proportion of the sample
represented by a particular group. For example,
The total sanples is 50 and 20 are males and 30
are females. This means that 40% of the sample
is male and 60% is the female. When we look I
the gender table for the category at the top
labeled 1 we see that there are 22 respondents.
Therefore the expected frequencies are 40%
males=8.8 and 60% females=31.2
The first table shows the number of cases
processed
Cases
Valid Missin Tota Percent
g l
N Percent N Percent N 100.0%
Distance 50 100.0% 0 .0% 50
Driven
Second table shows the number of responses
for male and females for each of the categories
Gende Total
r
Female Male
Distanc 1 Count 9 13 23
e
Driven
Expected 13.2 8.8 22.0
Count
2 Count 14 6 20
Expected 12.0 8.0 20.0
Count
3 Count 7 1 8
Expected 4.8 3.2 8.0
Count
Total Count 30 20 50
Expected 30.0 20.0 50.0
Count
Gende Total
r
Chi-Square Test
Value Df Asymp.
Sig.(2-
sided)
Pearson 6.695 2 .035
Chi-square
Likelihood 7.071 2 .029
Ratio
Linear-by- 6.413 1 .011
Linear
Association
N of valid 50
Cases
The above table shows the Chi-square value- it
is 6.695 at 0.35 level.
Since, the level significance is less than .05, we
may reject the null hypothesis with a high
degree of confidence.
If we look at the observe frequencies in the
table-2 above, it appears that female
customers of Deli Depot tend to drive farther
you get to Deli Depot that do male customers.
That is there are more female customers than
would be expected who drive farther to get to
restaurant, and fewer female customers than
would be expected who visit the restaurant
from nearby 9 looking at the observed and
expected frequencies in each cell.

RM Lab File Harpreet Kaur Combined

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

RM Lab File Harpreet Kaur Combined

Uploaded by

Copyright:

Available Formats

GURU GOBIND SINGH INDRAPRASTHA UNIVERSITY

MAHARAJA SURAJMAL INSTITUTE

SPSS Statistics is a software package used

1.1 Launching SPSS

SPSS has two working windows:

 SPSS Data Editor

1.1 Inserting data in IBM SPSS

2. Enter data in data view.

3. Enter the variable as “Percentage of first semester”. Add output

1.3 Descriptive Statistics

2. Put variable in row as “Gender” and in column as “age of

3. Tick on the boxes as shown below. Then click on “continue”.

3. Enter bars represent as “percentage in second semester” and variable

3. Now enter “Gender” in define slices by.

To understand the nature of data, one will use case summary.

2.1.1 On the basis of Gender

1. Go to analyze tab, then click on reports and then click on case

2. Then put variables as “percentage in first semester”

4. Then click on “OK”.

2. Then put variables as “percentage in first semester”

3. Then click on statistics and put cell statistics as

4. Then click “OK”.

2. Put Target Variable as “FYmarks”.

3.1 Coding and Recoding in SPSS.

2. Enter the values for CGPA.

4. Put CGPA as a variable.

2. Put numeric variable as “FYmarks”.

6. Enter on “Old and new values”. Click on the “Output

2. Click on “if conditions are satisfied”. And then click on “IF”.

2. Put variable as “FYmarks” and grouping variable as “gender”.

4. Then press “OK”.

2. Sort by “age of the respondant” and put sort order as

2. Go to Analyze tab, then click on descriptive statistics

4. Select “Mean”. Click on “continue”.

2. Put in the variable as “percentage in first semester”.

5.3 Spilt Files

2. Go to analyze tab, click on “descriptive statistics”, and

4. Select “Mean”. Click “continue”.

2. Put the variable as “percentage in first semester” and

4. Now click on “old and new values”.

6.1) FINDING NORMAL DISTRIBUTION ACCORDING TO THE GIVEN CASE

1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS, CLICK ON EXPLORE.

2. SELECT “BEHAVIOUR OF THE OWNER “ AS THE DEPENDENT LIST .

1. GO TO GRAPHS, CLICK ON LEGACY DIALOGS, THEN CLICK ON BOX

3. SELECT “INCOME”AS THE VARIABLE. THEN CLICK ON OPTIONS.

5. SELECT “GENDER” AS THE CATEGORY AXIS. CLICK OK.

1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS, CLICK ON EXPLORE.

3. CLICK ON PLOTS, CLICK ON HISTOGRAM. THEN CLICK CONTINUE.

1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS , CLICK ON EXPLORE.

3. UNCHECK STEM AND LEAF. CLICK ON CHECK NORMALITY PLOTS

1. GO TO ANALYZE , THEN DESCRIPTIVE STATISTICS , CLICK ON EXPLORE.

2.CLICK ON PLOTS. THEN CLICK ON “NORMALITY PLOTS WITH TESTS”.

(CORRECTING DATA PROBLEMS)

There are three methods to correct data problems

7.1 LOG TRANSFORMATION - IT REDUCES POSITIVE

Log 10 (X). Example LOG (DAY 2 +1)

1. Go to Transform" and choose "Compute Variable" from the drop

3. In output viewer, Histogram of 'reci’ is shown.

It reduces positive skewness and useful in stabilizing variance.

1. Go to Transform tab, choose Compute Variable from the drop-down menu.

3. Go to explore and select log in dependent variable.

7.In output viewer, Histogram of Log is shown.