Professional Documents
Culture Documents
Wesley (2021) SPSS For Beginners
Wesley (2021) SPSS For Beginners
FOR BEGINNERS
IN 45 MINUTES
A 2021 Quick Reference Guide to Research Methods, Data Analysis
and Interpretation of Statistical Data
Bill Wesley
Copyright
Copyright©2021 Bill Wesley
All rights reserved. No part of this book may be reproduced or used in any manner without the
prior written permission of the copyright owner, except for the use of brief quotations in a book
review.
While the advice and information in this book are believed to be true and accurate at the date of
publication, neither the authors nor the editors nor the publisher can accept any legal
responsibility for any errors or omissions that may be made. The publisher makes no warranty,
express or implied, with respect to the material contained herein.
Printed on acid-free paper.
Introduction to SPSS
One of the best statistical software in the world is IBM SPSS. This statistical software offers a
wide variety of statistical calculations, and it is very easy to learn. This is because the software is
user-friendly and free from numerous sophistications like other statistical software. The SPSS
software uses an advanced statistical procedure to ensure high accuracy and quality decision-
making. Also, it can be used for all facets of the analytics lifecycle, such as data preparation, data
management, data analysis, and reporting of data.
History of SPSS
SPSS is a computer software owned and managed by IBM Inc. for the analysis of data. It is
important to note that IBM did not initially create the program; it was created by three social
scientists Normal H. Nie, C. Hadai Hull, and Dale H. Bent, in 1968. These three creators were
fresh graduates when they invented the software to analyze large amounts of data obtained from
different research methods quickly and efficiently. At first, the program was used at the
University of Chicago, after which it gained popularity, and other American universities adopted
the software usage. The software right was later acquired by IBM in 2009 and has thus received
the brand name IBM SPSS Statistics.
SPSS MENU
There are several menu options in the software. These menu bars are located at the top screen for
easy finding. Here is the menu and what they contain;
This menu is meant for a large corporation that is into market research. It consists of several
features that help you analyze your customers or contacts to help improve your results. With this
menu, you can do the following;
● Identify the right contacts and improve campaign ROI
● Easily uncover customer groups
● Get an all-in-one marketing analysis
● Connect to Salesforce.com for further insight
● Access to a range of features
Other Menu
Other menus that can be found in the SPSS menu bar include;
● Utilities Menu
● Add Ons menu
● Window Menu
● Help Menu
CHAPTER TWO
Getting familiar with the SPSS Data Editor
Environment
SPSS is a unique software that is used for statistical calculations and analysis. This software has
become essential in different fields, especially the Social Sciences, due to its multiple uses. It can
be used for statistical calculations, descriptive and inferential analysis, graphs, correlations, and
time series. Research companies also make use of SPSS to analyze their data efficiently. All
professionals from different areas who need to apply statistical analysis and data analysis make
use of SPSS.
More to it is that University students with prior knowledge of statistics use the SPSS to fulfill
their assignments or end-of-degree and end-of-master projects. Therefore, the use of SPSS
software facilitates the collection and organization of data, makes it possible to know if the work
hypotheses have been fulfilled, facilitates decision-making, and allows the best strategy to be
adopted.
When starting SPSS for the first time, the window will ask you what you want to do? Thus, if
you're going to analyze a new data set, check the Enter data option.
Suppose you choose to analyze a new data set, then the SPSS Data Editor will be displayed. The
SPSS Data Editor is the initial framework used to enter data and select the appropriate procedure
for analysis. This window is made up of:
● The Menu Bar which contains the SPSS Main Menu with all its options. Each of these
options has different procedures that are displayed by clicking on each of them.
● The Toolbar, which is made up of different icons that allow direct access to the most
common procedures. The bar of each of these icons, shown by positioning the mouse
over the icon, from left to right.
The SPSS Data Editor, which is made up of cells. Each row represents an element of the data set,
and each column represents a variable and is displayed when the box is enabled Data Views in
ShowBar. Checking each of these variables and pressing the right mouse button shows the
following options;
● Insert variables
● Sort ascendingly
● Sort the data in descending order
Similarly, you can add cases to the variables or columns if you forget something. To insert cases,
you need to click the secondary button of the mouse and select the option to add a case.
In the Variables view, each row corresponds to a variable, and each column determines its
characteristic. It has the following features;
● Name: The name of the variable is entered.
● Type: The type of variable is chosen from the possibilities offered by clicking on Type.
● Numeric: A variable whose values are numbers. Values are shown in standard numeric
format (Width and Decimal places are set).
● Comma: A numeric variable where commas determine the thousands.
● Point: A numerical variable where the points determine the thousands.
● Scientific notation: A numeric variable whose values are shown with an embedded E
and an exponential sign that represents a power of base ten.
● Date: A numeric variable whose values are to be displayed.
● Custom Currency: A numeric variable whose values are displayed in one of the various
custom currency formats.
● String: Variables whose values are not numeric. They are also known as alphanumeric
variables.
● Width: Determines the width of the column.
● Decimals: Determines the number of decimal places that appear on the screen.
● Label: Variables can be labeled so that this label appears in subsequent investigations.
● Values: This allows you to enter the modalities of the string type variables.
● Missing Values: SPSS will enable you to encode missing values discreetly or in a
specific range.
● Columns: This will enable you to enter the width of the column. It can also be changed
in Data View by clicking and dragging the edges of the column.
● Alignment: This will enable you to choose between aligning the entered data to the Left,
Right, or Centered.
● Measure: It will help to define the variable either as Ordinal or Nominal.
● Ordinal: It is used to represent data with an intrinsic value (Ex: great, medium, small;
failing, passing, notable, outstanding).
● Nominal: It is used to represent data without intrinsic value (Ex: red, yellow, green).
● Role: Functions that can be assigned to variables for analysis.
● Input: The variable is used as an input (for example, predictor, independent variable).
● Purpose: The variable is used as output or destination (for example, dependent variable).
● Both: The variable is used as input and output.
● None: The variable does not have a role assignment.
● Partition: The variable is used to divide the data into separate samples.
● Segment: Variables with this role are not used as variables in a segmented file.
Frequencies
The Frequencies procedure provides statistical and graphical representations that are useful for
describing different types of variables. It allows obtaining a variable description from the
frequency tables, histograms, bar graphs, percentiles, central tendency indices, and dispersion
indices. Here is how to access frequency;
● Proceed to the Main menu
● Ensure that you select the Analyze menu
● Immediately to select the Descriptive Statistics option
● Click on the Frequencies
In the dialogue box of frequencies, the variable or variables (categorical or quantitative) to be
analyzed are introduced. In this window, there are four command buttons:
I. Statistics: It is used to obtain descriptive statistics for quantitative variables
II. Charts: It is used to make bar graphs, pie graphs, and histograms
III. Format: It is used to choose the order in which you want the result to show
IV. Style: It is used to enhance style sampling. In addition, the option Show frequency
tables in their default display.
Click on Statistics, and a dialogue box will show up. This dialogue is where sets of descriptive
measures are displayed grouped in: Percentile values, Central tendency, Dispersion, and
Distribution. In this window, the descriptive statistics you want to study are marked and click
on Continue.
Click on Charts, and a dialogue box will show up to choose the Chart Type and the Chart
Values. In this window, select the graph you want to make (bar graphs, pie graphs, and
histograms) and click Continue.
Click on format, and a dialogue box will show up to choose the frequency tables and how you
want them to be ordered. You can order them according to the values of the variable or according
to the observed frequencies. In addition, you can choose to delete variables with a large number
of different values or those with more than n categories of groups.
If you click on Style, you will be taken to a dialogue box, which allows you to perform style
sampling.
Descriptive
The Descriptive procedure calculates central tendency statistics, dispersion, and distribution for
several variables, displaying them in a single table and calculating standardized values (z scores).
To access this procedure, you have to follow the steps;
● Proceed to the Main menu
If you select Save Standardized Values as Variables, the z scores are saved and added to the data
in the Data Editor for further analysis. The z- score transformations allow the comparison of
variables that are recorded in different units of measurements. Here are some of the options that
will be displayed when you click on the descriptive statistics;
i. Reset allows you to reset all the selected options of the system and move them back
to default mode
ii. Cancel will enable you to ignore all variables selected
iii. Paste sends the syntax of the procedure to the syntax window
iv. Accept, having chosen the specifications, press the Accept button to obtain the
results of the procedure.
Also, suppose you click on the option. In that case, you will find characteristics of central
tendency, dispersion, distribution, and order the variables by the size of their means (in
ascending or descending order), alphabetically, or by the order in which the variables are
selected (the default value). Click on the continue button to generate the result of the descriptive
statistics.
Explore
The Explore procedure generates summary statistics and graphical representations such as Box
Plots, Stem-and-Leaf Plots, Histograms, Normality Plots, Level Scatter Plots, etc. To explore the
data with this feature,
● Choose Analyze in the main menu
● Select Descriptive statistics
● Choose Explore from the drop menu.
A dialogue box will be displayed where one or more dependent variables are selected (dependent
list). Also, you can choose one or more variable factors (Factors list). You can also choose
values to define groups of cases or select an identification variable to label cases (Label cases
by).
Also, it would help if you chose the confidence interval for the mean. After you have selected the
necessary option, you can click on the continue option.
Suppose you click on plots; you get the following dialogue box;
i. Boxplot
ii. Descriptive
● Stem-and-leaf
● Histogram
The Box Plot is a form of graphical representation to summarize the distribution of the values of
a variable. In this representation, instead of displaying the individual values, basic statistics of
the distribution are represented: the median, the 25th percentile, the 75th percentile, and the
extremes of the distribution. This graphic representation is based on the 5th statistical measure.
Information that can obtain from this type of graph:
● The position of the median determines the central tendency
● The width of the box gives an idea of the variability of the observations. If the median is
not in the center of the box, you can deduce that the distribution is asymmetric (if it is
close to the lower limit of the box, positive asymmetric, and if it is close to the upper
limit, negative asymmetric)
● These graphs are handy for comparing the distribution of values between different groups.
A stem and leaf diagram is a technique used to observe the shape of the frequency distribution
table. The stem-and-leaf plot is a graphical representation in which the data is placed in two
levels so that you can visualize the shape of the distribution. A stem-and-leaf plot consists of a
series of horizontal rows of numbers.
The first column, known as stems, consists of a vertical line drawn, and to its right are the
corresponding leaves in each row. The number used to designate a row is its stem; the rest of the
numbers in the row are called leaves. The stem is the largest portion of the number. The leaves
give secondary information about the number.
Graphs with normality test: This procedure checks whether the data come from a normal
population, and for this, it uses two graphs and an analytical contrast. (This procedure will be
used in more advanced practices).
Graphs
In addition to the graphs produced by the previous descriptive procedures, SPSS has a menu
specifically dedicated to obtaining graphical results. Here is how to use the graph
● Proceed to the graphs from the Main Menu
● The following window will open up displaying;
i. Chart builder
ii. Graphboard Template chooser
iii. Legacy dialogue
To proceed with making some interactive graphs, select the chart builder in the Main Menu and a
new window will be shown for you to make selections. Suppose you select the legacy dialogue;
then you can choose the type of graph that you want to make. Here you can choose to make a
Simple, Grouped, or Stacked Bar Chart. If you choose Grouped, click on Define, and the next
window is displayed.
To create a grouped bar chart, you must select a category variable and a grouping variable. In
this way, once the OK button is clicked, a bar chart of the chosen categories is generated,
grouped by the chosen grouping variable.
The standard procedure for generating graphs begins with choosing from the Main Menu of the
desired type of graph. After this choice, the program requests more information about the
characteristics of the desired graph. This is usually done through the Legacy dialogue. Once the
definition of the graph has been confirmed by clicking the OK button, the created graph appears
in the Results Viewer. Selecting it with the mouse and clicking the right button shows the
following figure where you choose Edit content.
CHAPTER FOUR
REGRESSION AND CORRELATION
Regression theory searches for a function that best expresses the relationship between two or
more variables. This practice only studies the situation of two variables. One of the most
interesting applications of regression is that it is Predicting, that is, knowing the value of one of
the variables, estimating the value that the other variable related to it will present.
Correlation theory studies the degree of dependence between the variables; its objective is to
measure the degree of fit between the theoretical function (fitted function) and the point cloud.
When the functional relationship linking the variables X and Y is a line, the regression and
correlation are called Linear Regression and Linear Correlation. Pearson's Linear Correlation
Coefficient gives a measure of Linear Correlation.
Where y are the unknown parameters that we will estimate and εi and yi are random
variables. εi is called a random error or disturbance. When starting a simple linear regression
study, the first step for the researcher is to plot the observations of both variables on a graph
called a scatter plot or point cloud. From this representation, the researcher determines if there
● Select or drag the ScatterDot option you want to represent (simple scatter, matrix scatter,
simple dot, overlay scatter, and 3-D scatter).
● Proceed to the variable menu to represent the X and Y-axis of the graph
● Click OK, and the scatter plot is displayed
● The graph will show the possible adequacy of the linear model and its increasing trend.
● A new will display for you to move the various variables x and y to their corresponding
field
● To carry out further settings, you need to click on the statistics option
● You will be taken to a new window where you need to select from several options
● Click Continue.
Normal
You can perform the normality analysis graphically using histogram, normal probability graph,
and Kolmogorov-Smirnov test
Histogram
You can superimpose a normal curve into a histogram graph. If the residuals follow a normal
distribution, the histogram bars should represent an appearance similar to that of said curve. Here
is how to go about it;
● Proceed to Graph
● Select chat builder
● Proceed to select histogram.
● Select the element properties menu, and you will see a pop-up window, select the variable
that represents the standardized residuals and mark the option Show normal curve
● Press Continue and OK, and the histogram is displayed with the normal curve
superimposed.
If you observe that the data does not reasonably approximate a normal curve, the sample size
considered is very small. That is why this kind of representation is not advisable in small sample
sizes.
Homoscedasticity
Here is how to go about Homoscedasticity;
● Select Analyze
● Proceed to Regression
● Select Linear
● Press the Graphs button and select the variable in the corresponding dialog box* ZRESID
for the Y-axis (this variable represents the standardized residuals) and the variable *
ZPRED (a variable that represents the standardized predicted values) for the X-axis
● Press Continue and Accept, and the graph will be displayed
It is important to note that this graph is handy to detect inadequacy of the proposed model to the
data, possible deviations from the linearity hypothesis. If you observe non-random behavior
trajectories, this indicates that the proposed model does not adequately describe the data.
It is important to note that SPSS provides the value of the Durbin-Watson statistic but does not
show the associated p-value, so the corresponding tables must be used. The Durbin-Watson
statistic measures the degree of autocorrelation between the residual corresponding to each
observation and the previous one.
● If X = Top face of a coin. If the top face of the coin takes the values X = {1, 0} with
probabilities P (X) = {1/2, 1/2}. Thus, the probability of X if it takes;
● The value 1, which is denoted by P [X = 1], will be 1/2 (P [X = 1] = 1/2)
● In the dialogue box, proceed to Function group, select PDF & Noncenteral PDF.
● In functions and special variables, the corresponding distribution can be selected, which
is the pdf binomial. Ensure that you double-click it so that it can display in the numeric
expression box (PDF.BINOM(quart, n, prob).
● Input the values for the different variables in the numeric expression box.
● Ensure that you type a name for the target variable and click on the OK button below.
Distribution Function
The Distribution Function of the random variable X is denoted as F {X}. It is defined as the
probability that the path X takes, whose value is less than or equal to x. To obtain values of the
distribution function of a specific model, in SPSS, the option FDA and FDA not centered is
selected in the Function group. You must know the value of the Variable and the parameters
that determine the model. You can find the various model in functions and variables:
● CDF.BERNOUILLI (c, prob): Numeric. Returns result from the cumulative
probability that a value of the Bernoulli distribution, with the given probability
parameter, is less than or equal to c. The probability that the variable X is less than or
equal to c, P [ X ≤ c ], where X is a random variable.
● CDF.BINOM (c, n, prob): Numeric. Returns result from the cumulative probability
that the number of successes in n trials, with the probability of success p in each of
them, is less than or equal to c. That is, the probability that the variable X is less than or
equal to c, P [ X ≤ cant ], where X is a random variable with Binomial distribution of
parameters n and prob. When n is 1, the value is the same as that of CDF.
● CDF.POISSON (c, mean): Numeric. Returns the cumulative probability that a value
from the Poisson distribution, with the specified mean or rate parameter, is less
than or equal to c. That is the probability that the variable X is less than or equal to c, P
[ X ≤ cant ], where X is a random variable with the Poisson distribution of the Mean
parameter.
● CDF.NORMAL (c, mean, typical_dev): Numeric. Returns the cumulative probability
that a value from the Normal distribution, with the specified mean and standard
deviation, is less than or equal to c. That is, the probability that the variable X is less
than or equal to c, P [ X ≤ qty ], where X is a random variable with Normal distribution
of mean parameters and typical_dev.
● A dialogue box will be displayed where the variable to be analyzed is passed to the
Dependents window.
● By default, SPSS calculates the confidence interval at a level of 95%. To modify this
level, press Statistics and choose your desired level of the confidence interval.
● A dialogue box will be displayed where the variable is passed to the Contrast variables
window.
● The confidence level can be modified by pressing the Options button.
● To obtain the interval, press Continue and then OK.
Practical Question
In a sample of 9 tomato juice preparations, the following data has been obtained on vitamin C
content in mg / 100 cc: 21.60; 19.72; 18.92; 23.01; 17.98; 22.06; 25.01; 21.98; 20.80. Assumed
that the vitamin C content of tomato juice is normally distributed:
1. Estimate the mean vitamin C content of tomato juice.
2. Calculate a 95% confidence interval for said quantity.
Solution Procedure
1. Estimate the average vitamin C content of tomato juice.
The mean is obtained by adding the respective vitamin content and dividing by the number of
observations. This will give you 21.23 as the answer.
Practical Question
Two laboratories A and B carry out nicotine determinations in 4 tobacco units, with the
following results: Lab. A: 16, 14, 13, 17. Lab. B: 18, 21, 18, 19
Assuming that the two populations examined are normal and independent with equal variance,
estimate the difference in the mean nicotine content of tobacco at a 95% confidence level.
Solution Procedure
Here is the step of how to solve the problem;
● Ensure to enter the data correctly in the editor text.
● In the main menu, select Analyze.
● Choose the option Compare means and select T-test for independent samples.
● A dialogue box will be displayed for you to enter in the Contrast variables field: Lab A.
You will also need to enter the Grouping Variable field: Lab B.
● The next step is to define the group by clicking on the Define Groups button.
● Enter the values assigned to each sample and press Continue.
● Select Accept to display the output or result.
The p-value is equal to 0.356, which is greater than the significance level of 0.05. This result
indicates that the equality of variances should not be rejected with a confidence level of 95%.
From this result, it can be deduced that the mean nicotine content differs from one laboratory to
another. The nicotine content is higher in laboratory B than in laboratory A.
Confidence Intervals for the difference of means in related samples
In this case, the observations are entered so that each sample is in a column of the SPSS Data
Editor. Here is how to perform a confidence interval for the difference of means in related
samples using SPSS;
● In the main menu, select Analyze
● Proceed to select compare means
● Select T-test for related samples, and a new dialogue will be displayed
● The pairs of variables to be compared are selected simultaneously and passed to Related
variables
● In Options, you can change the confidence level
● Click Continue and OK.
CHAPTER SEVEN
Contrasts of Hypothesis
Hypothesis testing is a statistical process by which an investigation is carried out to verify a
claim of an observed result. All hypothesis testing is based on two mutually exclusive
propositions:
The hypothesis H0 consists of a specific statement about the probability distribution or the value
of the distribution parameters. The name null means without value, effect, or consequence. It
means that H0 represents the hypothesis that will remain unchanged unless the data presented is
false. In contrast, Hypothesis H1 negates the null, and it includes everything that H0 excludes.
Therefore, the researcher's interest should be focused on H1.
Types of contrasts
Parametric tests: Once a path with a certain distribution is known, statements are made about
the parameters of that distribution.
Non-parametric tests: It is the opposite of parametric tests because the distribution of the
observations is unknown; therefore, statements are not made.
One-sided test: If the hypothesis gives rise to a critical region "on one side of the parameter
value," we will say that the test is one-sided or one-tailed.
● In the corresponding output, one or more quantitative variables are selected to contrast
them with the same assumed value
● By pressing Options, you can choose the confidence level.
● Click Continue and OK. A statistical summary is obtained for the sample, and the output
of the procedure
Practical Question
An experiment is conducted to study the level (in minutes) required for a desert lizard's body
temperature to reach 45º from its normal body temperature while in the shade. The following
observations were obtained: 10.1; 12.5; 12.2; 10.2; 12.8; 12.1; 11.2; 11.4; 10.7; 14.9; 13.9; 13.3.
Suppose that the variable X: Time to reach 45º follows a Normal law
1. Can it be concluded that the mean time required to reach the lethal dose is 15
minutes?
2. Can it be concluded that the mean time required to reach the lethal dose is less than
13 minutes?
Solution Procedure
1. The following hypothesis test is carried out:
● A new window will be displayed to select one or more quantitative variables, and a
different T-Test is calculated for each variable.
● Next, select a single grouping variable and click Define Groups to specify the codes of the
groups that you want to compare. Selecting Define Groups, a new screen is displayed
where the number of groups to be compared is specified.
● Press Continue and then Accept. The statistical summary for the two samples and the
output of the procedure will be displayed.
Practical Question 2
Let say you are asked to compare two populations of isolated frogs. The length of the two
samples of populations is measured and expressed in millimeters as follows;
Population 1: 20.1; 22.5; 22.2; 30.2; 22.8; 22.1; 21.2; 21.4; 20.7; 24.9; 23.9; 23.3
Population 2: 25.3; 31.2; 22.4; 23.1; 26.4; 28.2, 21.3, 31.1, 26.2, 21.4
Test the hypothesis of equality of means at a significance level of 1%. (Assuming the length is a
normal distribution).
Solution Procedure
Here is how to solve the problem
● To perform a contrast of independent samples, the data must be entered in the SPSS
Editor as follows:
● Proceed to the main menu and select Analyze.
● Ensure that you select Compare means and click T-test for independent samples, and a
new window will be displayed
● In the test variable window, ensure that you input length as the variable. While in the
grouping variable, select frog as group variable.
● Click on Define Groups and input 1 in group 1 and 2 in group 2
● Press Continue and select Options. The box for the percentage of the confidence interval
is filled with 99.
● Press Continue and Accept, and the result will be displayed.
● A new window will be displayed for you to select the variable that you want to compare.
It is important to note that the test can be performed simultaneously for more than one
pair of variables.
● Click Continue and then the OK button to provide the statistical summary for the two
samples.
Practical Question
A study was carried out, in which ten individuals participated, to investigate the effect of
physical exercise on the level of cholesterol in plasma. Before exercise, blood samples were
taken to determine the cholesterol level of each individual. Afterward, the participants were put
through an exercise program. Blood samples were taken again at the end of the exercises, and a
second cholesterol level reading was obtained. The results are shown below.
First Cholesterol level: 182; 230; 160; 200; 160; 240; 260; 480; 263; 240
Second Cholesterol level: 190; 220; 166; 150; 140; 220; 156; 312; 240; 250
Check to know if physical exercise has lowered your cholesterol level to a 95% confidence level.
Solution Procedure
Here is the procedure of how to solve the problem presented above;
● To perform a paired sample contrast, the data must be entered in the SPSS Editor as
follows:
● Ensure that you choose Compare means and tap paired samples T-test
● A new window will be displayed for you to select the variables that you want to compare,
such as the first cholesterol level and the second cholesterol level
● Click Continue and then the OK button to provide the statistical summary for the two
samples.
● Proceed to select the Options button to choose any from the following options; descriptive
statistics, quartiles, and control the treatment of data
● Press the Continue button and choose Accept. The outputs will be displayed
Practical Question
Suppose you roll a die 720 times and get the results shown in the table.
Solution Procedure
Here is how to tackle the problem using SPSS software;
● The first step required is that you need to enter the data in the SPSS editor
● The next step is to weight the cases of the values. You can do this by proceeding to data
and selecting weight cases. You can choose to weight cases by displaying the variable
name. Click on the OK button
● The next step is to proceed to the main menu bar and select Analyze.
● Ensure that you select Non-parametric tests.
● Click legacy dialog boxes and select Chi-square.
● In the corresponding window, select frequency into the test variable window.
● Proceed to select the Options button to choose any from the following options;
descriptive statistics, quartiles, and control the treatment of data.
● Press the Continue button and choose Accept. The outputs will be displayed.
● The experimental value of the Chi-square test statistic is equal to 0.683, and the associated
p-value is 0.984 (greater than 0.05); therefore, the null hypothesis is not rejected.
Consequently, the die is well constructed.
CHAPTER EIGHT
Statistical Design of Experiments
The statistical design of experiments includes sets of techniques or statistical construction
methods that allow researchers to carry out the complete process of planning an experiment to
obtain appropriate data and valid or objective conclusions. Different statistical models can be
used in an experiment, but we will consider only CRD and RCBD. These two models are the
most used experimental model in statistics.
● Also, in the Fixed factors field, enter the Treatments factor and the Firs block.
● To indicate that it is a model without interaction between the treatments and the blocks,
click on model and indicate that it is an additive model in the corresponding output.
● By default, SPSS has a full Factorial model marked, so Custom should be noted.
● Let say you are studying only the main effects of two factors; therefore, select Type:
Main effects and the two factors, Treatments and Firs, into the Model field
● You will observe no distinction between the two factors.
How to Import a Data Source from Excel to
SPSS
Importing an excel file to SPSS is very easy, especially when importing the Excel file as a data
source. Here is how to go about it;
● Select the File menu, go to Open and then click data.
● From the drop-down menu at the bottom of this window, select the file type (for example,
.xls, .xlsx, or .xlsm format) and navigate to the Excel file you want to import.
● Click Open
● The Opening Excel Data Source dialog should appear, allowing you to select an Excel
spreadsheet to import. Check the box Read variable names from the first row of data only
if the Excel spreadsheet you have selected has variable names in the first row. When you
are satisfied with your selection, click OK
Bill Wesley is a researcher, writer, and Teacher with over thirteen years of experience. He started
using the IBM SPSS Statistics software as far back as 2009. He had conducted several kinds of
research and postulated models using this great software. Bill graduated from the University of
California, where he studied Economics and Statistics. He furthered his studies by studying
Applied Statistics at Master's level.
Bill is married and the father of three children. His hobbies are research, singing, writing, and
meditating. He is a statistics teacher by profession, and he has spent over thirteen years teaching
high school students. He loves research and has spent half of his whole life studying different
concepts and how they affect human lives. Bill has written several books as well as research
publications and he has won diverse awards for his contributions to humanity.