Professional Documents
Culture Documents
Chapter 1
Chapter 1
Chapter 1
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 1
Objectives
In this chapter you learn:
■ To understand issues that arise when defining
variables.
■ How to define variables.
■ To understand the different measurement scales.
■ How to collect data.
■ To identify different ways to collect a sample.
■ To understand the issues involved in data
preparation.
■ To understand the types of survey errors.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 2
Classifying Variables By Type
DCOVA
▪ Categorical (qualitative) variables take categories as
their values such as “yes”, “no”, or “blue”, “brown”,
“green”.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 3
Examples of Types of Variables
DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 4
Measurement Scales
DCOVA
A nominal scale classifies data into distinct
categories in which no ranking is implied.
Do you have a
Facebook profile? Yes, No
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 5
Measurement Scales (con’t.)
DCOVA
An ordinal scale classifies data into distinct
categories in which ranking is implied.
Categorical Variable Ordered Categories
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 6
Measurement Scales (con’t.)
DCOVA
▪ An interval scale is an ordered scale in which the
difference between measurements is a meaningful
quantity but the measurements do not have a true
zero point.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 7
Interval and Ratio Scales
DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 8
Types of Variables
DCOVA
Variables
Categorical Numerical
Continuou
Nominal Ordinal Discrete
s
Examples: Examples: Ratings Examples: Examples:
■ Marital Status ■ Good, Better, Best ■ Number of Children ■ Weight
■ Political Party ■ Low, Med, High ■ Defects per hour ■ Voltage
■ Eye Color (Ordered Categories) (Counted items) (Measured
(Defined Categories) characteristics)
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 9
Data Is Collected From Either A
Population or A Sample
DCOVA
POPULATION
A population contains all of the items or
individuals of interest that you seek to study.
SAMPLE
A sample contains only a portion of a
population of interest.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 10
Population vs. Sample DCOVA
Population Sample
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 11
Collecting Data Via Sampling Is Used
When Doing So Is DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 12
A Sample Is Analyzed To Estimate
Characteristics Of An Entire Population
DCOVA
■ A population parameter summarizes the value
of a specific variable for a population.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 13
Sources Of Data Arise From
The Following Activities DCOVA
■ Capturing data generated by ongoing business
activities.
■ Distributing data compiled by an organization or
individual.
■ Compiling the responses from a survey.
■ Conducting a designed experiment and
recording the outcomes.
■ Conducting an observational study and
recording the results.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 14
Examples of Data Collected From
Ongoing Business Activities
DCOVA
■ A bank studies years of financial transactions to
help them identify patterns of fraud.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 15
Examples Of Data Distributed
By An Organization or Individual
DCOVA
■ Financial data on a company provided by
investment services.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 16
Examples of Survey Data
DCOVA
■ A survey asking people which laundry detergent
has the best stain-removing abilities.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 17
Examples of Data From A
Designed Experiment
DCOVA
■ Consumer testing of different versions of a
product to help determine which product should
be pursued further.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 20
Sources of Data DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 21
A Sampling Process Begins With A
Sampling Frame
DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 22
Types of Samples DCOVA
Samples
Simple Stratified
Random
Judgment Convenience
Systematic Cluster
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 23
Types of Samples:
Nonprobability Sample DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 24
Types of Samples:
Probability Sample DCOVA
Simple
Systematic Stratified Cluster
Random
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 25
Probability Sample:
Simple Random Sample DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 27
Probability Sample:
Systematic Sample DCOVA
■ Decide on sample size: n
■ Divide frame of N individuals into groups of k
individuals: k=N/n
■ Randomly select one individual from the 1st
group
■ Select every kth individual thereafter
First Group
N = 40
n=4
k = 10
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 28
Probability Sample:
Stratified Sample DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 29
Probability Sample
Cluster Sample DCOVA
■ All items in the selected clusters can be used, or items can be chosen
from a cluster using another probability sampling technique.
Population
divided into
16 clusters. Randomly selected
clusters for sample
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 30
Probability Sample:
Comparing Sampling Methods
DCOVA
■ Simple random sample and Systematic sample:
■ Simple to use.
■ May not be a good representation of the population’s
underlying characteristics.
■ Stratified sample:
■ Ensures representation of individuals across the
entire population.
■ Cluster sample:
■ More cost effective.
■ Less efficient (need larger sample to acquire the
same level of precision).
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 31
Data Cleaning Is An Important Data
Preprocessing Task Prior To Analysis DCOVA
Data cleaning corrects irregularities in the data:
■ Invalid variable values, including:
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 32
Cleaning Invalid Variable Values
Can Be Semi-Automated DCOVA
■ Invalid variable values can be identified by
simple scanning techniques, for example:
■ Non-numeric entries for numerical variables.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 33
Examples Of Coding Errors
DCOVA
Copy-and-paste or data import can result in poor
recording or entry of data.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 34
Data Integration Errors From Combining
Two Different Computerized Data Sources
DCOVA
■ Data integration errors often requires
time-consuming manual effort.
■ Some examples:
■ Variable names or definitions may differ.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 35
Data Can Be Formatted and / or
Encoded In More Than One Way
DCOVA
■ Some electronic formats are more readily
usable than others.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 36
Missing Values Are Values Not
Collected For A Variable
DCOVA
■ Survey data may include answers for which no
response was given by the survey taker.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 37
Data Cleaning Cannot Be A Fully
Automated Process DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 38
Other Data Preprocessing Tasks
DCOVA
■ Data Formatting
■ Rearranging data structure or changing electronic encoding of
the data or both.
■ Stacking and Unstacking Data
■ Analysis of a numerical variable may require subdividing that
data into two or more groups.
■ Unstacking involves creating separate numerical variables for
the different groups.
■ Stacking involves pairing the one numerical variable with a
second categorical variable.
■ Recoding Variables
■ Redefining categories for a categorical variable.
■ Transforming a numerical variable into a categorical variable.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 39
Recoding Of Variables
DCOVA
■ Recoding a variable can either supplement or replace
the original variable.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 40
Evaluating Survey Worthiness
DCOVA
■ What is the purpose of the survey?
■ Is the survey based on a probability sample?
■ Coverage error – appropriate frame?
■ Nonresponse error – follow up.
■ Measurement error – good questions elicit good
responses.
■ Sampling error – always exists.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 41
Types of Survey Errors DCOVA
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 42
Types of Survey Errors (continued)
DCOVA
■ Coverage error
Excluded from
frame
■ Nonresponse error
Follow up on
nonresponses
■ Sampling error
Random
differences from
sample to sample
■ Measurement error
Bad or leading
question
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 43
Ethical Issues About Surveys
DCOVA
■ Coverage error and nonresponse error can be
leveraged by survey designers to purposely
bias survey results.
■ Sampling error can be an ethical issue if the
findings are purposely not reported with the
associated margin of error.
■ Measurement error can be an ethical issue:
■ Survey sponsor chooses leading questions.
■ Interviewer purposely leads respondents in a
particular direction.
■ Respondent(s) willfully provide false information.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 44
Chapter Summary
In this chapter we have discussed:
■ Understanding issues that arise when defining
variables.
■ How to define variables.
■ Understanding the different measurement scales.
■ How to collect data.
■ Identifying different ways to collect a sample.
■ Understanding the issues involved in data
preparation.
■ Understanding the types of survey errors.
ALWAYS LEARNING Copyright © 2021, 2017, 2014 Pearson Education, Inc. Slide 45