Professional Documents
Culture Documents
Stat609 Sp23 LCN Unit2
Stat609 Sp23 LCN Unit2
Stat609 Sp23 LCN Unit2
STATISTICAL INFERENCE
CHAPTER 7: Sampling and Sampling
Distributions
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7
Sampling and Sampling Distributions
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-1 Introduction
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-3a Simple Random Sampling
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 7.1: Sampling Families to Analyze
Annual Incomes (slide 1 of 2)
Objective: Using SPSS to illustrate how the random number
function, RAND, can be used to generate simple random samples.
The file name: Annual Incomes.
Solution: Consider the frame of 40 families with annual incomes.
Choose a simple random sample of size 10 from this frame.
SPSS Steps:
From the menus choose:
Data Select Cases Random Sample of Cases.
The ‘Random Sample of Cases’, selects a random subset of cases
based on a pseudo-random number generated by SPSS.
Enter Exactly 10 cases from the first 40 cases.
To obtain more random samples of size 10 (for comparison), you
would need to go through this process repeatedly.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 7.1: Sampling Families to Analyze
Annual Incomes (slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-3b Systematic Sampling (slide 1 of 2)
Example 7.2:
Suppose you are asked to select a random sample of 250 names
from a large company’s directory of employees.
There are 55,000 names listed in alphabetical order in the
directory.
First, you divide the population size by the sample size:
55,000/250 = 220.
Next, you use a random mechanism to choose a number
between 1 and 220. Suppose this number is 131.
Then you choose the 131st name and every 220th name
thereafter.
So, you would choose name 131, name 351, name 571, and so
on. The result is a systematic sample of size n = 250.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-3c Stratified Sampling (slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sources of Estimation Error
(slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Finite Population Correction (slide 3 of 3)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-4d The Central Limit Theorem (slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 7.5: Average Winnings From A
Wheel of Fortune (slide 1 of 2)
Objective: To illustrate the central limit theorem by a simulation of
winnings in a game of chance.
Solution: The population is the set of all outcomes you could obtain
from a single spin of the wheel—that is, all dollar values from $0 to
$1000.
Each spin results in one randomly sampled dollar value from this
population.
Each replication of the experiment simulates n spins of the wheel and
calculates the average—that is, the winnings—from these n spins.
A histogram of winnings is formed, for any value of n, where n is the
number of spins.
As the number of spins increases, the histogram starts to take on more
and more of a bell shape.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 7.5: Average Winnings From A
Wheel of Fortune (slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-4e Sample Size Selection (slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
7-4e Sample Size Selection (slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-1 Introduction
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-2b Other Sampling Distributions
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-3 Confidence Interval for a Mean
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.1: IQ Tests (slide 2 of 2)
[100.5, 111.5]
Does not contain 100
99% confidence interval:
[96.95, 115.05]
Contains 100
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.2: Fuel Usage of “Ultra-Green” Cars
(slide 1 of 4)
Use the sample data to estimate the mean MPG of all ultra-green cars
with 90% confidence.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.2: Fuel Usage of “Ultra-Green” Cars
(slide 2 of 4)
Summary Statistics:
and
Assume that MPG follows a normal distribution
Construct the 90% confidence interval for the population mean
For a 90% confidence interval:
With 90% confidence, the average MPG of all ultra-green cars is between
92.86 MPG and 100.18 MPG
The manufacturers claim that the ultra-green car will average 100 MPG
cannot be rejected since 100 falls within the interval.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.2: Fuel Usage of “Ultra-Green”
Cars (slide 3 of 4)
SPSS Steps:
From the menus choose:
Analyze Compare Means One-Sample T Test..
Click MPG and move it onto the Test Variable(s) field.
Click Options and enter 90% for confidence interval
percentage.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.2: Fuel Usage of “Ultra-Green”
Cars (slide 4 of 4)
SPSS Output:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-4 Confidence Interval for a Total
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.3: Estimating Total Tax
Refunds
(slide 1 of 2)
Based on these calculations, the IRS can be 95% confident that it will need
to pay out somewhere between about 244 and 346 million dollars to these
1,000,000 taxpayers.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-5 Confidence Interval for a Proportion
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Confidence Interval for a Proportion
(slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.4: Estimating the Response to a
New Sandwich (slide 1 of 4)
Objective: To illustrate the procedure for finding a
confidence interval for the proportion of customers
who rate the new sandwich at least 6 on a 10-point
scale.
The file name: Satisfaction Ratings
Solution: A sample of 40 customers who ordered a
new sandwich were surveyed. Each was asked to
rate the sandwich on a scale of 1 to 10.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.4: Estimating the Response to a
New Sandwich (slide 2 of 4)
,
Sample proportion,
Std Error of proportion
For 95% confidence level, 960
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.4: Estimating the Response to a
New Sandwich (slide 3 of 4)
SPSS Steps:
From the menus choose:
Analyze Compare Means One- Sample Proportions.
Select Satisfaction and move it onto the Test Variable(s) field.
Select Values under Define Success criteria and enter 6 7 8 9 10
(Multiple values must be separated by spaces)
Click on Confidence Intervals and select Wald.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.4: Estimating the Response to a
New Sandwich (slide 4 of 4)
SPSS Steps & Output:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.5: Auditing for Price Errors (slide 1 of
2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.5: Auditing for Price Errors (slide 2 of
2)
,
Sample proportion,
Std Error of proportion
645
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-7 Confidence Interval for the
Difference Between Means
One of the most important applications of statistical
inference is the comparison of two population
means.
There are many applications to business.
For statistical reasons, independent samples must
be distinguished from paired samples.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-7a Independent Samples
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.6: Reliability of Treadmill
Motors at SureStep (slide 1 of 3)
Objective: To find a confidence interval for the
difference between mean lifetimes of motors, and
to see how this confidence interval can help
SureStep choose the better supplier. The file name:
Treadmill Motors
Solution: SureStep Company installs motors from
supplier A on 30 of its treadmills and motors from
supplier B on another 30 of its treadmills.
It then runs these treadmills and records the
number of hours until the motor fails.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.6: Reliability of Treadmill
Motors at SureStep (slide 2 of 3)
SPSS Steps:
From the menus choose:
Analyze Compare Means Independent- Samples T Test.
Select lifetimes and move it onto the Test Variable(s) field.
Select Suppliers and transfer it to the box labeled Grouping Variable.
Click on the <Define Groups> button. In the window displayed, enter
values 1 and 2 for Suppliers 1 and 2 respectively.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.6: Reliability of Treadmill
Motors at SureStep (slide 3 of 3)
SPSS Output:
With 95% confidence, the confidence interval for the difference
between means extends from -47.549 to 233.815.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.7: consumer advocate (slide 1 of 3)
A consumer advocate analyzes the nicotine content in two brands
of cigarettes.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.7: consumer advocate (slide 2 of 3)
The 95% confidence interval for the difference between the two
means ranges from -0.41 to -0.13.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.7: consumer advocate (slide 3 of 3)
SPSS Steps:
From the menus choose:
Analyze Compare Means Summary Independent-
Samples T Test.
Complete the dialog box as shown. SPSS Output:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Equal-Variance Assumption
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-7b Paired Samples
then click on the arrow button to move the selection pair into
the Paired Variables box.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.8: Husband and Wife Reactions
to Presentations (slide 3 of 3)
The sample mean Husband minus Wife difference is 1.629 and a
95% confidence interval for this difference extends from 1.057 to
2.200
SPSS Output:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-9 Sample Size Selection
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-9a Sample Size Selection for Estimation
of the Mean
The appropriate sample size for estimation of the mean can be
calculated from the formula for the confidence interval for the
mean, by setting and solving for n:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.9: Sample Size Selection for
Estimating Reaction
Objective: To find the sample size of customers required to
achieve a sufficiently narrow confidence interval for the mean
rating of the new sandwich.
Solution: The fast-food manager in Example 8.1 surveyed 40
customers, each of whom rated a new sandwich on a scale of 1 to
10.
Based on the data, a 95% confidence interval for the mean rating
of all potential customers extended from 5.739 to 6.761, with a
half-length (Margin of error) of 0.511.
Find how large a sample would be needed to reduce this half-
length to approximately 0.3, s=1.597 .
Using the sample size formula yields the following, rounded up
to 109.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
8-9b Sample Size Selection for
Estimation of Other Parameters
The sample-size analysis for the mean carries over
with very few changes to other parameters.
Sample size formula for estimating a proportion:
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.10: Sample Size Selection for
Estimating the Proportion
Objective: To find the sample size of customers required to
achieve a sufficiently narrow confidence interval for the
proportion of customers who have tried the new sandwich.
Solution: The data set is the same as in Examples 8.1 and 8.9.
Now the fast-food manager wants to estimate the proportion of
customers who have tried its new sandwich.
She wants a 90% confidence interval for this proportion to have
half-length 0.05.
If she is fairly sure that the proportion who have tried the new
sandwich is around 0.3, she can use pest = 0.3.
Round up to 228.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 8.11: Sample Size Selection for
Analyzing Customer Complaints
Objective: To see how many employees in each experimental group must be
sampled to achieve a sufficiently narrow confidence interval for the difference
between the mean numbers of complaints.
Solution: A customer service center has two types of employees: those who
have had a recent course in dealing with customers (but little actual experience)
and those with a lot of experience dealing with customers (but no formal
course).
The company wants to estimate the difference between the two types of
employees in terms of the average number of customer complaints regarding
poor service in the last six months.
The company plans to obtain information on a randomly selected sample of
each type of employee, using equal sample sizes.
How many employees should be in each sample to achieve a 95% confidence
interval with approximate half-length 2 ? Assume =5
Round up to 49.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.