Professional Documents
Culture Documents
Sessions 8 & 9 - Sample and Sampling Distributions
Sessions 8 & 9 - Sample and Sampling Distributions
Distributions
Sessions- 8 & 9
BUSINESS STATISTICS
Introduction to Sampling Process
• Sample Unit, Sample Frame and target population: list of the items of
the (target) population from which a sample is to be obtained
• Characteristics of a Population ~ Parameter
• Characteristics of a Sample ~ Statistic
• Sampling Error: Selection Bias
• Sample size [n] vs cost: a trade-off
• Non-Sampling Error: Measurement Bias
• Defects: observational error
• Clerical errors: Recording, Copying, editing
• No-response: Missing Values
Sampling Distribution of x
σ
μX μ
and
σX
n
Using E[X] and Var (X)
Properties of Moments
Sample Mean Sampling Distribution: Standard Error of the Mean
.2 .2
.1 .1
0
X
0
18 19 20 21 22 23 24
_
18 20 22 24 X
A B C D
Intuitive representation of effect of n on standard error
Z-value for Sampling Distribution
of the Mean
• Z-value for the sampling distribution of : X
( X μX ) ( X μ)
Z
σX σ
n
•
μx μ Distribution
μ x
(i.e. x is unbiased ) Normal Sampling
Distribution
(has the same mean)
μx
x
Sampling Distribution Properties
Larger
sample size
Smaller
sample size
μ x
Sample Mean Sampling Distribution:
If the Population is not Normal
μx μ σ
and
σx
n
Central Limit Theorem
the sampling
As the n↑ distribution of
sample the sample
size gets mean becomes
large almost normal
enough… regardless of
shape of
population
x
Sample Mean Sampling Distribution:
If the Population is not Normal
Population Distribution
Sampling distribution
properties:
Central Tendency
μx μ
μ x
Variation Sampling Distribution
σ (becomes normal as n increases)
σx Larger
n Smaller
sample size
sample
size
μx x
Visualizing CLT
How do we define a large sample?
• Approximated by a
Sampling Distribution
normal distribution if: P( ps)
.3
• nπ 5 .2
.1
0
and
0 .2 .4 .6 8 1 p
n(1 π ) 5
where
π (1 π )
μp π and σp
n
(where π = population proportion)
Z-Value for Proportions
Standardize p to a Z value with the formula:
p p
Z
σp (1 )
n
Example
• If the true proportion of bankers who support M & A
of PSU Banks is π = 0.4, what is the probability that
a sample of 200 managers yields a sample
proportion between 0.40 and 0.45?
(1 ) 0.4(1 0.4)
Find σ p : σ p 0.03464
n 200
Standardized
Sampling Distribution Normal Distribution
0.4251
Standardize
N n
fpc
N 1
Using The fpc In Calculating Standard Errors
Standard Error of the Mean for Finite Populations
N n
X
n N 1
(1 ) N n
p
n N 1
Using The fpc Reduces The Standard Error
• Resulting in more precise estimates of population parameters
40 1000 100
X 3.8
100 1000 1
Sampling Distribution of p
• Standard Deviation of p
Finite Population Infinite Population
N n p (1 p) p(1 p)
p p
N 1 n n
• Sampling distributions
• The sampling distribution of the mean
• For normal populations
• For non-normal populations: Using the Central Limit Theorem
• The sampling distribution of a proportion
• Calculating probabilities using sampling distributions
• To know when finite population corrections are needed
• To know how to utilize finite population correction factors in calculating
standard errors-
• FINAL EXAMPLE
Sampling Distribution of x
Example: St. Andrew’s College
What is the probability that a simple random
sample of 30 applicants will provide an estimate of
the population mean SAT score that is within +/-10
of the actual population mean ?
In other words, what is the probability that x will
be between 1687 and 1707?
Sampling Distribution of x
Example: St. Andrew’s College
Sampling
Distribution
87.4
of x x 15.96
n 30
for SAT
Scores
x
E( x ) 1697
Sampling Distribution of x
Example: St. Andrew’s College
Step 1: Calculate the z-value at the upper endpoint of
the interval.
z = (1707 - 1697)/15.96= .63
Step 2: Find the area under the curve to the left of the
upper endpoint.
P(z < .63) = .7357
Sampling Distribution of x
Example: St. Andrew’s College
Cumulative Probabilities for
the Standard Normal Distribution
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
. . . . . . . . . . .
.5 .6915 .6950 .6985 .7019 .7054 .7088 .7123 .7157 .7190 .7224
.6 .7257 .7291 .7324 .7357 .7389 .7422 .7454 .7486 .7517 .7549
.7 .7580 .7611 .7642 .7673 .7704 .7734 .7764 .7794 .7823 .7852
.8 .7881 .7910 .7939 .7967 .7995 .8023 .8051 .8078 .8106 .8133
.9 .8159 .8186 .8212 .8238 .8264 .8289 .8315 .8340 .8365 .8389
. . . . . . . . . . .
Sampling Distribution of x
Example: St. Andrew’s College
Sampling
Distribution x 15.96
of x
for SAT
Scores
Area = .7357
x
1697 1707
Sampling Distribution of x
Example: St. Andrew’s College
Step 3: Calculate the z-value at the lower endpoint of
the interval.
z = (1687 - 1697)/15.96= - .63
Step 4: Find the area under the curve to the left of the
lower endpoint.
P(z < -.63) = .2643
Sampling Distribution of x SAT Scores
for
Example: St. Andrew’s College
Sampling
Distribution x 15.96
of x
for SAT
Scores
Area = .2643
x
1687 1697
Sampling Distribution of x SAT Scores
for
Example: St. Andrew’s College
Step 5: Calculate the area under the curve between
the lower and upper endpoints of the interval.
P(-.68 < z < .68) = P(z < .68) - P(z < -.68)
= .7357 - .2643
= .4714
The probability that the sample mean SAT score will
be between 1687 and 1707 is:
Sampling
Distribution x 15.96
of x
for SAT
Scores
Area = .4714
x
1687 1697 1707
Relationship Between the Sample Size
and the Sampling Distribution of x
Example: St. Andrew’s College
• Suppose we select a simple random sample of 100 applicants
instead of the 30 originally considered from a total of 900 applicants .
• E( x) = m regardless of the sample size. In our
example, E(x) remains at 1697.
• Whenever the sample size is increased, the standard
error of the mean x is decreased. With the increase
in the sample size to n = 100, the standard error of
the mean is decreased from 15.96 to:
N n 900 100 87.4
x .94333(8.74) 8.2
N 1 n 900 1 100
Relationship Between the Sample Size
and the Sampling Distribution of x
Example: contd…*applying concept of sampling from finite population
With n* = 100,
x 8.2
With n = 30,
x 15.96
x
E( x ) 1697
Relationship Between the Sample Size
and the Sampling Distribution of x
Example: St. Andrew’s College
• Recall that when n = 30, P(1687 < x < 1707) = .4714.
• We follow the same steps to solve for P(1687 < x
< 1707) when n = 100 as we showed earlier when
n = 30.
• Now, with n = 100, P(1687 < x < 1707) = .7776.
• Because the sampling distribution with n = 100 has a
smaller standard error, the values of x have less
variability and tend to be closer to the population
mean than the values of x with n = 30.
Relationship Between the Sample Size
and the Sampling Distribution of x
Example: St. Andrew’s College
Sampling x 8.2
Distribution
of x
for SAT
Scores
Area = .7776
x
1687 1697 1707