Professional Documents
Culture Documents
Lec 10-13
Lec 10-13
Lec 10-13
Mean x
Standard
Deviation s
Variance s2 2
Binomial ^
p p
Proportion
Sampling Distribution
2.5 x 2.5
5.2
Properties of Sampling
Distributions:
Unbiasedness and
Minimum Variance
Point Estimator
Probability Distribution
x 0 2 3
1 1 1
p(x)
3 3 3
= 1.667 2 = 1.556
= 1.247
Example
Sampling Distribution of x for n = 2
(3 possible samples, each with a sample mean)
3 5
x 1
2 2
1 1 1
p(x)
3 3 3
Dispersion
m = 50 x
x
n
Sampling with Sampling Distribution
replacement n=4 n =16
x = 5 x = 2.5
mx- = 50 x
Standardizing the
Sampling Distribution of x
x x x
z
x
Sampling n Standardized Normal
Distribution Distribution
sx s=1
mx x m =0 z
Thinking Challenge
You’re an operations
analyst for AT&T. Long-
distance telephone calls
are normally distributed
with = 8 min. and = 2
min. If you select random
samples of 25 calls, what
percentage of the sample
means would be between
7.8 & 8.2 minutes?
© 1984-1994 T/Maker Co.
Sampling Distribution
Solution*
x 7.8 8
z .50
2
n 25
x 8.2 8
z .50
Sampling 2 Standardized Normal
Distribution n 25 Distribution
s`x = .4 s=1
.3830
.1915 .1915
Dispersion
m = 50
x x
n
Sampling with Sampling Distribution
replacement n=4 n =30
x = 5 x = 1.8
mx- = 50 x
Central Limit Theorem
Consider a random sample of n observations
selected from a population (any probability
distribution) with mean μ and standard deviation .
Then, when n is sufficiently large, the sampling
distribution of x will be approximately a normal
distribution with mean x and standard
deviation x n . The larger the sample size,
the better will be the normal approximation to the
sampling distribution of x .
Central Limit Theorem
As sample x
n
size gets
sampling
large
distribution
enough
becomes almost
(n 30) ...
normal.
x x
Central Limit Theorem
Example
The amount of soda in cans
of a particular brand has a
mean of 12 oz and a standard
deviation of .2 oz. If you
select random samples of 50 SODA
cans, what percentage of the
sample means would be less
than 11.95 oz?
Central Limit Theorem
Solution*
x 11.95 12
z 1.77
.2
Sampling n 50 Standardized Normal
Distribution Distribution
s`x = .03 s=1
.0384
.4616
11.95 12 `x –1.77 0 z
Shaded area exaggerated
5.4
pˆ p (1 p ) / n .
3. For large samples, the sampling distribution is
approximately normal. (A sample is considered
large if npˆ 15 and n(1 pˆ ) 15.)
Thinking Challenge
Suppose you’re
interested in the
average amount of
money that students
in this class (the
population) have on
them. How would
you find out?
Statistical Methods
Statistical
Methods
Descriptive Inferential
Statistics Statistics
Hypothesis
Estimation
Testing
6.1
Estimation
Point Interval
Estimation Estimation
Target Parameter
p Proportion; percentage
fraction; rate Qualitative
Point Estimator
A point estimator of a population parameter is a
rule or formula that tells us how to use the sample
data to calculate a single number that can be used
as an estimate of the target parameter.
Point Estimation
Confidence Confidence
limit (lower) limit (upper)
x z /2 x z /2
n n
.05 .05
1.99 1.645 1.99 1.645
100 100
1.982 1.998
6.3
Standard
Normal
Bell-Shaped
t (df = 13)
Symmetric
‘Fatter’ Tails t (df = 5)
z
t
0
t - Table
t-value
If we want the t-value with an area of .025 to its
right and 4 df, we look in the table under the
column t.025 for the entry in the row corresponding
to 4 df. This entry is t.025 = 2.776. The
corresponding standard normal z-score is z.025 =
1.96.
Small-Sample
Confidence Interval for µ
s
x t 2
n
x = 3.7
s = 3.8987
• n = 6, df = n – 1 = 6 – 1 = 5
• t.05 = 2.015
.38987 .38987
3.7 2.015 3.7 2.015
6 6
.492 6.908
6.4
Large-Sample Confidence
Interval for a Population
Proportion
Sampling Distribution of p̂
1. The mean of the sampling distribution of p̂ is p;
that is, p̂ is an unbiased estimator of p.
ˆˆ
pq ˆˆ
pq 32
pˆ Z /2 p pˆ Z /2 pˆ 0.08
n n 400
.053 p .107
Thinking Challenge
You’re a production
manager for a newspaper.
You want to find the %
defective. Of 200
newspapers, 35 had
defects. What is the 90%
confidence interval estimate
of the population
proportion defective?
Confidence Interval
Solution*
pˆ qˆ pˆ qˆ
pˆ z /2 p pˆ z /2
n n
.175(.825) .175(.825)
.175 1.645 p .175 1.645
200 200
.1308 p .2192
Adjusted (1 – )100%
Confidence Interval for a
Population Proportion, p
p1 p
p z 2
n4
x2
p
where n 4 is the adjusted sample proportion
of observations with the characteristic of interest, x
is the number of successes in the sample, and n is
the sample size.
6.5
1.645 45
2 2
(z 2 )
2 2
n 219.2 220
(SE) 2 5
2
Sample Size Determination
for 100(1 – ) %
Confidence Interval for p
In order to estimate p with a sampling error SE and
with 100(1 – )% confidence, the required sample
size is found by solving the following equation for
n: pq
z 2 SE
n
The solution for n can be written as follows:
z pq
2
2
Note: Always round n
n up to the nearest
SE 2
integer value.
Sample Size Example
(Z 2 ) pq
2
1.645 .5 .5
2
n 3006.69 3007
(SE) 2 .015 2
Thinking Challenge
You work in Human
Resources at Merrill Lynch.
You plan to survey employees
to find their average medical
expenses. You want to be
95% confident that the
sample mean is within ± $50.
(z 2 )2 2
n
(SE)2
1.96 400
2 2
50
2
245.86 246
6.6
s N n
̂ x
n N
p̂(1 p̂) N n
̂ p̂
n N
Since
n 60 .086
N 700
is greater than .05 use the finite correction
factor
Finite Population Correction
Factor Example
You want to estimate a population mean, μ, where
x =115, s =18, N =700, and n = 60. Find an
approximate 95% confidence interval for μ.
s N n 18 700 60
x 2 115 2
n N 60 700
115 4.4
110.6, 119.4
6.7
n 1 s 2
2
n 1 s 2
2 21
2 2
2
349.874 253.912
57.30 78.95
2
7.1
The Elements of
a Test of Hypothesis
Hypothesis Testing
I believe the
population mean Reject
age is 50 Reject
hypothesis!
hypothesis!
Population (hypothesis). Not
Notclose.
close.
Random
sample
Mean
X = 20
What’s a Hypothesis?
7. Conclusion:
7. Conclusion:
20 m = 50 Sample Means
H0
Rejection Region
(One-Tail Test)
Sampling Distribution Level of Confidence
Rejection
Region
1–
a
Fail to Reject
Region
Ho Sample Statistic
Critical Value
Value
Rejection Regions
(Two-Tailed Test)
Sampling Distribution Level of Confidence
Rejection Rejection
Region Region
1–
1/2 a 1/2 a
Fail to Reject
Region
Ho Sample Statistic
Critical Value Critical
Value Value
Rejection Regions
Alternative Hypotheses
0 1.50 z
z value of sample
statistic (observed)
Two-Tailed Z Test
p-Value Solution
1/2 p-Value 1/2 p-Value .5000
– .4332
.0668
.4332
–1.50 0 1.50 z
From z table: z value of sample
lookup 1.50 statistic (observed)
Two-Tailed z Test
p-Value Solution
–1.50 0 1.50 z
Two-Tailed z Test
p-Value Solution
p-Value = .1336 = .05
Do not reject H0.
1/2 p-Value = .0668 1/2 p-Value = .0668
Reject H0 Reject H0
1/2 = .025 1/2 = .025
–1.50 0 1.50 z
Test statistic is in ‘Do not reject’ region
One-Tailed z Test
p-Value Example
Does an average box of
cereal contain more than
368 grams of cereal? A
random sample of 25
boxes showed x = 372.5.
The company has specified
to be 15 grams. Find the
p-value. How does it
368 gm.
compare to = .05?
One-Tailed z Test
p-Value Solution
x 372.5 368
z 1.50
15
n 25
0 1.50 z
z value of sample
statistic
One-Tailed z Test
p-Value Solution
Use p-Value
alternative .5000
hypothesis – .4332
to find .4332 .0668
direction
0 1.50 z
From z table:
z value of sample
lookup 1.50 statistic
One-Tailed z Test
p-Value Solution
p-Value
Use
.0668 .5000
alternative
hypothesis – .4332
to find .4332 .0668
direction
0 1.50 z
From z table:
z value of sample
lookup 1.50 statistic
One-Tailed z Test
p-Value Solution
(p-Value = .0668) ( = .05).
Do not reject H0.
p-Value = .0668
Reject H0
= .05
0 1.50 z
Test statistic is in ‘Do not reject’ region
p-Value
Thinking Challenge
You’re an analyst for Ford. You
want to find out if the average
miles per gallon of Escorts is
less than 32 mpg. Similar
models have a standard
deviation of 3.8 mpg. You take
a sample of 60 Escorts &
compute a sample mean of
30.7 mpg. What is the p-
value? How does it compare to
= .01?
p-Value
Solution*
p-Value is P(z -2.65) = .004.
p-Value < ( = .01). Reject H0.
p-Value
Use .5000
alternative
.004 – .4960
hypothesis .0040
to find .4960
direction
–2.65 0 z
z value of sample From z table:
statistic lookup 2.65
Converting a Two-Tailed
p-Value from a Printout to a
One-Tailed p-Value
Reported p-value if Ha is of the form > and z is
p
2 positive
or Ha is of the form < and z is
negative
Reported p-value
p 1
2
if Ha is of the form > and z is negative
Ha is of the form < and z is positive
7.4
No evidence average is
0 1.645 z more than 368
One-Tailed z Test Thinking
Challenge
You’re an analyst for Ford. You
want to find out if the average
miles per gallon of Escorts is at
least 32 mpg. Similar models
have a standard deviation of 3.8
mpg. You take a sample of 60
Escorts & compute a sample
mean of 30.7 mpg. At the .01
level of significance, is there
evidence that the miles per
gallon is less than 32?
One-Tailed z Test Solution*
H0: = 32 Test Statistic:
Ha: < 32 x 30.7 32
z 2.65
= .01 3.8
n= 60 n 60
Critical Value(s): Decision:
Reject Reject at = .01
Conclusion:
.01
Two-Tailed Test
H0: µ = µ0
Ha: µ ≠ µ0
x
Test statistic: t
s n
There is no evidence
0 1.833 t average is more than 5
7.6
Large-Sample Test of
Hypothesis about a Population
Proportion
Large-Sample Test of
Hypothesis about p
One-Tailed Test
H0: p = p0
Ha: p < p0 (or Ha: p > p0)
Test statistic:
Rejection region:
z < –z(or z > z when Ha: p > p0)
Note: p0 is the symbol for the numerical value of p
assigned in the null hypothesis
Large-Sample Test of
Hypothesis about p
Two-Tailed Test
H0: p = p0
Ha: p ≠ p0
p̂ p0
Test statistic: z where p̂ p0 q0 n
p̂
q0 1 p0
Rejection region: |z| < z
There is evidence
-1.96 0 1.96 z proportion is not 4%
7.7
One-Tailed Test
H0: = 0
Ha: < 0(or Ha: > 0)
2 n 1 s 2
Test statistic: 02
Rejection region: 2
2
1
(or > when Ha: > 0)
where 0 is the hypothesized variance and the
distribution of is based on (n – 1) degrees of
freedom.
Test of a Hypothesis about 2
Two-Tailed Test
H0: = 0
Ha: ≠ 0
2 n 1s 2
Test statistic: 02
Rejection region: 2
2
1 2 or 2
2
2
Example
Is the variation in boxes
of cereal, measured by
the variance, equal to 15
grams? A random
sample of 25 boxes had
a standard deviation of
17.7 grams. Test at
the .05 level of
significance.
Chi-Square (2) Test
Solution
H0: 2 = 15
Test Statistic:
Ha: 2 15
(25 1) 17.7
2
(n 1) s 2
= .05
2
25 – 1 = 24
2
0 152
df =
= 33.42
Critical Value(s): Decision:
/2 = .025 Do not reject at = .05
Conclusion:
There is no evidence
0 12.401 39.364 2 2 is not 15
7.8
s
Lower-tailed test: x0 0 z x 0 z
n
s
x0, U 0 z 2 x 0 z 2
n
Steps for Calculating for a
Large-Sample Test about µ
2. Specify the value of µa in the alternative
hypothesis for which the value of is to
be calculated. Then convert the border
value(s) of x0 to z-value(s) using the
alternative distribution with mean µa. The
general formula for the z-value is
x0 a
z
x
Steps for Calculating for a
Large-Sample Test about µ
Equal to 1 –
Used in determining test adequacy
Affected by
• True value of population parameter
• Significance level
• Standard deviation & sample size n
Two-Tailed z Test Example
Does an average box of
cereal contain 368 grams of
cereal? A random sample
of 25 boxes had x = 372.5.
The company has specified
to be 15 grams. Test at
the .05 level of significance.
368 gm.
Finding Power
Step 1
Reject H0
Hypothesis:
15
n
Do Not
H0: 0 368 Draw
25 Reject H0
Ha: 0 < 368 = .05
0 = 368 x
Finding Power
Steps 2 & 3
Reject H0
Hypothesis:
15
n
Do Not
H0: 0 368 Draw
25 Reject H0
Ha: 0 < 368 = .05
0 = 368 x
‘True’ Situation:
a = 360 (Ha)
Draw
1–
Specify
a = 360 x
Finding Power
Step 4
Reject H0
Hypothesis:
15
n
Do Not
H0: 0 368 Draw
25 Reject H0
Ha: 0 < 368 = .05
0 = 368 x
15
‘True’ Situation:
a = 360 (Ha)
xL 0 z
n
368 1.64
25
Draw
363.065
1–
Specify
a = 360 363.065
x
Finding Power
Step 5
Reject H0
Hypothesis:
15
n
Do Not
H0: 0 368 Draw
25 Reject H0
Ha: 0 < 368 = .05
0 = 368 x
‘True’ Situation:
a = 360 (Ha)
xL 0 z
368 1.64
15
Draw n 25
= .154
363.065
Specify 1– =.846
z Table
a = 360 363.065 x
Properties of
and Power
1. For fixed n and ,
the value of
decreases, and the
power increases as
the distance
between the
specified null value
µ0 and the specified
alternative value µa
increases.
Properties
of and
Power
2. For fixed n and
values of µ0
and µa, the
value of
increases, and
the power
decreases as
the value of
is decreased.
Properties of and Power
3. For fixed and values of µ0 and µa, the value of
decreases, and the power increases as the
sample size n is increased.