Professional Documents
Culture Documents
Week05 Confidence Intervals
Week05 Confidence Intervals
Week05 Confidence Intervals
Confidence intervals
Review and Preview
Sampling
The Central Limit Theorem
Estimating a Population Proportion
Estimating a Population Mean
Estimating a Population Standard Deviation or
Variance
Formula 1
Distribution determined by
fixed values of mean and
Figure 1 standard deviation
In previous sessions, we used descriptive
statistics when we summarized data using
tools such as graphs and statistics such as
the mean and standard deviation.
We also looked at critical values:
◦ zα denotes the z score with an area of α to its
right.
◦ If α = 0.025, the critical value is z0.025 = 1.96.
That is, the critical value z0.025 = 1.96 has an area
of 0.025 to its right.
Steps
◦ Divide the population into a large number of
clusters, such as city blocks.
Steps
◦ Divide the population into separate groups, called
strata, based on some attribute or characteristic
about the subjects.
x
The standard deviation of sample mean
x
n
(often called the standard error of the mean)
As we proceed from
n = 1 to n = 50, we
see that the
distribution of
sample means is
approaching the
shape of a normal
distribution.
As we proceed from
n = 1 to n = 50, we
see that the
distribution of
sample means is
approaching the
shape of a normal
distribution.
As we proceed from
n = 1 to n = 50, we
see that the
distribution of
sample means is
approaching the
shape of a normal
distribution.
Converting to z:
156.25 182.9
z 2.61
10.2
ˆˆ
pq
E z 2
n
SIS 1037Y 2020/2021 52
p = population proportion
p̑ = sample proportion
n = number of sample values
E = margin of error
zα/2 = z score separating an area of α/2 in
the right tail of the standard normal
distribution.
E z 2
ˆˆ
pq
1.96
0.85 0.15
n 1007
E 0.0220545
SIS 1037Y 2020/2021 59
b) The 95% confidence
interval:
p̑-E <p < p̑+E
0.85 – 0.0220545 < p < 0.85 + 0.0220545
0.828 < p < 0.872
957.839
958
SIS 1037Y 2020/2021 68
Example - Continued
b) Use 0.05 so z 2 1.96
E 0.03
To be 95% confident
that our sample
z
2
2 0.25 percentage is within
n
E2 three percentage points
of the true percentage
1.96 0.25
2
Margin of Error:
= (upper confidence limit) — (lower confidence limit)
21.0
E t 2 2.009 6.027
n 49
α = 0.05
α/2 = 0.025
zα/2= 1.96 With a simple random sample of only
E= 3 97 statistics students, we will be 95%
confident that the sample mean is
σ = 15 within 3 IQ points of the true
population mean μ.
or x E,x E
or n > 30
or n > 30
10
SIS 1037Y 2020/2021 0
2
n 1 s 2
2
where
n = sample size
s2 = sample variance
σ2 = population variance
df = n – 1 degrees of freedom
10
SIS 1037Y 2020/2021 1
1. The chi-square distribution is not symmetric,
unlike the normal and Student t distributions. As the
number of degrees of freedom increases, the
distribution becomes more symmetric.
10
SIS 1037Y 2020/2021 3
A simple random sample of 22 IQ scores is
obtained. Construction of a confidence interval
for the population standard deviation σ requires
the left and right critical values of χ2
corresponding to a confidence level of 95% and
a sample size of n = 22.
10
SIS 1037Y 2020/2021 4
10
SIS 1037Y 2020/2021 5
Critical Values of the Chi-Square Distribution
10
SIS 1037Y 2020/2021 6
The sample variance s2 is the best point
estimate of the population variance σ2.
The sample standard deviation s is a
commonly used point estimate of σ (even
though it is a biased estimate).
10
SIS 1037Y 2020/2021 7
σ = population standard deviation
σ2 = population variance
s = sample standard deviation
s2 = sample variance
n = number of sample values
E = margin of error
Χ2L = left-tailed critical value of χ2
χ2R = right-tailed critical value of χ2
10
SIS 1037Y 2020/2021 8
Requirements:
◦ 1. The sample is a simple random sample.
◦ 2. The population must have normally distributed
values (even if the sample is large).
Confidence Interval for the Population
Variance σ2
n 1 s2
2
n 1 s 2
2
R 2
L
For the Population Standard Deviation σ :
n 1 s2
n 1 s2
2
R
L2
10
SIS 1037Y 2020/2021 9
1. Verify that the required assumptions are
satisfied.
2. Using n – 1 degrees of freedom, refer to
Table or use technology to find the critical
values χ2R and Χ2L that correspond to the
desired confidence level.
3. Evaluate the upper and lower confidence
interval limits using this format of the
confidence interval: n 1s 2 2 n 1s 2
2
R
L2
11
SIS 1037Y 2020/2021 0
4. If a confidence interval estimate of σ is
desired, take the square root of the upper and
lower confidence interval limits and change σ2 to
σ.
5. Round the resulting confidence level limits.
If using the original set of data to construct a
confidence interval, round the confidence
interval limits to one more decimal place than
is used for the original set of data. If using the
sample standard deviation or variance, round
the confidence interval limits to the same
number of decimal places.
11
SIS 1037Y 2020/2021 1
Confidence intervals can be used informally
to compare the variation in different data
sets, but the overlapping of confidence
intervals should not be used for making
formal and final conclusions about equality of
variances or standard deviations.
11
SIS 1037Y 2020/2021 2
A group of 22 subjects took an IQ test during
part of a study. The subjects had a standard
deviation IQ score of 14.3.
11
SIS 1037Y 2020/2021 3
We must first verify the requirements are met.
1. We can treat the sample as a simple random sample.
2. The following display shows a histogram of the data and the
normality assumption can be reasonably met.
11
SIS 1037Y 2020/2021 4
n = 22 so df = 22 – 1 = 21
Use table to find:
n 1 s 2 2
n 1 s 2
R2 L2
22 114.3 22 114.3
2 2
2
35.479 10.283
11
SIS 1037Y 2020/2021 5
Evaluation of the preceding expression yields:
121.0 < σ2 < 417.6
Finding the square root of each part (before
rounding), then rounding to two decimal
places, yields this 95% confidence interval
estimate of the population standard
deviation: 11.0 < σ < 20.4
Based on this result, we have 95%
confidence that the limits of 11.0 and 20.4
contain the true value of σ.
11
SIS 1037Y 2020/2021 6
The procedures for finding the sample size
necessary to estimate σ2 are much more
complex than the procedures given earlier
for means and proportions. Instead of using
very complicated procedures, we will use
the following Table (see next slide).
11
SIS 1037Y 2020/2021 7
11
SIS 1037Y 2020/2021 8
We want to estimate the standard deviation σ
of all voltage levels in a home. We want to be
95% confident that our estimate is within 20%
of the true value of σ.
How large should the sample be? Assume
that the population is normally distributed.
11
SIS 1037Y 2020/2021 9
From our Table, we can see that 95%
confidence and an error of 20% for σ
correspond to a sample of size 48.
12
SIS 1037Y 2020/2021 0
Sampling techniques
Central limit theorem
We have seen a number of methods for
estimating a Population Proportion, a
Population Mean, and a Population Standard
Deviation or Variance.
When to apply which technique
12
SIS 1037Y 2020/2021 1
Comments?