Professional Documents
Culture Documents
Topic 05c - Inference About Population Variance
Topic 05c - Inference About Population Variance
DECISION SCIENCE ‐ 1
PGP ‐ 1
Notice: This material contains material that is protected by copyright. You have been granted permission to use this material only for
the course Decision Science ‐ 1 being offered Term 1 (Aug‐Oct 2020) at Indian Institute of Management, Bangalore. This presentation
may not be copied or distributed in whole or in part without permission from Ananth Krishnamurthy.
Outline
• By selecting a sample, we can compute the sample variance. If the sample variance is high,
it may indicate over‐performance or under‐performance, even if the average is same
• We start with the sampling distribution for the variance, and use that information to
develop confidence intervals and conduct hypothesis tests
Variance,𝜎
• When we sample the population, we obtain a we obtain a
specific value of sample variance, 𝑆 , denoted by 𝑠
– For example, if we get 𝑠 =2.5. This is our estimate of 𝜎
Distribution
number
𝑠
• If we had an interval estimate, we would have been able to 𝑠
𝑠
say 𝜎 lies in some interval, say (1, 5)
Sample Variance (𝑆 )
Decision Science ‐1 ‐ Course Material
∑ ̅
• Recall that: 𝑠
– If the random variables Xi ~ N(0,1) for 1 ≤ i ≤ n, are independent, then the random variable Y = X12 + X22
+…+ Xn2 is said to have a Chi‐Square distribution with n degrees of freedom.
Chi‐Square Distribution
• If the random variables Xi ~ N(0,1) for 1 ≤ i ≤ n, are independent, then the random variable Y
= X12 + X22 +…+ Xn2 is said to have a Chi‐Square distribution with n degrees of freedom.
• It is denoted by Y ~ χ2n and has an expectation of n and a variance of 2n.
With 2 degrees
f(x) of freedom
With 5 degrees
of freedom
With 10 degrees
of freedom
Chi‐Square Distribution
𝑃 𝑋 𝜒 , 𝛼
• We will use the notation 𝜒 , to
denote the critical points of a Chi‐ 𝛼
Square distribution with 𝜐 degrees of
freedom
𝜒
0 𝜒 ,
𝑃 𝑋 𝜒 , 𝛼
𝜒
• Substituting for for the 𝜒 we get:
0 𝜒 ,
𝑛 1 𝑠
𝜒 𝜒
, 𝜎 ,
.025
• Performing algebraic manipulations, we get: .025
95% of the
𝑛 1 𝑠 𝑛 1 𝑠 possible 𝜒 , values
𝜎 𝜒
𝜒 𝜒
, ,
0 𝜒 . , 𝜒 . ,
𝛼
𝑛 1 𝑠 𝑛 1 𝑠
𝜎
𝜒 𝜒
, , 𝜒
• And…
0 𝜒 ,
𝑛 1 𝑠 𝑛 1 𝑠
𝜎
𝜒 𝜒
, ,
.025
.025
95% of the
• The 𝜒 values are based on a Chi‐Square distribution possible 𝜒 , values
𝜒
with 𝑛 1 degrees of freedom
• The term (1 𝛼) is the confidence coefficient 0 𝜒 . , 𝜒 . ,
10
.025 8 68.6
.025 9 67.9
95% of the
10 67.2
possible 𝜒 , values
𝜒 Sample std. dev: 𝒔 0.84
𝟐
0 𝜒 . , 2.70 𝜒 19.02 Sample Variance: 𝒔 0.70
. ,
11
12
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
13
• Note: The lack of symmetry in the 𝜒 distribution changes the critical value criteria
14
15
𝛼 0.10
Specify the Level of Significance (𝛼)
𝑛 10, 𝑑𝑓 9, 𝑠 0.70
Collect sample data and compute 𝑛 1 𝑠 9 ∗ 0.70
the value of the Test Statistic 𝜒 12.60
𝜎 0.5
16
17
Outline
18
The F‐Distribution
• The ratio of two independent Chi‐Square random variables
that have been divided by their respective degrees of
freedom is defined as the 𝐹 ‐distribution. This ratio is
written as:
𝜒 ⁄𝜐
𝐹 , ~
𝜒 ⁄𝜐
Sir Ronald Aylmer Fisher (1890 –1962)
• The 𝐹 ‐distribution is defined on the positive state space was a British statistician and geneticist.
For his work in statistics, he has been
(i.e. the random variable takes only positive values) described as "a genius who almost
single‐handedly created the foundations
for modern statistical science” and "the
single most important figure in 20th
• It has an expectation close to one and its variance century statistics".
19
The F‐Distribution
• We will use the notation 𝐹 , , to denote the critical points of an F‐distribution
𝜒 ⁄𝜐
𝐹 , ~
𝜒 ⁄𝜐
𝑃 𝑋 𝐹 , , 𝛼
x
0 𝐹 , ,
20
• The Scenario:
Population 1 Population 2
21
• Suppose we have two independent random samples from two Normally distributed
populations
• Then these samples will give rise to two random variables, 𝑆 and 𝑆 where:
22
• Then the ratio of these two random variables 𝑆 and 𝑆 will be the random variable
,
where: ∗ ∗
,
, ⁄
• When the population variances are equal, i.e. 𝜎 𝜎 , then ⁄
which is the
,
ratio of to Chi‐Square random variables divided by their own degrees of freedom
, ⁄
• Then 𝐹 ⁄
is a random variable with an 𝐹 distribution with 𝑛 1 and
,
𝑛 1 degrees of freedom
Decision Science ‐1 ‐ Course Material
23
• The test statistic used to investigate the ratio of population variances of two normally
distributed populations is:
𝑆 𝜒 , ⁄𝑛 1
𝐹
𝑆 𝜒 , ⁄𝑛 1
• For a specific sampling experiment, this random variable 𝐹, takes value , just like we got a
value 𝑥̅ for the random variable 𝑋
24
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
Upper‐tail Two‐tailed
• We usually denote the population providing the larger sample variance as population 1
• Since the 𝐹‐test statistic is constructed with the larger sample variance, 𝑠 in the numerator,
the value of the test statistic will be in the upper tail of the F distribution (greater than 1)
• Therefore we do not consider Lower tail tests and tables do not list lower tail distributions
25
• Summary of approaches:
Assuming sampling from Normal population and 𝑠 𝑠
Upper Tail Test Two‐Tailed Test
Hypotheses 𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
Test Statistic 𝑆 𝜒 , ⁄𝑛 1 𝑆 𝜒 , ⁄𝑛 1
𝐹 𝐹
𝑆 𝜒 , ⁄𝑛 1 𝑆 𝜒 , ⁄𝑛 1
Rejection Rule (p‐value) 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻
𝑖𝑓 p value 𝛼 𝑖𝑓 p value 𝛼
Rejection Rule (critical ‐value) 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻
𝑖𝑓 𝐹 𝐹 𝑖𝑓 𝐹 𝐹 /
26
27
28
29
• Summary of approaches:
Assuming sampling from Normal population and 𝑠 𝑠
Upper Tail Test Two‐Tailed Test
Hypotheses 𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
𝐻 :𝜎 𝜎 𝐻 :𝜎 𝜎
Test Statistic 𝑆 𝜒 , ⁄𝑛 1 𝑆 𝜒 , ⁄𝑛 1
𝐹 𝐹
𝑆 𝜒 , ⁄𝑛 1 𝑆 𝜒 , ⁄𝑛 1
Rejection Rule (p‐value) 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻
𝑖𝑓 p value 𝛼 𝑖𝑓 p value 𝛼
Rejection Rule (critical ‐value) 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻 𝑅𝑒𝑗𝑒𝑐𝑡 𝐻
𝑖𝑓 𝐹 𝐹 𝑖𝑓 𝐹 𝐹 /
30
Questions?
Acknowledgments: The authors are thankful for permission to use cartoons and graphics in this
presentation. They are solely intended to improve educational experience of students.
31