Professional Documents
Culture Documents
STATS 2 Part 1 Rev 2.0 With Exercise Slides Ao
STATS 2 Part 1 Rev 2.0 With Exercise Slides Ao
Part 1
Speaker Name
ST Context2
• The main goals of STatS are review, rationalize and improve the effectiveness
of Statistical methodology in general.
• To drive and support the deployment and correct application of the Statistics
Manuals in all ST manufacturing plants
S.P.C.
Statistics Learning
A Awareness
W Statistics Level 1 S.P.C.
A (STAT 1) Operators
R
M.S.A.
E Awareness
N
Statistics Level 2
E S.P.C.
Part 1 Engineers
S
D.O.E.
S Awareness Part 2
4
Training purpose
• To understand the basics of inferential Statistics
• To form, use and interpret confidence intervals
• Perform and interpret statistical test: null and alternative hypothesis, test statistics, p-value
• To understand and use the concept of correlation between variables
• To use both parametric and non-parametric Statistics
• To focus on the importance of the methods for outlier detection
• To know about the existence of bootstrap methodology
5
Benefits
6
Let’s get to know each other…
Round table:
• Name
• Organization
• Are you already using statistical methodology?
• If so, what are the main applications?
• Expectations from the course
7
Pre-test
• Complete the questionnaire to the best of your
knowledge
8
Structure of the Course
Structure of the course
10
Module 1 : Introduction
Objective: Duration ~ TBD hrs
• Recall from STATS1 : Population vs Sample , Descriptive vs Inferential Statistics.
• Point and interval estimation of parameters.
• Hypothesis Testing Procedures.
• Inferential error.
FAQ
#1 : To collect input from participants.
12
Module 1
Inferential Error
Descriptive statistics
• Collecting, summarizing, and processing data to transform data into information
Inferential statistics
• provide the basis for predictions, forecasts, and estimates that are used to transform
information into knowledge
13
Module 1
Inferential Error
Descriptive
Module 1 Key Learning
•Collect data
• e.g., Survey
•Present data
• e.g., Tables and Graphs
•Summarize data
σ 𝑋𝑖
ത
• e.g., Sample average = 𝑋 =
𝑛
14
Module 1
Inferential Error
•Hypothesis testing
• e.g., Test the claim that the population
mean weight is 120 pounds
Inference is the process of drawing conclusions or making
decisions about a population based on sample results
15
Module 1
Inferential Error
Hypothesis testing
• e.g., Test the claim that the population mean weight is 120
pounds
16
Module 1
Introduction To Inference
Inferential Error
17
Module 1
Introduction To Inference
Inferential Error
Point Estimate
Lower Upper
Confidence Confidence
Limit Limit
Confidence Interval
18
Module 1
Introduction To Inference
Inferential Error
Estimation
• e.g., Estimate the population mean weight using the
sample mean weight
Hypothesis testing
e.g., Test the claim that the population mean weight is
120 pounds
19
Module 1
Introduction To Inference
Inferential Error
• A hypothesis is a claim (assumption) about an aspect of the
Module 1 Key Learning
population under investigation
• Other
20
Module 1
Introduction To Inference
21
Module 1
Introduction To Inference
Inferential Error
22
Module 1
Inferential Error
“Represents adequately” ≠ “Represents perfectly”
Module 1 Key Learning
23
Module 1
Introduction To Inference
Inferential Error
WHY in interval estimation problems and hypothesis testing
Module 1 Key Learning
procedures we cannot be 100% sure to take the right decision
(cannot be 100% confident)?
24
Module 1
Introduction To Inference
If we use a higher confidence level (same data), say 99%, we must accept a
wider confidence interval → less precise estimator.
- Confidence limits: LCL=50, UCL=64 (width =14)
- Confidence Level: 0.99 (or 99%)
- Population mean point estimate: ഥ = 𝟓𝟕
𝑿
Introduction To Inference
• Hypothesis Testing
• Inferential Error
• Key definitions:
• Population vs. Sample
• Point vs. Interval Estimation
• Null Hypothesis vs. Alternative Hypothesis
• Inferential Error
26
Module 2 : - The Decision Process
- The Central Limit Theorem
Objective: Duration ~ TBD hrs
• Understand the Concept of Central Limit Theorem
FAQ
#1 : To collect input from participants.
28
Module 2
Inferential Statistics
Knowledge
STATS 2
Statistical Decision
29
Module 2
30
Module 2
The CLT provides useful information about the distribution of the sample average. In fact, the
theorem demonstrates that the distribution of 𝑿ഥ approaches normality regardless of the shape
of the distribution of the individual 𝑿𝒊 .
𝟐
1. 𝑿~𝑵 𝝁𝑿 , 𝝈𝟐𝑿 ഥ ~𝑵 𝝁𝑿ഥ = 𝝁𝑿 , 𝝈𝟐ഥ = 𝝈𝑿
→𝑿 𝑿 𝒏
𝟐
2. 𝑿~𝑾𝒉𝒂𝒕𝒆𝒗𝒆𝒓 (𝒖𝒏𝒌𝒏𝒐𝒘𝒏) 𝒘𝒊𝒕𝒉 𝒑𝒂𝒓𝒂𝒎𝒆𝒕𝒆𝒓𝒔 𝝁𝑿 , 𝝈𝟐𝑿 ഥ ~𝑵 𝝁𝑿ഥ = 𝝁𝑿 , 𝝈𝟐ഥ = 𝝈𝑿
→𝑿 𝑿 𝒏
when 𝒏 → ∞ (typically, when n>30)
31
Module 2
Exercise 1.1
1. Click on Sheet Pop 1.
2. Copy Pop 1 data into JMP table.
3. Establish Data Distribution for Pop 1.
Exercise 1.2
4. Create Sampling Distribution for Mean & Variance for sample size, n=5.
5. Copy into JMP table.
6. Repeat Step 4 and 5 for n = 10 , 15 , 20 , 25 & 30.
Exercise 1.3
7. Establish Data Distribution for Sampling Distribution of Mean (for all n).
32
Module 2
Exercise 1.1
1. Click on Sheet Pop 1.
2. Copy Pop 1 data into JMP table.
3. Establish Data Distribution for Pop 1.
33
Module 2
Exercise 1.1
1. Click on Sheet Pop 1.
2. Copy Pop 1 sheet Population data column into JMP table (Edit > Paste With Column Names)
3. Establish Data Distribution for Pop 1.
3.1 Click on Analyze > Distribution
Cast Population column into Y
35
Module 2
Exercise 1.2
4. Create Sampling Distribution for Mean & Variance for sample size, n=5.
5. Copy into JMP table.
6. Repeat Step 4 and 5 for n = 10 , 15 , 20 , 25 & 30.
36
Module 2
37
Module 2
6.3 Draw 1000 groups of data with size n=10, copy to JMP
4.26 21.94
3.91 38.86
7.15 31.04
5.00 6.48
5.95 20.37
3.04 15.32
2.14 26.31
4.50 22.12
7.15 32.26
5.44 24.84
38
Module 2
Exercise 1.3
7. Establish Data Distribution for Sampling Distribution of Mean (for all n).
Go to Analyze > Distribution, select all Mean columns
39
Module 2
t-Distribution n=5
Module 2 Key Learning
n = 10
Did you notice the Sampling Distribution of Mean:
1. Normally distributed.
n = 30
40
Module 2
t-Distribution
𝝈
𝑵 𝝁,
Module 2 Key Learning 𝒏
𝑛=5
4.99
𝑁 5.02, ⇒ 𝑁(5.02,2.24)
5
𝑛 = 25
4.99
𝑁 5.02, ⇒ 𝑁(5.02,1.00)
25
41
Module 2
Exercise
1. Click on Sheet Pop 2.
2. Copy Pop 2 data into JMP table.
3. Establish Data Distribution for Pop 2.
Exercise Con’t
4. Create Sampling Distribution for Mean & Variance for sample size, n=5.
5. Copy into JMP table.
6. Repeat Step 4 and 5 for n = 15 & 30.
Exercise Con’t
7. Establish Data Distribution for Sampling Distribution of Mean (for all n).
42
Module 2
n = 10
𝑁(4.00,0.18)
n = 15
𝝈 𝑁(4.00,0.15)
𝑵 𝝁,
𝒏
n = 20
𝑁(4.00,0.13)
n = 25
𝑁(4.00,0.12)
n = 30
𝑁(4.00,0.11)
43
Module 2
Exercise
1. Click on Sheet Pop 3.
2. Copy Pop 3 data into JMP table.
3. Establish Data Distribution for Pop 3.
Exercise Con’t
4. Create Sampling Distribution for Mean & Variance for sample size, n=5.
5. Copy into JMP table.
6. Repeat Step 4 and 5 for n = 15 & 30.
Exercise Con’t
7. Establish Data Distribution for Sampling Distribution of Mean (for all n).
44
Module 2
n = 10
𝑁(6.06,1.36)
n = 15
𝝈 𝑁(6.06,1.11)
𝑵 𝝁,
𝒏
n = 20
𝑁(6.06,0.96)
n = 25
𝑁(6.06,0.86)
n = 30
𝑁(6.06,0.78)
45
Module 2
t-Distribution
Any Population Distribution Sampling Distribution of Mean
Module 2 Key Learning
𝜎
𝑛
46
Module 2
Population
The Decision-Making Process
t-Distribution
n=5
n=10
n=15
n=20
n=25
n=30
47
Module 2
t-Distribution
48
Module 2
𝜎
𝑛
𝜎
𝑛
𝝈
𝑵 𝝁,
𝒏
Central Limit Theorem However, most likely we will not know the
Numerical simulations & examples value of the population stdev.
t-Distribution
As such, we cannot form the Sampling
Module 2 Key Learning Distribution for Mean in this situation. Sampling Ex : Inferential
Distribution of About Population Mean
Mean
Sample with size, n
𝝈
𝑰𝒇 𝝈 𝒌𝒏𝒐𝒘𝒏 𝑵 𝝁,
𝒏
Ex:
ഥ
𝑴𝒆𝒂𝒏 ∶ 𝒙 Confidence Interval
𝑺𝒕𝒅𝒆𝒗 ∶ 𝒔 Ex:
Hypothesis Testing
𝒔
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁,
𝒏
• Machine setup.
• Produce n setup units. You want to know from the setup you did,
• 𝑆𝑎𝑚𝑝𝑙𝑒 𝑀𝑒𝑎𝑛, 𝑥.ҧ Real cases We replace σ with s.
mainly and infer about the population (as in
• 𝑆𝑎𝑚𝑝𝑙𝑒 𝑆𝑡𝑑𝑒𝑣, 𝑠. production).
belong to This results in a
this group distribution that follow
a Then you make decision if you need to
t-distribution. re-setup or release the machine for
production.
51
Module 2
t-Distribution
Central Limit Theorem This situation can be used when sample size is large
Numerical simulations & examples enough.
t-Distribution Reason
Module 2 Key Learning t ➔ Normal.
In other words, as if we know about Pop Stdev.
Sampling Inference
Distribution of About Population Mean
Mean
Sample with size, n
𝝈
𝑰𝒇 𝝈 𝒌𝒏𝒐𝒘𝒏 𝑵 𝝁,
𝒏 Ex:
ഥ
𝑴𝒆𝒂𝒏 ∶ 𝒙 Confidence Interval
𝑺𝒕𝒅𝒆𝒗 ∶ 𝒔 Ex:
Hypothesis Testing
𝒔
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁,
𝒏
To understand this concept, let’s look into how it is being applied and go through practical examples.
53
Module 2
• What is t-distribution.
54
Module 3 : Confidence Interval
Objective: Duration ~ TBD hrs
• Know which are the main properties of point estimators.
• Form Confidence Interval for the population parameters.
FAQ
#1 : To collect input from participants.
56
Module 3
Estimation
Point Estimation
Properties of Point Estimators
GOAL
Interval estimation
Parameters of 1 normal population Estimation of unknown population parameters (e.g. “the
Parameters of 2 normal populations
Relevant terms:
o ESTIMATOR → sample statistic used to estimate the unknown
population parameter (e.g. average of sample
ത
weights 𝑋)
57
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
58
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Notation:
• The unknown parameters to estimate are indicated by Greek letters
(e.g. μ, σ, θ, ρ, λ …)
ത s, r …)
• The estimators are indicated by Latin letters (e.g. 𝑋,
ത read “the
• The symbol ^ (hat) is used to indicate estimation (e.g. 𝜇Ƹ = 𝑋,
ത
point estimator of μ is the sample average 𝑋“)
59
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population Usually, the unknown parameters that we need to estimate are:
Parameters of 2 normal populations
• The population mean (μ)
Module 3 Key Learning
• The population variance (σ2) or the population standard deviation (σ)
• The population proportion (θ)
Or, we might be interested in estimating other unknown population parameters like:
• The coefficients of a Regression Model (the β’s)
• The correlation coefficient (ρ)
• Other...
60
Module 3
Estimation
Point Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population EXAMPLE
Parameters of 2 normal populations
Let µ be an unknown parameter that we want to estimate. Here, µ is the mean lifetime of a
Module 3 Key Learning
certain type of batteries.
A random sample of n = 30 batteries might yield the following observed lifetimes (hours):
x1= 6.1, x2= 5.3, …, x30= 5.9
61
Module 3
Estimation
Point Estimation
Point Estimator Properties
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Point estimators can be classified according to some
Module 3 Key Learning
desirable properties.
• Unbiased.
• Consistent.
• Most efficient
NOTE
For more details, se also the Manual of Statistical Methodology (8482919 ver.2).
62
Module 3
Estimation
Point Estimation
Unbiasedness
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations • A point estimator 𝜃መ is said to be an unbiased estimator of the
Module 3 Key Learning
parameter if E(𝜽),
the expected value, or mean, of the sampling
መ is .
distribution of 𝜃,
• Examples:
• The sample mean is an unbiased estimator of μ
• The sample variance is an unbiased estimator of σ2
• The sample proportion is an unbiased estimator of 𝜋
63
Module 3
Estimation
Expected Value
Point Estimation
Properties of Point Estimators መ
Expected value, or mean, of the sampling distribution of 𝜃:
Interval estimation
1. The unknown population parameter that we want to estimate is θ
Parameters of 1 normal population
Parameters of 2 normal populations 2. The population contains a very large (infinite) number of items
Module 3 Key Learning 3. From that population, a very large (infinite) number (k) of samples is drawn
4. From each sample the estimate of θ is calculated
5. The expected value of the sampling distribution of 𝜃 is the average of all the estimates
calculated at the previous point 4
Population
θ̂1 θ̂ 2 ... θ̂ k
i (i = 1,…, k), k→ ∞, is the Expected Value (or Mean)
The average of all the 𝜽
of the sampling distribution of 𝜃 and it is indicated by E(𝜽)
64
Module 3
Estimation
Point Estimation
Unbiasedness
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations መ
𝑓(𝜃)
Module 3 Key Learning Sampling distribution of θ 1
Sampling distribution of θ 2
Bias in 𝜃መ 2
መ let’s define bias = E(𝜽)
For a any estimator of 𝜃, say 𝜃, - 𝜽.
1) = 0, 𝜽
Since the bias (𝜽 1 is an unbiased estimator of 𝜽.
Conversely, since the bias (𝜽 2) > 0, 𝜽
2 is a biased estimator of 𝜽.
65
Module 3
Estimation
Point Estimation
Bias
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population • Let 𝜃መ be an estimator of
Parameters of 2 normal populations
= 𝑬(𝜽)
Bias(𝜽) −𝛉
66
Module 3
Estimation
Point Estimation
Consistency
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population • Let 𝜃መ be an estimator of
Parameters of 2 normal populations
67
Module 3
Estimation
Point Estimation
Maximum Efficiency
Properties of Point Estimators
Interval estimation
68
Module 3
Estimation
Point Estimation
Maximum Efficiency
Properties of Point Estimators
𝟏 ) < Var(𝜽
Var(𝜽 𝟐)
𝟏)
Var(𝜽
Relative Efficiency =
𝟐)
Var(𝜽
69
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
70
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
• How much uncertainty is associated with a point estimate of a
Module 3 Key Learning
population parameter? We cannot know, but,…
71
Module 3
Estimation
Point Estimation
Interval Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Intervals are commonly chosen such that the unknown parameter falls within
with a certain (percent) probability. This probability, determined a-priori, typically
is set between 90% to 99% and it is called confidence level. Hence, the
intervals are called confidence intervals;
the end points of such an interval are called upper and lower confidence limits.
72
Module 3
Estimation
Point Estimation
Interval Estimation
Properties of Point Estimators
Interval estimation
EXAMPLE
Parameters of 1 normal population
Parameters of 2 normal populations Consider again the batteries lifetime example.
Module 3 Key Learning Instead of calculating a point estimator of the mean lifetime of the batteries, now we want to go
further and form a confidence interval at the 95% confidence level, for the unknown mean lifetime
of the batteries.
Using the methods presented in the next Modules, it will be possible to form such an interval. For
example, a possible outcome is the interval (5.23, 6.31)
The values 5.23 and 6.31 are the lower and upper confidence limits respectively
It is reasonable to state that – with a confidence equal to 95% - the true unknown mean of the
batteries lifetime is contained in the 5.23 to 6.31 interval
73
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations The interval containing a population parameter is established by calculating
Module 3 Key Learning
that statistic from values measured on a random sample taken from the
population and by applying the knowledge of the fidelity with which the
properties of a sample represent those of the entire population.
The probability tells what percentage of the time the assignment of the
interval will be correct but not what the chances are that it is true for any
given sample.
74
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
An interval gives a range of values:
Module 3 Key Learning
75
Module 3
Estimation
Point Estimation
Properties of Point Estimators Let be an unknown population parameter that we want to estimate. If
Interval estimation P(a < < b) = 1 - then the interval from a to b is called 100(1 - )%
Parameters of 1 normal population confidence interval of .
Parameters of 2 normal populations
The quantity (1 - ) is called confidence level of the interval (0 < < 1).
o In repeated samples, the true value of the parameter would be contained in 100(1 - )%
of the intervals.
o The confidence interval calculated in this manner is written as a < < b with 100(1 - )%
confidence
76
Module 3
Estimation
Point Estimation
Frequency Interpretation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
❑ Suppose confidence level = 95%
❑ A specific interval either will contain or will not contain the true parameter
▪ No probability involved in a specific interval
77
Module 3
X
μx = μ
Xሜ 1
Xሜ 2
In repeated samples, 100(1-)% of
intervals contain μ, 100()% do not.
78
Module 3
Estimation
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Confidence intervals for parameters of one normal population
Module 3 Key Learning
Confidence Intervals
σ2 Known σ2 Unknown
79
Module 3
Estimation
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Confidence intervals for parameters of one normal population
Module 3 Key Learning
Confidence Intervals
σ2 Known σ2 Unknown
80
Module 3
Estimation
Point Estimation
Confidence Intervals
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
❑ The value of the reliability factor depends on the desired level of confidence
𝜎
❑ The standard error is equal to , where σ is the population std. dev. and n is the sample size
𝑛
81
Confidence Interval for μ (σ2 known)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
- Assumptions
Interval estimation
• Population is normally distributed (if population is not normal, use large sample)
Parameters of 1 normal population
Parameters of 2 normal populations
- Context
Module 3 Key Learning
• Population variance σ2 is known
𝝈 𝝈
ഥ − 𝒁𝜶
𝑿 ഥ + 𝒁𝜶
<𝝁<𝑿
𝟐 𝒏 𝟐 𝒏
Where,
▪ 𝑋ത - point estimator of the population mean, it is the sample average
▪ z/2 – percentage point of the N(0,1) distribution such that: 𝑃 𝑧 ≥ 𝑍𝛼/2 = 𝛼/2
▪ σ – population standard deviation (known)
▪ n – sample size
For more references on method see also the Manual of Statistical Methodology, Ch. 6 and Annex 4 (DMS 8482919_A)
82
Module 3
Estimation
Point Estimation
Confidence Interval for μ (σ2 known)
Properties of Point Estimators
Interval estimation
The confidence interval can be written also as follows:
Point Estimate (Reliability Factor)*(Standard Error)
Parameters of 1 normal population
Parameters of 2 normal populations
𝝈
ഥ ± 𝒁𝜶
𝑿
𝟐 𝒏
or as
ഥ ±ME
𝑿
𝝈
where ME is called the Margin of Error, 𝑴𝑬 = 𝒁𝜶
𝟐 𝒏
The interval width, W, is equal to twice the margin of error: W = 2*ME
83
Module 3
Estimation
Point Estimation
Reducing the Margin of Error
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Q. Why it is desirable to reduce the ME?
Parameters of 2 normal populations 𝝈
Module 3 Key Learning
𝑴𝑬 = 𝒁𝜶 A. Because W, the width of the interval, given by W=2*ME is
𝟐 𝒏 inversely proportional to the precision of an interval estimator.
NOTE
𝝈
Fixed a value for W and a confidence level, one can solve for n the equation 𝑾 = 𝟐𝒁𝜶 to obtain the
𝟐 𝒏
minimum sample size required to guarantee a confidence (1 − 𝛼) and a precision W
84
Module 3
Estimation
Reducing the Margin of Error
Point Estimation
Properties of Point Estimators
𝛼 𝛼
Interval estimation = 0.05 = 0.05
2 2 Narrower CI. More precise.
Parameters of 1 normal population (1 − 𝛼) = 0.90 Less likely to include the
Parameters of 2 normal populations 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 90%
true population Mean.
(𝛼 = 0.10)
Module 3 Key Learning
3.18 CI @ 90% 6.03
𝛼 𝛼
= 0.025 = 0.025
2 2
(1 − 𝛼) = 0.95
𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 95%
(𝛼 = 0.05)
2.89 CI @ 95% 6.31
𝛼 𝛼
= 0.005 = 0.005
2 2 Wider CI. Less precise.
(1 − 𝛼) = 0.99 More likely to include the
𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 99%
true Population Mean.
(𝛼 = 0.01)
2.30 CI @ 99% 6.91
85
Module 3
Estimation
Point Estimation
Finding the Reliability Factor
Properties of Point Estimators
α α
= 0.025 = 0.025
2 1−𝛼Z= 0.95 2
Estimation
The standard normal distribution table
Point Estimation
Properties of Point Estimators
.06
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
α
= 0.025
2
α
= 0.025 1-α = 0.95
2
-1.96
87
Module 3
Estimation
Point Estimation
Common Levels of Confidence
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Commonly used confidence levels are 90%, 95%, and 99%
Module 3 Key Learning
88
Module 3
Estimation
Point Estimation
Advanced Explanation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
How did we arrive to the previous formula for the
Module 3 Key Learning confidence interval?
We need to focus on the following:
2
𝜎𝑋
1. ത
The Central Limit Theorem (CLT) tells us that if 𝑋~𝑁(𝜇𝑋 , 𝜎𝑋2 ) ⇒ 𝑋~𝑁 𝜇𝑋ത = 𝜇𝑋 , 𝜎𝑋2ത = . In
𝑛
ത the
words, if X is a normally distributed random variable with parameters 𝜇𝑋 and 𝜎𝑋2 , then 𝑋,
𝜎2
sample average, is also a normal RV with parameters 𝜇𝑋ത = 𝜇𝑋 and 𝜎𝑋2ത = 𝑋 , where n is the sample
𝑛
size. If X is NOT normally distributed, still, for large values of n (at least, n>30), the previous
distributional property holds (statisticians say that this property holds “asymptotically”).
89
Module 3
Estimation
Point Estimation
Advanced Explanation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
To form a confidence interval for the population mean 𝜇𝑋 of a normal RV with known
Module 3 Key Learning variance 𝜎𝑋2 , we need 2 critical values, say CV1 and CV2, that include 100 1 − 𝛼 %
of the population (thus, leaving in each tail an area equal to 𝛼/2.
𝛼/2 (1 − 𝛼) 𝛼/2
𝜇𝑋 X
𝐶𝑉1 𝐶𝑉2
Due to the symmetrical feature of the normal distribution, it can be noticed that:
𝑪𝑽𝟐 − 𝝁𝑿 = 𝝁𝑿 − 𝑪𝑽𝟏
90
Module 3
Estimation
Advanced Explanation
Point Estimation
Properties of Point Estimators
Interval estimation
We standardize the value of 𝑋ത to form the confidence interval for 𝜇𝑋 :
Parameters of 1 normal population
Parameters of 2 normal populations
ത 𝑋
𝑋−𝜇 ഥ ത 𝑋
𝑋−𝜇
Module 3 Key Learning 𝑍𝑋ത = =𝜎
𝜎𝑋
ഥ 𝑋/ 𝑛
According to the desired confidence level, we may obtain the different critical values that
one can find in the table of the standard normal distribution.
𝛼
Let CV2, the upper critical value, be 𝐶𝑉2 = 𝑍𝑋ത ( 2 ). Considering the property of symmetry
𝛼
of the normal distribution, 𝐶𝑉1 = −𝐶𝑉2 = −𝑍𝑋ത ( 2 ) holds.
For example, for α = 0.05 ⇒ α/2 = 0.025, from the table of the Standard Normal
Distribution we obtain 𝐶𝑉1 = −1.96 and 𝐶𝑉2 = 1.96.
91
Module 3
Estimation
Point Estimation
Advanced Explanation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
To form the confidence interval for 𝜇𝑋 we consider the following inequality:
Module 3 Key Learning
𝛼 𝑋ത − 𝜇𝑋 𝛼
−𝑍( ) ≤ ≤ 𝑍( )
2 𝜎𝑋 / 𝑛 2
𝛼 𝜎𝑋 𝛼 𝜎
And we solve it for 𝜇𝑋 ⇒ −Z( 2 ) ≤ 𝑋ത − 𝜇𝑋 ≤ 𝑍 ( 2 ) 𝑋𝑛 .
𝑛
After solving the system of inequalities, we obtain the desired confidence interval:
𝜶 𝝈𝑿 𝜶 𝝈𝑿
ഥ−𝒁
𝑿 ഥ
≤ 𝝁𝑿 ≤ 𝑿 + 𝒁( )
𝟐 𝒏 𝟐 𝒏
92
Module 3
Estimation
Point Estimation
Confidence Intervals
Properties of Point Estimators
• A sample of 11 circuits from a large normal population has a mean resistance of 2.26 ohms.
We know from past testing that the population standard deviation is 0.35 ohms.
• The available data: {2.25, 2.48, 1.40, 2.31, 2.43, 2.47, 2.52, 2.56, 2.33, 1.48, 2.61}
• Determine & interpret a 95% confidence interval for the true mean resistance of the
population.
93
Module 3
𝟐. 𝟎𝟓 < 𝝁 < 𝟐. 𝟒𝟕
• We are 95% confident that the true mean resistance is between 2.05 and 2.47 ohms
• Although the true mean may or may not be in this interval, 95% of intervals formed in
this manner will contain the true mean
NOTE: From the statistical tables of the Standard Normal Distribution, we obtain that 𝑍𝛼 = 1.96.
2
However, the confidence interval is provided by the software and not calculated by hand. 94
Module 3
Estimation
Point Estimation
Confidence Intervals
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Confidence intervals for parameters of one population
Module 3 Key Learning
Confidence Intervals
σ2 Known σ2 Unknown
95
Module 3
Estimation
Point Estimation
Confidence Interval for μ (σ2 Unknown)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
96
Module 3
Estimation
Point Estimation
Confidence Interval for μ (σ2 Unknown)
Properties of Point Estimators
Interval estimation
• Assumptions
Parameters of 1 normal population • Population is normally distributed (if population is not normal, use large sample)
Parameters of 2 normal populations
𝒔 𝒔
ഥ − 𝒕𝒏−𝟏,𝜶/𝟐
𝑿 ഥ + 𝒕𝒏−𝟏,𝜶/𝟐
<𝝁<𝑿
𝒏 𝒏
where tn-1,α/2 is the critical value of the t distribution with n-1 degrees of freedom
and an area of α/2 in each tail:
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
d.f. = n - 1
98
Module 3
Estimation
Point Estimation
Student’s t Distribution
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations f(Z), f(t)
Module 3 Key Learning
Standard Normal
(t with d.f. = ∞)
t (d.f. = 13)
00 t Z, t
Estimation
Point Estimation
Student’s t Distribution
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
With comparison to the Z value (Z=Standard Normal, N(μ=0, σ2=1))
Module 3 Key Learning
Confidence t t t t t Z
Level (10 d.f.) (20 d.f.) (30 d.f.) (60 d.f.) (120 d.f.)
Notes
• d.f.=n-1 ⇒ from the table above, we notice that as n increases the interval width
decreases (and the precision increases)
• as the number of d.f.→ ∞ ⇒ t → Z
• with d.f. > 60, approximately t = Z 100
Module 3
Estimation
Point Estimation
Use of the Student’s t Distribution
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
EXAMPLE
Module 3 Key Learning
confidence interval for the mean of a normal population (variance unknown)
• Consider the data from the previous example. Sample size n=11, but here the true
population standard deviation is unknown and must be estimated.
• From sample data, we obtain s = 0.42
• Determine a 95% confidence interval for μ, the true population mean.
101
Module 3
Estimation
Point Estimation
Use of the Student’s t Distribution
Properties of Point Estimators
• ഥ = 2.260
𝑿
• 𝑡10, 𝛼 = 2.228 (α = 0.05 → α/2 = 0.025 )
2
• 𝒔 = 0.420 𝒔 𝒔
ഥ − 𝒕𝒏−𝟏,𝜶/𝟐
𝑿 ഥ + 𝒕𝒏−𝟏,𝜶/𝟐
<𝝁<𝑿
• n = 11 → n-1 =10 𝒏 𝒏
𝟎. 𝟒𝟐𝟎 𝟎. 𝟒𝟐𝟎
𝟐. 𝟐𝟔𝟎 − 𝟐. 𝟐𝟐𝟖 < 𝝁 < 𝟐. 𝟐𝟔𝟎 + 𝟐. 𝟐𝟐𝟖
𝟏𝟏 𝟏𝟏
NOTES:
• From the Statistical Tables of the t Distribution (see next slide), we obtain that 𝑡10,0.025 = 2.228.
• The interpretation of this result is the same as for the previous example. The only difference is in the formula used
for the calculation.
102
Module 3
Estimation
Point Estimation
Use of the Student’s t Distribution
Properties of Point Estimators
Table of critical values for Student’s t distributions
Interval estimation df α = 0.1 0.05 0.025 0.01 0.005 0.001 0.0005
Parameters of 1 normal population 1 3.078 6.314 12.706 31.821 63.656 318.289 636.578
2 1.886 2.920 4.303 6.965 9.925 22.328 31.600
Parameters of 2 normal populations 3 1.638 2.353 3.182 4.541 5.841 10.214 12.924
4 1.533 2.132 2.776 3.747 4.604 7.173 8.610
Module 3 Key Learning 5 1.476 2.015 2.571 3.365 4.032 5.894 6.869
6 1.440 1.943 2.447 3.143 3.707 5.208 5.959
7 1.415 1.895 2.365 2.998 3.499 4.785 5.408
8 1.397 1.860 2.306 2.896 3.355 4.501 5.041
9 1.383 1.833 2.262 2.821 3.250 4.297 4.781
10 1.372 1.812 2.228 2.764 3.169 4.144 4.587
11 1.363 1.796 2.201 2.718 3.106 4.025 4.437
12 1.356 1.782 2.179 2.681 3.055 3.930 4.318
13 1.350 1.771 2.160 2.650 3.012 3.852 4.221
14 1.345 1.761 2.145 2.624 2.977 3.787 4.140
15 1.341 1.753 2.131 2.602 2.947 3.733 4.073
16 1.337 1.746 2.120 2.583 2.921 3.686 4.015
17 1.333 1.740 2.110 2.567 2.898 3.646 3.965
18 1.330 1.734 2.101 2.552 2.878 3.610 3.922
19 1.328 1.729 2.093 2.539 2.861 3.579 3.883
20 1.325 1.725 2.086 2.528 2.845 3.552 3.850
21 1.323 1.721 2.080 2.518 2.831 3.527 3.819
22 1.321 1.717 2.074 2.508 2.819 3.505 3.792
23 1.319 1.714 2.069 2.500 2.807 3.485 3.768
24 1.318 1.711 2.064 2.492 2.797 3.467 3.745
25 1.316 1.708 2.060 2.485 2.787 3.450 3.725
26 1.315 1.706 2.056 2.479 2.779 3.435 3.707
27 1.314 1.703 2.052 2.473 2.771 3.421 3.689
28 1.313 1.701 2.048 2.467 2.763 3.408 3.674
29 1.311 1.699 2.045 2.462 2.756 3.396 3.660
30 1.310 1.697 2.042 2.457 2.750 3.385 3.646
60 1.296 1.671 2.000 2.390 2.660 3.232 3.460
120 1.289 1.658 1.980 2.358 2.617 3.160 3.373
∞ 1.282 1.645 1.960 2.326 2.576 3.091 3.291
103
Module 3
Estimation
Point Estimation
Confidence Interval
Properties of Point Estimators
Interval estimation
Sampling
Parameters of 1 normal population Sample with size, n Distribution Inference
Parameters of 2 normal populations About Population Mean
of Mean
Module 3 Key Learning
𝝈
𝑰𝒇 𝝈 𝒌𝒏𝒐𝒘𝒏 𝑵 𝝁,
𝒏
ഥ
Average ∶ 𝒙
𝑺𝒕𝒅𝒆𝒗 ∶ 𝒔 Confidence Interval
𝒔
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁,
𝒏
Estimation
Exercise #4
Point Estimation
Properties of Point Estimators
Exercise File:
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
2. Subsequent slide is using this set of data for explanation on how Confidence Interval
is being calculated.
105
Module 3
Estimation
Exercise #4
Point Estimation
Properties of Point Estimators
Exercise File:
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations 1. Open the file…data is in row format
Module 3 Key Learning
Estimation
Exercise #4
Point Estimation
Properties of Point Estimators
Exercise File:
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations 3. Make a distribution of the sample data
Module 3 Key Learning Go to Analyze > Distribution. Cast Row column into Y, then hit Ok
JMP generated CI
107
Module 3
Estimation
Exercise #4
Point Estimation
Properties of Point Estimators
Exercise File:
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations To generate confidence interval for both Mean and Stdev:
Module 3 Key Learning Go to distribution hotspot > Confidence Inter
JMP generated CI
108
Module 3
Estimation
Point Estimation
Confidence Interval
Properties of Point Estimators Step 1 Convert Data into Information.
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Descriptive
Module 3 Key Learning Statistics
𝒔 𝟒. 𝟓𝟖
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁, 𝑵t 𝟒. 𝟔𝟎,
𝒏 𝟑𝟎 Sampling
Distribution of Mean
inference on the
Inferential
t Distribution Population Mean.
Statistics
𝟒. 𝟔𝟎 109
Module 3
Estimation
Point Estimation
Confidence Interval
Properties of Point Estimators Step 3 User to Define Significance Level, 𝜶.
Interval estimation
t Distribution
Parameters of 1 normal population
Significance Level, 𝜶 Confidence Level, (1-𝜶)
Parameters of 2 normal populations
𝛼 𝛼 0.01 99%
= 0.025 = 0.025
Module 3 Key Learning 2 2
1 − 𝛼 = 0.95 0.05 95%
0.10 90%
𝐿𝐶𝐿 𝟒. 𝟔𝟎 𝑈𝐶𝐿
LCL = Lower Confidence Limit Typically we choose 𝛼 = 0.05.
UCL = Upper Confidence Limit
Estimation
Point Estimation
Step 4 Establishing the Confidence Interval Confidence Interval
Properties of Point Estimators
t Distribution
Interval estimation
Significance Level, 𝜶 Confidence Level, (1-𝜶)
Parameters of 1 normal population
Parameters of 2 normal populations 𝛼 𝛼 0.01 99%
= 0.025 = 0.025
2 2
Module 3 Key Learning 1 − 𝛼 = 0.95 0.05 95%
0.10 90%
𝐿𝐶L 𝟒. 𝟔𝟎 𝑈𝐶𝐿
2.89 6.31
In the example:
Estimation
Point Estimation
Confidence Interval
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Let’s say specs target is 0+/-50um Let’s say specs target is 5+/-50um
Estimation
Point Estimation
Confidence Interval
Properties of Point Estimators FAQ
Interval estimation
Why typically we choose 𝛼 = 0.05 or
Parameters of 1 normal population Confidence Level = 95%?
Parameters of 2 normal populations
99%
95%
2.30 2.89 6.31 6.91
CI @ 95%
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population Frequency Interpretation – a numerical simulation
Parameters of 2 normal populations
2. Create 100 groups of sample with size, n=30 using sheet Pop 1.
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
Interval estimation
M2.0 Central
1. Open M2.0 Central Limit Theorem.xls. Limit Theorem
Parameters of 1 normal population
Parameters of 2 normal populations
2. Create 100 groups of sample with size, n=30 using sheet Pop 1.
Module 3 Key Learning
Up to 30
100
groups
116
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation 3. Copy/paste sample data to JMP, transpose to column format Limit Theorem
Up to 100 cols
Up to 30 rows
Can delete Label column…click on column, right click > Delete Column
117
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Limit Theorem
Untick
118
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Limit Theorem
Parameters of 1 normal population 4. Create confidence interval for all these 100 groups using:
Parameters of 2 normal populations a. 𝛼 = 0.10 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 90% .
b. 𝛼 = 0.05 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 95% .
Module 3 Key Learning
c. 𝛼 = 0.01 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝐿𝑒𝑣𝑒𝑙 = 99% .
Pressing Ctrl key, click on hotspot, select Confidence interval, choose confidence level
Round1: 90% Round2: 95% Round3: 99%
119
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Limit Theorem
There are 30 groups, each group with 3 sets of CI (90, 95, 99% CL)
120
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Remove Std Dev rows Limit Theorem
121
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Stack the data columns Limit Theorem
Y, Response: Data
X: Y
By: 1-Alpha
Click OK
122
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
M2.0 Central
Interval estimation Limit Theorem
123
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
Remove Grand Mean
Interval estimation
Click on hotspot > Display Options – untick Grand Mean
Parameters of 1 normal population
Parameters of 2 normal populations
Create box plot
Module 3 Key Learning Click on hotspot > Display Options – tick Box Plot
124
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
From exercise 1.1, we get population mean = 5.02, we will add pop mean line into our graph
Interval estimation
Double-click on Y axis of the graph, put value 5.02 then click Add > OK (Do for all 3 Oneway graph)
Parameters of 1 normal population
Parameters of 2 normal populations
CL(%) Tr P1 P2 P3 P4
90
95
99
125
Module 3
Estimation
Exercise #5
Point Estimation
Properties of Point Estimators
Compare Confidence Interval Width, use uniform scaling to easily see
Interval estimation
Right-click on Y axis of the graph (for 99%) > Edit > Copy Axis Settings
Parameters of 1 normal population
Click on 95% CL graph Yaxis > Edit > Paste Axis Settings
Parameters of 2 normal populations
Click on 90% CL graph Yaxis > Edit > Paste Axis Settings
Module 3 Key Learning
126
Module 3
Estimation
Frequency Interpretation – a numerical simulation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Population Mean
Module 3 Key Learning
At 90% Confidence Level ➔ On average, 90/100 Confidence Intervals contain the Pop Mean 127
Frequency Interpretation – a numerical simulation
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Pop Mean
At 95% Confidence Level ➔ On average, 95/100 Confidence Intervals contain the Pop Mean 128
Frequency Interpretation – a numerical simulation
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Pop Mean
Module 3 Key Learning
At 99% Confidence Level ➔ On average, 100/100 Confidence Intervals contain the Pop Mean 129
Module 3
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
130
Module 3
Estimation
Exercise #6
Point Estimation
Properties of Point Estimators
Scenario:
4. Decision? :
a. Release machine for production.
b. Re-setup machine.
131
Module 3
Estimation
Exercise #6
Point Estimation
Properties of Point Estimators
Generate descriptive statistics
Interval estimation
Click on Analyze > Distribution, select X & Y offset columns, then OK
Parameters of 1 normal population
Parameters of 2 normal populations
Input here
Do the same for another parameter down below
132
Module 3
Estimation
Exercise #6
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population Generate descriptive statistics
Parameters of 2 normal populations Click on Analyze > Distribution, select X & Y offset columns, then OK
Module 3 Key Learning
133
Module 3
Estimation
Exercise #6
Point Estimation
Properties of Point Estimators
Generate descriptive statistics
Interval estimation
Parameters of 1 normal population Click on Analyze > Distribution, select X & Y offset columns, then OK
Parameters of 2 normal populations
134
Module 3
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
𝒔
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁,
𝒏
Target Interest
𝟑. 𝟔𝟗 𝟏. 𝟑𝟕
t𝑵 −𝟏. 𝟐𝟕, t𝑵 −𝟗. 𝟗𝟎,
𝟑𝟎 𝟑𝟎
𝑆𝑡𝑑 𝐸𝑟𝑟: 0.67 𝑆𝑡𝑑 𝐸𝑟𝑟: 0.25
Target Interest
−𝟏. 𝟐𝟕 −𝟗. 𝟗𝟎
-2.65 CI 0..11 -10.41 CI -9.39
Knowledge Knowledge
At 95% CL, target within CI. At 95% CL, target not within CI.
Process Is Centered. Process Not Centered.
Step 3 User to Define Significant Level, 𝜶. Typically @ 0.05 or 95% Confidence Level.
Estimation
Exercise #7
Point Estimation
Properties of Point Estimators
Example #2
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Same scenario as in Exercise 6 but with different data set
Module 3 Key Learning
136
Module 3
Estimation
Exercise #7
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations To add process capability
Go to data table, select both columns > Right-click > Column Properties > Specs Limits
Module 3 Key Learning
Input here
137
Module 3
Estimation
Exercise #7
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
138
Module 3
Confidence Intervals
Estimation FAQ Answer
Point Estimation Could we increase the precision Increase the sample size, n. 𝑺𝒂𝒎𝒑𝒍𝒆 𝑺𝒊𝒛𝒆, 𝒏𝟏
Properties of Point Estimators without sacrificing the frequencies?
Interval estimation
Parameters of 1 normal population 2.5% 2.5%
𝝈
Parameters of 2 normal populations
𝑰𝒇 𝝈 𝒌𝒏𝒐𝒘𝒏 𝑵 𝝁, 95%
Module 3 Key Learning
𝒏
𝒔 CI @ 95%
𝑰𝒇 𝝈 𝒖𝒏𝒌𝒏𝒐𝒘𝒏 𝑵
t 𝝁,
𝒏
𝑺𝒂𝒎𝒑𝒍𝒆 𝑺𝒊𝒛𝒆, 𝒏𝟐
𝒔 (𝒏𝟐 > 𝒏𝟏 )
Sample size increases ➔ decreases.
𝒏
CI @ 95% 139
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Exercise File:
Module 3 Key Learning
Use M.20 Central Limit Theorem.xls file.
Exercise
1. Click on Sheet Pop 1.
2. Random sampling 100 groups of sample with size = 5 & 30.
3. Compute Confidence Interval for all 100 groups for both n = 5 and n = 30.
4. Use 95% Confidence Level for both groups.
140
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
141
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Copy and paste to JMP
Interval estimation
Parameters of 1 normal population
Go to Tables > Transpose
Parameters of 2 normal populations
142
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Remove Quantiles and other Statistics except Confidence Intervals
Interval estimation
Parameters of 1 normal population
Pressing Ctrl key, Go to hotspot > Display Options, untick Quantiles
Parameters of 2 normal populations
Pressing Ctrl key, Go to hotspot of Summary Statistics, untick others except Confidence Intervals
143
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Right-Click somewhere here, select
Module 3 Key Learning
Make Combined Data Table
144
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Make a box plot
Interval estimation
Parameters of 1 normal population
Go to Analyze > Fit Y by X
Parameters of 2 normal populations
Have the Grand Mean removed and tick on Box Plots Add the Pop Mean line (double-click on Yaxis)
145
Hotspot > Display Options
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Do the same for n = 30 samples
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Go to n = 30 graph > right-click on Y axis > Edit > Paste Axis Settings
146
Module 3
Estimation
Exercise #8
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
147
Module 3
Estimation
Confidence Intervals
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
148
Using same CL at 95%, as sample size increases ➔ Precision improves maintaining same frequencies to contain pop mean.
Module 3
Estimation
Confidence Intervals
Increase
Point Estimation
Probability to Contain Sample Size
Properties of Point Estimators Population Mean
Interval estimation
𝑺𝒂𝒎𝒑𝒍𝒆 𝑺𝒊𝒛𝒆, 𝒏𝟏
(𝒏𝟐 > 𝒏𝟏 ) 𝑺𝒂𝒎𝒑𝒍𝒆 𝑺𝒊𝒛𝒆, 𝒏𝟐 > 𝒏𝟏
Parameters of 1 normal population
Parameters of 2 normal populations
Precision
Estimation
Point Estimation
Confidence Intervals
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Confidence intervals for parameters of one population
Module 3 Key Learning
Confidence Intervals
σ2 Known σ2 Unknown
150
Module 3
Estimation
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Assumption
- The population is normally distributed
Module 3 Key Learning
(n − 1)s 2
The random variable 2
n−1 =
σ2
follows a chi-square distribution with (n – 1) degrees of freedom
151
Confidence Interval for σ2
Module 3
Estimation
Point Estimation
Properties of Point Estimators
The (1 - )% confidence interval for the population variance is:
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations 𝒏 − 𝟏 𝒔𝟐 𝒏 − 𝟏 𝒔𝟐
< 𝝈𝟐 <
𝝌𝟐𝜶 𝝌𝟐
Module 3 Key Learning
𝜶
,𝒏−𝟏 𝟏− ,𝒏−𝟏
𝟐 𝟐
f(2n-1)
probability probability
α/2 = 0.025 (1-α)=0.95 α/2 = 0.025
2n-1
2n-1, α/2 2n-1, 1-α/2
152
Module 3
Estimation
Point Estimation
Confidence Interval for σ2
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population EXAMPLE
Parameters of 2 normal populations
You are testing the speed of a computer processor (X).
Module 3 Key Learning
Assuming that the population is normal, determine a 95% confidence interval for σx2 , the
true population variance.
153
Module 3
Point Estimation
Properties of Point Estimators
154
Module 3
Estimation
Point Estimation
Advanced Explanation
Properties of Point Estimators
Interval estimation
2 2 2
Parameters of 1 normal population
DEFINITION: if 𝑃 𝜒𝑛−1 ≥ 𝜒𝛼/2,𝑛−1 = 𝛼Τ2 then, 𝜒𝛼/2,𝑛−1 is the percentage point of a
Parameters of 2 normal populations Chi-Square distribution with (n-1) degrees of freedom.
Module 3 Key Learning
2
❑ Let 𝜒𝛼/2,𝑛−1 be the percentage point of a Chi-Square distribution with (n-1) DF such
2 2
that 𝑃 𝜒𝑛−1 ≥ 𝜒𝛼/2,𝑛−1 = 𝛼Τ2, (right tail).
2
❑ Let 𝜒1−𝛼/2,𝑛−1 be the percentage point of a Chi-Square distribution with (n-1) DF, such
2 2
that 𝑃 𝜒𝑛−1 ≥ 𝜒1−𝛼/2,𝑛−1 = 1 − 𝛼Τ2 (left tail).
(𝑛−1)𝑠 2
It can be shown that the quantity follows a Chi-Square distribution with (n-1) DF.
𝜎2
(𝑛−1)𝑠 2
A confidence interval at the level (1- 𝛼 ) for the quantity can be formed taking the
𝜎2
2 2
distance between the 2 percentage points previously shown: 𝜒𝛼/2,𝑛−1 and 𝜒1−𝛼/2,𝑛−1 .
2 (𝑛−1)𝑠 2 2
So, it can be stated that P 𝜒1−𝛼/2,𝑛−1 ≤ ≤ 𝜒𝛼/2,𝑛−1 =1−𝛼
𝜎2
156
Module 3
Interval estimation
Parameters of 1 normal population To obtain the confidence interval for the population variance, the term 𝜎 2 must be isolated.
Parameters of 2 normal populations
Dividing by (𝑛 − 1)𝑠 2 , we obtain
Module 3 Key Learning
2 2
𝜒1−𝛼/2,𝑛−1 1 𝜒𝛼/2,𝑛−1
P ≤ ≤ =1−𝛼
(𝑛 − 1)𝑠 2 𝜎 2 (𝑛 − 1)𝑠 2
2
𝜒1−𝛼/2,𝑛−1 1 (𝑛−1)𝑠 2
≤ 2 ≥ 𝜎2
(𝑛−1)𝑠 2 𝜎2 𝜒1−𝛼/2,𝑛−1
to solve for 𝜎 2 we need the reciprocal, and we get
1
2
𝜒𝛼/2,𝑛−1 (𝑛−1)𝑠 2
≤ 𝜎2 ≥ 2
𝜎2 (𝑛−1)𝑠 2 𝜒𝛼/2,𝑛−1
which leads to the 100(𝟏 − 𝜶 )% two-sided confidence interval for the population variance:
𝒏 − 𝟏 𝒔𝟐 𝒏 − 𝟏 𝒔𝟐
< 𝝈𝟐 <
𝝌𝟐 𝜶 𝝌𝟐 𝜶
𝒏−𝟏, 𝟐 𝒏−𝟏,(𝟏−𝟐 )
157
Module 3
Estimation
Confidence Interval for the population proportion
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Let 𝑝Ƹ be the proportion of successes in n independent trials, having each a probability of success
equal to p.
Module 3 Key Learning
The following C.I. for the population proportion p is valid for large samples. Or, in other terms, the
method is valid under the assumption that the Binomial distribution is satisfactorily approximated
by a Normal distribution. This happens if np(1- p) > 9.
Under this assumption, a 100(1-α)% C.I. for the population proportion is given by:
ෝ(𝟏−ෝ
𝒑 𝒑) ෝ (𝟏−ෝ
𝒑 𝒑)
ෝ − 𝒁𝜶/𝟐
𝒑 ෝ + 𝒁𝜶/𝟐
<𝒑<𝒑
𝒏 𝒏
158
Module 3
Estimation
Confidence Interval for the population proportion
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
ෝ(𝟏−ෝ
𝒑 𝒑) ෝ(𝟏−ෝ
𝒑 𝒑) 𝟎.𝟏𝟖𝟕𝟓(𝟎.𝟖𝟏𝟐𝟓) 𝟎.𝟏𝟖𝟕𝟓(𝟎.𝟖𝟏𝟐𝟓)
ෝ − 𝒁𝜶/𝟐
𝒑 ෝ + 𝒁𝜶/𝟐
<𝒑<𝒑 ⟹ 0.1875-1.96 <p< 0.1875+1.96
𝒏 𝒏 𝟖𝟎 𝟖𝟎
IC=(0.102 – 0.273)
159
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
1. Open the exercise File:
Interval estimation
2. Trainer will show using JMP:
Parameters of 1 normal population
a. How to perform one population Proportion Test
Parameters of 2 normal populations
Note:
• In JMP, the default proportion test is using Wilson Score Interval. Thus the result
obtained is different from the method used in slide # the previous slide.
Visit the JMP Learning Center under section of “Useful Add-In” and download
the add-in called Statistical Calculator.
https://stmicroelectronics.sharepoint.com/teams/jmplearningcenter9/SitePages/
Welcome-to-JMP-Video-Tutorial.aspx
160
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
161
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
Interval estimation Using this data: Go to Analyze > Distribution – Conformity column to Y > OK
Parameters of 1 normal population
Parameters of 2 normal populations
162
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
Interval estimation Using this data: Go to Analyze > Distribution – Conformity column to Y, Freq col to Freq > OK
Parameters of 1 normal population
Parameters of 2 normal populations
163
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
Interval estimation
JMP uses
Parameters of 1 normal population Wilson score method
Parameters of 2 normal populations
164
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators Go to Add-Ins > Statistics Calculator III > Confidence Interval for One Proportion
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
165
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators Go to Add-Ins > Statistics Calculator III > Confidence Interval for One Proportion
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
166
Module 3
Estimation
Exercise #9
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
167
Module 3
Estimation
Interval estimation
Parameters of 1 normal population Formulas for confidence intervals of parameters of one normal population:
Parameters of 2 normal populations
ෝ(𝟏−ෝ
𝒑 𝒑) ෝ(𝟏−ෝ
𝒑 𝒑)
Θ (population proportion) ෝ − 𝒁𝜶/𝟐
𝒑 ෝ + 𝒁𝜶/𝟐
<𝒑<𝒑
𝒏 𝒏
NOTE: 168
For more details, se also the Manual of Statistical Methodology – Ch. 6. (DMS 8482919_A)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
169
Module 3
Estimation
Point Estimation
Interval Estimation – 2 populations
Properties of Point Estimators
Interval estimation When two populations are simultaneously considered, the goal of
Parameters of 1 normal population
Parameters of 2 normal populations
the confidence intervals is often to compare among them the
parameters of the two populations
Module 3 Key Learning
Population X Population Y
σX σY
μX μY
Samples x1, x2, ... ,xnx y1, y2, ... ,yny
Point estimates ഥ, 𝒔𝒙 𝟐
𝒙 & ഥ, 𝒔𝒚 𝟐
𝒚
Comparisons μX = μY ? & σ 2X = σ 2Y ?
170
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
GENERAL SCHEME Confidence Interval for:
Module 3 Key Learning
Examples:
Comparison of means: Comparison of means: Comparison of
Group 1 vs. independent Same group before variances of two
Group 2 vs. after treatment normal distributions
171
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
GENERAL SCHEME Confidence Interval for:
Module 3 Key Learning
172
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
173
Module 3
Estimation
Point Estimation
C. I. for μX-μY (independent populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations Goal: Form a confidence interval
Population means,
Module 3 Key Learning
independent samples for the difference between two
population means, μX – μY
• Different data sources
• Uncorrelated
• Independent
• Sample selected from one population has no effect on the sample selected from the
other population
Interval estimation
Parameters of 1 normal population
Different cases can be identified
Parameters of 2 normal populations
𝟐 𝟐
𝒏𝒙 − 𝟏 𝑺𝒙 + 𝒏𝒚 − 𝟏 𝑺𝒚
Calculate the pooled variance 𝑺𝟐𝒑 : 𝑺𝟐𝒑 =
𝒏𝒙 + 𝒏𝒚 − 𝟐
177
Module 3
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
178
Module 3
Estimation
Point Estimation
C. I. for μX-μY (independent populations)
Properties of Point Estimators
Assuming that the two populations are normally distributed with unknown but equal
variances, determine a 95% confidence interval for the difference of CPU mean speeds.
179
C. I. for μX-μY (independent populations)
SOLUTION
2 2
2
n x − 1 Sx + n y − 1 Sy 17 − 1 742 + 14 − 1 562
The pooled variance is: Sp = = = 4427.03
(n𝑥 − 1) + (ny − 1) (17−1) + (14 − 1)
The t value for a 95% confidence interval is: t nx +ny −2 , α/2 = t 29 , 0.025 = 2.045
416.69 μX − μY 515.31
We are 95% confident that the mean difference in CPU speed is between 416.69 and 515.31 Mhz. Since zero
is not included in the interval, we cannot state that they are equally performing in terms of mean CPU speed.
180
Module 3
Estimation
Exercise #10
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations 1. Open the exercise File:
Module 3 Key Learning
2. Trainer will show using JMP:
a. How to perform Unequal Variance Test.
b. How to perform Confidence Interval.
3. Interpretation of results.
181
Module 3
Estimation
Exercise #10
Point Estimation
Properties of Point Estimators
Interval estimation 1. Open the exercise File and stack the data columns
Parameters of 1 normal population
Parameters of 2 normal populations
Go to Tables > Stack
182
Module 3
Estimation
Exercise #10
Point Estimation
Properties of Point Estimators
Go to hotspot > Unequal Variances
Does your CI
include 0?
183
C. I. for μX-μY (independent populations)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
Population means, independent samples
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations σx2 and σy2 known
Module 3 Key Learning
σx2 and σy2 assumed equal
σx2 and σy2 unknown
σx2 and σy2 assumed unequal
1 Assumptions:
▪ Samples are randomly and independently drawn
▪ both population distributions are Normal
Context: This case
▪ Population variances are unknown and assumed unequal
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
185
Module 3
Estimation
Exercise #11
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations 1. Open the exercise File:
3. Interpretation of results.
186
Module 3
Estimation
Exercise #11
Point Estimation
Properties of Point Estimators
1. Follow Step1 & 2 in Exercise 10
Interval estimation
Parameters of 1 normal population Go to hotspot > Unequal Variances
Parameters of 2 normal populations
Does your CI
include 0?
187
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
GENERAL SCHEME Confidence Interval for:
Module 3 Key Learning
Examples:
Comparison of means: Comparison of means: Comparison of
Group 1 vs. independent Same group before variances of two
Group 2 vs. after treatment normal distributions
188
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population Interval estimation of unknown population parameters (two
Parameters of 2 normal populations
dependent populations)
Module 3 Key Learning
189
Module 3
Estimation
Point Estimation
C. I. for μX-μY (dependent populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Estimation
Point Estimation
C. I. for μX-μY (dependent populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
𝑺𝒅 𝑺𝒅
ഥ−𝒕
𝒅 𝜶 ഥ+𝒕
< 𝝁𝒅 < 𝒅 𝜶
𝒏−𝟏, 𝒏 𝒏−𝟏,
𝟐 𝟐 𝒏
Where,
n = the sample size (number of matched pairs in the paired sample)
191
Module 3
Estimation
Point Estimation
C. I. for μX-μY (dependent populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
192
Module 3
Estimation
Exercise #12
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
1. Open the exercise File:
Module 3 Key Learning
2. Trainer will show using JMP:
a. How to perform Confidence Interval.
3. Interpretation of results.
193
Module 3
Estimation
Exercise #12
Point Estimation
Properties of Point Estimators
194
Module 3
Estimation
Exercise #12
Point Estimation
Properties of Point Estimators
Interval estimation Method2: Using the Changes column (difference of T0 & T500 measurements)
Parameters of 1 normal population
Parameters of 2 normal populations
Go to Distribution
Cast Changes to Y, Response → OK
Module 3 Key Learning
195
C. I. for μX-μY (dependent populations)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
EXAMPLE
Interval estimation
Parameters of 1 normal population Six people sign up for a weight loss program. You collect the following data:
Parameters of 2 normal populations
1 136 125 11
2 205 195 10
3 157 150 7
4 138 140 -2
5 175 165 10
6 166 160 6
42
di ∑ (d i- d) 2
Sd = = 4.82
d = n = 7.0 n -1
Form a 95% confidence interval for the difference of means and establish
if the weight loss program helps people loosing weight.
196
C. I. for μX-μY (dependent populations)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
For a 95% confidence level, the appropriate t value is tn-1,/2 = t5,0.025 = 2.571
Interval estimation
SOLUTION The 95% confidence interval for the difference between means, μd , is
Parameters of 1 normal population
Parameters of 2 normal populations
𝐒𝐝 𝐒𝐝
ഥ
𝒅−t ഥ+𝐭
< 𝛍𝐝 < 𝒅
Module 3 Key Learning n−1,α/2 𝐧 n−1,α/2 𝐧
4.82 4.82
7−(2.571) < 𝛍𝐝 < 7+(2.571)
𝟔 𝟔
Interpretation:
• Since this interval does not contains zero, we can be 95% confident, given this
limited data, that the weight loss program produced a statistically significant effect
on people weight.
• Since the confidence limits are positive, we can state that the weight loss program
helps people to lose weight (weight before > weight after).
197
Module 3
Estimation
Point Estimation
C. I. for μd and σd (Dependent samples)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
EXAMPLE
Module 3 Key Learning
Confidence interval for the mean μd and for the standard deviation σd of the difference (d)
of the values drawn from 2 normal paired (or matched) samples
Use here the same data of previous example 2.6 for independent samples (same
parameter in two successive time periods).
Here, we assume that the data in columns A and B represent measurements on the
same units before and after a certain treatment was given (or, equivalently, in two
different periods of time: t-1 and t)
• Determine and interpret the 95% confidence intervals for μd and for σd
198
Module 3
Confidence interval
199
Module 3
Estimation
Point Estimation
Interval Estimation(two populations)
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
GENERAL SCHEME Confidence Interval for:
Module 3 Key Learning
Examples:
Comparison of means: Same group Comparison of
Group 1 vs. before vs. after variances of two
independent Group 2 treatment normal distributions
200
C. I. for (σX)2/(σY)2 (indep. Normal populations)
Module 3
Estimation
Point Estimation
Properties of Point Estimators
2 The confidence interval for the ratio of variances σX2/σY2 is given by:
3 σ2X and σ2Y comparison: “Form a (1-α)% confidence interval for the ratio σ2X
/σ2Y . If one (1) is included in the interval, we can state (with (1-α)% confidence)
that σ2X = σ2Y ”.
201
Module 3
Estimation
Confidence Interval for the difference of 2 population proportions
Point Estimation
Properties of Point Estimators
ASSUMPTIONS:
Interval estimation
Parameters of 1 normal population • Samples are randomly and independently drawn.
Parameters of 2 normal populations • Samples sizes are nX and nY .
• Both sample sizes are large and the Normal approximation to the Binomial distribution holds (np (1 - p) > 9).
Module 3 Key Learning
𝑝Ƹ𝑋 1 − 𝑝Ƹ𝑋 𝑝Ƹ 𝑌 1 − 𝑝Ƹ 𝑌+
𝑝Ƹ𝑋 − 𝑝Ƹ 𝑌 ± 𝑍𝛼 +
2 𝑛𝑋 𝑛𝑌
202
Module 3
Estimation
Confidence Interval for the difference of 2 population proportion
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
EXAMPLE
Two production lots have been inspected. From the first, a random sample of size nX = 270 units is drawn and then
Module 3 Key Learning checked. 11 units out of 270 have been found defective. From the second lot, a random sample of size nY = 352 units
is drawn and then checked. 19 units out of 352 have been found defective.
Form a 90% C.I. for the difference of proportions of defective units in lot 1 and in lot 2 to assess equality of proportions.
SOLUTION
11 19
From lot 1: 𝑝Ƹ𝑋 = = 0.0407. From lot 2: 𝑝Ƹ𝑌 = = 0.0053977
270 352
𝑝Ƹ 𝑋 1−𝑝Ƹ 𝑋 𝑝Ƹ 𝑌 1−𝑝Ƹ 𝑌+ 0.0407 0.9593 0.053977 0.946023
For 90% confidence level, 𝑍𝛼 = 1.645, and + = + = 0.01702
2 𝑛𝑋 𝑛𝑌 270 352
𝑝Ƹ 𝑋 1−𝑝Ƹ 𝑋 𝑝Ƹ 𝑌 1−𝑝Ƹ 𝑌+
The confidence limits are: 𝑝Ƹ𝑋 − 𝑝Ƹ𝑌 ± 𝑍𝛼 + = (0.0407 − 0.053977) ± 1.645(0.01702)
2 𝑛𝑋 𝑛𝑌
and the C.I. is:
𝑪𝑰 = −𝟎. 𝟎𝟒𝟏𝟐 < 𝒑𝑿 − 𝒑𝒀 < 𝟎. 𝟎𝟏𝟒𝟖
CONCLUSION
Since 0 ∈ C.I., with a 90% confidence level we can state that the proportion of defective is equivalent in the two lots.
203
Module 3
Estimation
Exercise #13
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
3. Interpretation of results.
204
Module 3
Estimation
Exercise #13
Point Estimation
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
Go to Tables > Stack
Stacked data
205
Module 3
Estimation
Exercise #13
Point Estimation Go to hot spot > Two Sample Test for Proportions
Properties of Point Estimators
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
CI for Proportion
206
Module 3
Estimation
Exercise #13
Point Estimation
Properties of Point Estimators
207
Module 3
Interval estimation
Parameters of 1 normal population
Parameters of 2 normal populations
208
Module 3
Estimation
Point Estimation
Module 3 Key Learnings
Properties of Point Estimators
209
END OF STATS 2 PART 1
210
File Revision
211