Professional Documents
Culture Documents
Lec09 Estimation
Lec09 Estimation
Lec09 Estimation
İnci M. Baytaş
Fall 2022
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 1
Statistical Inference
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 2
Point Estimate
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 3
Point Estimate
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 3
Point Estimate
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 3
Point Estimate
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 3
Unbiased Estimator
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 4
Unbiased Estimator
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 4
Unbiased Estimator
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 4
Unbiased Estimator
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 4
Variance of a Point Estimator
I If Θ̂1 and Θ̂2 are two unbiased estimators of the same population
parameter θ, choose the estimator whose sampling distribution has
the smaller variance.
I If σθ̂2 < σθ̂2 , Θ̂1 is a more efficient estimator of θ than Θ̂2 .
1 2
Definition
If we consider all possible unbiased estimators of some parameter θ, the
one with the smallest variance is called the most efficient estimator of θ.
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 5
Sampling Distributions of Different Θ̂
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 6
Interval Estimation
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 7
Interval Estimation
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 8
Estimating the Mean
I x̄ is likely to be a very accurate estimate of µ when n is large.
I If the sample is from normal population or, if n is sufficiently large, a
confidence interval for µ can be established by considering the
sampling distribution of X̄
I Recall the Central Limit Theorem, sampling distribution of X̄ is
√
approximately normal with µX̄ = µ and σX̄ = σ/ n.
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 10
Confidence Interval
I Most of the time, x̄ will not be exactly equal to µ and the point
estimate will be in error.
Theorem
If x̄ is used as an estimate of µ, we can be 100 (1 − α) % confident that
the error, |µ − x̄ |, will not exceed zα/2 √σn .
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 11
Confidence Interval
Theorem
If x̄ is used as an estimate of µ, we can be 100 (1 − α) % confident that
the error will not exceed a specified amount e when the sample size is
2
zα/2 σ
n=
e
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 12
One-Sided Confidence Bounds
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 13
Exercise
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 14
The Case of Unknown σ
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 15
The Case of Unknown σ
X̄ − µ
T = √
S/ n
S S
P X̄ − tα/2 √ < µ < X̄ + tα/2 √ = 1 − α.
n n
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 15
The Case of Unknown σ
Figure: P −tα/2 < T < tα/2 = 1 − α
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 16
The Case of Unknown σ
s
x̄ ± tα/2 √ = x̄ ± tα/2 s.e. (x̄ )
n
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 18
Prediction Intervals
I Assume the random sample from a normal population with unknown
µ and known σ 2 .
I To predict a new observation, we need to take into account
I variation due to estimating the mean
I variation of a future observation.
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 19
Prediction Interval
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 20
Prediction Interval
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 21
Prediction Limits for Outlier Detection
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 22
Exercise
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 23
Estimating the Difference between Two Means
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 24
Estimating the Difference between Two Means
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 25
Variances Unknown but Equal
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 26
Variances Unknown but Equal
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 27
Variances Unknown but Equal
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 28
Variances Unknown and Not Equal
I Now, we consider the difference of two means when the samples are
not independent and the variances of the two populations are not
necessarily equal.
I This situation is named as paired observations.
I A pair of observation is received.
I Using paired may reduce experimental error variance.
I i−th pair difference
Di = X1i − X2i
Var (Di ) = Var (X1i − X2i ) = σ12 + σ22 − 2Cov (X1i , X2i )
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 30
Paired Observations
I Pairing reduces variance and hence reduces the standard error of the
point estimate.
I Degrees of freedom are also reduced since it’s a one-sample problem
now.
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 31
Single Sample: Estimating a Proportion
I A point estimator of the proportion p in a binomial experiment is
given by the statistic P̂ = X /n, where X represents the number of
successes in n trials.
I If p is not too close to 0 or 1, a confidence interval can be established
considering the sampling distribution of P̂.
I Failure is represented by 0 and success is 1. Then, the number of
successes x is the sum of n values.
I For sufficiently large n, the Central Limit Theorem suggests that P̂ is
approximately normal with mean
X np
µP̂ = E P̂ = E = =p
n n
and variance
σ2 npq pq
σP̂2 = σX2 /n = X2 = 2 = .
n n n
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 32
Confidence Interval for p
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 33
Single Sample: Estimating the Variance
(n − 1) S 2
X2 = .
σ2
P χ21−α/2 < X 2 < χ2α/2 = 1 − α
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 34
Single Sample: Estimating the Variance
Figure: P χ21−α/2 < X 2 < χ2α/2 = 1 − α
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 35
Single Sample: Estimating the Variance
(n − 1) s 2 2 (n − 1) s 2
< σ <
χ2α/2 χ21−α/2
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 36
Estimating the Ratio of Two Variances
σ22 S12
F =
σ12 S22
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 37
Estimating the Ratio of Two Variances
Figure: P f1−α/2 (v1 , v2 ) < F < fα/2 (v1 , v2 ) = 1 − α
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 38
Estimating the Ratio of Two Variances
Note:
1
f1−α (ν1 , ν2 ) =
fα (ν2 , ν1 )
İnci M. Baytaş CMPE 343 Introduction to Probability and Statistics for Computer Engineers 39