Professional Documents
Culture Documents
Sampling and Sampling Distribution: September 12, 2017
Sampling and Sampling Distribution: September 12, 2017
Outline of Topics
1 Sampling
Statistical Inference
Parameter Statistic
Probability distribution Histogram
(Population) mean, µ (Sample) mean, X̄
(Population) variance, σ 2 (Sample) variance, s 2
(Population) standard deviation, σ (Sample) standard deviation, s
(Population) proportion, p (Sample) proportion, p̂
PN
i=1 Xi
µ=
N
X1 , X2 , · · · , XN
Population
PN
i=1 (Xi − µ)2
σ2 =
N
Pn
i=1 Xi
X̄ =
n
X1 , X2 , · · · , Xn
Sample
Pn
i=1 (Xi − X̄ )2
s2 =
n
Intuition tells us X̄ is similar to µ and s2 is similar to σ2
STAT 101 Class 5 Slide 8
Outline Sampling Sampling distribution of a mean Sampling distribution of a proportion
Sampling error
Example Sampling with replacement from a finite population
Population Sample∗
Units X1 , ..., X7 = 1, 2, 3, 4, 5, 6, 7 X1 , ..., X5 =3, 6, 5, 1, 6
Size N=7 n=5
X1 +...+XN 1+...+7 X1 +...+Xn 3+6+5+1+6
Mean µ= N = 7 =4 X̄ = n = 5 = 4.2
Sampling distribution
Every SRS is randomly drawn from the population, hence X̄ and its
sampling error X̄ − µ ≡ × are both random — we cannot make
definitive statements about anything random (c.f., class 1 slide 10)
× = X̄ − µ
Sample k
4, 5, 6, 1, 7 4.6 − µ = ×
Population
X̄ = 4.6
1, 2, 3, 4, 5, 6, 7
Sample 2
1, 4, 6, 2, 2
3−µ=×
X̄ = 3
Sample 1
3, 6, 5, 1, 6
4.2 − µ = ×
X̄ = 4.2
0 Sampling error
µ X̄
X̄ ∼ Normal
|{z} | {z } (µ, var(X̄ )) and × = X̄ − µ ∼ Normal
| {z } (0, var(×))
| {z } | {z } | {z }
statistic sampling sampling sampling error sampling sampling
distribution variation distribution variation
We do not know where exactly is × among the red ×s. However, using p the
empirical rules, we can be 95% certain that × is no more than 0 ± 2 var(×)
STAT 101 Class 5 Slide 13
Outline Sampling Sampling distribution of a mean Sampling distribution of a proportion
Sampling variation
Sample X̄ Sampling error ×
1 3, 6, 5, 1, 6 4.2 4.2 − µ
2 1, 4, 6, 2, 2 3 3 −µ
.. .. .. ..
. . . .
k 4, 5, 6, 1, 7 4.6 4.6 − µ
X1 +...+X5 X1 +...+X5
Any X1 , X2 , X3 , X4 , X5 5 5 −µ
3 X1 , X2 , ..., X5 are randomly drawn from the population, they must have the
same behaviour as any X randomly drawn from the population, i.e.,
Sampling distribution
Sampling error
0 0
What is a proportion?
Example We wish to estimate the proportion, p, of homeless people in
a population of N individuals. Let X indicate whether someone is
homeless:
1 homeless
X =
0 not homeless
X1 + ... + Xn
X̄ ≡ p̂ =
n
1 with probability p
X1 , ..., Xn are:
0 with probability 1 − p
var(X )
We use CLT for X̄ , i.e., X̄ ∼ N(µ, )
n }
| {z
var(X̄ )
p2
z}|{
var(X ) = E(X 2 ) − E(X )2 = (1)2 p + (0)2 (1 − p) − µ2
= p − p 2 = p(1 − p)