Professional Documents
Culture Documents
Sampling and Estimation: Research Methods: Lecture 6 Sarah Griffiths Sarah - Griffiths@ucl - Ac.uk
Sampling and Estimation: Research Methods: Lecture 6 Sarah Griffiths Sarah - Griffiths@ucl - Ac.uk
• Populations vs samples
• Different types of sampling
• Random vs convenience
• Sampling distributions
• Central limit theorem
• Standard error
• Confidence intervals
POPULATIONS AND SAMPLES
• When we conduct a study, we are not usually interested in the particular participants in
our study (our sample), we are interested in people in general (the population).
• Say our study was about the verbal skills of 5-6 year old children:
Population
Sample
• How likely is it that the population mean is exactly the same as the sample
mean?
• Not very likely!
• The difference between the population parameter and the sampling statistic is
called sampling error
sample
mean=3.83 cm p op u lation mean=???
SAMPLING ERROR
• There are many possible samples of 5 aliens that I could have selected.
• All of these samples will have slightly different means.
sample
mean=3.64 cm
sample mean=4.06 cm
cm
sample
mean=3.83 cm sample
mean=3.98 cm
SAMPLING DISTRIBUTION
• http://onlinestatbook.com/stat_sim/sam
pling_dist/
STANDARD ERROR
• The
standard deviation of the sampling distribution of the mean is also called
the ‘Standard error’
• Standard error can be estimated from the sample standard deviation and the
sample size using the following formula:
SAMPLE SIZES
0.866
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟= =0.387
Sample size Sample mean Sample SD √5
Scientist 1 5 3.83 0.866
0. 7 66
Scientist 2 15 3.98 0.766 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟= =0.198
√ 15
Scientist 3 50 3.70 0.622
0. 622
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟= =0. 122
√ 50
PLOTTING STANDARD ERROR
0. 622
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟= =0. 122
√ 50
+ geom_errorbar(aes(ymin=means-SEs,
ymax=means+SEs), position=position_dodge(.9))
INTERPRETING STANDARD ERROR
• We can be 68% confident that the population mean lies within one standard
error of the sample mean.
• This is because, 68% of sample means taken from the sampling distribution
will lie within one standard deviation of the population mean
• Note that it is not the same to say there is a 68% probability that the
population mean lies within one standard error of the sample mean.
95% CONFIDENCE INTERVALS
CI95
CI95
SDs
95% CONFIDENCE INTERVALS
CI95
CI95cm
SUMMARY
• The choice of sampling method is an important consideration in study design and can
effect the validity of conclusions.
• Sample statistics will never match exactly the true population parameters we are
interested in, so it is important to present measures of confidence.
• Larger sample sizes increase the chance that our sample statistics will be an accurate
estimate of the true population parameters.
• Standard error is the standard deviation of the sampling distribution of the mean. We
can be 68% confident that the population mean lies within one standard error of the
sample mean.
• The 95% confidence interval contain the values that lie within 1.96 standard errors of
the sample mean. We can be 95% confident that the population mean lies within this
range.
READING
• Chapter 7: Sampling
• Statistics 101: Standard error of the mean https://
www.youtube.com/watch?v=uIHFbMn8SBc