Professional Documents
Culture Documents
Sampling Part 1 Notes
Sampling Part 1 Notes
1.Introduction
How data is generated
-Quantitative reasoning involved the use of data for decision making
-Very important to know the relevancy and reliability of the data
-Critical area to pay attention to: The way data are generated
Learning outcomes:
1.Generate our own data set as primary source of information?
-Takes longer time and costs more to generate our own set of data
-Can tailor it to suit our needs
2.Identify good existing data set as secondary source of information for our own use?
-Cheaper to use data set that is already available in the market
-The data set might be collected for other purposes
-Might not meet exactly what we are looking for
*To decide:
-Cost
-Time
Examples:
1.Quarterly unemployment rate in Singapore:
-Unit: Every adult, aged 15 and over who resides in Singapore
-Population: The collection of all the units defined above
-Sample: A portion of adults would be selected to provide their employment status
Census vs Sample:
Census:
-Measurement would be taken from every unit in the population
-This is the preferred method, but the task is too huge
-Most visible census: population census of a country
Sample:
-Measurements would be taken from some selected units in the population
-Used in situation when a census is not possible
-Example: Blood testing for a certain disease, determine the effectiveness of a drug
-As measurements are taken from only a portion of the population Cost will be smaller,
faster to obtain the results (Speed), obtain more accurate outcomes as we can devote more
resources to train a better task force to take measurements
Systematic sampling:
Method of selecting units from a list through the application of a selection interval, K, so
that every Kth unit on the list, following a random start, is included in the sample
*Value of K: chose based on the exact size/estimated size of the population + size of the
sample
-Produces the same result as simple random sampling, if it meets certain conditions
-Also a process that can apply to the situation where the exact size of the population is not
known, at the planning stage
-Need to know: rough estimate of the population size
Difficulties in Sampling:
1.Using imperfect sampling frame
To find a sampling frame that covers exactly the target population is not easy, especially
when the target population keeps changing over time
-A perfect sampling frame is the one that consists of exactly all units of the target
population
-In a lot of situations, a sampling frame might either include unwanted units or exclude
desired units
-A frame with too many unwanted units would increase the cost of field work
-For a frame that exclude desired units: Need to redefine the target population + Assess the
impact of excluding these units in our study
-Then, we can roughly determine the impact on the outcomes of the study when these
desired units are being excluded
2.Non-response
-Not all selected units are contactable (Be unavailable)
-Not all selected units are willing to take part in the study (Refuse to provide measurements)
-Non-response distorts the results of studies (If they come from some particular types on
the variable of interest, then we have a biased sample)
-Usually non-respondents differ from respondents Need to study the extent of the effect,
in order to reduce the bias of the collected information
*Extra effort need to made by field workers, to help convince selected units to take part in
the study + retain them for the whole duration of the study
Disasters in sampling:
Come from the use of non-probability sampling
-Worst disasters that can occur
-Quite common in practice, as the plan can be easily executed + cheaper
-Expect to obtain biased sample, that makes the extension of the results to the population
impossible
1.”Getting a volunteer or Self-selected sample”
-Media like to conduct opinion polls on their websites
-Asks readers/viewers who are interested to respond
-Mostly attract those who have strong view on the issues
-Results would reflect only the opinion of those who volunteer to respond, biased
2.”Using a convenience or haphazard sample”
-Samples are made up of individuals casually met or conveniently available, such as students
enrolled in a class or people passing by on a street corner
-Used when it is difficult to construct or find a proper sampling frame
-Easier to use the most convenient group available
-A convenient sample is often biased
-Information we can gather from respondents who are easily available is normally different
from the hard ones to get
3.”Taking a judgement sample”
-Sample units are chosen from the population by interviewers using their own discretion
about which informants are typical/representative
-Sample is biased, as selection is based on the opinion of experts
-Experience and knowledge of these experts would have a lot of influences on how sample
is to be constructed
-No one can verify whether the opinions given are of typical as well
4.”Selecting a quota sample”
-Process of selection in which the elements are chosen in the field by interviewers using
prearranged categories of sample elements to obtain a predetermined number of cases in
each category
-Each interviewer is assigned a fixed quota of units to interview
-Numbers falling into certain categories of variables like sex, age, race, housing type are
fixed
-Plan is bias in the way interviewers select their respondents
-Variable of interest might not be strongly associated with these variables
-Having the proportions of these categories in the sample similar to those in the population
does not make the extension of the results derived from the sample to the population
better
-This sampling plan Very popular in the market research