Professional Documents
Culture Documents
002 Population & Sample
002 Population & Sample
POPULATION
Introduction: Population is generally a large collection of individuals or objects that is
the main focus of a scientific query. Collection of data is an essential part of the
research process. In order to collect data for any research problem the researcher has
to select a representative sample from the population so as to enable him to draw
conclusions from the sample, which will hold good for the population.
A sample is selected since it is neither possible nor feasible to gather data for the whole
population. By correctly drawing a sample from a specific population, a researcher can
analyze the sample and make inferences about population characteristics. However,
due to the large sizes of populations, researchers often cannot test every individual in
the population because it is too expensive and time-consuming. If the sample is
representative of the population from which it is drawn, the researcher can have
confidence in concluding that the results are generalizable to the entire population.
Population is the entire group whom you want to generalize the research finding.
For example if you want to study the morbidity and mortality pattern of
children with Leukemia in Tamil Nadu, the population might be defined as all children
with Leukemia in Tamil Nadu.
The population may be finite or infinite. If you are looking for a job in the
Automobile Sector, so you search online for Automobile jobs. Here the universe is jobs
all around the World, but if you want to work in India, then your Population is jobs in
India or especially Automobile jobs in India. The sample is top 30 companies or top 30
positions in Automobile Sector.
The number of units constituting the population is called the size of population.
For example, in the study of socioeconomic conditions of a particular village, all
families or houses in the village will be a population. A population comprises all
possible cases (persons, objects, events) that constitute a known whole. The population
or universe can be finite or infinite. The population is said to be finite if it consists of a
fixed number of elements so that it is possible to enumerate it in its totality. For
instance, the population of a city, the number of nurses working in Government sectors
are examples of finite populations. The symbol 'N is generally used to indicate how
many elements (or items) are there in case of a finite elements.
Types of Population
Finite vs Infinite Population
Finite: A population containing a finite number
of units, individuals, or member is called a ‘finite
population’ (e.g. the students studying in a
particular Institution)
⮨
Infinite: A
⮩
Real (or) Existent vs Hypothetical Population
Real population:
In real population, all potential observations are
possible and accessible at the time of sampling.
This is otherwise known as existent population
which is the population of concrete individuals.
Example, Weight of all students in a School.
Page 2
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Conceptual/hypothetical population:
It is a population which does not physically
exist or potential observations are not
accessible at the time of sampling is
referred to as the hypothetical population.
For example, all possible values of
tomorrow's highest temperature or rainfall
or all possible new cases of diabetes
mellitus in the next year.
⮩
Target Population vs Accessible Population:
Target population refers to the entire
group of individuals or objects to
which researchers are interested in
generalizing the conclusions. The
target population usually has varying
characteristics and it is also known as
the "theoretical population.
For example, we may want to study
the problems of adolescents in
Chennai. In this case, the target
population may be all those
adolescents residing at Chennai is our
population.
Page 3
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
SAMPLING
Although there are over 10000000000 TV
viewers in India and somewhat over half
that many TV sets, only about 1,000 of
those sets are sampled to determine what
programs Indians watch. Why select only
about 1,000 sets out of 100 crores?
Because time and the average cost of an
interview prohibit the rating companies
from trying to reach crores of people. And
since polls are reasonably accurate,
interviewing everybody is unnecessary.
Introduction to Sampling
Shoppers often sample a small piece of cheese before purchasing
any. They decide from one piece what the larger chunk will taste
like. A chemist does the same thing when he takes a sample of
alcohol from a still, determines that it is 90 proof, and infers that
all the alcohol in the still is 90 proof, If the chemist tests all the
alcohol or the shoppers taste all the cheese, there will be none to
sell. Testing all of the product often destroys it and is
unnecessary. To determine the characteristics of the whole, we
have to sample only a portion.
Suppose that, as the personnel director of a large bank, you need to write a report
describing all the employees who have voluntarily left the company in the last 10 years.
You would have a difficult task locating all these thousands of people. They are not
easily accessible as a group many have died, moved from the community, left the
country, or acquired a new name by marriage. How do you write the report? The best
idea is to locate a representative sample and interview them in order to generalize
about the entire group.
Page 4
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Time is also a factor when managers need information quickly in order to adjust an
operation or change a policy. Consider an automatic machine that sorts thousands of
pieces of mail daily. Why wait for an entire day's output to check whether the machine
is working accurately (whether the population characteristics are those required by the
postal service)? Instead, samples can be taken at specific intervals, and if necessary,
the machine can be adjusted right away.
Statisticians use the word population to refer not only to people but to all items that
have been chosen for study. In the cases we have just mentioned, the populations are
all the cheese in the chunk, all the whiskey in the vat, all the employees of the large
bank who voluntarily left in the last 10 years, and all mail sorted by the automatic
machine since the previous sample check. Statisticians use the word sample to
describe a portion chosen from the population.
The sample refers to those individuals or units or elements who are selected for
a study. A sample is simply a subset of the population. The concept of sample arises
from the inability of the researchers to test all the individuals in a given population.
The sample must be representative of the population from which it was drawn and it
must have good size to warrant statistical analysis.
According to Goode and Hatt, "A sample as the name applies, is a smaller
representative of a large whole, According to Pauline V Young, "A statistical sample is a
miniature of cross selection of the entire group or aggregate from which the sample is
taken'. A sample is a part of the population from which it was drawn.
Sampling
Sampling is the process of selection of samples from a population. According to
Bogardus, "Sampling is the selection of certain percentage of a group of items according
to a context of funds available for the research study. It should be such that the results
of sample study can be applied, in general, for the universe with a reasonable level of
confidence. It should result in small sampling error. Sampling is the process of
selecting a group of subjects for a study in such a way that the individuals represent
the larger group from which they were selected. This representative portion of a
population is called as sample. Sampling may involve the selection of people, events,
organizations, and periods of time, record and many other phenomena of concern to
the field of nursing.
Page 5
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Sampling Frame
Sampling frame is a list of all elements of the population and it is the list from
which we draw a sample. Sampling frame can be prepared for the finite population
whereas it is impossible in infinite population. The elementary units or the group or
cluster of such units may form the basis of sampling process in which case they are
called sampling units is known as sampling frame. Thus sampling frame consists of a
list of items from which the sample is to be drawn. If the population is finite and the
time frame is in the present or past, then it is possible for the frame to be identical with
the population. In most Cases they are not identical because it is often impassible to
draw a sample directly from population. As such this frame is either constructed by a
researcher for the purpose of his study or may consist of some existing list of
population. For instance, one can use telephone directory as a frame for conducting
opinion survey in a city. Whatever the frame may be, it should good representative of
the population.
Ideally, the sampling frame and the population are identical. But practically, it is not
possible to include all the units of the population. We may omit few elements or it may
be an incomplete list. The goal is to get accurate sampling frame list as much as
possible.
Page 6
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Budgetary constraints: The size as well as the type of sample depends upon
the cost consideration.
Sampling procedure: Finally the researcher must decide about the techniques
to be used in selecting the items for the sample, i.e, he must decide the type of
sample. This technique stands for the sample design itself. He should select the
sample design in such a way that for a given sample size and for a given cost,
the sample design has a smaller sampling error.
Characteristics of Sampling
Sample design should be such, so that systematic bias can be controlled in a
better way
It should result in a truly representative sample
It should be viable in the context of funds available for the research study
It should be such that the results of sample study can be applied, in general,
for the universe with a reasonable level of confidence
It should result in small sampling error.
Steps in Sampling
Identify the target population
Identity the accessible population
Specify sampling frame
Determine the size of the sample
Decide sampling plan
Select the sample.
Page 7
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
SAMPLING TECHNIQUES
There are various types of sampling techniques. Basically, they are classified as
probability and non-probability sampling techniques.
Sampling techniques
Page 8
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Disadvantages
The selector has no control over the selection of units. The researcher cannot
contact the far situated units
He cannot prepare the whole field when the universe is vast
If units have no homogeneity, the method is not appropriate
There is no question of alternatives. The selected units cannot be replaced or
changed.
Systematic Sampling
In some instances the most practical way of sampling is to select every 1st, say
example, 10th name on a list, every 10th house on one side of a street and So on.
Sampling of this type is known as systematic sampling. An element of randomness is
usually introduced into this kind of sampling by using random numbers to pick up the
unit with which to start. 2his procedure is useful when sampling frame is available in
the form of a list. In such a design the selection process starts by picking some random
point in the list and then every nth element is selected until the desired number is
secured.
Advantages
It can be taken as an improvement over a simple random sampling
It is an easier and less costly method of sampling and can be conveniently used
even in case of large populations.
Disadvantages
If there is a hidden periodicity in the population, systematic sampling will prove to
be an inefficient method. For example, if we take every 4th piece of thermometer in
a production unit which is detective, we will get all defective thermometer as our
samples.
Stratified Sampling
If the population from which a sample is to be drawn does not constitute a
homogeneous group, then stratified sampling technique is applied so as to obtain a
representative sample, In this technique, the population is stratified into a number of
non-overlapping subpopulations or strata and sample items are selected from each
stratum. If the items selected from each stratum is based on simple random sampling,
the entire procedure, must stratification and then simple random sampling, is known
as stratified random sampling Process of stratifying: The stratification of domain or
data should be with great care, because the success of the technique depends upon
successful stratification.
Following points should be kept in mind:
We should process extensive information of all items included in a domain and
should know which item make a coherent whole on the basis of similar traits and
which others are different from them and why?
The size of each stratum should be large to enable the use of random sampling
technique.
In stratifying, it must be kept in mind that various strata should have similar
relation to the domain and should be themselves homogeneous.
The various strata which differ from each other should be the same as the
proportion of stratum from the domain. Suppose a domain has four strata,
1
accordingly the proportion of each stratum of domain is . Now if the number of
4
total items of the sample is 64, we shall select 16 items from each stratum and
1
thus the proportion of selected items from each stratum will be .
4
Page 9
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
The strata may be formed on the basis of common characteristics of the items to be put
in each stratum. This means that various strata be formed in such a way as to ensure
elements being most homogeneous within each stratum and most heterogeneous
between the different strata. Thus, strata are purposively formed and are usually based
on past experience and personal judgment of the researcher. From the strata, samples
can be selected using simple random sampling technique or systematic sampling
technique. Regarding the number of samples to be selected from each strata, a
proportionate sampling can be done, i.e. the samples from different strata are kept
proportional to the sizes of the strata.
Example:
Suppose, if we want to select 50 samples from 6000 population which is divided into 3
strata of 1000, 2000 and 3000 then the proportionate sample can be selected from
each strata as follows:
1000
The number of sample to be selected from the strata one 50 8.3 8
6000
2000
The number of sample to be selected from the strata two 50 16.66 17
6000
3000
The number of sample to be selected from the strata three 50 25
6000
The total sample size 8 17 25 50
Advantages
Neither group nor class of importance is totally neglected as units of each are
represented in the sample
If different classes are divided properly, selection of few units represents the
whole group
On the classification of regional basis, units are not in contact easily. This leads
to economy of time and money
There is a facility in substitution of units. If someone is not contacted easily,
the other person of the same class can be substituted for him. Such inclusion
will not show any contradicting result.
Disadvantages
The sample does not become representative if selected sample has more or less
units of a class
If the sizes of different group are different, no equal proportional quality can be
viewed
Non-proportional selection leads to more emphasis in the end. During such
time researcher can be biased, hence samples will not be a representative
If group is not expressed properly, the difficulty is seen about the unit to be
kept under which group or class.
Cluster Sampling
Cluster sampling involves grouping the population and then selecting the groups or the
clusters rather than individual elements for inclusion in the sample. Suppose a college
wishes to sample its alumni members for some purpose and if it has 20,000 alumni
members, and the sample size is to be kept say 200. For cluster sampling this list of
Page 10
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
20,000 students could be formed into 10 clusters of 100 students each. Two clusters
might then be selected for the sample randomly. The clustering approach can, however,
make the sampling procedure relatively easier and increase the efficiency of field work,
especially in the case of personal interviews.
Advantages
It is no doubt, reduces cost by concentrating surveys in selected clusters.
Cluster sampling is used only because of the economic advantage it possesses;
estimates based on cluster samples are usually more reliable per unit cost
The disadvantage is that it is less precise than random sampling.
Area Sampling
It is quite close to cluster sampling and is often talked about when the total &
geographical area of interest happens to be big one. Under area sampling we must
divide the total area into a number of smaller non-overlapping areas, generally called
geographical clusters, then a number of these smaller areas are randomly selected, and
all units in these small areas are included in the Sample. Area sampling is especially
helpful where we do not have the list of the population concerned. It also makes the
field interviewin8 more efficient since interviewer can do many interviews at each
location.
Multi-stage Sampling
This is a further development of the idea of cluster sampling. This technique is meant
for big inquiries extending to a considerably large geographical area like an entire
country. Under multi-stage sampling the first stage may be to select large primary
sampling units such as states, then districts, then towns and finally certain families
within towns. If the technique of random sampling is applied at all stages, the sampling
procedure is described as multi-stage random sampling.
Purposive Sampling
Purposive sampling, also known as Judgmental,
Selective or Subjective sampling, reflects a group
of sampling techniques that rely on the
judgment of the researcher; when it comes to
selecting the units that are to be studied.
For example, Specific People, specific
cases/Organizations, specific pieces of data.
Page 11
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
item for the purpose of study depends entirely upon the wishes of investigator and he
will chose items or units which in his judgment are representative of the whole data.
This is a very simple technique of choosing the samples and is useful in cases where
the whole data is homogeneous and the investigator has full knowledge of the various
aspects of the problem.
Judgment Sampling
Expert Sampling
Expert sampling is a type of purposive sampling used in Delphi technique.
Sample of experts on a particular phenomenon is selected based on the belief that they
are the expertise in those field.
Convenience Sampling
Convenience samples involve selection on the basis of
ease or convenience. When population elements are
selected for inclusion in the sample based on the ease of
access, it can be called 'convenience sampling. At times
such a procedure may give very biased results
particularly when the population is not homogeneous.
Advantages
More representation is possible in this method
As sample is small in size, the method is less expensive and less time consuming
The utility of this method increases when few units of universe have special
importance
When units are less in number, sample is profitable.
Disadvantages
Units are selected by researcher at his will. Hence sample is biased
The error of the sample cannot be detected
Researcher is unable to understand the whole group
Those hypotheses on which inference of error of sample is attributed, are less used.
Page 12
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Quota Sampling
In stratified sampling the cost
of taking random samples
from individual strata is often
so expensive that interviewers
are simply given quota to be
filled from different strata, the
actual selection of items for
sample being left to the
interviewer's judgment. This is
called quota sampling. The
size of the quota for each
stratum is generally proportionate to the size of that stratum in the population. Quota
sampling is thus an important form of non-probability sampling. Quota samples
generally happen to be judgment samples rather than random samples.
In practice, several of the methods of sampling described above may well be used in
the same study in which case it can be called mixed sampling! I maybe pointed out
here that normally one should resort to random sampling so that bias can be
eliminated and sampling error can be estimated. But purposive sampling is considered
desirable when the universe happens to be small and a known characteristic of it is to
be studied intensively. Also, there are conditions under which sample designs other
than random sampling may be considered better for reasons like convenience and low
costs. The sample design to be used must be decided by the researcher taking into
consideration the objectives, availability and feasibility, time and the cooperation of the
others.
Page 13
SAAM SAMPLING DISTRIBUTION, ESTIMATION AND HYPOTHESIS TEST
Page 14