METHODS OF
COLLECTING DATA
povecaimafion of cat 7
Statistics, as defined scientifically, entails systematic collection of data to ensure the validity and
reliability of the result, By definition, data (singular datum) refer to a set of observation, values, or
element taken into consideration. Data, when fully interpreted, can be converted into information which
| will be useful in testing hypothesis, or making conclusions, inferences, and recommendations. Some of
| the methods in collecting data are listed below:
1. The Direct or Interview Method. This is a method wherein a person can directly obtain data from
another person or a group of persons. _
This can be categorized into two:
Structured interview - questions are already made and only those listed can be asked.
Unstructured interview - questions relevant to the data to be collected can be freely asked,
2. The Indirect or Questionnaire Method. This is the opposite of the person-to-person method. In this
process, the researcher makes a questionnaire to be answered in written form by the intended
sample. Also, some essay-type questions may not be answered by the respondents due to some
other factors or may be answered by other persons.
' 3. The Registration Method. This, by far, is one of the easiest methods of collecting data.
However, this method of collecting data is governed by certain laws. Some data, especially those
that are "classified", cannot be given by the registrar even if a request letter is made.
4. The Observation Method. In this method, the researcher observes what he wants to observe about
Persons, groups, or organizations.
‘There are two kinds of observation method:
Natural or passive method - the investigator does not interfere with the situation; he only
observes.
Participatory method - the investigator participates in the making of the event.
5. The Experiment Method. This method is used for sophisticated scientific research studies. It tries to
identify the cause-effect relationship between and among variables in a carefully made laboratory
tosh tt Uber on Soe ceeuinions group and the experiment group. The
must establist validity and reliability of the data collected in
other internal and extemal factors. " eset inertorence of
| Thus, there are three categories of collecting data: objective method, subjective method, and use of
existing records.
Two types of data:
|. Primary data are those collected directly from the source and is jh objective or
Se rene toes collecten les is obtained through obj
12 Secondary data, on the other hand, are those which have been acquired through the use of existing
| records.
' a
Pa pe eeDETERMINING THE SAMPLE SIZE
In research, we seldom use the entire population because of the cost and time Involved, In fact,
‘most researchers do not use the population in their study. Instead, the sample which Is a small
Tepresentative of a population is used. The characteristics of the whole or entire population is described
using the characteristics observed from the sample.
To, determine the ‘sample size from a given population size, the Slovin's formula is used:
— |n=
Where: n= sample size
/ 1+ Ne*
| aie N= population size
| Someliney 7 weil vegrenentred e@= margin of error - pz ctcbowes!
| emettion gene voliza\l9 Cr (i Akuccbional re Searchers, sine
te om "% 7s medical acventeg
1 samging gure
= Yow wrany wgiry oi «wo Bp
Lctoving, Fesrrnl «
| Adventures:
Find the sample size of the following: -
A group of researchers will conduct a survey to find out the opinion
community regarding the oil price hike. If there are 14000 residents in th
researchers plan to use a sample using a 15% margin of error, what sh
Of residents of a particular
je community and the
lould be the sample size
be?10%75%? 5
Solution: 162 ,, Oh” one £%",
4 Maen (Diy ny> NIE
CHD Y CORT” ire 10D 4 7
44.3 Ny» 36
4S , ,
[Rend 09] Aare Pie Os
Conduin Ye Ay
Cena) Limit TWeover”
WF wormeal, (Me mare ve\alde Ke daly
SAMPLING TECHNIQUE
Itis the Process of selecting a number of individuals (a sample) from a population, preferably in
Such a way that individual selected represent the larger group from which they were selected,
itis the procedure used to determine the individuals,
Biased vs. Unbiased Sample
1. Biased if it is calculated in ‘such a way that is ‘systematically different from the population
Parameter of interest.
2. Unbiased when the probability of rejecting the null hypothesis is less than or equal to the
Sianificance level when the null hypothesis is true, and the probability of rejecting the null
hypothesis is greater than or equal to the significance level when the alternative hypothesis is
true.
“
—‘Sampling w/ Replacement (SWR) vs. wio Replacement (SWOR)
1. Sampling with replacement the two sample values are independent. Practically, this means
that what we get on the first one does not affect what we get on the second. Mathematically, this
means that the covariance between the two is zero.
Sampling without replacement the two sample values are not independent. Practically, this
means that what we got on the first one affects what we can get for the second one.
Mathematically, ee means that the covariance between the two is not zero.
“Ys
| i Gece
A. Probability Or Random Sampling
Itis the most commonly used sampling technique in which each member in the population is
i it i iproloaloility ~ eave Chance 7 creyoctersies
given an equal chance of being selected in the sample. ¢ °a\\-" ae ‘ees I, erocterst
Types of probability or Random Sampling
1. Simple Random Sampling
This is the simplest form of random sampling. It consist of choosing a sample from a set of all
possible samples of pre-chosen size, giving each sample an equally likely chance of being the
selected one.
a. Lottery or Fishbowl sampling
It is done by simply writing the names or numbers of all individual members of the population in
a small-rolied piece of papers, this is then placed in a container. Shake the container thoroughly — then
draw as many pieces of paper as there were desired in the sample.
o
Ke.
Adventures
Suppose Mrs. Tababa wanted to send five students to attend a 3-day seminar in basic computer
programming. To avoid bias in selecting these five students from her 40 students, she used the lottery
technique. She did this by assigning a number to each student and then writing these numbers on
pieces of papers. Then, these pieces of paper were folded and placed in a box. She shakes it
thoroughly and then picked five pieces of papers. Who will be chosen?
b. Sampling with the used of Table of Random Numbers
This refers to the used of Table of Random Numbers which contain of,rows and columns of
digits randomly ordered by a computer. A sample sized n can be penerated beginning at en art
point in Table of Random Numbers, closing your eyes and haphazardly pointing at an entry in the
Table. - Calowotorsnumbers? If there are 50 res
numbers?
Advantage
-Itis simple and is more easily
understood than other sampling
design.
2. Systematic Sampling
This technique of sampling is done
first unit being selected at random and the Test will be sel
| Population sized is known, k value/sampling interval can
eat
Using the random table, what are ti
on a
Advantures
31871 | 60770 | 59235-[ 41702
871134 | 32839 | 17850 | 37059
06728 | 16614-| 811 42072
95646 | 67486-| 05{67-| 07819
44085
87248-| 47678] 98638
The above figure is an example of a Table of Random Numbers
where: N = population size
If there is a population of 1
Sample if the interval is every 12"
}000 and a researcher needs 2:
number? Who is the 200"
n= the sample size
= sampling interval
#e
&
Adventures
the 10 sample numbers if we are the 3 and 4” digit of the
pondents, how many are only included using the table of random
Disadvantages
-It is a list of all members in the population is needed.
-lt may be difficult to collect the sample data with SRS if the
‘Samples are spread inconveniently throughout the population.
by taking every k”unit from the ordered Population. Only the
lected in a systematic manner. If the
be calculated as
'86 respondents who will his 94"
sample? 286" sample?
Advantages
Disadvantages
tis easier to apply and less Iikely
tohave mistakes
altis possible to select a sample in
the field without a sampling frame.
-It could give a more precise
estimate than SRS when there is.
order in the ‘sample
-If periodic regularities are found in
the list, a systematic sample may
consist only of similar types
-if the population is not in random
order, one cannot validly estimate
the variance of the mean from a
Single systematic sample.
It could be less precise than SRS
When to use
-If periodic regularities are found in
the list, a systematic sample may
consist only of similar types
-If the population is not in random
order, one cannot validly estimate
the variance of the mean froma
Sit matic sample.
ia en ns
o3. Stratified Random Sampling
This is a technique in which the Population of size N is divided into a number, L, of non-
the stratum are more or less homogenous and samples
8 that we will be able to represent not only the overall Population,
but also key subgroups of the population It is more statistically-precise than simple random sampling. It
‘can be simple or proportional,
'. Simple Stratified Random Sampling
This is done by taking equal number of individual from each stratum.
>. Stratified Proportional Random Sampling
[This is done by division of population into smaller groups known as strata,
“
Suppose a community consist of 5000 families belo
draw 200 families
nging to different income bracket. We will
a8 Our sample using stra
tified random sampling,
Using Simple Stratified Random Sampling
Number of families in the ‘sample in each stratum m0 = 66.66666667
Strata Number of families | Number of Families in the ‘Sample
High Income-Families 1000 66
Average Income-Families 2500 o7
Low income-Families 1500 67
N= 5000 n= 200
Using Stratiied Proportional Random Samplin
Number of
Strata pall Percentage
1000 _ 5
High income-Families 1000 5000 020°
20%
2500
Average Income- 2500 «os
Families 2800 5000 = 5 or
50%
1500
Low Income-Families 1500 5000 “2-3 or 0.3: 200 = 60
30%
N= 6000 paste,‘Advantages
Disadvantages
When to use
-ttis gain in the precision of the
estimates of characteristic of the
population.
tallows a more comprehensive
data analysis since information is
provided for each stratum,
-Itaccommodates administrative
convenience; fieldwork is organized
by strata, which usually result in
saving in cost.
sttaccommodates different
‘sampling plans in different strata.
-Sampling frame is necessary for
‘every stratum.
-Prior information about the
Population and its subpopulation is
necessary for stratification
Purposes.
-Population is known to be
heterogeneous or when the
Population can be subdivided into
‘mutually exclusive and exhaustive
groups
bbStudy and solve the following problems.
1. A biologist plants 100 seeds in pots numbered 1 to 100. Describe how to create a systematic
‘sample of size. '
] a. 20
| 2. There are 45 incoming sophomores seeking transfer to a certain high school. They come from
} different types of school. The guidance counsellor wants to get a profile of the transferees. With
the sample size of 15, find the average using the systematic sampling,
) hee
| 3. Aresearcher would like to investigate the perception of students of mathematics. He divided
{he Population into sub-populations, as shown below. Use stratified random sampling if the
sample to be drawn consists of 500 students. e = 5%, wv a
Strata | Number of students
First year 1500 '
Second year 1000
Third year 1500 \
Fourth year 2000
4. ATV journalist would like to know the most favourite noontime show for this month, He decided
to conduct a survey on 5 barangays. The table below shows the list of barangays and the
umber of residents in each barangay. Use proportional stratified random sampling to draw
1000 residents who will be ingluded in the survey. e = 1%
Barangay | Number of Residents
Mapayapa 2000
Maganda 1000
Makisig 1500
Malinis 2500
Mahangin 1800
x
[ S4. Cluster Sampling
Example 7:
are 250 Barangay in Manila, We
sampling, and then the numbers.
This is a sampling wherein
Suppose that we want to determine the aver
can draw
of families from the 20
groups or clusters instead of individuals are randomly chosen This is
‘sometimes called area sampling because this is usually applied when the Population is large.
rage income of the families in Manila. Let us assume there
a random sample of 20 Barangay using a simple random
Barangay will be the samples.
treduces listing and transportation
cost.
|| -The procedure saves time, effort
|| and money.
especially when the clusters are of
unequal size.
‘Advantages Disadvantages ‘When to use
; ~Sampling frame is not available
Piette ella ayy and the cost of constructing such a
of clu: eva list of ~ It entails more statistical analysis. | frame is very high.
ou in the . . - i -Estimation procedures are difficult, | -It is used for economic
consideration, i.e., when the time,
effort and cost involved in obtaining
information on the population units
increase as the distances
Separating these units increase
5. Multi-Stage Or Multiple Sampling
This sampling was named because sampling is done in stages.
The number of stages depends
on the number of population and the sample size needed in the study.
vo
Advent
and the households, the five-stage sampling units.
Advantages: Disadvantages When to use
.