Professional Documents
Culture Documents
Ch5: Data Collection
Ch5: Data Collection
1 Data Collection
Types of Data
What is data collection
2 Data Collection Techniques
3 Data Sampling
4 Data Sampling Methods
5 Data Analysis
A collective recording of
observations either numerical
or otherwise is called data.
Quantitative Data:
Qualitative Data:
Deals with numbers.
Deals with description.
Data can be measured.
Data can be observed
but not measured. length, height, width,
speed, time,
colors, textures, tests,
temperature, sound cost,
appearance,. . .
levels, ages, . . .
Tools
Techniques Questionnaire.
Administering written Interview guide,
questionnaires. checklist, tape recorder.
Interviewing. Eyes and other senses,
Observation. pen/paper, microscope,
Using available devices.
information. Check lists, data
compilation forms.
Basic Definitions:
Population: Collection of all the units that are of
interest to the investigator.
Sample: Representative part of population.
Sampling: Technique of selecting a representative
group from a population.
Basic Definitions:
Sampling error: The difference that occurs purely by
chance between the values of sample statistic and that
of the corresponding population parameter.
SamplingError = x̄ − µ,
Sampling Methods
Systematic Quota
sampling sampling
Probability Non-probability Judgment
Stratified
sampling sampling
Cluster Snowball
sampling sampling
Every individual
is chosen
entirely by
chance and each
member of the
population has
an equal chance
of being
selected.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 16 / 31
Data Sampling Methods . . .
Systematic Sampling
In this type of
sampling, the
first individual is
selected
randomly and
others are
selected using a
fixed “sampling
interval”.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 17 / 31
Data Sampling Methods . . .
Stratified Sampling
In this type of
sampling, we divide
the population into
subgroups (called
strata) based on
different traits like
gender, category,
etc. And then we
select the sample(s)
from these
subgroups.
Cluster Sampling
Use the subgroups of
the population as the
sampling unit rather
than individuals. Data
Sampling Methods
Cluster sampling The
population is divided
into subgroups, known
as clusters, and a
whole cluster is
selected to be included
in the study.
Convenience Sampling
Individuals are
selected based on
their availability
and willingness to
take part. Data
Sampling Methods
Convenience
sampling Let’s say
individuals
numbered 4, 7, 12,
15 and 20 want to
be part of our
sample.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 20 / 31
Data Sampling Methods . . .
Quota Sampling
In this type of
sampling, we
choose items based
on predetermined
characteristics of
the population.
Consider that we
have to select
individuals having a
number in multiples
of four for our
sample.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 21 / 31
Data Sampling Methods . . .
Judgment Sampling
It depends on the
judgment of the
experts when choosing
whom to ask to
participate. Suppose,
our experts believe that
people numbered 1, 7,
10, 15, and 19 should
be considered for our
sample as they may
help us to infer the
population in a better
way.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 22 / 31
Data Sampling Methods . . .
Snowball Sampling
Existing people are asked
to nominate further people
known to them so that the
sample increases in size like
a rolling snowball. This
method of sampling is
effective when a sampling
frame is difficult to identify.
Here, we had randomly
chosen person 1 for our
sample, and then he/she
recommended person 6,
and person 6 recommended
person 11, and so on.
1. Descriptive Analysis
Refers to the description of the data from a particular
sample - hence the conclusion must refer only to the
sample;
Summarizes the data and describe sample
characteristics;
Descriptive statistics are numerical values obtained
from the sample that gives meaning to the data
collected;
1. Descriptive Analysis-Classification
A. Frequency distribution
A systematic arrangement of numeric values from the
lowest to the highest or highest to lowest
X
f = N, (5.1)
1. Descriptive Analysis-Classification
B. Measure of central Tendency
A statistical index that describes the average of the
set values.
(xi − x)2
P P
xi 2
x̄ = , σ =
N N
n
x[ ],
M od(x) =
2 if n is even
n−1 n+1
x[ 2 ]+x[ 2 ] , if n is odd
2
1. Descriptive Analysis-Classification
C. Measure of variability
Statistics that concern the degree to which the cores
in a distribution are different from or similar to each
other.
Range:
The distance between the highest score and the lowest
score in a distribution:
1. Descriptive Analysis-Classification
C. Measure of variability- Standard deviation:
The most commonly is used measure of variability
that indicates the average to which the scores deviate
from the mean.
rP
(x − µ)2
σ= , (5.3)
N
where µ is the mean.
1. Descriptive Analysis-Classification
D. Bivariate Descriptive Statistics
Correlation:
The correlation coefficient that indicates the strength
of the relationship between two variables can be found
using the following formula:
P
(xi − x̄)(yi − ȳ)
rxy = pP , (5.4)
(xi − x̄)2 (yi − ȳ)2
where rxy is the correlation coefficient that measures
the linear relationship between variables x, y.
(Jimma University Institute of Technology: Chapter
FECE) 5: Data Collection, Sampling Data Analysis December 16, 2022 31 / 31
The End !