Professional Documents
Culture Documents
Q3 - Module 2 (DATA COLLECTION AND PRESENTATION)
Q3 - Module 2 (DATA COLLECTION AND PRESENTATION)
STATISTICS AND
PROBABILITY
QUARTER 3: MODULE 2&3
DATA COLLECTION AND PRESENTATION
PREPARED BY:
MS. JOAN L. MONCAWE, LPT
TABLE OF CONTENTS
Introduction ……………………………………………………………
Module 1……………………………………………………………….
Warm up Discussion……………………………………………
Lecture………………………………………………………….
Activity ………………………………………………………...
Module 2.
Warm up Discussion……………………………………………
Lecture………………………………………………………….
Activity ………………………………………………………...
INTRODUCTION
2
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
MODULE NO.2: DATA COLLECTION AND PRESENTATION
Question must not be worded in manner that will influence the respondent to answer in
a certain way, that is to favor certain response or to be against it.
An unbiased question is stated in neutral language and no element of pressure.
3
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
Type of Data
1. Primary Data are information collected from an original source of data, which is first-hand in
nature. Examples are data collection from interview, and survey.
2. Secondary Data are information collected from published or unpublished sources like books,
newspaper and thesis.
Exercise 1:
A. Change the following question to make it simpler and clearer.
1. What is your opinion about beauty contest being demeaning to womanhood and against
the vaunted equality of the sexes?
2. What is your mass measure in metric units?
B. Determine which of the following question is best to ask about someone’s health _________.
a. Are you a weakling?
b. Are you bursting over with health?
c. Can you be an advertisement for health club?
d. Are you in good health?
C. To find how much someone likes music, which question is the best to ask?
a. Do you go for the present trend- all noise and sound?
b. How much time do you spend listening to the radio?
c. Do you prefer popular music, jazz, classics, or rock?
d. Do you listen to boring classical tunes’?
4
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
ACTIVITY NO. ____
NAME: ADVISER:
GRADE AND SECTION: CONTACT NUMBER
1. Get a partner. Work together to write a plan for a survey using questionnaire method.
4. Submit the plan and the questionnaire to you teacher for approval. Finalize the
questionnaire.
5
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
MODULE NO.3: PRESENTATION OF DATA
Concept of Sampling:
Sampling is the process of selecting units, like people, organizations, or objects from population
of interest in order to study and fairly generalize the results back to the population from which the
sample was chosen.
Types of Sampling:
Simple Random - Sampling every member of the population has an equal chance of being selected.
Your sampling frame should include the whole population.
Example: You want to select a simple random sample of 100 employees of Company X. You assign
a number to every employee in the company database from 1 to 1000, and use a random number
generator to select 100 numbers.
Systematic sampling is similar to simple random sampling, but it is usually slightly easier to
conduct. Every member of the population is listed with a number, but instead of randomly
generating numbers, individuals are chosen at regular intervals.
Example: All employees of the company are listed in alphabetical order. From the first 10
numbers, you randomly select a starting point: number 6. From number 6 onwards, every 10th
person on the list is selected (6, 16, 26, 36, and so on), and you end up with a sample of 100 people.
Stratified sampling involves dividing the population into subpopulations that may differ in
important ways. It allows you draw more precise conclusions by ensuring that every subgroup is
properly represented in the sample.
To use this sampling method, you divide the population into subgroups (called strata) based on the
relevant characteristic (e.g. gender, age range, income bracket, job role).
Example: The company has 800 female employees and 200 male employees. You want to ensure
that the sample reflects the gender balance of the company, so you sort the population into two
strata based on gender. Then you use random sampling on each group, selecting 80 women and 20
men, which gives you a representative sample of 100 people.
Cluster sampling also involves dividing the population into subgroups, but each subgroup should
have similar characteristics to the whole sample. Instead of sampling individuals from each
subgroup, you randomly select entire subgroups.
Example: The company has offices in 10 cities across the country (all with roughly the same
number of employees in similar roles). You don’t have the capacity to travel to every office to
collect your data, so you use random sampling to select 3 offices – these are your clusters.
6
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
Methods of Data Presentation
Types of FDT
1. Qualitative or Categorical FDT a frequency distribution table where the data grouped
according to some qualitative characteristics; data are grouped into no numerical categories.
2. Quantitative FDT – frequency distribution table where the data are grouped according to
some numerical or quantitative characteristics.
Example of a Quantitative FDT:
Table 3: Frequency Distribution for the Weights of 50 PIECES OF Luggage.
3. Determine the class size (c) by calculating first the preliminary class size c’/
Preliminary Class size c’
' R
c=
k
Conditions for the actual c:
a. It should have same number of decimal places as in the raw data; i.e. if the
observation in the data set are all whole number, then your c should also be a
whole number.
b. It should be odd in the last digit.
4. Enumerate the classes or categories.
5. Tally the observations (note: Sometimes the number of classes (k) is not followed. An extra
class will be added to accommodate then highest observed value in the data set and a class
will be deleted if it turns out to be empty)
6. Computed for values in others columns of the FDT as deemed necessary.
2. Class Mark (CM) – midpoint of the class interval where the observations tend to cluster
about.
CM = ½ (LL + UL) or CM = ½ (LTCB + UTCB)
3. Relative Frequency (RF) – proportion of observations falling in a class and is expressed
in percentage.
frequency frequency
RF= %RF= x 100 %
N N
4. Cumulative Frequency (CF) – accumulated of the classes.
a. Less than CF (<CF) – total number of observation whose values do not exceed the
upper limit of the class.
b. Greater than CF (>CF) – total number of observation whose values are not less than
the lower limit of the class.
5. Relative Cumulative Frequency (RCF)
c. Less than RCF (<RCF)
d. Greater than RCF (>RCF
' 52
Step 3: c = =8.67 ≅ 9 . thus c=9
6
Step 4:
Table: Frequency Distribution of Age (In years) of 40 Patients confined at a certain Hospital
Age Classes
5-13
14-22
23-31
32-40
41-49
50-58
TOTAL
Step 5:
Table: Frequency Distribution of Age (In years) of 40 Patients confined at a certain Hospital
Table: Frequency Distribution of Age (In years) of 40 Patients confined at a certain Hospital
A graph or a chary is device for showing numerical values or relationships in pictorial form.
Advantages:
1. Main features and implications of a body of data can be seen at once.
2. Can attract attention and hold the readers’ interest
9
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
3. Simplifies concepts that would otherwise have been expressed in so many words.
4. Can readily clarify data, frequency bring hidden facts and relationship.
3. Pie Chart – a
categories. The “pieces of pie” represent the
proportions of the total that fall into each category.
1. Frequency Histogram – a bar graph that displays that classes on the horizontal axis and
the frequencies of the classes on the vertical axis; the vertical line of the bars are erected at
the class boundaries and the height of the bars correspond to the class frequency.
2. Relative Frequency Histogram – graph that displays the classes on the horizontal axis
and the relative frequencies on the vertical axis.
3. Frequency Polygon – a line chart that is constructed by plotting the frequencies at the
class marks and connecting the plotted points by means of straight lines; in polygon is
closed by considering an addition class at each end and the ends of the lines are brought
down to the horizontal axis at the midpoints of the additional classes.
4. Ogives – graphs of the cumulative frequency distribution
a. <Ogives – the <CF is plotted against the UTCB
b. >Ogives – the >CF is plotted against the LTCB
10
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
ACTIVITY NO. ____
NAME: ADVISER:
GRADE AND SECTION: CONTACT NUMBER
1. Survey taken at a hotel in Malate indicated that 30 guests preferred the following means of
transportation.
car car bus plane train bus bus plane car plane plane
bus plane car car train train car car car car plane
plane car bus car bus car plane car plane plane car
car car bus train car bus car
_______________________________________________________________________________________________________________
11
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
2. Construct a graph for the given FDT and write a brief interpretation.
_______________________________________________________________________________________________________________
12
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
3. Fill up the missing values in the FDT below .
13
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY
Question 1
18
16
14
12
10 Series 1
8
6
4
2
0
41-44 45-48 49-52 53-56 57-60 61-64 65-68 69-72 73-76
14
MARY THE QUEEN COLLEGE OF QUEZONCITY
STATISTICS AND PROBABILITY