Professional Documents
Culture Documents
P102 Lesson 2
P102 Lesson 2
Objectives:
1. The set of categories or scores that make up the original measurement scale.
The simplest frequency distribution table presents the measurement scale by listing
the different measurement categories (S values) in a column from highest to lowest.
Beside each S value, we indicate the frequency, or the number of times that particular
measurement occurred in the data. It is customary to use an S as the column heading
for the scores and an f as the column heading for the frequencies.
A. Ungrouped Frequency Distribution Table
This is useful in organizing small data, specifically when the number of scores
obtained is below 30.
Example: The following set of scores was obtained from a 10-point statistics quiz.
These scores can be organized by constructing a frequency distribution table.
8 9 8 7 10 9 6 4 9 8
7 8 10 9 8 6 9 7 8 8
Steps:
1. Make 2 columns – S and f wherein S stands for the scores and f for the
frequency
2. Take note of the highest and lowest score
3. List all scores from the highest to the lowest score on the S column.
i. Even if the score does not exist, include it. All scores between the
highest and lowest scores should be present.
ii. Make sure to arrange the scores in order, either ascending or
descending.
4. Count the number of scores in the data set. Example: How many 10s are
in the data set? This will be the frequency and will be recorded in the f
column.
5. Add all the values on the f column. The ∑f is also equal to N or the total
number of individuals.
∑f = N
S (Scores) f (Frequency)
4 1
5 0
6 2
7 3
8 7
9 5
10 2
∑f = 20
6. Add additional columns to your table namely the proportion and percentage.
Proportion is represented by rf while percentage by p.
𝒇
𝒓𝒇 =
𝑵
p = rf x 100
S f rf p
4 1 0.05 5%
5 0 0.00 0%
6 2 0.10 10%
7 3 0.15 15%
8 7 0.35 35%
9 5 0.25 25%
10 2 0.10 10%
∑f = 20 ∑rf = 1 ∑p = 100
53 15 33 3 21 19 17 16 44 25
30 3 5 9 35 32 26 13 14 14
47 47 18 11 5 19 24 17 6 25
8 18 29 3 19 22 3 22 29 2
6 10 29 10 29 21 38 41 16 17
Steps:
k = 1+3.3log(50)
k = 7
i = R/k
i = 51/7
i = 7.2 (always round up)
i = 8
i. Since you already have your class width and number of class interval,
you can start making your class intervals. Since the lowest class
interval must contain the lowest score, you must start with 2. Use
the class width to determine the class interval. The class width
represents how many values should be present within the class
interval. The first class interval would be 2, 3, 4, 5, 6, 7, 8, and 9
or 2-9. There are 8 values within the class interval of 2-9 so you
have met the requirement of your class width. Continuing counting
until you reach 7 class intervals as you have determined in step 3.
The next class interval would start at 10 so it will be 10, 11 12, 13,
14, 15, 16, and 17 or 10-17.
C.I.
8 values 2-9
10-17
18-25
7 class
26-33
intervals
34-41
42-49
50-57
5. Make 2 columns first, Class intervals and f. Class intervals will contain your
class intervals while f will be the frequency.
6. Tally the frequency for each class intervals. Go to the data set and count
how many scores are present in each class interval. Fill in the column of
f with your tally.
53 15 33 3 21 19 17 16 44 25
30 3 5 9 35 32 26 13 14 14
47 47 18 11 5 19 24 17 6 25
8 18 29 3 19 22 3 22 29 2
6 10 29 10 29 21 38 41 16 17
C.I. f
2-9 11
10-17 12
18-25 12
26-33 8
34-41 3
42-49 3
50-57 1
7. Get your ∑f or N.
8. Add another column for the cumulative frequency (cf). This is a common
factor used in future computations.
A cumulative frequency distribution indicates the number of scores
that fall below the upper real limit of each interval.
9. In getting the cf, you must start with the lowest class interval. Copy the f
of the lowest class interval in the cf column. After that, continue adding
the frequency until you reach the highest class interval. You know that
what you are doing is right when you reach the highest class interval and
the corresponding cf is equal to your N.
ii. Example:
Class Intervals f cf
2-9 11 11
10-17 12 23
18-25 12 35
26-33 8 43
34-41 3 46
42-49 3 49
50-57 1 50
N = 50
iii. The lowest class interval 2-9 has a f of 11 so you can just copy
that for its cf. For the cf of 10-17 add its f to the cf of 2-9 which
is 11 so that will be 23. For the cf of 18-25, add its f to the cf of
10-17 so that would be 35. Do this until you reach the bottom.
Guidelines:
1. A graph has two axes: vertical and horizontal. The vertical axis is called the
ordinate, or Y axis, and the horizontal axis is the abscissa, or X axis.
2. Very often the independent variable is plotted on the X axis and the dependent
variable on the Y axis. In graphing a frequency distribution, the score values are
usually plotted on the X axis and the frequency of the score values is plotted on
the Y axis.
3. Suitable units for plotting scores should be chosen along the axes.
4. To avoid distorting the data, it is customary to set the intersection of the two
axes at zero and then choose scales for the axes such that the height (Y-axis)
of the graphed data is about two-thirds or three-fourths of the length (X-axis).
Bar Graph
Frequency distributions of nominal or ordinal data are customarily plotted
using a bar graph. This type of graph is shown below. A bar is drawn for each
category, where the height of the bar represents the frequency or number of
members of that category. Since there is no numerical relationship between
the categories in nominal data, the various groups can be arranged along the
horizontal axis in any order. In the figure found below, they are arranged from
left to right according to the magnitude of frequency in each category. Note
that the bars for each category in a bar graph do not touch each other. This
further emphasizes the lack of a quantitative relationship between the
categories. For a nominal scale, the space between bars emphasizes that the
scale consists of separate, distinct categories. For ordinal scales, separate bars
are used because you cannot assume that the categories are all the same
size.
Sample figure:
Steps:
1. List the categories of measurement along the X-axis and then the scale of
frequencies along the Y-axis.
2. Draw a bar above each category so that the height of the bar corresponds to
the frequency for the category.
3. Since the values of a nominal variable are not in any particular order, you
leave a space between the bars.
II. Graphs for Interval and Ratio Data
A. Histogram
The histogram is used to represent frequency distributions composed of interval
or ratio data. It resembles the bar graph, but with the histogram, a bar is
drawn for each class interval. The class intervals are plotted on the horizontal
axis such that each class bar begins and terminates at the real limits of the
interval. The height of the bar corresponds to the frequency of the class
interval. Since the intervals are continuous, the vertical bars must touch each
other rather than be spaced apart as is done with the bar graph. This type of
graph is most appropriate for discrete data.
Sample figure:
Histogram for Grouped Frequency Table
Sample figure:
B. Frequency Polygon
The frequency polygon is also used to represent interval or ratio data. The
horizontal axis is identical to that of the histogram. However, for this type of
graph, instead of using bars, a point is plotted over the midpoint of each
interval at a height corresponding to the frequency of the interval. The points
are then joined with straight lines. Finally, the line joining the points is extended
to meet the horizontal axis at the midpoint of the two class intervals falling
immediately beyond the end class intervals containing scores. This closing of
the line with the horizontal axis forms a polygon, from which the name of this
graph is taken. This type of graph is mot appropriate for continuous data.
Frequency Polygon for Ungrouped Frequency Table
Sample figure:
The major difference between a histogram and a frequency polygon is the following: The
histogram displays the scores as though they were equally distributed over the interval,
whereas the frequency polygon displays the scores as though they were all concentrated
at the midpoint of the interval. Some investigators prefer to use the frequency polygon
when they are comparing the shapes of two or more distributions. The frequency polygon
also has the effect of displaying the scores as though they were continuously distributed,
which in many instances is actually the case.
Using SPSS