Professional Documents
Culture Documents
Classification and Frequency Dist
Classification and Frequency Dist
Lecture #3
WHAT IS CLASSIFICATION?
The term classification is defined as the process of dividing a
set of observations
Or objects into classes or groups in such a way that:
i. observations or objects in each class or group are similar,
ii. observations or objects in each class or group are
dissimilar to observations or objects in other class or
group.
AIMS OF CLASSIFICATION
Or
• It is a grouping of all the (numerical) observations into intervals
or classes together with a count of the number of observations that
fall in each interval or class.
In general, we can say frequency distribution is use to change
the ungrouped or Raw data into grouped data. There are two
types of frequency distribution
1) Discrete frequency distribution
used when data is given in the form of whole number for
example No. of students in universities, No. of colleges in
cities, No. of chemicals in labs etc.
2) Continuous Frequency distribution
used when the data is given in the form of intervals such as
weight, length, width etc.
WHAT IS CLASS-LIMITS?
The following may be used as a guide in constructing an FDT (note that if there is an existing set of
class intervals for the specific data, then steps 1 to will not apply)
Steps in Constructing a frequency distribution
table
Step 1 Step 2 Step 3 Step
4
Step Step Step
5 6 7
Step 1: Find the range R.
Note: If the resulting value is fractional, then we take the next higher
integer.
Solution: k=1+3.3log(N)
k = 1+3.3 log(32)
K=5.96 approximately
equal to 6
k= 6
Step 3 Step 4 Step 5 Step 6 Step 7 Back
Step 3: Estimate the class width c of each interval.
c = R/k
Note: Round off the answer to the same number of decimal
places
that the observations have.
Solution:
c = 132/6
= 22
Step 4 Step 5 Step 6 Step 7 Back
Step 4:
List the lower and upper class limits of the first
interval.
Class Interval
20 – 41
42 – 63
64 – 85
86 – 107
108 – 129
130 – 152
List all the succeeding lower and upper class limits by adding the class with c
to the lower limit of the first class interval. The upper class limit of the first
interval should be the number before the lower class interval of the second
interval. The highest class should contain the largest observation.
Note: Class limits must have the same number of decimal places as the raw
data.
Step 6 Step 7 Back
Step 6: • From the data, tally the observations according to the
interval which it belongs to. Summarize the tallies
in a column for the frequencies.
Note: The number 2.5 is accurate to the tenth unit (or 0.1) while
3.42 is accurate to the hundredth unit (or 0.01).
Next
Step 7:
Class Interval Tally Frequency Class boundary
20 – 41 IIII-II 7 19.5 – 41.5
42 – 63 IIII-III 8 41.5 – 63.5
64 – 85 IIII-II 7 63.5 – 85.5
Subtract 0.5
86 – 107 Add 0.5 in all IIII 4 85.5 – 107.5
108 – 129 I 1 107.5 – 129.5
130 – 151 IIII 5 129.5 – 152.5