Professional Documents
Culture Documents
Organisation of Data CH 3
Organisation of Data CH 3
Organisation of Data
Organisation of Data
Organisation of data is the second
statistical tool under which data are
arranged in such a form that comparison
of masses of similar data may be
facilitated a further analysis may be
possible. The most popular way of
organization of data is classification of
data.
Meaning of Classification of Data
Classification is the of arranging data in various
groups or classes according to their characteristics.
There are two features of classification of data
1- Data are classified in various groups or classes.
2- The basis of classification of data is their
characteristics (resemblances and affinities).
Objectives of Classification:
a] To simplify complex data
b] To facilitate understanding
c] To facilitate comparison
2 BIHAR 12899.89
3 KARNATAKA 1834.78
4 PUNJAB 21788.20
1 1941 31.87
2 1951 36.11
3 1961 43.91
4 1971 54.82
5 1981 68.33
Qualitative Classification-
When data are classified on the basis of
quality is known as qualitative classification
of data.
QUALITATIVE
CLASSIFICATION
We may first divide the population to
male and female on the basis of the
attribute “ sex” each of this class may
be further subdivide into “literate and
‘illiterate’ on the basis of attribute
‘literacy’ further classification can be
made on the basis of same other
attribute , say , employment .
QUANTITATIVE CLASSIFICATION
When data are classified on the basis of quantity is
known as quantitative classification of data.
WEIGHT (Kg) NO. OF STUDENTS
40-50 60
50-60 50
60-70 28
70-80 20
80-90 12
90-100 170
Characteristics of a Good
Classification
1.Comprehensiveness
2.Clarity
3.Homogeneity
4.Suitability
5.Stability
6.Elastic
Concept of Variable
Types of Variable
A) Discrete Variable
Discrete variables are those variables that increase in jumps
or in complete numbers. (No fraction is possible)
Eg. Number of students in a class, Number of cars in a show
room etc. (1,2, 10,or 15 etc.)
B) Continuous Variables
Variables that assume a range of values or increase not in
jumps but continuously or in fractions are called continuous
variables.
Eg. Height of the boys –5’1’’ , 5’3’’ and so on, Marks in any
range 0-10, 10-15, 15-20
Meaning of Discrete Variable
Which are measured in complete numbers
like numbers of students, teachers, office
staff etc.
EMPLOYEE/ NO
STUDENTS
STUDENTS 500
TEACHERS 32
OFFICE STAFF 6
D-GROUP EMPLOYEE 4
Meaning of Continuous
Variable
Which are not measured in complete
numbers always like height in meter,
weight
Heightin
inKg
Cm etc. No of Students
110-120 10
120-130 12
130-140 11
140-150 8
Types of Statistical Series-
Types of Series
1 10
2 15
3 18
4 20
5 22
Frequency Distribution
Series-
Types Frequency Distribution
Series-
C.I. Frequency
1-10 8
11-20 9
21-30 10
31-40 9
41-50 8
Conversion of Inclusive into
Exclusive Series
11-20 9 10.5-20.5 9
21-30 10 20.5-30.5 10
31-40 9 30.5-40.5 9
41-50 8 40.5-50.5 8
Open End Series
The lowest value of highest value of
the distribution are not defined.
00-10 8 10 8
10-20 9 20 8+9=17
20-30 10 30 17+10=27
30-40 9 40 27+9=36
40-50 8 50 36+8=44
More Than Cumulative Frequency
Series
C.I Frequency More No of
Than Items
0 44
00-10 8
10 44-8=36
10-20 9
20 36-9=27
20-30 10
30 27-10=17
30-40 9
40 17-9=8
40-50 8
50 8-8=0
LOSS OF INFORMATION
•The frequency distribution summarizes the raw data by
making it concise and comprehensible. However, it does not
show the details that are found in your data and leads to loss
of information.
•When the raw data is grouped into classes, an individual
observation has no significance in further statistical
calculations.
•For example, suppose class 10-20 contains 6 values: 12,
15,16,18,14,19. When such data is grouped as a class 10-20,
study material and individual values have no significance and
only frequency, i.e.6 is recorded and not their actual values.
•Statistical calculations are based only on the values of the
class mark instead of the actual values. As a result, it leads to
considerable loss of information.
FREQUENCY ARRAY