Professional Documents
Culture Documents
Measures of Central Tendency
Measures of Central Tendency
TENDENCY
1
Measure of Central Tendency
Objectives of Averaging:-
• To get single value that describes the characteristics
of the entire data.
• To facilitate comparison.
3
Characteristics of a good average
• It should be easy to understand.
• It should be simple to compute.
• It should be based on all the observations.
• It should be rigidly defined.
• It should have sampling stability.
• It should be capable of further algebraic treatment.
• It should not be unduly affected by the presence of
extreme values.
4
The various measures of central tendency or
averages commonly used are:-
5
Arithmetic mean
The most popular and widely used measure for
representing the entire data. Its value is obtained
by adding together all the observations and by
dividing this total by the number of observations.
Calculation of AM - Ungrouped data:-
Direct Method:-
N
_ x _
x1 x2 x N i
x
X
N
i 1
N
i.e. X N
6
Short – Cut Method:-
The AM can be calculated by taking deviation from
any point in that case formula is
_
d
X A
N
where d x A
Direct Method:- _
fx
X
N
Where, x= mid point of various classes
f = frequency of each class
N = Total frequency i.e. N f
8
Short – Cut Method:-
_
fd
X A
N
X h
Where, d
x A
h
_ _
_ N1 x 1 N 2 x 2
x 12
N1 N 2
11
Merits:-
• The calculation of AM is simple and it is unique,
that is, every data has one and only one mean.
• The calculation of AM is based on all the values
given in the data set.
• The AM is reliable single value that reflects all
values in the data set.
• The AM is least affected by fluctuations in the
sample size.
12
Limitations:-
13
Weighted Arithmetic Mean:
The AM as discussed earlier, gives equal importance
to each observation in the data set. However, there
are situations in which values of individual
observations in the data set are not of equal
importance. Under these circumstances, we may
attach to each observation a value ‘weight’ w1 , w2 ,...
...wn as an indicator of their importance. The
formula for computing weighted AM is
_
Xw
xw
w
14
Geometric Mean:-
Geometric mean is defined as the Nth root of the product of N
observations of a given data. If there are two observation,
we take the square root, if three then cube root and so on.
GM N x1 x2 x3 .....x N
To simplify calculations logarithms are used
1
log GM log x1 log x2 log x N
N
log x
GM anti log
N 15
For grouped data the GM is calculated as
GM x x .........x
1
f1
2
f2 fN
N 1
N
f log x
GM anti log
N
Application of GM:-
1. The GM is used to find the average percent
increase in sales, production, population or other
economic or business data.
2. It is an average which is most suitable when large
weights have to be given to small values of
observation and vice-versa. 16
Merits:-
1. The value of GM is not much affected by extreme
observations and is computed by taking all the
observations into account.
2. It is useful in averaging ratio and percentage as
well as in determining rate of increase and
decrease.
Limitations:-
1. The calculation of GM as compared to AM is more
difficult.
2. The value of GM cannot be calculated when any of the
observation in the data set is either negative or zero.
17
Harmonic Mean:
The harmonic mean (HM) of a set of observation is
defined as the reciprocal of the arithmetic mean of the
reciprocal of the observations i.e.
N N
HM
1 1 1 1
x
x1 x2 xN
21
Calculation of Median:- Ungrouped Data
Arrange the data in ascending or descending order of
magnitude.
If the number of observations (N) is an odd number,
then
N 1
Median = size or value of th observation in
2
the data set .
If the number of observations (N) is an even number,
then the median is
N N 1
th observation th observation
Median = 2 2
2 22
Calculation of Median – Grouped Data
First identify the class interval which contains the
median value i.e. N Observation of the data set.
2
N c. f .
Median = L 2 h
f
Where L is lower limit of median class
c.f. is preceding cumulative frequency to the
median class
f is frequency of the median class
h is the class interval of the median class 23
Merits:-
1. Median is unique i.e. like mean, there is only one median for a
set of data.
2. The value of median is easy to understand and may be
calculated from any type of data .
3. The sum of absolute differences of all the observations in the
data set from median value is minimum. i.e.
X Med is minimum.
4. The extreme values in the data set does not affect the
calculation of the median value.
5. The median value may be calculated for an open-ended
distribution of data set.
6. The median is considered the best statistical tech. for studying
the qualitative attribute of an observation in the data set.
24
Limitations:-
1. The median is not capable of algebraic treatment
i.e. the median of two or more sets of data cannot
be determined.
2. The median is more affected by sampling
fluctuations.
3. Median is an average of position, therefore
arranging the data in ascending or descending
order of magnitude is time consuming in case of a
large number of observations.
25
Related positional measures i.e. Partition Values:
Quartiles:-
The values of observations in a data set, when arranged in an
ordered sequence can be divided into four equal parts or
quarters, using three quartiles namely Q1,Q2 and Q3.
The generalized formula for calculating quartiles in case of
grouped data is :
iN c. f .
Qi L 2 h For i = 1, 2, 3
f
jN c. f .
Dj L 2 h For j = 1, 2, 3…, 9
f
27
Percentiles:
The value of observations in a data set when arranged
in an ordered sequence can be divided into hundred
equal parts, using ninety nine percentiles, Pi (i=1,2,
….99).
The generalized formula for calculating percentiles in
case of grouped data is:
kN c. f .
Pk L 2
f
h For k = 1, 2, 3,… 99
28
Mode:-
Mode is defined as that value which occurs the
maximum number of times i.e. having the maximum
frequency.
The concept of mode is of great use to large scale
manufacturing of consumable items such as ready
made garments, shoe-makers and so on. In all such
cases it is important to know the size that fits most
persons rather than ‘mean size’.
29
Calculation of Mode:-
Ungrouped Data:- For determining mode count, the
number of observations the various values repeat
themselves and the value which occurs the
maximum numbers of times is the modal value.
Grouped Data:- In discrete and continuous series if
items are concentrated at one value only then mode
can be calculated easily. But if items are
concentrated at more than one value, we find the
item of concentration by the method of grouping.
30
After finding the modal class we will use the following
formula:-
f1 f 0
Mode = L h
2 f1 f 0 f 2
Where L is lower limit of the modal class
f1 is frequency of the modal class
f2 is frequency of the class succeeding the
modal class
f0 is frequency of the class preceding the
modal class
31
h is class interval of modal class
It must be noted that the value of mode must lie in
the modal class. If it does not lie in modal class, it
is considered to be incorrect. In such situation we
use the following alternative formula
f2
Mode = L h
f0 f2
32
Merits:-
1. Mode value is easy to understand and to calculate.
Modal class can also be located by inspection.
2. The mode is not affected by the extent values in
the distribution. The mode value can also be
calculated for open-ended frequency distributions.
3. The mode can be used to describe qualitative as
well as quantitative data.
33
Limitations:-
1. Mode is not a rigidly defined measure as there are
several methods for calculating its value.
2. It is difficult to locate modal class in the case of
multi-modal frequency distributions.
3. Mode is not suitable for algebraic manipulations.
4. When data sets contain more than one modes, such
values are difficult to interpret and compare.
34
Relationship between Mean, Median and
Mode:-
In symmetrical distribution, the value of mean, median and
mode are equal. When all these three values are not equal to
each other, the distribution is not symmetrical.
For asymmetrical distribution, Karl-Pearson has suggested a
relationship between these three measures of central
tendency as
Mean Mode 3 Mean Median
or
1 or 1
Mean 3Median Mode Median 2Mean Mode
2 3
Mode 3Median 2Mean
35