Professional Documents
Culture Documents
Fundamentals of Statistics
Fundamentals of Statistics
CHAPTER - 2
CENTRAL TENDENCY
In statistics, a central tendency is a central or typical value for a probability distribution. It
may also be called a center or location of the distribution. The central tendency may give a
fairly good idea about the nature of the data (mean, median and mode shows the "middle
value").
In statistics we always find measures of central tendency because it always makes sense to
compare individual scores to the overall group of scores in order to be able to correctly
interpret the result. Measures of central tendency help you find the middle, or the average, of a
data set
The Arithmetic Mean is one of the most commonly used measures of central tendency. For a
set of numbers, the mean is simply the average, i.e. sum of all the numbers divided by the
number of observations.
Therefore if you want to find the average length of a group of insects, you simply take the
length of each insect, add up all these lengths and divide by the number of insects. If the
lengths of 5 insects are 6.5mm, 5.4mm, 5.8mm, 6.2mm and 5.9mm, then the mean is
(6.5+5.4+5.8+6.2+5.9)mm/5 = 5.96mm.
Merits or Advantages: It is rigidly defined. It is based on all the observation. It can be easily
calculated and understandable. It is also least affected by the sampling fluctuation. It is
amenable to algebraic treatment.
The Median is another frequently used measure of central tendency. The median is simply the
midpoint of the distribution, i.e. there are as many numbers above it as below it.
If the number of data points is odd, then the median is simply the middle number. Therefore
the median of 3, 5, 6, 9, 15 is 6.
If the number of data points is even, then the median is the mean of the middle two numbers.
Therefore the median of 2, 7, 15, 20 is (7+15)/2 = 11.
The median is particularly useful when there are a few data points that are hugely different.
The Mode is simply the most commonly occurring value. For example, in a class of 50
students graded on a scale of 1-5, the distribution may be as shown in the figure. The mode of
this data is 4.
Different types of data need different measures of central tendency to calculate and describe
the distribution of data.
Demerits or Disadvantages: Mode is ill defined. It is not always possible to find a clearly
defined mode. It is not based upon all the observation. It is not capable for further
mathematical treatment. It is also more affected by sampling fluctuations.
CONCEPTS:
Mean:
(1)
However, equation (1) can only be used when the error associated with each measurement is
the same or unknown. Otherwise, the weighted average, which incorporates the standard
deviation, should be calculated using equation (2) below.
(2)
Median:
Individual Series:
Continuous Series:
Where l1 and l2 are the lower and upper limits of the group in which the median is located,
f is the frequency of the median group, m is the size of the middle item (n+1/2)th and c is
the cumulative frequency of the next lower group than the median group.
Mode:
Discrete Series:
Here the location of the mode is only simple arithmetic based upon successive groups.
Continuous Series:
where; l1 and l2 are the lower and upper limits of the modal class, fm is the maximum
frequency , f1 is the frequency of the class preceding the modal class and f2 frequency of
the class following the modal class.
For the moderately skew distribution. It should be used only when the frequency
distribution has two or more equal maximum frequencies.
Solved Examples:
Arithmetic Mean:
1. The following figures are the heights in cms. of 7 plants chosen at random. Calculate the
arithmetic mean of the heights (a) Direct Method (b) Short cut method
S. No. 1 2 3 4 5 6 7 N=7
Ht. (Xi) 64 59 67 69 65 70 68 ΣXi = 462
Di=Xi -65 -1 -6 2 4 0 5 3 ΣDi = 7
2. The following data give the weekly increment (in mm) of 60 plants. Calculate the average
increment per plant by direct as well as short cut method.
Weekly In. 2 3 4 5 6 7 8 9
No. of Plants (fi) 10 16 11 8 6 4 3 2 Σfi = 60=N
fi Xi 20 48 44 40 36 28 24 18 Σfi Xi = 258
Di=Xi -A -3 -2 -1 0 1 2 3 4 (if A=5)
fi Di -30 -32 -11 0 6 8 9 8 Σ fi Di = -42
3. Make a frequency table and calculate the Arithmetic Mean by Direct and Short cut
method (Change of origin and scale) from the following data.
14 16 16 14 22 13 15 24 12 23 14 20
17 21 18 18 19 20 17 16 15 11 12 21
20 17 18 19 22 23
Example
Group 11-13 13-15 15-17 17-19 19-21 21-23 23-25
Freq. fi 3 4 5 6 5 4 3 Σfi = 30 =N
Mid Value Xi 12 14 16 18 20 22 24
fi Xi 36 56 80 108 100 88 72 Σfi Xi = 540
Di=(Xi-18)/2 -3 -2 -1 0 1 2 3
fi Di -9 -8 -5 0 5 8 9 Σ fi Di = 0
Median:
1. Obtain the Median of the following items: 14, 8, 6, 12, 15, 9, and 18
Median = Size of {(n+1)/2}th item = 81st item and lies in the height group 20-30
Mode:
1. Find out the mode of the following set of numbers.
2, 5, 2, 3, 5, 5, 6, 4, 5, 3, 5, 2, 5, 7, 1
Mode = 5 (Since Frequency of 5 is repeated 6 times
Group 1-5 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45
Freq. 7 10 16 32 24 18 10 5 1
Mode = 16 + {(32-16)/(64-16-24)}(20-16)
= 16 + {(16/24) x 4} = 18.66
Geometric Mean:
G = ( 2 x 4 x 8)1/3 = 641/3 = 4
2. The marks obtained by 18 students in a test are given below. Calculate the H.M.
Marks 20 21 22 23 24 25
No. of Students 4 2 7 1 3 1 Σfi = N = 18
1/Xi .05000 .04762 .04545 .04368 .04167 .04000
fi 1/Xi .20000 .09524 .31815 .04348 .12501 .04000
Σ fi 1/Xi = 0.82188