Professional Documents
Culture Documents
Chapter 3 Statistics
Chapter 3 Statistics
Measure of Central Tendency: Usually when two or more different data sets are
to be compared it is necessary to condense the data, but for comparison the
condensation of data set into a frequency distribution and visual presentation are
not enough. It is then necessary to summarize the data set in a single value. Such a
value usually somewhere in the center and represent the entire data set and hence it
is called measure of central tendency or averages. Since a measure of central
tendency (i.e. an average) indicates the location or the general position of the
distribution on the X-axis therefore it is also known as a measure of location or
position.
1. Arithmetic Mean
2. Geometric Mean
3. Harmonic Mean
4. Mode
5. Median
Numerical Example:
Calculate the arithmetic mean for the following the marks obtained by 9
students are given below:
x i
xi
x i1
45
n
32
n9 37
46
360 39
x 40 marks 36
9 41
48
36
n
Numerical Example: x
i1
i 360
Calculate the arithmetic mean for the following data given below:
x i1
n , n fi
f i
i1
i1
The weight recorded to the nearest grams of 60 apples picked out at random from a
consignment are given below:
f
i1
i 60 fx
i1
i i 7350.0
n
fx i i
7350.0 122.5 grams
x i1
n 60
f i
i1
Using formula of short cut method of arithmetic mean for grouped data:
n
fD n
n fi
i i
x A i1 ,
n
i1
fi
i1
f
i1
i 60 fD
i1
i i =480
fD
n
i i
480
xA i1
n 114.5 122.5 grams
f i 60
i1
Using formula of step deviation method of arithmetic mean for grouped data:
fu
n
i i
xi A
xA i1
n
h u , where h is the width of the class interval:
i
, h
fi
i1
Xi A
Weight (grams) Midpoints ( xi ) Frequency ui fi ui
h
( fi ) A 114.5 , h=20
65----84 (65 84) 2 74.5 09 -2 -18
85----104 94.5 10 -1 -10
105----124 114.5 17 0 0
125----144 134.5 10 1 10
145----164 154.5 05 2 10
165----184 174.5 04 3 12
185----204 194.5 05 4 20
n n
f
i1
i 60 fu
i1
i i =24
fu
n
i i
24
xA i1
n
114.5 114.5 122.5 grams (Answer).
h 08
20
f i 60
i1
Chapter 03 Measures of Central Tendency
Geometric Mean: “The nth root of the product of “n” positive values is called
geometric mean”
Geometric Mean product of " n" positive
n
xi 45 32 37 46 39 36 41 48 36
n9
xi log xi
n
45 log 45 =1.65321 log xi
32 G.M anti log i1
1.50515 n
37
1.56820
46 1.66276 14.38700
G.M anti log
39 1.59106
9
36 1.55630 G.M anti log 1.59856
41 1.61278
48 1.62124 G.M 39.68 (Answer).
36 1.55630
n
log x
i1
i =14.38700
f
i1
i
fi log xi
65----84 (65 84) 2 74.5 09 1.8722 16.8498 G.M anti log i1
n
85----104 10 1.9754 19.7540
fi
105----124 94.5 17 2.0589 35.0013 i1
124.2483
125----144 114.5 10 2.1287 21.2870 anti log
60
145----164 134.5 05 2.1889 10.9445
165----184 154.5 04 2.2418 8.9672 anti log 2.0708
124.2483
xi 45 32 37 46 39 36 41 48 36
Using formula of harmonic mean for ungrouped data:
n9
xi 1 xi H .M n
1
n
45 0.02222 x
32 0.03125 i1 i
37 0.02702 9
H .M
46 0.02173 0.22862
39 0.02564
H .M 39.36663 (Answer).
36 0.02777
41 0.02439
48 0.02083
36 0.02777
n
1
x =0.22862
i1 i
f i
H .M n
i1
fi
x
i1 i
Weight (grams) Midpoints ( xi ) Frequency
( fi ) fi x i
65----84 (65 84) 2 74.5 09 0.12081
85----104 94.5 10 0.10582
105----124 114.5 17 0.14847
125----144 134.5 10 0.07435
145----164 154.5 05 0.03236
165----184 174.5 04 0.02292
185----204 194.5 05 0.02571
n
n
fi
f
i1
i 60 x =0.53044
i1 i
f
n
i
H .M i1
60
113.11grams (Answer).
n fi 0.53044
x
i1 i
Calculate the median for the following the marks obtained by 9 students are
given below:
xi 45 32 37 46 39 36 41 48 36
Arrange the data in
ascending order
32, 36, 36, 37, 39, 41, 45, 46, 48. n “n” is odd
th 9
n 1
Median Size of 2 observation
th
9 1
Median Size of 2 observation
Median Size th
5 observation
of
Median 39 (Answer).
Calculate the median for the following the marks obtained by 10 students
are given below:
xi 45 32 37 46 39 36 41 48 36 50
32, 36, 36, 37, 39, 41, 45, 46, 48, 50. n “n” is even
10
n th n
observation
th
Size of
2 12
Median
2
th
10
10 th
Size of 1 observation
2
2
Median
2
Median Size of 5 th
6
th
observation
2
39
Median 40 (Answer).
41
2
The number of values above the median balances (equals) the number of
values below the median i.e. 50% of the data falls above and below the
median.
Numerical examples: The following distribution relates to the number of
assistants in 50 retail establishments.
No.of 0 1 2 3 4 5 6 7 8 9
assistant
f 3 4 6 7 10 6 5 5 3 1
f
i1
i =50
n fi “n” is even
50 n th
i1
n
th
Size of observation
2 12
Median
th 2
50
50 th
Size of 1 observation
2 2
Median
2
Median Size of 25 th
26
th
observation
2
44
Median
(Answer).
4
2
Numerical example: Find the median, for the distribution of examination marks
given below:
f
i1
i =905
n
905= =452.5th student which corresponds to marks in the class 59.5---69.5
2 2
Therefore hn
Median l C 10 905 59.5 452.5 285
59.5 285 10
304 2 304
f 2
1675
Median 59.5
Median 59.5 5.5 Median 65 marks (Answer).
, ,
304
These values are denoted by Q1, Q2 and Q3. It is to be noted that 25% of the data falls
below Q1, 50% of the data falls below Q2 and 75% of the data falls below Q3.
These values are denoted by D1, D2,…,D9. It is to be noted that 10% of the data falls
below D1, 20% of the data falls below D2,…, and 90% of the data falls below D9.
These values are denoted by P1, P2,…,P99. It is to be noted that 1% of the data falls
below P1, 2% of the data falls below P2,…, and 99% of the data falls below P99.
Median Q2 D5 are same and are equal to median.
P50
Calculate the quartiles for the following the marks obtained by 9 students
are given below:
xi 45 32 37 46 39 36 41 48 36
1
where j=1, 2, 3.
n
th
Q1 Marks obtained by
2.25 1
th
student
Q1 Marks obtained
2 1
th
student
by
th
Q Marks obtained by 3 student
1
Q1 36 (Answer).
1
f
i1
i =50
3 50
th
Q3 Marks obtained by student
1 4
th
150
Q3 Marks obtained by student
1 4
th
Q3 Marks obtained by 38 student
Q3 6 (Answer).
Calculate the Decile for the following the marks obtained by 9 students
are given below:
xi 45 32 37 46 39 36 41 48 36
Arranged the observation in ascending order
32, 36, 36, 37, 39, 41, 45, 46, 48.
1
where j=1, 2, 3, 4, 5, 6, 7, 8, 9.
4n
th
student
Put j=4, n=9, D4 Marks obtained by
1 10
4 9
th
th
D4 Marks obtained by 4 student
D4 37 (Answer).
1
f
i1
i =50
6 50
th
D6 Marks obtained by 1
10
student
300
th
D6 Marks obtained by 10 student
1
th
D6 Marks obtained by 31 student
Q3 5 (Answer).
Calculate the percentile for the following the marks obtained by 9 students
are given below:
xi 45 32 37 46 39 36 41
48 36
Arranged the observation in ascending order
1
where j=1, 2, …, 99.
68n
th
D4 45 (Answer).
f
i1
i =50
49 50
th
P49 Marks obtained by 100 student
1
th
2450
P49 Marks obtained 100 student
by 1
P49 Marks obtained by 24thstudent
Q3 4 (Answer).
Numerical example of Quartile, Decile and Percentile for continuous grouped
data:
Class boundaries Midpoints xi Frequency fi Cumulative frequency c. f
29.5---39.5 34.5 8 8
39.5---49.5 44.5 87 95
49.5---59.5 54.5 190 285
59.5---69.5 64.5 304 589
69.5---79.5 74.5 211 800
79.5---89.5 84.5 85 885
89.5---99.5 94.5 20 905
n
f
i1
i =905
10
Q2 69.5 678.75 589
211
897.5
Q 69.5
2
211
Q2 (Answer)
73.7535
10
D4 59.5 362 285
304
770
D 59.5
4
304
D4 (Answer)
62.0328
h 8n
D8 l C
f 10
8n 8 905 =724
=
10 10
D8 69.5 10
724 589
211
1350
D 69.5
8
211
D8 (Answer)
75.8981
18
905 =162.9
100
P18 49.5 10
162.5 95
190
675
P 49.5 53.05 (Answer).
18
190
h 76n
P76 l C
f 100
76
905 =687.8
100
P76 69.5 10
687.8 589
211
The main object (purpose) of the average is to give a bird’s eye view
(summary) of the statistical data. The average removes all the unnecessary
details of the data and gives a concise (to the point or short) picture of the
huge data under investigation.
Average is also of great use for the purpose of comparison (i.e. the
comparison of two or more groups in which the units of the variables are
same) and for the further analysis of the data.
Averages are very useful for computing various other statistical measures
such as dispersion, skewness, kurtosis etc.
Requisites (desirable qualities) of a Good Average: An average will be
considered as good if:
It is mathematically defined.
It utilizes all the values given in the data.
It is not much affected by the extreme values.
It can be calculated in almost all cases.
It can be used in further statistical analysis of the data.
It should avoid to give misleading results.
Mode in case of Ungrouped Data: “A value that occurs most frequently in a data
is called mode”
OR
“if two or more values occur the same number of times but most frequently than the
other values, the there is more than one whole”
“If two or more VAlues occur the SAme number of times but most frequently
thAn the other vAlues, then there is more thAn one mode”
Mode in case of Discrete Grouped Data: “A value which has the largest frequency
in a set of data is called mode”
xi : 2, 3, 8, 4, 6, 3, 2, 5, 3.
Mode = 3 (Answer).
No. of assistants fi
0 3
1 4
2 6
3 7
4 10
5 6
6 5
7 5
8 3
9 1
n
f
i1
i =50
Mode = 4
f
i1
i =905
Purpose: