Professional Documents
Culture Documents
MAT 161 Lesson - 3
MAT 161 Lesson - 3
MAT 161 Lesson - 3
0:19 / 0:19
The measure of Central Tendency
In this lecture, we want to learn how to obtain values from our data which
describes the general location of the data set and will later be used for
statistical analysis.
Arithmetic Mean
The mean x̄ or arithmetic mean. This is a value that is representative of a
set of data. It is defined as the sum of all the numbers in the data divided by
the number in the data.
sum ofallthedata values
x̄ =
number ofallthedata values
0:19 / 0:19
x1 +x2 +...xn
x̄ =
n
, where n=1,2,3...
Σxi
x̄ =
n
where i=1,2,3....n
Where x = value of 1st observation
1
0:19 / 0:19
f1 x1 +f2 x2 +f3 x3 +...+fn xn
x̄ =
f1 +f2 +f3 +...+fn
Σfx
x̄ =
Σf
Example
The following are the weights in a kilogram of six cows bought from Malu
Market, Sabo-Osogbo, 142, 97, 84, 125 127, 130. Calculate the mean weight of
the cows bought.
0:19 / 0:19
Solution:
Σxi
x̄ =
n
142+97+84+125+127+130
x̄ =
6
705
x̄ =
6
=117.5 kg
0:19 / 0:19
Example
Find the arithmetic mean of the following data
x 4 5 6 7 8
f 2 4 2 3 5
Σfx
x̄ =
Σf
2(4)+5(4)+6(2)+7(3)+8(5)
x̄ =
2+4+2+3+5
101
x̄ = = 16.31
16
0:19 / 0:19
Features of a grouped frequency table
Score Frequency
30-39 3
40-49 14
50-59 14
60-69 10
70-79 7
80-89 2
0:19 / 0:19
The table above will explain the following features of a grouped frequency
table.
• Class Limits:
These are the endpoints of a class. Thus, the lower and upper-class limits
of the 3rd class (50-59) of the mark distribution table of Table 1 are 50 and 59
respectively.
• Class Interval:
This is the gap between the beginning and the end values of a class. For
example the interval 40-49 is the class interval of the second class in the table
above.
0:19 / 0:19
• Class Boundary:
This is the point of demarcation between a class and the next lower or higher
class. It is the average of the upper class limit of a class and the lower class
limit of the next higher class. For example, in the table above, the lower class
boundary (l.c.b) of the second class (40-49) is obtained by finding the average
of the upper class limit of the first class and the lower class limit of the class
second class. That is,
39+40
l. c. b = = 39.5
2
0:19 / 0:19
while the upper-class boundary (u.c.b) of the second class (40-49) is
obtained by finding the average of the upper-class limit of the second class
and the lower-class limit of the third class. That is,
49+50
u. c. b = = 49.5
2
So, for the second class, the class boundaries are l.c.b {class} u.c.b That
is,
39.5 {40 - 49} 49.5
0:19 / 0:19
• Class Mid-point:
The average of the lower and upper-class limits or boundaries of a class
is called class mid-point. Thus, the class mid-point of the class (70-79) is
70+79 69.5+79.5
2
= 74.5 or 2
= 74.5
0:19 / 0:19
Example: The weights of 100 logs of wood cut down in a rainforest are given in
the table below. Calculate the mean weight.
0:19 / 0:19
Weight (kg) Class Mid-point f fx
60-62 61 15 915
63-65 64 28 1792
66-68 67 27 1809
69-71 70 22 1540
72-74 73 8 584
Total 100 6640
0:19 / 0:19
First, we need to calculate the class mid-point of each group. For the group
60−62
60-62, the mid-point is 2
= 61 , similarly for other groups 64, 67, 70 and
73.
Σfx
x̄ =
Σf
6640
x̄ = = 66.4kg
100
The value obtained indicates the center of the data set. Thus, x̄ is a
measure of location.
0:19 / 0:19
Weighted Mean
Sometimes the set of data available for computation may be similar but
have different weights. If the numbers x1 , x2 , x3 . . . given the weights
w1 , w2 . . . wn then the weighted mean is
¯
¯¯¯¯¯¯ x1 w1 +x2 w2 +...xn wn
Xw =
w1 ,w2 ...wn
.
Geometric Mean
The geometric mean of n values, x1 , x2 , x3 . . . is given by
1
¯¯¯¯¯¯¯ −−−−−−−−−
XG .
= (√x1 , x2 . . . xn ) n
0:19 / 0:19
\overline{X_G}=n\sqrt{x^f_1_1,x^f_2_2...x^f_n_n}
1
¯¯¯¯¯¯¯ −−−−−−−−−
XG = (√x1 , x2 . . . xn ) Σf
.
Harmonic Mean
The harmonic mean of n values, x1 , x2 , x3 . . . is given by
¯
¯¯¯¯¯¯¯ 1
XH = 1 1 1 1
( + +... )
n x x xn
1 2
0:19 / 0:19
Solution:
¯¯¯¯ 1+2+3+4+5
X =
5
¯¯¯¯ 15
X = = 3
5
¯¯¯¯¯¯¯−−− −−−− 1
XG = (√1.2.3.4.5 ) 5 = 2.61
¯
¯¯¯¯¯¯¯ 1
XH = 1 1 1 1 1 1
( + + + + )
5 1 2 3 4 5
¯
¯¯¯¯¯¯¯ 1
XH = = 2.22
0.45
0:19 / 0:19
The Relationship among Arithmetic, Geometric and Harmonic Means.
The arithmetic mean of a set of data, x1 , x2 . . . xn is greater or equal to
the geometric mean, while the geometric mean is greater or equal to the
¯¯¯¯ ¯¯¯¯¯¯¯ ¯
¯¯¯¯¯¯¯
harmonic mean of the same set of data. That is, X ≥ XG ≥ XH .
Median for Discrete Data
To find the median for a set of discrete data, you follow these steps.
Arrange the data in order of magnitude x1 , x2 . . . xn .
If n is odd, the median is the middle item, i.e the 12 (n + 1)th item.
If n is even, the median is the arithmetic mean of the middle two items,
i.e. the 1
2
nth and 1
2
(n + 2)th items.
0:19 / 0:19
Example
Find the median of:
11, 4, 1, 7, 8, 2, 5, 9, 3, 10, 1
9, 5, 1, 4, 7, 6, 8, 5, 7, 4, 9, 3
Solution
In order: 1, 1, 2, 3, 4, 5, 7, 8, 9, 10, 11
The median is 5.
In order: 1, 3, 4, 4, 5, 5, 6, 7, 7, 8, 9, 9
The median is 1
2
(5 + 6) = 5.5
0:19 / 0:19
Median for Grouped Data
For grouped data of discrete or continuous data, the following graphical
method can be used to estimate the median and quantities.
From the cumulative frequency (CF) distribution. The cumulative
frequency for any class is the sum of frequencies of that class and lower class.
Plot each cumulative frequency against the upper limit of the
corresponding class interval to form the cumulative frequency curve (or
Ogive).
0:19 / 0:19
Example
The table below is the frequency distribution of marks obtained in a test
by 200 students.
Marks 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89
Frequency 18 34 58 42 24 10 6 8
Table 2.1
Draw a cumulative frequency polygon to illustrate the data and estimate the
median from the graph.
0:19 / 0:19
Marks Cf
10-19 18
20-29 52
30-39 110
40-49 152
50-59 176
60-69 186
70-79 192
80-89 200
0:19 / 0:19
From the cumulative frequency distribution table above we have the
Ogive
0:19 / 0:19
Median by calculation:
N
−(Σf )
2 pmedian
M edian(M ) = Lmedian + × C
fmedian
0:19 / 0:19
First, we need to determine the median class from Table 2.1 and the Cf
table. Since the total frequency is 200, the median is lying in the middle value
i.e. 200/2= 100. From the table, the median class lies in cumulative frequency
100 which is class 30-39.
Lm edian = lower class boundary of the median class= 29.5
N= Total frequency=200
(Σf )pmedian =sum of frequencies preceding the median= 18+34=52
.
2
M edian = 29.5 + × 10 = 38marks
58
0:19 / 0:19
Quartiles
The quartiles divide the distribution into four equal parts. The three
corresponding values of the variable are denoted by Q1 , Q2 , Q3
where Q1 (Lower or first quartiles) is at 1
4
N
4
N
3
Q3 (Upper or third quartile) is at 4
N
and calculated as
N
−(Σf )
4 q1
Q1 = Lq1 + × C
f
q1
0:19 / 0:19
Example
Find the Q1 from Table 2.1 above
1 1
Q1 = N = (200) = 50
4 4
200
−18
Q1 = 19.5 +
4
34
× 10 = 28.9 .
200
−18
Q1 = 19.5 +
4
34
× 10 = 28.9 .
3 3
For the third quartile Q3 , Q3 = 4 N = 4 (200) = 150
From the cumulative frequency table, 150 falls into the 40-49 class
From the cumulative frequency table, the lower class boundary of class 40-49
is 39.5
3×200
−110
Q3 = 39.5 +
4
42
× 10 = 49.0 .
0:19 / 0:19
Deciles
The deciles D1 , D2 , D3 , . . . D9 divide the distribution into ten equal parts
1
D1 = N
10
2
D2 = N
10
9
D9 = N
10
Example
Find the 5th decile from Table 2.1 above
5
D5 = (200) = 100
10
From the cumulative frequency table, the 5th decile class is 30-39
0:19 / 0:19
5N
−(Σf )
10 d5
D5 = Ld5 + × C
fd5
5×200
−52
10
D5 = 29.5 + × 10
58
D5 = 29.5 + 8.5 = 38 .
Percentiles
The percentiles P1 , P2 , . . . P99 divide a distribution into one hundred equal
parts.
P1 , P2 , . . . P99
1
P1 = N
100
99
,..,P99
2
P2 = N = N
100 100
0:19 / 0:19
Example
Find the 60th percentile from Table 2.1 above
Solution:
60
P60 = (200) = 120
100
P60 = Lp60 + × C
f
p60
60×200
−110
100
P60 = 39.5 + × 10
42
0:19 / 0:19