Professional Documents
Culture Documents
UNIT 3 Stat I
UNIT 3 Stat I
3.1 INTRODUCTION
Averages should be computed with sufficient ease and rapidity or averages should not involve
more of mathematical complexities. The most popular and widely used measure for representing
the entire data by one value is arithmetic mean.
3.2 SUMMATION NOTATION AND ITS PROPERTIES
Summation operator,, implies that the values that follow it are to be summed or added together.
n upper lim it
5
Example x = x1 + x2 + x3 + x4 + x5
i 1
i
Properties:
x yi
n n n n n n
i 1
i xi
i 1
yi
i 1
, xi
i 1
yi xi
i 1
y
i 1
i
Example: Suppose x1 = 1, x2 = 3, x3 = 4, y1 = 2 , y2 = 5 , y3 = 3
/
x yi
3 3 3
Then
i 1
i xi
i 1
y
i 1
i
(x1 + y1) + (x2 + y2) + (x3 + y3) = (x1 + x2 + x3) + (y1 + y2 + y3)
(1 + 2) + (3 + 5) + (4 + 3) = (1 + 3 + 4) + (2 + 5 + 3)
3 + 8 + 7 = 8 + 10
18 = 18
3 3 3
xi
i 1
yi xi
i 1
y
i 1
i …… left for the student
Page 1 of 24
2. Multiplication by a constant
n n
kx
i 1
i k xi
i 1
14 + 35 + 28 = 7(11)
77 = 77
3. Summation of a constant
n
Case 1: If lower limit equal to 1, k nk
i 1
6 46
i 1
6+ 6 + 6 + 6 = 24
k n m 1k
im
k n m 1k
im
8 6 4 18
i4
8 + 8 + 8 = 3(8)
24 = 24
Page 2 of 24
4. Sum of summations
n k n
x x
i 1
i
i 1
i x
i k 1
i , for any k < n
xi xi
i 1 i 1
x
i k 1
i
3 6 6
xi
i 1
xi
i 4
x
i 1
i
xi 10, x 148
2
i
Exercise 1 Let i 3 i 3 , ,
6 6 6
xi xi x (x 2)
2
Find i. ii. iii. i i
i 1 i 1 i 1
6 2
iv.
i 1
( 2 xi 3) 2 v.
i 1
( x i 4)
3.3.0 Definition
The arithmetic mean is the sum of the values in a group divided by the number of items in that
group. Let x1, x2… xn be n values of a variable x, and then their arithmetic mean is defined by:
n
x x 2 xn x i
x
x 1 i 1
n n n
Where x –sum of all observations
n –total number of observations
Page 3 of 24
3.3.1 Computation of Arithmetic Mean for Ungrouped and Grouped Data
x i
d
Direct method: x i 1
Short cut method: x A
n n
Where n – number of items A = Assumed mean d = sum of deviations i.e. (xi-A)
Solution:
8
8 x i
180.4
i. xi = 180.4, n = 8
i 1
x i 1
n
8
22.55
ii. Let A = 22 then di : 1.4, -6.4, 0.1, -2, 4.7, 9.4, -3.1, 0.3
8
8 d i
4.4
di = 4.4, n = 8 x A
i 1
i 1
n
22
8
= 22 + 0.55 = 22.55
Discrete Series:
n
fx i i
fx fd
Direct method: x i 1
Short cut method: x A
n n n
Where f - frequency d - deviation of items from assumed mean (xi – A)
A - assumed mean n - number of observations
Example: Given data of 50 students of marks of a test in a class. Calculate the arithmetic mean
by i. direct method ii. Short cut method.
Marks 20 30 40 50 60 70
No. of students 8 10 16 8 5 3
Page 4 of 24
Solution:
40 16 640 0 0
50 8 400 10 80
60 5 300 20 100
70 3 210 30 90
50 2010 10
i. x
fx
2010
40.20
n 50
ii. x A
fd 40
10
40.20
n 50
fcm fd '
Direct method x Step deviation method x A c
i
n n
Where,
f – Frequency
n – Number of observation
Cmi –Class mark
A – Assumed mean
d – Deviation of class marks from assumed mean, i.e d=cmi – A)
c – class width
d ' =d
d/c
Page 5 of 24
Example: In a survey, the number of persons at different ages is found as follows:
Age in Year 5 - 15 15 - 25 25 - 35 35 – 45 45 - 55 55 - 65
No. of Persons 8 10 14 20 16 12
Solution:
25 - 35 14 30 420 0 0 0 0
35 - 45 20 40 800 10 200 1 20
45 - 55 16 50 800 20 320 2 32
55 - 65 12 60 720 30 360 3 36
80 3020 620 62
i. x
fcm 3020 37.75
n 80
ii. x A
fd 30
620
30 7.75 37.75
n 80
fd 1
iii. x A c 30 62 10 30 0.775 10 37.75
n 80
Exercise 2 Find the arithmetic mean of the following data.
i. 53 54 52 32 30 60 47 46 35 29
ii.
iii.
Classes 10 - 15 15 - 20 20 - 25 25 - 30 30 - 35
Frequencies 5 6 7 7 5
Page 6 of 24
3.3.3 Combined (Pooled) Arithmetic Mean
N x N 2 x2 N n xn N x i i
For n number of groups, xc 1 1 i 1
N1 N 2 N n n
N
i 1
i
Example: The mean height of 25 male and 20 female is 161.0cm and 155.6cm. What will be
the combined mean height?
xm = 161.0cm, xF = 155.6cm, NM = 25, NF = 20
xm N m x F N F
xc
Nm NF
161.0 25 155.6 20 7137
xc 158.60cm
25 20 45
Exercise 4 In a factory, 120 workers get an average wage of birr 30 a day, 160 workers get Birr
50 a day, 80 workers get Birr 60 a day and 40 workers get birr 80 a day. Find:
i. The average of averages.
An item or value may be relatively more important or less important than other items. This
relative importance is technically known as weight. In case where the relative importance of the
different items is not the same we compute weighted arithmetic mean.
If w1, w2, …, wn are weights attached to the values x1, x2, … , xn respectively, then the weighted
AM is defined as
xw
x1 w1 x2 w2 xn wn
wx
w1 w2 wn w
Page 7 of 24
Example: An auto ride costs Birr 5 for the first km, Birr 4 for the next 3kms and Birr 9 for each
of the subsequent kms. Find the average cost per km for 10 kms.
4.00 3 12.00
9.00 6 54.00
10 71.00
xw
xw
71.00
7.10Birr
w 10
1. The average mark of 100 students was found to be 40 but latter it is discovered that a score of
33 was misread as 83. Find the correct average corresponding to the correct sum.
Page 8 of 24
x 40
xi x N 40 100 4000 Wrong sum
N 100
Wrong Entry = 83
Correct Entry = 33
4000 83 33 3950
Correct Mean 39.5
100 100
2. The average of a class having 35 pupils is 14 years. When the age of the class teacher is
added to the sum of the ages of the pupils, the average rises by 0.5 year. What must be the
age of the teacher?
x 14
xi 14 35 490 … Sum of ages of the pupils
N 35
x 14.5
xi 14.5 36 522 … Sum of ages of the pupils and the teacher
N 36
Exercise 6 The mean of 200 items is 50. Later on it is discovered that two items were wrongly
taken as 92 and 8 instead of 192 and 88. Find out the correct mean.
Exercise 7 The average rainfall for a week, excluding Sundays, was 10cm. Due to heavy
rainfall on Sunday, the average rainfall for the week rose to 15cm. How much rain fall
was there on Sunday?
Page 9 of 24
Geometric Mean
The geometric mean like arithmetic mean is calculates an average. It is used when observed values
are measured as ratios, percentages, proportions, indices or growth rates.
Geometric mean for individual series: The geometric mean, G.M. of an individual series of
positive numbers (> 0) , is defined as the nth root of their product.
G.M n x1.x2 xn
∑ )
Solution: a) GM 3 12 36 6 ; b) GM= √ √ =4
Example 3.8: Compute the geometric mean of the following values: 3, 3, 4, 4, 4, 5, 6 and 6.
Solution
Values 3 4 5 6
Frequency 2 3 1 2
G.M. = √ = 4.236
Page 10 of 24
Geometric mean for continuous grouped FD:- The above formula can also be used whenever
the frequency distribution is grouped continuous, class marks of the class intervals are
considered as xi.
Harmonic Mean
It is a suitable measure of central tendency when the data pertains to speed, rate and time. The
harmonic Mean of n values is defined as n divided by the sum of their reciprocal.
Harmonic mean for individual series: If , are n observations, then harmonic
mean can be represented by the following formula:
Example 3.9 A man travels from Addis to Awassa by a Motorbike and takes 4 hours
to cover the whole distance. In the first hour he travels at a speed of 50 km/hr, in
the second hour his speed is 64 km/hr, in third hour his speed is 80 km/hr and in
the fourth hour he travels at the speed of 55 km/hr. Find the average speed of the
motorist.
Solution
50 65 80 55 Total
⁄ 0.0200 0.0154 0.0125 0.0182 0.0661
Page 11 of 24
Where,
No of Cars 3 4 8 9 2
Solution:
130 3 0.0231
135 4 0.0091
140 8 0.0571
145 9 0.0621
150 2 0.0133
Total 26 0.1648
2-6 4 10 ⁄ 2.5
6-10 8 12 ⁄ 1.5
10-14 12 18 ⁄ 1.5
Total n=40 5.5
Page 12 of 24
∑
MODE
A value, which occurs most frequently in a series of observations, is called Mode. So by looking
the observations mode can be identified. It is the value, which has the greatest frequency density
in its immediate neighborhood.
For ungrouped data: Mode x̂ = that value in the data set, which occurs most often.
For grouped data: Discrete Series: Mode x̂ = the value of the variable corresponding to the
maximum frequency.
Continuous Series: The class corresponding to the maximum frequency is called the modal
class. The value of mode is obtained by the following interpolation formula.
Page 13 of 24
f1 f 0
Mode xˆ l c
f1 f 0 f1 f 2
or
1
Mode xˆ l c
1 2
Where l – LCB of the modal class f2 – frequency succeeding f1
f1 – maximum frequency C – magnitude of the class
f0 – frequency preceding f1 ∆1 = f1 – f0
f2 – frequency succeeding f1 ∆2 = f1 – f2
i. 25, 15, 23, 40, 27, 25, 23, 25, 20, 19, 22, 24, 25
ii.
x 10 20 30 40 50 60
f 4 9 16 25 22 15
iii.
Classes 0-9 10 - 19 20 - 29 30 - 39 40 - 49 50 – 59 60 - 69 70 - 79
Solution:
Page 14 of 24
l = 19.5 f0 = 350 f1 = 720 f2 = 664 c = 10
f1 f 0
Mode xˆ l c
f1 f 0 f1 f 2
720 350 3700
19.5 10 19.5 28.1854
720 350 720 664 426
i. 27, 33, 42, 25, 23, 27, 25, 33, 27, 28, 16, 18
ii.
No. of Persons 4 6 10 8 20 24 9 5
iii.
Classes 0 - 400 400 - 800 800 - 1200 1200 - 1600 1600 - 2000
Fi 4 12 40 41 27
MEDIAN
The median is as its name indicates the middle most value in the arrangement which divides the
data into two equal parts. It is obtained by arranging the data in an increasing or decreasing order
of magnitude and denoted by ̃.
It is the value which cuts a given data in to two. It is the mid point of a data.
Page 15 of 24
N 1
th
Median ~
x vallue of the item (where n is odd)
2
xn 1
2
For
1
th th
N N
Median x
~
Value of item Value of 1 item Where n is even
2 2 2
1
xn xn 1
2 2 2
grouped data:
Discrete Series:
i. a) 27, 33, 42, 25, 23, 25, 33, 28, 27, 16, 18, 12
Page 16 of 24
b) 8, 5, 2, 6, 15, 10, 25
ii.
x 4 6 8 10 12 14 16
F 2 4 5 3 2 4 1
iii.
fi 20 21 50 40 53 16
Solution:
i. a. Rearranging:
12 16 18 23 25 25 27 27 28 33 33 42
n = 12 … even
1
~
x xn xn 1
1
x6 x7 1 25 27 26
2 2 2 2 2
b. Rearranging: 2 5 6 8 10 15 25
n = 7 … odd
~ x
x n 1 x4 8
2
ii.
x 4 6 8 10 12 14 16
f 2 4 5 3 2 4 1
<cfi 2 6 11 14 16 20 21
th
n = 21 Median = The value of N + 1 item
2
th
= 21 + 1 item
2
= The value of the 11th item
= 8
Page 17 of 24
iii.
c n
Median ~
x l 100 91 80 9 82.25
10
c. f 80
f 2 40 4
Exercise 15 Find the median of the following data.
i. a. 20 15 21 13 22 24 22 25 26 27 25
b. 120 125 112 137 129 127
ii.
x 28 30 32 34 36 38 40 42
f 14 15 16 24 16 10 6 4
iii.
X 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54 55 - 59
fi 5 10 15 20 6 4
38 51 55 62 67
40 51 56 62 69
41 52 56 63 69
45 52 57 64 71
48 53 59 65 77
48 54 59 66 78
50 55 59 66 79
50 55 62 67 79
Page 18 of 24
QUARTILES, DECILES AND PERCENTILES
Quartiles: are the three values, which divide the given data in to four equal parts. They are
denoted by Q1, Q2 and Q3.
Q1 - The lower or first quartile. It covers 25% of the distribution.
Q2 - The middle or second quartile. It covers 50% of the distribution.
Q3 - The upper or third quartile. It covers 75% of the distribution.
Deciles: are the nine values, which divide the series in to ten equal parts. They are denoted by
D1, D2, … , D9.
D1 = Covers 10% of the distribution
D2 = Covers 20% of the distribution ……. .
D9 = Covers 90% of the distribution
Percentiles: are the 99 values, which divide the series in to 100 equal parts. They are denoted by
P1, P2 , … , P99.
First, for ungrouped data, rearrange the values in the order of magnitude and for discrete series,
compute the <Cfi column. Then apply the following formula.
i N 1
th
Q i value of item
4
i N 1
th
D i value of item
10
i N 1
th
Pi value of item
100
Page 19 of 24
Example:
38 51 55 62 67
40 51 56 62 69
41 52 56 63 69
45 52 57 64 71
48 53 59 65 77
48 54 59 66 78
50 55 59 66 79
50 55 62 67 79
1
th th
N N
Median x
~
Value of item Value of 1 item
2 2 2
1
x x
2 2 1
n n
2
1 40
th
40
th
Median ~x item 1 item
2 2 2
1
2
20th item 21 th
item
1
116
2
58
Page 20 of 24
i N 1
th
Q i value of item
4
i N 1
th
D i value of item
10
i N 1
th
Pi value of item
100
i N 1
th
Qi item
4
i 40 1
th
Q1 10.25
4
I.e. in between 10th and 11th item
10th item = 51 and 11th item =52
The difference =52-51=1
0.25(1) =0.25
Q 51.25
1
i N 1
th
Qi item
4
3 40 1
th
Q3 30.75
4
I.e. in between 30th and 31th item
65+0.75 (66-65)
Q 65.75
3
Page 21 of 24
Marks Number of Cumulative
students frequency
2 1 1
1 1 2
4 1 3
5 3 6
6 4 10
7 4 14
8 4 18
9 2 20
N=20
i N 1
th
Qi item
4
i 20 1
th
Q1 5.25
4
Since it is not existed in the <cf column we will take the next one
For continuous series:
Example: For the data given below, compute the value of Quartiles, D3, D7, P15 and P88 and
interpret.
Page 22 of 24
Marks Below 10 10 - 20 20 - 40 40 - 60 60 - 80 Above 80
No. of Students 10 15 25 30 14 6
<cfi 10 25 50 80 94 100
Solution:
th
N
Q1 – size of item = 25th item 10 – 20 quartile class
4
l = 10, c = 10, f = 15, c.f = 10
c n
Q1 l c. f 10
10
25 10 20
f 4 15
Mark of 25% of students is less than 20.
th
2N
Q2 – size of item = 50th item 20 – 40 quartile class
4
l = 20, c = 20, f = 25, c.f = 25
c n
Q2 l c. f 20
20
50 25 40
f 2 25
Mark of half of students is below 40.
th
3N
Q3 – size of item = 75th item 40 – 60 quartile class
4
l = 40, c = 20, f = 30, c.f = 50
c 3n
Q3 l c. f 40
20
75 50 56.6
f 4 30
3 th
Mark of of students is below 73.33.
4
th
3N
D3 – size of item = 30th item 20 – 40 decile class
10
L = 20, c = 20, f = 25, c.f = 25
c 3n
D3 l c. f 20
20
30 25 24
f 10 25
Mark of 30% of students is below 24.
Page 23 of 24
th
7N
D7 – size of item = 70th item 40 – 60 docile class
10
L = 40, c = 20, f = 30, c.f = 50
c 7n
D7 l c. f 40
20
70 50 53.33
f 10 30
Mark of 70% of students is below 53.33.
th
15N
P15 – size of item = 15th item 10 – 20 percentile class
100
L = 10, c = 10, f = 15, c.f = 10
c 15n
P15 l c. f 10
10
15 10 13.3
f 100 15
Mark of 15% of students is below 13.3.
th
88N
P88 – size of item = 88th item 60 – 80 percentile class
10
L = 60, c = 20, f = 14, c.f = 80
c 88n
P88 l c. f 60
20
88 80 71.43
f 100 14
Mark of 88% of students is below 71.43.
Exercise 16 Compute the value of Quartiles, D4, P69 and interpret for the data given below.
i. 46,35,28,52,54,43,35, 49, 46, 50,41
ii.
Daily Wages 40 45 50 55 60 65 70
No. of Workers 9 22 26 18 13 8 5
iii.
No. of 8 10 15 25 40 20 15 7
Houses
Page 24 of 24