Professional Documents
Culture Documents
CHAPTER 3-Basic Statistics
CHAPTER 3-Basic Statistics
Cont…
For example, you want to describe the age of
students attending the Adama Science
Technology University. Therefore if you randomly
ask 700 students for their age, the data will be
as follows:
10/25/2021
10/25/2021
Cont…
5 10
xi x1 x2 x3 x4 x5 , xi x4 x5 x6 x7 x8 x9 x10
i 1 i 4
x
i 1
i x1 x2 x3 x4 x5 5 7 7 6 8 33
Properties of Summation
10/25/2021
Exercise
10/25/2021
10/25/2021
10/25/2021
x i
x1 x2 ... xn
x i 1
10/25/2021 n n
Cont..
If we take an entire population the mean is denoted by μ and
is given by: n
x x x ... x i
i 1
1 2 N
N N
Where N stands for the total number of observations in the
population.
Example 1: Find the mean of the mark of 9 students (out of
100) given below: 52, 75, 70, 67, 35, 52, 70, 70, and 49.
Solution: n = 9
n
x i
x1 x2 ... x9 52 75 70 67 35 52 70 70 49 540
x i 1
60.
n 9 9 9
Exercise:
10/25/2021
Find the mean of the following data: 10.5, 2.4 ,3.6, 5.9 & 8.7
f i xi
f x f 2 x2 ... f k xk fx i i k
x i 1
1 1 i 1
, n fi
k
f1 f 2 ... f k
f
n i 1
i
i 1
No of 1 2 3 4 5 6 7 Total
children
frequency 5 9 12 17 14 10 6 73
10/25/2021
Solution:
k
fx i i
5 1 9 2 ... 6 7 299
x i 1
4.09 4
n 73 73
f i xi
x i 1
where n
k is the number of classes
n is total frequencies
xi is the ith class mark
10/25/2021
Cont…
Example: Find the mean for the following continuous
data.
C.L 1-5 6-10 11-15 16-20 21-25 26-30 31-35 Total
fi 4 8 12 6 3 4 3 40
C.M (xi) 3 8 13 18 23 28 33
fixi 12 64 156 108 69 112 99 620
f i xi
4 3 8 8 ... 3 33
x i 1
n 40
620
10/25/2021
15.5
40
Cont…
Exercise
1. The following table gives the daily wages of
laborers. Calculate the average daily wages paid
to a laborer.
Wages in dollar 11-13 13-15 15-17 17-19 19-21 21-23 23-25
Number of 3 4 5 6 6 4 3
n=31
laborer
C.M 12 14 16 18 20 22 24
2.
10/25/2021
1. The algebraic n
sum of deviations from the mean is always
zero. i.e. ( xi x) 0
i 1
2. The sum of squares of deviations from the mean is
minimum. i.e. ( x A) when A x .
n
2
i
i 1
Cont…
Example: The mean of 200 observations was 50. Later on,
it was discovered that two observations were wrongly
read as 92 and 8 instead of 192 and 88. Find the correct
mean.
Solution: n = 200, wrong mean = 50
wrong values = 92+8 = 100
correct values = 192+88 = 280
Correct values - wrong values
Correct Mean Wrong Mean
n
280 - 100
50 50.9.
200
10/25/2021
Cont…
5. Combined mean:
n1 x1 n2 x2 ... nk xk
xc
n1 n2 ... nk
Example: Last year there were three sections taking Probability &
Statistics course course in ASTU. At the end of the semester,
the three sections got average marks of 80, 83 and 76. There
were 28, 32 and 35 students in each section respectively. Find the
mean mark for the entire students.
n1 x1 n2 x2 n3 x3 7556
xc 79.54
n1 n2 n3 95
10/25/2021
Weighted Mean ( 𝒙𝒘 )
In the calculation of arithmetic mean, all items
were assumed to be of equally importance.
That is, each value in the data set has equal weight.
When the observations have different weight, we
use weighted average.
Weights are assigned to each item in proportion
to its relative importance.
If 𝑥1 , 𝑥2 ,…, 𝑥𝑛 represent values of the
observations and 𝑤1, 𝑤2 ,…, 𝑤𝑛 are the
corresponding weights, then the weighted mean is
given by
10/25/2021
Cont…
n
w x i i
w1 x1 w2 x2 ... wn xn
xw i 1
n
w1 w2 ... wn
w
i 1
i
Example: Suppose that a student was registered for five courses with 4, 4,
3, 2 and 3 credit hours and she obtained grades B, A, C, D and A,
respectively. Find her GPA. n
wi xi
48
x w GPA i 1n 3.0
wi
16
i 1
Exercise
1. A student’s final mark in Mathematics, Basic Statistics, Accounting
and Operation Mgmt are respectively 82, 80, 90 and 70.If the respective
credits received for these courses are 3, 5, 3 and 1, determine the
approximate average mark the student has got for one course.
2. If a student gets A in 4 cr. hrs, B in 3 cr. hrs and D in 2 cr. hrs courses,
what is his GPA in this semester?
10/25/2021
10/25/2021
Example
Find the G. M of a) 3 and 12 b) 2, 4 and 8
Solution:
a) 𝐺. 𝑀 = 𝑥1 . 𝑥2 = 3 × 12 = 36 = 6
3 3
b) 𝐺. 𝑀= 3 𝑥1 . 𝑥2 . 𝑥3 = 2×4×8 = 64 = 4
Properties of geometric mean
• It is less affected by extreme values.
• It takes each and every observation into consideration.
• If the value of one observation is zero its values
becomes zero.
10/25/2021
Harmonic Mean
It is a suitable measure of central tendency when the data
relates to speed, rate and time.
The harmonic mean of n values is defined as n divided by
the sum of their reciprocal.
𝒏
𝑯. 𝑴 =
𝟏 𝟏 𝟏
+
𝒙𝟏 𝒙𝟐 + ⋯ + 𝒙𝒏
H.M for discrete and continuous data:
𝒏
𝑯. 𝑴 = 𝒘𝒉𝒆𝒓𝒆 𝒏 = 𝒇𝒊
𝒇𝟏 𝒇 𝟐 𝒇𝒌
𝒙𝟏 + 𝒙𝟐 + ⋯ + 𝒙𝒌
For continuous data, xi is the ith class mark.
10/25/2021
Median
o It divided a given set of data into two equal parts
o It is obtained by arranging the data in an increasing or decreasing
order of magnitude
o It denoted by 𝑥
Case 1: Median for individual series of dat a
To determine the median:
arranging the data in an increasing or decreasing order
Identify the total number of observations is either odd or even.
Then,
(𝑛:1
2
)𝑡ℎ 𝑣𝑎𝑙𝑢𝑒 𝑖𝑓 𝑛 𝑖𝑠 𝑜𝑑𝑑
𝑥= (𝑛 2 )𝑡ℎ 𝑣𝑎𝑙𝑢𝑒 + (𝑛 2 +1)𝑡ℎ 𝑣𝑎𝑙𝑢𝑒
𝑖𝑓 𝑛 𝑖𝑠 𝑒𝑣𝑒𝑛
2
10/25/2021
Cont…
Example: Find the median of the following discrete
data
Number of 1 2 3 4 5 6 7 Total
children
fi 5 9 12 17 14 10 6 73
L.C.F 5 14 26 43 57 67 73
Solution: n = 73 is odd
73+1 th
𝑥 = (𝑛:1
2
)𝑡ℎ 𝑣 = ( ) v= 37th v = 4
2
10/25/2021
Solution: n = 40
𝑛 40
= = 20.
2 2
The minimum L.C.F greater than or equal to 20 is 24.
Therefore, the 3rd class is the median class.
Thus, 𝐿𝑚𝑒𝑑 =10.5, w=5, 𝑓𝑚𝑒𝑑 =12 , C.F = 12
𝑥 = 𝐿𝑚𝑒𝑑 + 𝑓 𝑤 𝑛2−𝐶.𝐹 =10.5+12 5
20−12 = 13.83
𝑚𝑒𝑑
10/25/2021
Merits of median
• It is less affected by extreme values.
• Median can be calculated even in case of open-ended
intervals.
• It can be computed for ratio, interval, and ordinal
level of data.
Demerits of median
Its value is not determined by each & every
observation.
It is not a good representative of the data if the
number of items (data) is small.
The arrangement of items in order of magnitude is
sometimes very boring process if the number of items
is very large.
10/25/2021
Wages in 126 and 127-135 136-144 145-153 154-162 163-171 172 and
Birr below above
No. of 3 5 9 12 5 4 2
Employees
10/25/2021
Mode
It is the third measure of central tendency.
The mode is the value that occurs most often in the data
set.
The mode is the value with the highest frequency
It denoted by 𝑥 (read as “x-hat”).
A data set may not have a mode or may have more than
one mode.
A data set that has only one value that occurs with the
greatest frequency is said to be unimodal.
If a data set has two values that occur with the same
greatest frequency, both values are considered to be the
mode and the data set is said to be bimodal.
10/25/2021
Cont…
If a data set has more than two values that occur with the
same greatest frequency, each value is used as the mode,
and the data set is said to be multimodal.
Example: Find the mode of the following data.
Data X: 3, 4, 6, 12, 31, 8, 9, 8. The Mode (𝑥 ) = 8
Data Y: 6, 8, 12, 13, 11, 12, 6. The Mode (𝑥 ) = 6 and 12
Data Z: 2, 6, 3, 5, 7, 8, 12, 11. No Mode
Exercise: The marks obtained by ten students in a semester
exam in statistics (out of 100) are: 70, 65, 68, 70,75, 73, 80,
70, 83 and 86. Find the mode of the students’ marks.
10/25/2021
10/25/2021
Merits of mode
Mode is not affected by extreme values.
We can change the size of the observations without
changing the mode.
It can be computed for all level of data i.e. ratio, interval,
ordinal or nominal.
Demerits of mode
It may not exist.
It does not take every value into consideration.
Mode may not exist in the series and if it exists it may
not be unique.
10/25/2021
Some
10/25/2021
of these are quartiles, deciles and percentiles.
Quartiles
Quartiles: are values which divide the data set in to
approximately four equal parts, denoted by 𝑄1 ,
𝑄2 𝑎𝑛𝑑 𝑄3.
𝑄1- the first quartile (the lower quartile)
- 25% of the observations value is below it.
𝑄2 - the 2nd quartile
- 50% of the observations value is
below/above
𝑄3 - the 3rd quartile (the upper quartile)
- 75 % the observations value is below it.
10/25/2021
10/25/2021
10/25/2021
xi 16 11 12 13 14 15 10 17 18
fi 20 8 25 48 65 40 2 9 2
Solution:
10/25/2021
1(𝑛+1) 𝑡ℎ (219+1) 𝑡ℎ
𝑄1 = 𝑣 = 𝑣 = 55th v = 13.
4 4
2(𝑛+1) 𝑡ℎ 2(219+1) 𝑡ℎ
𝑄2 = 𝑣 = 𝑣 = 110th v = 14
4 4
3(𝑛+1) 𝑡ℎ 3(219+1) 𝑡ℎ
𝑄3 = 𝑣 = 𝑣 = 165th v = 15
4 4
10/25/2021
where
𝐿𝑄𝑖 is the LCB of the ith quartile class,
𝑓𝑄𝑖 is frequency of the ith quartile class,
𝐶. 𝐹 is the L.C.F of the class immediately
preceding the ith quartile class
𝑄1 = 𝐿𝑄1 + 𝑓𝑤 𝑛
4
−𝐶.𝐹 , 𝑄2 = 𝐿𝑄2 + 𝑓𝑤 2𝑛
4
−𝐶.𝐹 and
𝑄1 𝑄2
𝑄3 = 𝐿𝑄3 + 𝑓𝑤 3𝑛
−𝐶.𝐹
𝑄3 4
10/25/2021
fi 4 8 15 5 9 5 4
L.C.F 4 12 27 32 41 46 50
Solution:
𝑄1 : 𝑛4 = 50
4
=12.5.
𝑄2 : 2𝑛 2×50
4
=
4
=25.
10/25/2021
𝑄3 : 3𝑛
4
=
3×50
4
= 37.5.
Thank you!!!
10/25/2021