Professional Documents
Culture Documents
Lecture 3 PDF
Lecture 3 PDF
&
Research
Methodology
Dr. Sybil Rose
Basic Biostatistics
Measures of central tendency
2/9/23 2
Measures of Central Tendency
1. Mean - average (arithmetic mean)
2/9/23 3
To avoid biased reporting central tendency must be addressed
tendency of a group of values- that is, the tendency for values in a group
4
Formulas for Mean: (arithmetic mean)
𝛴Xi
X9 = -----
n
2/9/23 5
𝛴Xi
Mean X9 = -----
n
The mean is the sum of all the values in a data set, divided by the
2/9/23 6
To calculate the mean: 𝛴Xi
X9 = -----
n
ü Sum up all the values.
2/9/23 7
Mean of the ungrouped data:
Example:
The results of HbA1c of patients with diabetes is; 4.0, 5.4, 4.6, 6.0.
Calculate the mean of the data?
Sum of all data values
Mean = --------------------------------
Number of data values
Symbolically,
𝛴x Where x̄ (read as ‘x bar’) is the mean of the set
of x values, 𝛴 x is the sum of all the x values, and
x̄ = ----- n is the number of x values
2/9/23
n 8
Result
(4.0+5.4+4.6+6.0)
Mean = ----------------- = 20/4 = 5
4
v Data set is 4, 7, 5, 9, 5.
2/9/23 10
Result
4+7+5+9+5
M = ---------------- = 6
5
10+12+16+14
M = ---------------- = 13
4
2/9/23 11
Mean of the grouped data
In calculating the mean from grouped data, we assume that all
𝛴mi%i
M = -----
2/9/23
𝛴%i 12
Example:
Age fi mi mifi
15-19 11 17 187
Where
k= the number of class intervals
20-24 36 22 792 mi= the mid-point of the ith class interval
25-29 28 27 756 fi= the frequency of the ith class interval
30-34 13 32 416
35-39 7 37 259
Mean = 2630/100 = 26.3
40-44 3 42 126
45-49 2 47 94
No specific amount of trimming is always best, but 20% trimming is often a good
choice in the literature. This means that the smallest 20%, as well as the largest 20%,
are trimmed and the average of the remaining data is computed. Although there are
2/9/23 14
Computation of trimmed mean:
• First compute 0.2 x n
• Round down to the nearest number, call this result g,
• The formula of 20% trimmed mean is given by:
1
X t = ----------- (X (g+1) +· · ·+X(n−g))
n−2g
2/9/23 15
Example
Data values are:
46,12,33,15,29,19,4,24,11,31,38,69,10
2/9/23 16
Ordered Data:
4,10,11,12,15,19,24,29,31,33,38,46,69.
The number of values is n = 13, 0.2(n) = 0.2(13) = 2.6,
• Rounding this down to the nearest integer yields g = 2.
• That is, trim the two smallest values, 4 and 10, trim the two
largest values, 46 and 69
• Average the numbers that remain yielding.
1
M t = ----------- (11+12+15+19+24+29+31+33+38) = 23.56.
9
2/9/23 17
Median
It is the second measure, is the middle number of a set of numbers
2/9/23 18
To calculate the median of the
ungrouped data?
• First arrange the values in order of size and then Zind the
middle value.
• If the number of observations, n, is even, Then location of
the sample median is, m=n/2. Then the median is the two
middle numbers divided by 2. Or we can use the formula m
= (n+1)/2 for both odd an even.
• If the number of observations, n, is odd, Then the
• location of the sample median is m = (n+1)/2.
2/9/23 19
Finding the location of the median
Median = (n+1)/2
Example 1
Median of the Ungrouped data
Find the median of (13, 3, 20, 22, and 25)
Ordered data: 3, 13, 20, 22, and 25. The median = n+1/2 =
5+1/2 = 3 so the location of the median is third data value
which is = 20
2/9/23 20
Example 2
If there is an even number of values, use the mean of the two
middle values. For example the values 3, 13, 13, 20, 22, 25:
number 3 and 4.
2/9/23 22
Example
• n/2 = 75/2 = 37.5
• Median class interval = 35-44
• Lm = 34.5
• Fc = 35
Age fi Cum. F
5-14 5 5 • W = 10
15-24 10 15 • n = 75
25-34 20 35 • Fm = 22
35-44 22 57 • Median = 34.5 + (37.5 - 35)/ 22 x 10 = 35.64
45-54 13 70
55-64
2/9/23
5 75 23
The mean versus the median
vThe mean is sensitive to outliers
vWhen the data are highly skewed, the median is usually preferred
vWhen the data are not skewed, the median and the mean will be very close
2/9/23 24
Mode
The last measure is the mode, which is the most frequent
occurring number.
Example: 3, 13, 13, 20, 22, 25: the mode = 13. It is usually more
informative to quote the mode accompanied by the percentage
of times it happened; e.g, the mode is 13 with 33% of the
occurrences. In medical research, mean and median are usually
presented. A set can have more than one mode; if it has two, it is
said to be bimodal.
2/9/23 25
Example
Data values:
Ordered data : 1,1,3,3,4,5, 60
The mean is : 77/7 = 11
(n+1) 7+1
Median is = ------ ---- = 4 (loca/on)
2 2
So the median is the fourth data value , m = 3
Mode = most frequent number in the data set Which is = 1 & 3 ,
so the mode is bimodal
2/9/23 26
Mode of the grouped data
D1
Mode = Lo + --------- X Co
D1 + D2
Lo = the lower boundary of the modal class
D1 = difference in frequency between modal class and the one
before
D2 = difference in frequency between modal class and the one
after
Co = the width of the modal class
Note , the modal class is the one that contains the
highest frequency
2/9/23 27
Example
class mi (midpoint) fi fc
9.5 – 13.5 11.5 3 3
13.5 – 17.5 15.5 4 7
17.5 – 21.5 19.5 8 15
21.5 – 25.5 23.5 3 18
25.5 – 29.5 27.5 2 20
Sum 20
Calculate :
Mode , mean and median of the data.
2/9/23 28
Mode, the third class has the largest frequency = 8
Co = (21.5 -17.5) = 4
2/9/23 29
Result
ü Median = 19
2/9/23 30
Thank you
Dr. Sybil Rose
sybil.rose@superior.edu.pk