Professional Documents
Culture Documents
Measures of The Centre Adv
Measures of The Centre Adv
Measures of Central Tendency
• The mode is the data value or datum (or value) which appears the
largest number of times in the set or the most frequently occurring
figure in the set
• If no data value is repeated, we say there is no mode.
Using the following data set;
2.7kg, 3.4kg, 3.0kg, 4.1kg, 5.2kg, 1.9kg, 2.3kg, 3.0kg, 3.3kg, 3.0kg.
The mode is 3.0kg (highest frequency)
The Median
• The median is defined as the middle figure after the data set is ranked
or placed in order of magnitude.
Example
22, 29, 35, 24, 26, 15, 28, 36, 45, 21, 33, 5, 46, 21, 19, 41, 5, 84, 58, 63,
5, 23
Find the median.
Solution
Rank the data in ascending order
5, 5, 5, 15, 19, 21, 21, 22, 23, 24, 26, 28, 29, 33, 35, 36, 41, 45, 46, 58,
63, 84
The Median
• Then pick the two middle numbers (because the total number of
observations is even, i.e. = 22)
5, 5, 5, 15, 19, 21, 21, 22, 23, 24, 26, 28, 29, 33, 35, 36, 41, 45, 46, 58,
63, 84
• The two middle figures are 26 and 28. The average of these two
figures is the median i.e. (26+28)/2 = 27 is the median.
The Arithmetic Mean
• This is another measure of the centre of observations
• The (arithmetic) mean of set of observations is the sum of the
observations divided by the number of the observations
• The mean of a sample data set is denoted by x
• The mean of a population data set is denoted by
The Arithmetic Mean
Example
• The following data are journey time of college students from their
place of residence to College:
17, 30, 14, 16, 26, 15, 27, 18, 26 minutes
• The mean of the journey times is
17+30+14+16+26+15+27+18+26 = 189/9 = 21 minutes
Median and mean of grouped discrete data
No. of letters 0 1 2 3 4 5
per day
Frequency 48 32 17 2 0 1
Cummulative 48 80 97 99 99 100
frequency
Calculate: (a) the median; (b) the mean; of the letter data
• (a) Median. There are 100 observations. The median is half the sum of
the 50th and 51st observations in the ranked order. We see from the
cumulative frequencies that both these observations equal 1. Hence
the median number of letters per day is 1.
• (b) Mean. Of the 100 observations 48 are 0’s, 32 are 1’s, etc. Hence
the mean equals (40x0+32x1+17x2+2x3+0x4+1x5)/100 = 0.77.
Exercises
• Work on the following exercises from Clarke & Cooke
• 2.2.2
• 2.3.2
Median and mean of grouped continuous data
Example
• Calculate (a) the median; (b) the mean, of the following data on the
height in centimetres of 10 plants in pots. The data have been
grouped
(a) Median
• The median is obviously inside the interval (16.5-21.5). If we assume
that the five observations which lie in the interval are equally spread
out with it, and put each at the centre of its own small interval, we
obtain the diagram
Median and mean of grouped continuous data
(a) Median
• From the definition, the median is half the sum of the and
observations. This value is at the end of the third of the five equal
intervals into which the interval (16.5, 21.5) is divided, and is 19.5
Median and mean of grouped continuous data
(a) Mean
• When calculating the mean from grouped continuous data we act as if
all the observations in a given interval are equal in value to the class-
centre of that interval.
• We then proceed as with grouped discrete data. The mean is
therefore
Median and mean of grouped continuous data
Exercise
• Try exercise 2.3.2???
-notation (Sigma notation)
• We can express the definition of the arithmetic mean in a simple
formula using the -notation.
• For example, in our student journey times example, we can represent
each journey time by as below
17 30 14 16 26 15 27 18 26
1. Show that
(i) , (ii) , (iii)
(iv) , (v) , (vi)
2. If (=1, 2, 3) and (=1, 2, 3) take the values shown in the
following table
6 1 2 5 3 4
confirm the following relations
-notation: exercise
2. , where c is a constant;
3. , where c is a constant;
4. , provided
The Arithmetic Mean
n
• Mean is given by x
i 1
i
x
n
Where n is number of observation in the sample
Example
Use the following data set to compute a sample mean
1.65kg, 3.3kg, 4.1kg, 3.0kg, 3.1kg 2.9kg 2.8kg, 3.2 kg, 3.0kg, 3.0kg
The Arithmetic Mean
• No Variability – No Dispersion
Measures of Variation
• There are 3 values that we will look at to measure
the amount of dispersion or variation. (The
spread of the group)
1. Range
2. Standard Deviation
3. Quartile deviation
Why is it Important?
• You want to choose the best brand of
medicine for your patients. You are
interested in how long the drugs takes to
cure a disease. The choices are narrowed
down to 2 different drugs. The results are
shown in the chart. Which drug would
you choose?
Drug A Drug B
The chart 10 35
indicates the 60 45
number of days a 50 30
drug takes to cure 30 35
a particular 40 40
disease. 20 25
210 210
Does the Average Help?
• Drug A: Avg = 210/6 = 35 days
• Range = 100 – 2 = 98
Deviation from the Mean
• A deviation from the mean, x – x , is the difference
between the value of x and the mean x
n 1 S2 i 1
n 1
( xi ) 2 N
(x
2
x
N i ) 2
2
N 2 i 1
N
Standard Deviation
• The standard deviation is the square root of the
variance.
2
s s
Example – Using Formula
• Find the variance of the following
dataset 6, 3, 8, 5, 3 (in hours)
x x 2
6 36
3 9
8 64
5 25
3 9
2
x 25 x 143
2 ( x) 2
x
s2 n
n 1
25 2
143
2 5 143 125 18
s 4. 5
4 4 4
Find the standard deviation
• The standard deviation is the square root of the
variance.
s 4.5 2.12