Professional Documents
Culture Documents
Measures of Spread
Measures of Spread
The Range is the difference between the lowest and highest values.
So the range is 9 − 3 = 6.
It is that simple!
The single value of 3616 makes the range large, but most values are around 10.
Like this:
Example: 5, 7, 4, 4, 6, 2, 8
Quartile 1 (Q1) = 4
Quartile 2 (Q2), which is also the Median, = 5
Quartile 3 (Q3) = 7
Sometimes a "cut" is between two numbers ... the Quartile is the average of the
two numbers.
Example: 1, 3, 3, 4, 5, 6, 6, 7, 8, 8
Q2 = (5+6)/2 = 5.5
And the result is:
Quartile 1 (Q1) = 3
Quartile 2 (Q2) = 5.5
Quartile 3 (Q3) = 7
Interquartile Range
The "Interquartile Range" is from Q1 to Q3:
Example:
Q3 − Q1 = 7 − 4 = 3
Also:
So now we have enough data for the Box and Whisker Plot:
Q3 − Q1 = 15 − 4 = 11
Percentiles
Percentile: the value below which a percentage of data falls.
If your height is 1.85m then "1.85m" is the 80th percentile height in that group.
In Order
Have the data in order, so you know which values are above and below.
Grouped Data
When the data is grouped:
In the test 12% got D, 50% got C, 30% got B and 8% got A
In other words you did "as well or better than 77% of the class"
(Why take half of B? Because you shouldn't imagine you got the "Best B", or the
"Worst B", just an average B.)
Deciles
Deciles are similar to Percentiles (sounds like decimal and percentile together),
as they split the data into 10% groups:
The 1st decile is the 10th percentile (the value that divides the data so
that 10% is below it)
The 2nd decile is the 20th percentile (the value that divides the data so
that 20% is below it)
etc!
Example: (continued)
Quartiles
Another related idea is Quartiles , which splits the data into quarters:
Example: 1, 3, 3, 4, 5, 6, 6, 7, 8, 8
Q2 = (5+6)/2 = 5.5
Quartile 1 (Q1) = 3
Quartile 2 (Q2) = 5.5
Quartile 3 (Q3) = 7
The Quartiles also divide the data into divisions of 25%, so:
For 1, 3, 3, 4, 5, 6, 6, 7, 8, 8:
Estimating Percentiles
We can estimate percentiles from a line graph .
Example: Shopping
0 0
2 350
4 1100
6 2400
8 6500
10 8850
12 10,000
First draw a line graph of the data: plot the points and join them with a smooth
curve:
Draw a line horizontally across from 3,000 until you hit the curve, then draw a
line vertically downwards to read off the time on the horizontal axis:
So the visits at 11 hours were about 9,500, which is the 95th percentile.
Mean Deviation
How far, on average, all values are from the middle.
Calculating It
Find the mean of all values ... use it to work out distances ... then find the
mean of those distances!
In three steps:
Like this:
3 6
6 3
6 3
7 2
8 1
11 2
15 6
16 7
Which looks like this:
It tells us how far, on average, all values are from the middle.
In that example the values are, on average, 3.75 away from the middle.
Formula
The formula is:
Absolute Deviation
Each distance we calculate is called an Absolute Deviation, because it is
the Absolute Value of the deviation (how far from the mean).
To show "Absolute Value" we put "|" marks either side like this:
|-3| = 3
Absolute Deviation = |x - μ|
Sigma
The symbol for "Sum Up" is Σ (called Sigma Notation ), so we have:
μ = 3 + 6 + 6 + 7 + 8 + 11 + 15 + 168 = 728 = 9
x |x - μ|
3 6
6 3
6 3
7 2
8 1
11 2
15 6
16 7
Σ|x - μ| = 30
Note: the mean deviation is sometimes called the Mean Absolute Deviation
(MAD) because it is the mean of the absolute deviations.
Here is an example (using the same data as on the Standard Deviation page):
Example: You and your friends have just measured the heights of
your dogs (in millimeters):
The heights (at the shoulders) are: 600mm, 470mm, 170mm, 430mm and
300mm.
x |x - μ|
600 206
470 76
170 224
430 36
300 94
Σ|x - μ| = 636
So, on average, the dogs' heights are 127.2 mm from the mean.
A Useful Check
The deviations on one side of the mean should equal the deviations on
the other side.
6+3+3+2+1 = 2+6+7
15 = 15
Likewise:
Example: Dogs
If they are not equal ... you may have made a msitake!
Standard Deviation and
Variance
Deviation just means how far from the normal
Standard Deviation
The Standard Deviation is a measure of how spread out numbers are.
The formula is easy: it is the square root of the Variance. So now you ask,
"What is the Variance?"
Variance
The Variance is defined as:
Example
You and your friends have just measured the heights of your dogs (in
millimeters):
The heights (at the shoulders) are: 600mm, 470mm, 170mm, 430mm and
300mm.
Find out the Mean, the Variance, and the Standard Deviation.
Answer:
Mean = 600 + 470 + 170 + 430 + 3005
= 19705
= 394
so the mean (average) height is 394 mm. Let's plot this on the chart:
Variance
= 1085205
= 21704
And the Standard Deviation is just the square root of Variance, so:
Standard Deviation
σ = √21704
= 147.32...
So, using the Standard Deviation we have a "standard" way of knowing what is
normal, and what is extra large or extra small.
Rottweilers are tall dogs. And Dachshunds are a bit short ... but don't tell
them!
Using
All other calculations stay the same, including how we calculated the mean.
Formulas
Here are the two formulas, explained at Standard Deviation Formulas if you
want to know more:
4 + 4 − 4 − 44 = 0
That looks good (and is the Mean Deviation ), but what about this case:
Oh No! It also gives a value of 4, Even though the differences are more spread
out.
So let us try squaring each difference (and taking the square root at the end):
In fact this method is a similar idea to distance between points , just applied in
a different way.
And it is easier to use algebra on squares and square roots than absolute
values, which makes the standard deviation easy to use in other areas of
mathematics.
Return to Top