Professional Documents
Culture Documents
Virtual University of Pakistan
Virtual University of Pakistan
Virtual University of Pakistan
Variation
Same center,
different variation
Range:
• Simplest measure of variation
• Difference between the largest and the smallest
observations:
Range = Xlargest – Xsmallest
Example:
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
Disadvantages of the Range
• Ignores the way in which data are distributed
7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
• Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119
Interquartile Range
• Can eliminate some outlier problems by using the interquartile range
• Eliminate some high- and low-valued observations and calculate the range from the remaining values
= Q3 – Q 1
Interquartile Range
Example:
X minimum Q1 Median Q3 X
maximum
(Q2)
25% 25% 25%
25%
12 30 45 57 70
Interquartile range
= 57 – 30 = 27
Variance
• Average (approximately) of squared deviations of values
from the mean
n
– Sample variance:
2
(X i X) 2
s i 1
n-1
s i 1
n-1
Calculation Example:
Sample Standard Deviation
Sample
Data (Xi) : 10 12 14 15 17 18 18 24
n=8 Mean = X = 16
(10 X) 2 (12 X) 2 (14 X) 2 (24 X) 2
S
n 1
130
4.309
7
Population Variance
i
(X μ) 2
i
• Shows variation about the mean 2
(X μ)
• Has the same units as the original data
σ i 1
N
Measuring variation
Small standard deviation
Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 0.92
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 4.84
Coefficient of Variation:
• Measures relative variation
• Always in percentage (%)
• Shows variation relative to mean
• Can be used to compare two or more sets of data measured in different units
S
CV 100%
X
Comparing Coefficient
of Variation
• Stock A:
– Average price last year = $50
– Standard deviation = $5
Both stocks
S $5 have the same
CVA 100%
100% 10% standard
• Stock B: X $50 deviation, but
– Average price last year = $100 stock B is less
variable relative
– Standard deviation = $5 to its price
S $5
CVB
X
100%
100% 5%
$100
Shape of a Distribution:
– Symmetric or skewed
• Click OK
Excel output
Microsoft Excel
descriptive statistics output,
using the house price data:
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Exploratory Data Analysis:
• Box-and-Whisker Plot: A Graphical display of data using 5-
number summary:
Minimum
Minimum 1st
1st Median
Median 3rd
3rd Maximum
Quartile
Quartile Quartile
Quartile
Shape of Box-and-Whisker Plots
• The Box and central line are centered between the endpoints if data are
symmetric around the median
Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
Practical Exercise