Professional Documents
Culture Documents
S6 Skewness2
S6 Skewness2
Shape
Learning Objectives
➢ Illustrate the measures of shape.
➢ Solve problems involving measures of shape.
➢ Use appropriate measures of position and other statistical methods in
analyzing and interpreting research data.
➢ Imbibe endurance to understand lessons.
James 1: 2-3 “Count it all joy, my brothers, when you meet trials of various
kinds, for you know that the testing of your faith produces steadfastness.”
Activity
It was alright. However our
teacher mentioned that the
test scores of our class are
How was your negatively skewed. What
last test, John? did she mean? Is that good
or bad?
Measures of Shape
Measures of Shape
describe the distribution (or pattern) of the data within a dataset.
What are the shapes of the dataset?
Symmetrical Distribution
SKEWNESS
❑ It gives the amount and direction of the skew of a data set or
the degree of symmetry of the distribution.
Skewed to the Right or Positively Skewed
❑ If majority of the data is at the left side and the right tail is longer.
Skewed to the Right or Positively Skewed
❑ If majority of the data is at the left side and the right tail is longer.
Example:
Skewed to the Right or Positively Skewed
❑ If majority of the data is at
the left side and the right
tail is longer.
❑ Hence, the right tail of the
data set is longer than the
left and is more stretched
on the side above mean.
❑ For positively skewed data
sets, the mode is less
than the median and the
median is less than the
mean.
Skewed to the Left or Negatively Skewed
❑ If majority of the data is at the right side and the left tail is longer.
Skewed to the Left or Negatively Skewed
❑ If majority of the data is at the right side and the left tail is longer.
Example:
Skewed to the Left or Negatively Skewed
❑ If majority of the data is at
the right side and the left
tail is longer.
❑ The left tail is more
stretched on the side
below the mean.
❑ For negatively skewed
data sets, the mode is
greater than the median
and the median is greater
than the mean.
Key Features of Skewed Distribution
❑ Asymmetrical shape
❑ Mean and median have different values and do not all lie at the center of the
curve
❑ There can be more than one mode.
❑ The distribution of the data tends towards the high or low end of the dataset.
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
a.
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
a.
Positively Skewed;
majority of the data
are on the left side
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
b.
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
b.
Approximately Symmetric;
the data set has
approximately the same
appearance on the left
and on the right of the
center line.
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
c.
Example 1
Determine whether the data set illustrated is positively
skewed, negatively skewed, or approximately symmetric.
c.
Negatively Skewed;
majority of the data
are on the right side.
Skewness is a pure number without a unit. Karl Pearson gave
this formula to estimate skewness 𝑠𝑘,
𝒎𝒆𝒂𝒏 − 𝒎𝒆𝒅𝒊𝒂𝒏 ഥ − 𝒙
𝟑 𝒙
𝒔𝒌 = 𝟑 =
𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏 𝒔
Example 2
a. Determine the skewness of the following data on the height (in cm) of
15 preschool students.
80 82 88 88 90
91 92 92 93 94
94 95 96 96 96
Solution
a. The problem is an example of ungrouped data.
b. First, determine the mean, median and standard deviation.
σ𝒙 𝟖𝟎+𝟖𝟐+𝟖𝟖+𝟖𝟖+𝟗𝟎+𝟗𝟏+𝟗𝟐+𝟗𝟐+𝟗𝟑+𝟗𝟒+𝟗𝟒+𝟗𝟓+𝟗𝟔+𝟗𝟔+𝟗𝟔 1 367
ഥ
𝒙 = = = = 𝟗𝟏. 𝟏𝟑 .
𝑵 𝟏𝟓 15
𝒏+𝟏 𝟏𝟓+𝟏 𝟏𝟔
Applying the formula of the median,
𝒙 = = = = 𝟖𝒕𝒉 𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏
𝟐 𝟐 𝟐
ഥ − 𝒙
𝟑 𝒙 3(91.13 − 92) 3 −0.87 2.61
𝒔𝒌 = = = =− ≈ −𝟎. 𝟓𝟑
𝒔 4.9 4.9 4.9
Note that the value of the skewness is a negative number. This implies that the
data set is skewed to the left. That is, majority of the heights of the students are
greater than the mean and median height of the group.
Example 3
a. Determine the skewness of the data below.
The table below shows the weight (in pounds) of 100 randomly
selected dogs in a dog shelter.
60 – 62 5
63 – 65 17
66 – 68 42
69 – 71 27
72 – 74 9
Solution
a. The problem is an example of a grouped data.
b. First, find the class mark, then determine the following; mean, median and standard
deviation (s) using the formula for grouped data.
Solving for the standard deviation, refer to the table above and substitute the values in the
𝒏 σ𝑓𝑥𝑚 2 −(σ𝑓𝑥𝑚 )𝟐
formula 𝒔= .
𝒏(𝒏−𝟏)
𝒎𝒆𝒂𝒏 −𝒎𝒆𝒅𝒊𝒂𝒏 𝟑 ഥ
𝒙−
𝒙
Solving for the skewness, applying the formula 𝒔𝒌 = 𝟑 = .
𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏 𝒔
where ഥ
𝒙 = 𝟔𝟕. 𝟓𝟒, 𝒙 = 𝟔𝟕. 𝟓 and 𝒔 = 𝟐. 𝟗𝟕 Substituting,
𝟑 ഥ
𝒙 − 𝒙 3(67.54 − 67.5) 3 0.04 0.12
𝒔𝒌 = = = = = 𝟎. 𝟎𝟒
𝒔 2.97 2.97 2.97
The skewness of the given data set is 0.04. It is positively skewed. Therefore, most of
the dogs have weighs that are less than the mean and median weights.
When interpreting skewness, you may refer to the following:
a. If the skewness value is less than – 1 or greater than 1, then the data set is highly skewed.
b. If the skewness value is between – 1 and – 0.5 or between 0.5 and 1, then the data set is
moderately skewed.
c. If the skewness value is between – 0.5 and 0.5, then the data set is approximately symmetric.
A skewness value that is closer to zero indicates that the data set is approximately symmetric.
Example 4
Is the data in example 3 highly skewed, moderately skewed, or
approximately symmetric?
Example 4
Is the data in example 3 highly skewed, moderately skewed, or
approximately symmetric?
Solution
Since the skewness of the data set is 0.04, you can conclude that the distribution of
the weights of the dogs is approximately symmetric.
ACTIVITY