Professional Documents
Culture Documents
Lecture 4 Measure of Central Tendency Ungrouped Freq Dist
Lecture 4 Measure of Central Tendency Ungrouped Freq Dist
MA 339
LECTURE 4
1
Measures of Central
Tendency and Dispersions
Part 1
Statistical Measures for
ungrouped data
2
The various measures of central
tendency and dispersion for
ungrouped data will be developed
with reference to Example 1 below.
3
Example 1.
Table 1 lists the number of employees for a sample of 8 major
Australian building societies, drawn from the 1992 KPMG Peak
Marwick financial database.
1 n
xi
x1 x2 x3 ....... xn
x
n n i 1
6
In terms of the sample data of example 1, the
mean is calculated to be;
7
The Median
The median (Me) is simply the central value by
virtue of its position. To obtain this “middle
value” we begin by rearranging the observations
of Table 1 in ascending order as in figure 1 below;
8
The Quartile and allied measures
Like median, quartiles are measures of location
by virtue of position. However, instead of
dividing into halves, they divide it into quarters –
hence the term quartiles.
272+517
𝑄 3= =394.5 𝐸𝑚𝑝𝑙𝑜𝑦𝑒𝑠𝑠
2
10
The Mode
The mode (Mo) is the most common or typical
value of the variable. Since the only value to
occur more than once in our example is 46, the
model number of employees is 46.
11
1.2 Measures of dispersion
Measures of dispersion introduced in here
includes among others the range, variance,
standard deviation and coefficient of variation.
12
The Range
The range of a set of sample observations is
simply the difference between the largest and
the smallest values. Referring to figure 1, we find
that
13
Interquartile range and quartile deviation
The interquartile range () is the difference between the
1st and 3rd quartiles (and respectively)
14
The Mean absolute Deviation
One logical way of preventing the sum of
deviations about the mean summing to zero, is to
use the absolute value of each deviation (ie.
Simply disregard its sign). This is precisely the
approach used to compute the mean absolute
deviation (MD), which is defined as;
15
16
The Variance
Another way of preventing deviations about the
mean from cancelling each other is to square
them, which is the approach adopted for the
variance (represented by the symbol ). As we
shall see, this is the most important measure of
dispersion.
By definition;
17
18
19
The Standard Deviation
The standard deviation (s) is simply the square
root of the variance.
By definition;
20
s
s
s
Or
ss
s
s
21
Measures of Central
Tendency and Dispersions
Part 2
Statistical Measures for
Grouped data
22
Constructing a Frequency Distribution
Using Grouped Quantitative Data
• Ideally, the number of classes in a frequency
distribution should be between 4 and 20;
– Some data sets, particularly those with continuous
data, require several values to be grouped
together in a single class.
– This grouping prevents having too many classes in
the frequency distribution, which can make it
difficult to detect patterns.
2-23
Number of Classes
• One method to determine the number of classes
in a frequency distribution is the rule;
2k > n
where k = Number of classes
n = Number of data points
• Find the lowest value of k that satisfies the rule
• Suppose n = 50
25 = 32 < 50 (k = 5 is too small)
26 = 64 > 50 (k = 6 is a good choice)
2-24
Class Width
• Once k is known, the width of each class can
be found;
– The width is the range of numbers to put into each
class
Maximum data value Minimum data value
Estimated class width
k
2-26
Presentation of grouped data
How to present a data into a grouped frequency distribution.
Class Frequency
20 – 40 2
50 – 70 6
80 – 90 12
100 – 110 14
120 – 140 4
150 – 170 2
Total 40
28