Professional Documents
Culture Documents
Central Tendency (Ungrouped)
Central Tendency (Ungrouped)
Central Tendency (Ungrouped)
2. Do it yourself
This exercise requires the learners to produce an output (not necessarily tangible) based on the
concepts covered in the lesson. Technology and software may be involved in this exercise.
After going through this module, you are expected to:
1. Compare and contrast the mean, median, and mode.
2. Identify the most appropriate measure of central tendency to be used in describing a set of data.
3. Calculate the mean, median, and mode of a set of data.
DEFINITION
A measure of center is a value at the center or middle of a data set.
Notes:
Measures of the central tendency of a set of data can be also called an average of that set of data.
There are several ways of computing the measure of center (or average) of a set of data. The most
common measures are: mean, median, and mode.
Being the measures of central tendency, when appropriately applied, we can locate the position where
the data are clustered around.
The measures of central tendency (or averages) can also be used as representative of a set of data.
Let us now proceed to probably the most familiar measure of tendency: the mean.
DEFINITION
The mean of a set of data is the measure of the center found by adding the data values and
dividing the total by the number of data values. This can be computed using the formula:
x=
∑ of all data values
number of datavalues
Comments:
21 10 36 42 39 52 30 25 26
EXAMPLE 2
The ages of 15 students in a certain class were taken and shown as follows:
15 18 17 16 19 21 18 23 24 18 16 17 20 21 19
The outliers are the extreme value(s) of data set. They are too large or too small data values.
Comments:
The mean is sensitive to outliers. It very important to know if there are outliers in a set of data. The
outliers can make the mean an ineffective measure of central tendency.
To illustrate the effect of outliers to the mean, let us have the following examples.
EXAMPLE 3
In example 2, these are the ages of the 15 students:
15 18 17 16 19 21 18 23 24 18 16 17 20 21 19
Let us replace the first value (i.e. 15) and second value (i.e. 18) as 70 and 75 respectively. This
leads to a new set of data as:
70 75 17 16 19 21 18 23 24 18 16 17 20 21 19
Comments:
EXAMPLE 4
The following are the scores of 10 students in a 100 – item examination in Mathematics.
85 78 90 80 17 91 81 81 77 83
Compute for the mean of the scores of the students.
Comments:
EXAMPLE 5
Compute for the mean of the following scores:
85 78 90 80 91 81 81 77 83
Solution:
85+78+ 90+ 80+91+81+81+77+ 83
x= =82.89
9
Hence, the mean of the distribution is 82.89. I would like you to reflect whether 82.89 is a more sensible
center of the distribution (compared to 76.3).
Suppose we would like to deal with outliers by means of another measure of central tendency. In this
case, we will have the concept of the median.
DEFINITION
The median of a data set is the measure of center that is the middle value when the original data values are
arranged in order of increasing (or decreasing) magnitude. The median is denoted by ~x.
Comments:
Unlike the mean, the median is more resistive to the effects of the outliers. In other words, median is
more effective measure of central tendency when a distribution contains outliers.
The median of a distribution may be unchanged even if one of the data values is changed.
In determining the median of a distribution, it is important to check first if the data values are arranged
in increasing or decreasing order.
EXAMPLE 6
Consider the set of data presented in Example 3. The following are the ages of 15 students.
70 75 17 16 19 21 18 23 24 18 16 17 20 21 19
16 16 17 17 18 18 19 19 20 21 21 23 24 70 75
Looking at the arrangement above, let us look for the middlemost value. In this case, we see that 19 is
the middlemost value.
16 16 17 17 18 18 19 19 20 21 21 23 24 70 75
Middlemost value
Thus, the median of the set of data is 19. Try to reflect if 19 is a more sensible center (compared to
26.27).
Let consider the set of data in Example 2.
EXAMPLE 7
Consider the set of data presented in Example 2. The following are the ages of 15 students.
15 18 17 16 19 21 18 23 24 18 16 17 20 21 19
15 16 16 17 17 18 18 18 19 19 20 21 21 23 24
15 16 16 17 17 18 18 18 19 19 20 21 21 23 24
Middlemost value
Looking at Examples 6 and 7, we see that the outliers have minimal effect on the median.
Let us look at the case where there are even number of data values.
EXAMPLE 8
The following are the scores of 10 students in a 100 – item examination in Science.
86 78 90 80 71 91 93 81 77 83
Determine the median.
Comments:
Unlike in Examples 6 and 7 where there are 15 (hence, odd number) data values, it is not
straightforward to spot the median if the number of data values is even. In this case, we have 10 data
values.
Solution:
First, let us arrange the data in increasing order.
71 77 78 80 81 83 86 90 91 93
As you can see, there are two middle values in the set. These are 81 and 83.
71 77 78 80 81 83 86 90 91 93
Middle values
In this case, let us take the two values and then get the midpoint. Recall the midpoint formula in Module 1 –
Lesson 3. That midpoint will be the median of the set of data. Thus,
~ 81+83
x= =82
2
Therefore, the median is 82.
EXAMPLE 9
The NCEE scores of 12 students in a certain college were taken and are shown below.
93 65 87 56 99 76 58 87 85 93 68 69
Determine the median.
Solution:
First, let us arrange the data in increasing order.
56 58 65 68 69 76 85 87 87 93 93 99
Looking at the arrangement above, we see that the middle values are 76 and 85.
56 58 65 68 69 76 85 87 87 93 93 99
~ 76+85
x= =80.5
2
Therefore, the median is 80.5.
All of our examples so far deal with quantitative data. The natural question that can be raised at this
point is: Can we have a measure of central tendency when the data are qualitative especially the
nominal data?
Here are the key points:
The computation of mean requires us to add and then divide. Both involve meaningful mathematical
operations that are NOT applicable to qualitative data.
The identification of median requires us to arrange the data in increasing (or decreasing order).
However, we learned that in the nominal level of measurement, there is no meaningful arrangement.
Thus, both mean and median cannot be used as measure of central tendency when the set
contains nominal data.
In the case of nominal data, we will have the third measure of central tendency: mode.
DEFINITION
There are instances where the mode of a distribution may not exist. It happens when all of the data
points occur equally in a distribution.
There are instances where we can have more than one mode. This is different from the case of mean
and median since for every distribution, there is only one mean and only one median.
EXAMPLE 10
Suppose that 10 students were asked about their favorite color. Their responses as the following:
Red Green Yellow Blue Green White Black Violet Orange Brown
Determine the mode.
Solution:
Looking at the given data set, the color Green occurs twice while the rest of the colors occurs only once. Thus,
Green is the most frequent color.
It follows that the mode is Green.
EXAMPLE 11
Fifteen (15) students were interviewed about the their most favorite subject. Their responses were
recorded and shown below:
Example 11 illustrates that it is possible for a distribution to have more than one mode.
EXAMPLE 12
The following are the ages of the first 11 students enrolled in ABC College.
19 21 18 17 16 22 24 25 20 23 26
Determine the mode.
Solution:
If you look closely to the set of data, you will see that no data point occurs more than the other. In fact,
each age in the given set of data occurs only once. Therefore, the distribution has no mode.
Comments:
While mode is the best candidate when the data is nominal, it is NOT limited to that type of data. It can
also be used even if the data is quantitative in nature.
Example 12 illustrates that it is possible that the mode of a distribution does not exist.
Republic of the Philippines
SOUTHERN LUZON STATE UNIVERSITY
LABORATORY SCHOOL
Lucban, Quezon
DO IT YOURSELF
Identify the mean, median and mode of the following set of data. Show your solution.
1. The following are the IQ scores of 20 selected college students.
25 31 26 19 20 30 25 31
21 33 25 21 24 33 36 26