Professional Documents
Culture Documents
CSD102 Measures of Central Tendency
CSD102 Measures of Central Tendency
Data Science
SESSION 4
The central tendency is the extent to which the data values group around a typical or
central value.
• VARIATION
The variation is the amount of dispersion, or scattering, of values away from a central
value.
• SHAPE
The shape is the pattern of the distribution of values from the lowest value to the highest
value.
CENTRAL TENDENCY
• A central value within the range of a data set that represent all the values in the
data set.
• Arithmetic Mean
• Weighted Mean
• Geometric Mean
• Median
• Mode
Arithmetic Mean
• The Mean or average is probably the most commonly used measure for describing a central
tendency.
• The mean is the average of all values in a distribution.
• Each data in a distribution contributes in the determination of mean.
• It is also known as arithmetic average as it is the arithmetic average of the data set.
• To compute the mean, all the values are added and divided by the total number of values. It is the
ratio of summation of all scores to the total numbers of scores.
• Using mean, one can compare different groups.
• It has mathematical properties that make it attractive to use in inferential statistics analysis.
• It is the best measure of central tendency in the case of symmetrical and moderately skewed data
sets.
• The mean is affected by each and every value, which is an advantage. The mean uses all the data,
and each data item influences the mean. It is also a disadvantage because extremely large or
small values can cause the mean to be pulled toward the extreme value.
Computation of Arithmetic Mean
(Individual Observation)
• The arithmetic mean can be computed by using the following
statistical formula:
Where
EXAMPLE
The following data represents test scores. Find the arithmetic mean of
the following data set:
16, 18, 19, 21, 23, 23, 27, 29, 29, 35
Solution:
=
Where
EXAMPLE
• The following data represents test scores. Find the arithmetic mean of
the following data set:
Test score Frequency
12 8
20 16
27 48
33 90
42 30
54 8
EXAMPLE
Solution:
Test score (x) Frequency (f) fx
12 8
20 16
27 48
33 90
42 30
54 8
∑
EXAMPLE
Test score (x) Frequency (f) fx
12 8 96
20 16 320
27 48 1296
33 90 2970
42 30 1260
54 8 432
200 6374
Where
Example
• A candidate obtains the following marks in an examination: English –
46 ; Mathematics – 67 ; Management – 72 ; Economics – 58 ; Political
Science – 53. It is agreed to give double weights to marks in English
and Mathematics as compared to other subjects. What is the
weighted mean?
EXAMPLE
Solution:
∑
EXAMPLE
Solution:
= 58.43
Example
• The Carter Construction Company pays its hourly employees $16.50, $19.00, or $25.00 per hour.
There are 26 hourly employees, 14 of which are paid at the $16.50 rate, 10 at the $19.00 rate,
and 2 at the $25.00 rate. What is the mean hourly rate paid the 26 employees?
• Solution: To find the mean hourly rate, we multiply each of the hourly rates by the number of
employees earning that rate. The mean hourly rate is
For product 1:
Unskilled
Semiskilled
Skilled
∑
Solution
Unskilled 5 1/8
Semiskilled 7 2/8
Skilled 9 5/8
∑
Solution
Examples
1. Andrews and Associates specialize in corporate law. They charge $100 an hour for researching a
case, $75 an hour for consultations, and $200 an hour for writing a brief. Last week one of the
associates spent 10 hours consulting with her client, 10 hours researching the case, and 20
hours writing the brief. What was the weighted mean hourly charge for her legal services?
2. In June, an investor purchased 300 shares of Oracle (an information technology company) stock
at $20 per share. In August, she purchased an additional 400 shares at $25 per share. In
November, she purchased an additional 400 shares, but the stock declined to $23 per share.
What is the weighted mean price per share?
Geometric Mean
• GM is used to deal with quantities that change over a period of time to know an
average rate of change.
• Geometric mean is used to show multiplicative effects over time in compound
interest and inflation calculations.
• The geometric mean is useful in finding the average change of percentages, ratios,
indexes, or growth rates over time.
• It has a wide application in business and economics because we are often
interested in finding the percentage changes in sales, salaries, or economic figures,
such as the Gross Domestic Product, which compound or build on each other.
• The Geometric mean can be computed by using the following statistical formula:
Compute the average growth factor for the following data:
SOLUTION
Examples
1. The percent increase in sales for the last 4 years at Combs Cosmetics were: 4.91, 5.75,
8.12, and 21.60. Find the geometric mean percent increase.
2. Compute the geometric mean of the following percent increases: 8, 12, 14, 26, and 5.
3. Compute the geometric mean of the following percent increases: 2, 8, 6, 4, 10, 6, 8,
and 4.
4. Listed below is the percent increase in sales for the MG Corporation over the last 5
years:
9.4 13.8 11.7 11.9 14.7
Determine the geometric mean percent increase in sales over the period.
Median
• The median is the positional average that divides a distribution into two equal parts so that one half of items
falls above it and the other half below it.
• It is the midpoint of a distribution of values.
• It measures the central observation in the data.
• The median is unaffected by the magnitude of extreme values. This characteristic is an advantage, because
large and small values do not inordinately influence the median. For this reason, the median is often the best
measure of location to use in the analysis of variables such as house costs, income, and age.
• We must array the data before we calculate median.
• One way to compute the median is to list all observations in numerical order, and then locate the
observation in the center of the sample.
• The median can be computed by using the following statistical formula:
Thus, Half of the test scores are less than 28 while other Half of the test
scores are more than 28.
Example
The following data represents test scores. Find the median :
12, 17, 3, 14, 5, 8, 7, 15
Solution:
Arrange the data in the ascending or descending order
3, 5, 7, 8, 12, 14, 15, 17 (n = 8)
Thus, Half of the test scores are less than 10 while other Half of the test
scores are more than 10.
Example
• The median is the average of the two
middle terms, $116,000 and $122,000, or
$119,000. This price is a reasonable
• A real estate broker wants to determine the representation of the prices of the 10
median selling price of 10 houses listed at the houses.
• Note that the house priced at $5,250,000
following prices:
did not enter into the analysis other than
to count as one of the 10 houses. If the
price of the tenth house were $200,000,
the results would be the same.
• However, if all the house prices were
averaged, the resulting average price of
the original 10 houses would be
$635,000, higher than 9 of the 10
individual prices.
EXAMPLE
• Nutritional data about a sample of seven breakfast cereals includes the number
of calories per serving: