Professional Documents
Culture Documents
Lesson 6c, 7, 8
Lesson 6c, 7, 8
Central Tendency
The measures of central tendency in statistics are
numerical values that represent the center or
average of a set of data. They provide a summary of
where the bulk of the data is concentrated. There are
three main measures of central tendency, namely:
Mean, Median, and Mode.
Mean
• Arithmetic mean
• Weighted mean
• Grand mean of combined data
• Geometric mean
The mean or arithmetic mean, often referred to as the average, is
calculated by adding up all the values in a dataset and then dividing by the
ഥ for sample.
number of values. The symbol used are 𝝁 for population and 𝒙
NGEC 1 3 87
know his GPA. The table
NGEC 4 3 84
shows the midshipman’s
NGEC 7 3 85
grades and the equivalent
ELECTRO (LEC) 3 95
credit units for each grade. ELECTRO (LAB) 1 82
Determine his Grade Point P.E. 1 82
𝑛 𝑛
𝑥𝑔𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐
ҧ = 𝑥1 𝑥2 𝑥3 … 𝑥𝑛 𝑥𝑔𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐
ҧ = 𝑅1 𝑅2 𝑅3 … 𝑅𝑛
Where: Where:
𝑥𝑔𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐
ҧ −geometric mean 𝑥𝑔𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐
ҧ −geometric mean
𝑥 − given data 𝑅 − rates of return
𝑛 − number of given data 𝑛 − number of years
Example:
Find the geometric mean of the following data,
2, 3, 6, 7, 7, 8, 9, 9, 9, 10
The Characteristics of the Mean
1.The mean is affected by all values in the distribution.
2.The mean is sensitive to extreme scores (known as
Outlier values).
3.The sum of the deviations about the mean is equal to
zero, σ 𝑥𝑖 − 𝑥ҧ = 0.
The Median is a positional average. When the scores are ranked, the median
is the point where half is greater and half is lesser. The median of the set of
scores is the middle value when the scores are arranged in order of
increasing magnitude. After arranging the original scores in increasing (or
decreasing) order, the median will be either of the following;
Where: Where:
𝜎 −population standard deviation 𝑠 −sample standard deviation
𝑥 − given data 𝑥 − given data
𝜇 −population mean 𝑥ҧ −sample mean
𝑛 − number of given data 𝑛 − number of given data
Procedure for Computing a Standard Deviation
1. Determine the mean of the 𝑛 numbers.
2. For each number, calculate the deviation (difference) between the
number and the mean of the numbers.
3. Calculate the square of each of the deviations and find the sum of these
squared deviations.
4. If the data is a population, then divide the sum by 𝑛. If the data is a
sample, then divide the sum by 𝑛 − 1.
5. Find the square root of the quotient in Step 4.
The variance for a given set of data is the square of the
standard deviation.
Where: Where:
𝜎 2 −population variance 𝑠 2 −sample variance
𝑥 − given data 𝑥 − given data
𝜇 −population mean 𝑥ҧ −sample mean
𝑛 − number of given data 𝑛 − number of given data
Example:
The following numbers were obtained by
sampling a population.
2, 4, 7, 12, 15
Find the variance and the standard deviation of
the sample.
The 𝒛 − 𝒔𝒄𝒐𝒓𝒆 for a given value 𝑥 is the number of standard
deviations that 𝑥 is above or below the mean of the data.
Where: Where:
𝑧𝑥 − standard score or z-score 𝑧𝑥 − standard score or z-score
𝜎 −population standard deviation 𝑠 −sample standard deviation
𝑥 − given data 𝑥 − given data
𝜇 −population mean 𝑥ҧ −sample mean
Example:
A 4th class midshipman has taken two tests in his NGEC 4 class. He
scored 72 on the first test, for which the mean of all scores was 65
and the standard deviation was 8. He received a 60 on a second
test, for which the mean of all scores was 45 and the standard
deviation was 12. In comparison to the other students, did the 4th
class midshipmen do better on the first test or the second test?
THE NORMAL
DISTRIBUTION
Relative Frequency Dist. Table Relative Frequency Histogram
Example:
Use the relative frequency distribution table
to determine the following:
a. percent of subscribers who required at
least 25 seconds to download the file.
b. probability that a subscriber chosen at
random will require at least 5 but less
than 20 seconds to download the file.
A normal distribution is a common pattern which forms a
bell-shaped curve that is symmetric about a vertical line
through the mean of the data. The normal distribution is a
fundamental concept in statistics and is essential in various
fields for analyzing and understanding data.
Normal Distribution in terms of the following components:
• Shape: The normal distribution has a distinctive symmetric bell-shaped curve.
• Central Tendency: The highest point on the curve, the peak, represents the
mean (average) of the data.
• Spread: The curve is more concentrated around the mean and gradually
decreases as you move away from the mean in either direction.
• Standard Deviation: The spread of the curve is measured by the standard
deviation. A smaller standard deviation indicates a narrower and taller curve,
while a larger standard deviation results in a wider and flatter curve.
Normal distribution of
data in a graph, forming
a bell-shaped curve with
a mean value (𝜇) of 5.
Properties of a Normal Distribution
• The graph is symmetric about a vertical line through the mean of
the distribution.
• The mean, median, and mode are equal.
• The y-value of each point on the curve is the percent (expressed
as a decimal) of the data at the corresponding x-value.
• Areas under the curve that are symmetric about the mean are
equal.
• The total area under the curve is 1.
The 68-95-99.7 rule, also known as the
empirical rule or three-sigma rule, is a
guideline that describes the percentage of
data that falls within certain ranges in a
normal distribution.
Empirical Rule for a Normal Distribution
In a normal distribution, approximately
• 68% of the data lie within 1 standard deviation of the mean.
• 95% of the data lie within 2 standard deviations of the mean.
• 99.7% of the data lie within 3 standard deviations of the
mean.
The empirical rule on a normal distribution graph
Example:
A survey of 1000 U.S. gas stations found that the price charged
for a gallon of regular gas could be closely approximated by a
normal distribution with a mean of $3.10 and a standard
deviation of $0.18. How many of the stations charge:
a. between $2.74 and $3.46 for a gallon of regular gas?
b. less than $3.28 for a gallon of regular gas?
The standard normal
distribution is the normal
distribution of 𝑧 − 𝑠𝑐𝑜𝑟𝑒𝑠 that
has a mean of 0 and a standard
deviation of 1. The purpose of
having a standard normal
distribution is to simplify
statistical calculations and
comparisons across different
normal distributions.
Example:
Find the area of the standard normal
distribution between 𝑧 = − 1.44 and 𝑧 = 0.
The Area of the Tailed Region
A tail region is a region of the standard normal
distribution to the right of a positive z-value or
to the left of a negative z-value.
Example:
Find the area of the standard normal
distribution to the right of 𝑧 = 0.82.
The Standard Normal Distribution, Areas, Percentages, and Probabilities
In the standard normal distribution, the area of the distribution from
𝑧 = 𝑎 to 𝑧 = 𝑏 represents
• the percentage of z-values that lie in the interval from a to b.
• the probability that z lies in the interval from a to b.
Because the area of a portion of the standard normal distribution can
be interpreted as a percentage of the data or as a probability that the
variable lies in an interval, we can use the standard normal distribution
to solve many application problems.