Professional Documents
Culture Documents
PME-lec7-ch4-a
PME-lec7-ch4-a
PME-lec7-ch4-a
Engineering
Lecture 7
Chapter#4a
Analyzing Data
The mean. median and mode of a data set are collectively known as
measures of central tendency as these three measure focus on where
the data is centered or clustered. To analyses data using the mean,
median and mode, we need to use the most appropriate measure of
central tendency. The following points should be remembered
The mean is useful for predicting future results when there are no
extreme values in the data set. However, the impact of extreme values
on the mean may be important and should be considered. E.g. The
impact of a stock market crash on average investment returns.
The median may be more useful than the mean when there are extreme
values in the data set as is not affected by the extreme values.
The mode is useful when the most common item. characteristic or
value of a data set is required.
16 Example
A student has gotten the following grades on his tests: 87,
95, 76, and 88. He wants an 85 or better overall, What is
the minimum grade he must get on the last test in order to
achieve that average?
17 Solution
18 Example
Find the mean, median, mode, and range for the
following list of values:
13, 18, 13, 14, 13, 16, 14, 21, 13
19 Solution
20 Example
21 2. Expectation
22 Expectation: in brief
23 Simple way: MEAN
The mean of a discrete random variable X is a weighted average of the
possible values that the random variable can take. Unlike the sample
mean of group of observations, which gives each observation
equal weight the mean of a random variable weights each outcome x.
according to s probability, Pi The common symbol for the mean (also
known as the expected value of) is ux, formally defined by
{
x, 0<x<1
f(x) = 2-x, 1≤x<2
0, elsewhere
(in hundreds of
x 3
2 x3
x 2 x dx
1 2 1 2
hours.)
E(X) x dx 2
3
|0 x 3 |
1
0 1
= 1 * 100 = 100.0 hours of operation annually, on average
28 Expected Value - discrete Variables
Example# 5
Example: Repair costs for a particular machine are
represented by the following probability distribution:
X $4000 -$1000
P(X) 0.3 0.7
{
f(x) = 2-x,
0,
1≤x<2
elsewhere
(in hundreds of hours of operation.)
What is the variance of X? The variable is
continuous, therefore we will need to evaluate the
integral.
Variance – Discrete Variables