Professional Documents
Culture Documents
2022 CSN373 Lec 8
2022 CSN373 Lec 8
● Concept of probability
● Random variables
● Distribution functions: discrete and continuous
● Moments and moment generating functions
2
Chebyshev’s Theorem:
• Chebyshev’s theorem can be used to find the minimum percentage of data values that will
fall between any two given values. This theorem can be applied to any distribution
regardless of its shape.
3
Chebyshev’s Theorem:
4
Measures of Position: z-score
5
Example:
6
Measures of Position: Percentiles
7
Example:
8
Measures of Position: Quartiles and
Deciles
• Quartiles divide the distribution into four groups, separated by Q1, Q2, Q3.
• Note that Q1 is the same as the 25th percentile; Q2 is the same as the 50th percentile, or
the median; Q3 corresponds to the 75th percentile, as shown:
• Deciles divide the distribution into 10 groups, as shown. They are denoted by D1, D2,
etc.
9
Measures of Position: Quartiles and
Deciles
10
Example:
11
Interquartile Range (IQR):
• In addition to dividing the data set into four groups, quartiles can be used as a rough
measurement of variability. The interquartile range (IQR) is defined as the difference
between Q1 and Q3 and is the range of the middle 50% of the data.
• The interquartile range is used to identify outliers, and it is also used as a measure of
variability in exploratory data analysis.
12
Measures of Position: Outliers
• Outliers:
A data set should be checked for extremely high or extremely low values. These values are
called outliers.
• An outlier can strongly affect the mean and standard deviation of a variable. For example,
suppose a researcher mistakenly recorded an extremely high data value. This value would
then make the mean and standard deviation of the variable much larger than they really
were. Outliers can have an effect on other statistics as well.
• There are several ways to check a data set for outliers. One method is shown here:
13
Example:
14
Discrete Probability Distributions:
15
Example:
16
Examples: Illustrations of Theoretical
Probability Distributions
• You did not need to actually perform the experiments to compute the probabilities. In
contrast, to construct actual probability distributions, you must observe the variable over
a period of time.
17
Example:
18
Next Class…
19