Professional Documents
Culture Documents
Chapter One - Probability Distribution
Chapter One - Probability Distribution
Chapter One - Probability Distribution
1.1 Introduction
In this chapter we discuss probability distributions of random variables and these are customary
used to model some problems in various fields such as business, finance, economics and in
general life. To clearly understand this chapter, you need some basic knowledge in probability
fundamentals.
Traditionally random variables are denoted by capital letters, X, Y, Z or X1, X2, X3 etc.
There are two types of random variable (R.V), these are discrete and continuous random
variables. A discrete random variables takes on only a finite number of values and these are
integers. Examples of discrete random variables are; the number of cars passing through the
roadblock, an experiment of tossing one or more fair coins, number of defective items in a sample,
number of death by COVID-19 in year 2020, etc. On the other hand a continuous random variable
is a random variable that can take on any value (always real numbers) in a given interval of values.
Examples of continuous R.V are, height, weight, rainfall, temperature etc.
General Examples of random variables in the real life are; the unemployment rate, consumer
price index, number of sales made in week, yearly profit of a company, share prices, return on
investments, money supply, GDP, wages, cash flows, interest rates, etc.
If X is a discrete random variable, the function denoted by 𝑓(𝑥) = 𝑃(𝑋 = 𝑥) for each 𝑥 within the
range of X is called Probability Mass Function of X. To capture clearly the meaning of the
probability distribution of a discrete random variable consider Example 1.1.
Example 1.1
Consider an experiment of tossing two fair coins simultaneously. Find the probability distribution
of obtaining a total number of heads.
Solution:
The following are the procedures for building probability distribution:
The list of all possible events can be obtained easily by using a structure of tree diagram as shown
below:
H
T
Start H
T
Figure 2: Tree Diagram
From a tree diagram as shown in Figure 2 the list of all possible outcomes of an experiment (i.e
sample space, S) is as shown below:
𝐻𝐻 → 2heads
𝐻𝑇 → 1 heads
𝑇𝐻 → 1 head
𝑇𝑇 → 0 head
Probability of an event can be obtained by employing traditional definition of probability, that is:
n( E )
P( E ) =
n( S )
Let 𝑋 be the number of observed heads. The probabilities of the number of heads showing up are
as indicated below and the probability distribution is shown in Table 1.
𝑛(0) 1
𝑓(0) = 𝑃(𝑋 = 0) = =
𝑛(𝑠) 4
𝑛(1) 1
𝑓(1) = 𝑃(𝑋 = 1) = =
𝑛(𝑠) 2
𝑛(2) 1
𝑓(2) = 𝑃(𝑋 = 2) = =
𝑛(𝑠) 4
Probability Distribution
Number of heads 𝑓(𝑥) = 𝑃(𝑋 = 𝑥)
(𝑋)
0 𝟏/𝟒
1 𝟏/𝟐
2 𝟏/𝟒
Total 1.00
Properties of PMF
1. 𝑓(𝑥) ≥ 0 for each 𝑥 ∈ 𝑋
2. 0 ≤ 𝑓(𝑥) ≤ 1
3. ∑ 𝑓(𝑥) = 1
Example 1.2
An employment rate of a certain country A (𝑅𝐴 ) in percentage is assumed to be a discrete random
variable whose probability distribution is as shown below:
𝑅𝐴 -12 -10 -6 0 4 8 10 12
Prob. 0.10 0.15 0.10 0.15 0.1 0.15 0.1 0.15
Find
a) 𝑃(𝑅𝐴 ≥ 0)
b) 𝑃(𝑅𝐴 ≥ −10)
Solution:
a) 𝑃(𝑅𝐴 ≥ 0) = 𝑃(𝑅𝐴 = 0) + 𝑃(𝑅𝐴 = 4) + 𝑃(𝑅𝐴 = 8) + 𝑃(𝑅𝐴 = 10) + 𝑃(𝑅𝐴 = 12)
= 0.15 + 0.1 + 0.15 + 0.1 + 0.15
= 0.65
c) 𝑃(𝑅𝐴 ≥ −10) = 1 − 𝑃(𝑅𝐴 < −10)
= 1 − 𝑃(𝑅𝐴 = −12)
= 1 − 0.10
= 0.9
Where 𝑃(𝑋 ≤ 𝑥) means the probability that a random variable 𝑋 takes a value of less than or equal
to a specific value 𝑥, where 𝑥 is given. For example 𝑃(𝑋 ≤ 2) means the probability that the
random variable 𝑋 takes the value less than of equal to 2.
Example 1.3
Find Probability Mass Function (PMF) and Cumulative Mass Function (CMF) of a total number
of heads obtained by tossing a fair coin three times.
Solution:
The following tree diagram is used to obtain the sample space 𝑆
H T
H
H
T T
Start
H
T T
H
T
T
Therefore 𝑺 = {𝐻𝐻𝐻, 𝐻𝐻𝑇, 𝐻𝑇𝐻, 𝐻𝑇𝑇, 𝑇𝐻𝐻, 𝑇𝐻𝑇, 𝑇𝑇𝐻, 𝑇𝑇𝑇 }
By letting 𝑋 =number of heads shown up, we find that 𝑋 can take values 0, 1, 2 or 3 and hence
the corresponding PMF will be obtained as indicated below:
1
𝑓(0) = 𝑃(𝑋 = 0) =
8
3
𝑓(1) = 𝑃(𝑋 = 1) =
8
3
𝑓(2) = 𝑃(𝑋 = 2) =
8
1
𝑓(3) = 𝑃(𝑋 = 3) =
8
In Tabula form:
𝑥 0 1 2 3
𝑓(𝑥) 1/8 3/8 3/8 1/8
It follows that, the Cumulative Mass Function (CMF) will be obtained as indicated here:
From:
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥)
1
𝐹(0) = 𝑃(𝑋 ≤ 0) = 𝑃(𝑋 = 0) =
8
4
𝐹(1) = 𝑃(𝑋 ≤ 1) = 𝑃(𝑋 = 0) + 𝑃(𝑋 = 1) =
8
7
𝐹(2) = 𝑃(𝑋 ≤ 2) = 𝑃(𝑋 = 0) + 𝑃(𝑋 = 1) + 𝑃(𝑋 = 2) =
8
𝐹(3) = 𝑃(𝑋 ≤ 3) = 𝑃(𝑋 = 0) + 𝑃(𝑋 = 1) + 𝑃(𝑋 = 2) + 𝑃(𝑋 = 3) = 1
However in plotting the CMF the following will be the ranges of:
0 for x0
1
for 0 x 1
8
4
F ( x) = for 1 x 2
8
7
8 for 2 x3
1 for 3 x
With reference to the previous example, it can be observed that, a CMF is merely an
accumulation of PMF for the values of 𝑋 less than or equal to a given 𝑥. That is,
𝐹(𝑥) = ∑ 𝑓(𝑥)
𝑥
Note: The results for both PMF and CMF can also be summarized in the following Table:
Number of
heads values of PMF values of CMF
𝑿 𝒙 𝒇(𝒙) 𝒙 𝑭(𝒙)
0 0≤𝑥<1 1/8 𝑥=0 1/8
1 1≤𝑥<2 3/8 𝑥≤1 4/8
2 2≤𝑥<3 3/8 𝑥≤2 7/8
3 3≤𝑥 1/8 𝑥≤3 1
∑ 𝑘 = 𝑛𝑘
𝑖=1
2. If 𝑘 is constant, then
𝑛 𝑛
∑ 𝑘𝑋𝑖 = 𝑘 ∑ 𝑋𝑖
𝑖=1 𝑖=1
∑(𝑎 + 𝑏𝑋𝑖 ) = 𝑛𝑎 + 𝑏 ∑ 𝑋𝑖
𝑖=1 𝑖=1
3. Also
𝐸(𝑋𝑌) ≠ 𝐸(𝑋)𝐸(𝑌)
That is, generally, the expected value of the product of two random variables is not equal
to product of the expected values of those random variables. However, there is an
exception to the rule, if X and Y are independent then
𝐸(𝑋𝑌) = 𝐸(𝑋)𝐸(𝑌)
4. If 𝑘 is a constant, then
𝐸(𝑘𝑋) = 𝑘𝐸(𝑋)
That is to say, the expected value of a constant times a random variable X, is equal to the
constant times the expected value of the R.V
5. If 𝑎 and 𝑏 are constants, then
𝐸(𝑎𝑋 + 𝑏) = 𝐸(𝑎𝑋) + 𝐸(𝑏)
= 𝑎𝐸(𝑋) + 𝑏
1.5.2 Variance of a Discrete Probability Distribution
1.5.2.1 Variance and standard deviation
Variance indicates how individual values are spread, dispersed or distributed around the mean
value. But also the statistical concept of variance is a useful measure of risk of any kind. Generally
if X is a discrete random variable, then its variance is given by:
2
𝑉𝑎𝑟(𝑋) = 𝐸(𝑋 − 𝐸(𝑋))
= 𝐸(𝑋 2 ) − (𝐸(𝑋))2
= 𝐸(𝑋 2 ) − 𝜇2
= 𝜎2
Where 𝐸(𝑋 2 ) = ∑ 𝑥 2 𝑓(𝑥)
The standard deviation of X is therefore given by
𝑆𝐷(𝑋) = √Var(𝑋)
=𝜎
Example 1.4
A company estimates the net profit for a new product to be launched with its corresponding
probabilities under different market conditions as follows;
Required:
a) Calculate the expected value of the net profit for the Company
b) What is standard deviation of the net profit
Solution:
a) We know that
n
E ( X ) = xP( X = x)
i =1
Market Condition Net Profit (in million Tsh) Probability 𝒇(𝒙) 𝒙𝒇(𝒙)
(𝒙)
Good 30 0.15 4.5
Fair 10 0.25 2.5
Poor -3 0.6 -1.8
Total 5.2
𝑛
Therefore the expected value of the net profit for the company under all three given market
conditions is 5.2 million Tsh.
𝐸(𝑋) = ∑ 𝑥𝑓(𝑥)
= 1.9
2
𝑉𝑎𝑟(𝑋) = 𝐸(𝑋 2 ) − (𝐸(𝑋))
= ∑ 𝑥 2 𝑓(𝑥) − (∑ 𝑥𝑓(𝑥))2
= 69.4 − 1.92
= 65.79
𝑆𝐷(𝑋) = √𝑉𝑎𝑟(𝑋)
= √65.79
= 8.11
Example 1.6
A monthly income of workers in millions of TShs from a certain sector with their associated
probabilities are as indicated in the following probability distribution
𝐸(𝑋) = ∑ 𝑥𝑓(𝑥)
= 2.34
2
𝑉𝑎𝑟(𝑋) = 𝐸(𝑋 2 ) − (𝐸(𝑋))
= ∑ 𝑥 2 𝑓(𝑥) − (∑ 𝑥𝑓(𝑥))2
= 6.321 − 2.342
= 0.8454
𝑆𝐷(𝑋) = √𝑉𝑎𝑟(𝑋)
= √0.8454
= 0.9194
Properties of variance
3. If 𝑏 is a constant, then
𝑉𝑎𝑟(𝑏 + 𝑋) = 𝑉𝑎𝑟(𝑋)
4. If 𝑎 is a constant, then
𝑉𝑎𝑟(𝑎𝑋) = 𝑎2 𝑉𝑎𝑟(𝑋)
5. If X and Y are independent random variables and 𝑎 and 𝑏 are constants, then
𝑉𝑎𝑟(𝑎𝑋 + 𝑏𝑌) = 𝑎2 𝑉𝑎𝑟(𝑋) + 𝑏 2 𝑉𝑎𝑟(𝑌)
A binomial experiment is a statistical experiment which consist of 𝑛 repeated trials. Each trial can
result in just two possible outcomes. We call one of these outcome a success and the other, a
failure. The probability of success, denoted by P, is the expected to be constant on every trial.
Consider the following random experiment. You flip a coin 10 times and count the number of
times the coin lands on tails. This is a binomial experiment because:
n
b( x; n, p) = P( X = x) = p x q n − x for x = 0, 1, 2, , n
x
n n n!
Note:
x = Cx = x!(n − x)!
1.12.2 Mean, Variance, and Standard Deviation for a Binomial Random Variable
The expected value (mean) of binomial random variable is given by 𝐸(𝑋) = 𝑛𝑝, and the standard
deviation is given by 𝑆𝐷(𝑋) = √𝑛𝑝(1 − 𝑝. That is,
Mean: 𝜇 = 𝑛𝑝
Variance: 𝜎 2 = 𝑛𝑝𝑞
Examples 1.10
Given that the expected value of a binomial distribution is 40 and standard deviation is 6.
Required:
Calculate n, p and q.
Solution:
From
= np = 40
and
2 = npq = 36
Put 1 into 2, we have
40q = 36
q = 0.9 p + q = 1 p = 0.1
Thus p = 0.1, q = 0.9, and n = 400
Example 1.11
Assume that on an average one telephone number out of 15 is busy.
Required:
What is the probability that if six randomly selected telephone numbers are dialled
a) Not more than three will be busy?
b) At least three of them will be busy?
Solution:
1 14
p= , q = , n = 6 , then
15 15
a) p( x 3) = p( x = 0) + p( x = 1) + p( x = 2) + p( x = 3) = 0.9997 (Use the
Binomial distribution formula)
b) P( x 3) = 1 − P( x 3) = 1 − P( x = 0) + P( x = 1) + P( x = 2) = 0.0051 (use the same
approach as in part a)
Example 1.12
The probability that a student is accepted to a prestigious college is 0.3. If 5 students from the
same school apply, what is the probability that at most 2 are accepted?
Data Given:
𝑝 = 0.3 𝑛 = 5
Find 𝑃(𝑋 ≤ 2)
𝑃(𝑋 ≤ 2) = 𝑃(𝑋 = 0) + 𝑃(𝑋 = 1) + 𝑃(𝑋 = 2)
From
𝑛
𝑃(𝑋 = 𝑥) = ( ) 𝑝 𝑥 (1 − 𝑝)𝑛−𝑥
𝑥
𝑛!
= 𝑝 𝑥 (1 − 𝑝)𝑛−𝑥
(𝑛 − 𝑥)! 𝑥!
5!
𝑃(𝑋 = 0) = × 0.30 × 0.75
(5 − 0)! 4!
5!
= × 0.30 × 0.75
5! 0!
= 𝟎. 𝟏𝟔𝟖𝟎𝟕
5!
𝑃(𝑋 = 1) = × 0.31 × 0.74
(5 − 1)! 1!
5!
= × 0.31 × 0.74
5! 1!
= 𝟎. 𝟑𝟔𝟎𝟏𝟓
5!
𝑃(𝑋 = 2) = × 0.32 × 0.73
(5 − 2)! 2!
5!
= × 0.32 × 0.73
5! 2!
= 𝟎. 𝟑𝟎𝟖𝟕
Therefore:
Note: the specified region could take many forms. For instance, it could be a length, an area, a
volume, a period of time, etc.
1.12.2.1 Application of Poisson Distribution
• The number of deaths by COVID-19 in the global in 2020
• The number of birth defects and genetic mutations
• The number of car accidents in Dar es Salaam city
• The number of typing errors on a page
• The spread of an endangered animal in Sub Saharan Africa
• The number of failure of a machine in one month
Notation
The following notation is helpful, when we talk about the Poisson distribution.
• 𝑒: A constant equal to approximately 2.71828. (Actually, e is the base of the natural
logarithm system.)
• λ: The mean number of successes that occur in a specified region.
• x : The actual number of successes that occur in a specified region.
• P ( x, ) : The Poisson probability that exactly x successes occur in a Poisson
e − x
P ( x, ) = for x = 0,1,2...
x!
Or
𝑒 −𝜆 𝜆𝑥 𝑥 = 0,1,2, ⋯ 𝑛
𝑃(𝑋 = 𝑥) =
𝑥!
Where λ represent the average number of outcomes occurring in the specified time or region.
Furthermore, if X has a Poisson distribution, then 𝐸(𝑋) = 𝜆 and 𝑆𝐷(𝑋) = √𝜆
Examples 1.13
The average number of days a school is closed due to snow during winter in a certain City in USA
is 4. Calculate the probability that the schools in this city will close for 6 days during a winter?
Solution:
Example 1.14
Suppose that on average, 1 person in every 1000 is an alcoholic. Find the probability that a random
sample of 8000 people will yield fewer than 7 alcoholics.
Solution:
Let 𝑥 represent the number of alcoholic persons
1
p( x) = = 0.001, n = 8000
1000
Since p is very small, and n is very large, then
𝜆 = 𝑛𝑝 = 0.001 × 8000 = 8
Now,
p( x 7) = p( x = 0) + p( x = 1) + p( x = 2) + ... + p( x = 6)
e −8 8 0 e −8 81 e −8 8 2 e −8 8 6
= + + + ... + = 0.3134
0! 1! 2! 6!
Example 1.15
The number of customers attended at CRDB bank follows Poisson distribution with a mean of
10 customers per hour, find the probability that in any given hour
Solution:
Data Given
𝜆 = 10
From:
𝑒 −𝜆 𝜆𝑥
𝑃(𝑋 = 𝑥) =
𝑥!
𝑒 −10 ×106
a) 𝑃(𝑋 = 6) = 6!
= 𝟎. 𝟎𝟔𝟑𝟎𝟓𝟓
𝑒 −10 ×100
b) 𝑃(𝑋 = 0) = 0!
= 𝟎. 𝟎𝟎𝟎𝟎𝟒𝟓𝟒
Most often, the equation used to describe a continuous probability distribution is called a
probability density function (PDF). Sometimes, it is referred to as a density function. For
a continuous probability distribution, the density function has the following properties:
1. 0 f ( x) 1
The continuous random variable is defined over a continuous range of values (called the
domain of the variable), the graph of the density function will also be continuous over
that range.
2. f ( x)dx = 1
The area bounded by the curve of the density function and the x-axis is equal to 1, when
computed over the domain of the variable.
b
3. A = f ( x)dx
a
The probability that a random variable assumes a value between a and b is equal to the
area under the density function bounded by a and b.
1. 14 Normal Distribution
Normal distribution is perhaps the single most important probability distribution involving
continuous random variable. By definition, a continuous random variable X has a normal
distribution if its probability distribution function (PDF) is given by:
𝟏 𝑿−𝝁 𝟐
𝟏 − ( )
𝒇(𝒙) = 𝒆 𝟐 𝝈 , −∞ < 𝑋 < ∞
𝝈√𝟐𝝅
Therefore, a normally distributed random variable with a given mean and variance can be
converted to a standard normal variable (aka normal deviate), which greatly simplifies our task
of computing probabilities.
Example 1.15
Find the probability that a 𝑍-score will be greater than 3.00 from the standard normal Table.
Solution:
Required to find 𝑃(𝑍 > 3.00)
𝑃(𝑍 > 3.00) = 0.5 − 𝑃(0 ≤ 𝑍 ≤ 3.00)
= 0.5 − 0.4987
= 0.0013
Example 1.16
It is given that, the daily sale of bread in a bakery, follows the normal distribution with a mean of
70 loaves and variance of 9, i.e 𝑋~𝑁(70,9). What is the probability that on any given day the sale
of bread is greater than 75 loaves?
Solution:
Data Given:
𝜇 = 70, 𝜎 2 = 9, 𝜎 = 3
Let 𝑋 represent the daily sale of the bread in a bakery, then:
𝑋 − 𝜇 75 − 70
𝑃(𝑋 > 75) = 𝑃 ( > )
𝜎 3
= 𝑃(𝑍 > 1.67)
= 𝑃(𝑍 > 1.67)
= 0.5 − 𝑃(0 ≤ 𝑍 ≤ 1.67)
= 0.5 − 0.4525
= 𝟎. 𝟎𝟒𝟕𝟓
Example 1.17
An investor is considering to purchase a stock whose monthly return is approximately normally
distributed with an expected return of 0.01 and a standard deviation of 0.02. Use the standard
normal distribution table to find the probability that the stock return is positive.
Data Given:
𝜇 = 0.01, 𝜎 = 0.02
Let 𝑋 represent the stock return, then required to find:
𝑋 − 𝜇 0 − 0.01
𝑃(𝑋 ≥ 0) = 𝑃 ( ≥ )
𝜎 0.02
= 𝑃(𝑍 ≥ −0.5)
= 𝑃(𝑍 ≥ −0.5)
≅ 𝑃(𝑍 ≤ 0.5)
= 0.5 + 𝑃(0 ≤ 𝑍 ≤ 0.5)
= 0.5 + 0.1915
= 𝟎. 𝟔𝟗𝟏𝟓
Example 1.18
The income in thousands of dollars of a given company are normally distributed with the mean
20 and the standard deviation of 5. Find the probability that a selected income will be
a) More than twenty five thousand dollars
b) Anywhere between eighteen twenty four thousand dollars
Data Given:
𝜇 = 20, 𝜎 = 5
Let 𝑋 represent the income of the company, then required to find:
𝑋 − 𝜇 25 − 20
𝑃(𝑋 > 25) = 𝑃 ( > )
𝜎 5
= 𝑃(𝑍 > 1)
= 𝑃(𝑍 > 1)
= 0.5 − 𝑃(0 ≤ 𝑍 ≤ 1)
= 0.5 − 0.0000393
= 𝟎. 𝟒𝟗𝟗𝟗
18 − 20 𝑋 − 𝜇 24 − 20
𝑃(18 ≤ 𝑋 ≤ 24) = 𝑃 ( ≤ ≤ )
5 𝜎 5
= 𝑃(−0.4 ≤ 𝑍 ≤ 0.8)
≅ 𝑃(0 ≤ 𝑍 ≤ 0.4) + 𝑃(0 ≤ 𝑍 ≤ 0.8)
= 0.1554 + 0.2881
= 𝟎. 𝟒𝟒𝟑𝟓
Example 1.19
Applicants for a certain job are given an aptitude test. Past experience shows that score from the
test are normally distributed with a mean of 60 points and a standard deviation of 12 points. What
percentage of candidates would be expected to pass the test, if a minimum score of 75 is required?
Data Given;
𝜇 = 60 𝜎 = 12
Let 𝑋 represent the scores of candidate, required to find 𝑃(𝑋 ≥ 75)
Solution:
𝑋 − 𝜇 75 − 60
𝑃(𝑋 ≥ 75) = 𝑃 ( ≥ )
𝜎 12
= 𝑃(𝑍 ≥ 1.25)
= 0.5 − 𝑃(0 ≤ 𝑍 ≤ 1.25)
= 0.5 − 0.3944
= 01056
Conclusion: Almost 10.56% of candidates would be expected to pass the test
Example 1.20