Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

BAMS1743 QUANTITATIVE METHODS

CHAPTER 5: PROBABILITY DISTRIBUTION

PROBABILITY DISTRIBUTION

 A frequency distribution gives an analysis of the number of times


each particular value occurs in a set of items. A probability
distribution simply replaces the actual numbers (frequencies) with
proportions of the total.

 For example, in a statistics test, the marks out of ten awarded to 50


students might be as follows:

Marks out of 10 Number of students Proportion or prob.


(Freq. distribution) (prob. distribution)
0 0 0.00
1 0 0.00
2 1 0.02
3 2 0.04
4 4 0.08
5 10 0.20
6 15 0.30
7 10 0.20
8 6 0.12
9 2 0.04
10 0 0.00
Total 50 1.00

 A probability distribution of a discrete random variable is called a


discrete probability distribution. The discrete probability
distributions:

 the Binomial distribution;


 the Poisson distribution.

 A probability distribution of a continuous random variable is called a


continuous probability distribution. A very important continuous
probability distribution is the normal distribution.
BINOMIAL DISTRIBUTION

A Binomial experiment must satisfy the following 4 conditions:


1. There are n identical trials
 an experiment is repeated n times under identical conditions
2. Each trial has only two possible outcomes
 success or failure
 Success or failure can be interpreted in any way that is
convenient.
For example (i) situation good or bad
(ii) item defective or non-defective
(iii) company gain profit or loss
3. The probabilities of the two outcomes remain constant for each
independent trial.
 The probability of success is denoted by p
 The probability of failure is denoted by q
 p+q=1
4. The trials are independent
 Outcome of one trial does not affect the outcome of another trial

Let random variable X ≡ number of successes in n independent trials


 X is having Binomial distribution with parameters n and p
 denoted by X ~ B(n, p)
The probability of obtaining exactly x successes in n trials is given by
P( X  x )  nCx p x (1 p)nx; x  0, 1, , n
Where n ≡ total number of trials
p ≡ probability of success
x ≡ number of successes in n trials

 For a Binomial distribution,


 Mean,  = np
 Variance, 2 = np(1 – p) = npq
 Standard deviation,  = npq

Example:

Suppose we are interested in the number of heads showing face up on


the three tosses of a coin. The possible results are (0 head), (1 head), (2
heads) and (3 heads).

(a) Find the probability for each number of heads.


(b) If X denotes the number of heads, find the mean and standard
deviation of X.

Solution:

Let X ≡ number of heads showing face up on the three tosses of a coin.


X ~ B (n = 3, p = 0.5)

(a) P( X  x )  nCx p x (1 p)nx ; x  0, 1, , n


 3Cx (0.5)x (0.5)3x ; x  0, 1, 2, 3

X=x P( X = x )
0  3C (0.5)0(0.5)3  0.125
0
1  3C (0.5)1(0.5)2  0.375
1
2  3C (0.5)2(0.5)1  0.375
2
3  3C (0.5)3(0.5)0  0.125
3
1.000

(b) Mean,  = np = 3 × 0.5 = 1.5


Standard deviation,
  npq  3  0.5  0.5  0.75  0.8660

Example:

The probability that a college student will graduate is 0.4. Determine the
probability that out of 5 students

a) none, b) 1, c) at least 1, d) all will graduate

Solution:

Let X ≡ number of students that will graduate out of 5 students


X ~ B( n = 5, p = 0.4 )
P( X  x )  5Cx 0.4 x (0.6)5x ; x  0,1, ,5
a) P( X  0)  5C 0.40 (0.6)5  0.07776  0.0778
0
b) P ( X  1)  5C 0.41(0.6)4  0.2592
1

c) P( X  1)  P( X  1)  P( X  2)  P( X  3)  P( X  4)  P( X  5)
Or
P( X  1)  1 P( X  1)
 1  P ( X  0)
 1  0.0778
 0.9222

d) P( X  5)  5C5 0.45 (0.6)0  0.01024  0.0102

Example:

A new-car salesperson knows from past experience that she will make a
sale to about 20% of her customers. Find the probability that in five
randomly selected attempts, she makes sale to
(a) exactly three customers;
(b) at most one customer;
(c) at least one customer;
(d) fewer than three customers;
(e) more than three customers.

If X denotes the number of sales made out of the five attempts, find the
mean and standard deviation of X.
(Answer: (a) 0.0512, (b) 0.7373, (c) 0.6723 (d) 0.9421,
(e) 0.0067, 1.0, 0.8944)

Solution:
Let X ≡ number of sales made out of 5 attempts
X ~ B( n = 5, p = 0.2 )

P( X  x )  5Cx (0.2)x (0.8)5x; x  0, 1, 2, 3, 4, 5


Remarks:
Students may encounter inequalities as listed:
Inequality sign
At least x≥
At most x≤
Less than / fewer than x<
Exceed / more than x>
Not more than x≤
Not less than x≥

Notes:
 The mean of binomial distribution is also equal to its expected value.
 Standard deviation of a probability distribution is a measure of
dispersion. When  is small, the probability that we will get a value
close to the mean is high and when  is large, we are more likely to
get a value far away from the mean.
POISSON DISTRIBUTION
A discrete random variable having probability function of the form
e   x
P( X  x)  ; x  0,1,2, 
x!
where  can take any positive value, is said to follow the Poisson
distribution.

If an event is randomly scatted in time (or space) and has mean number
of occurrence  in a given interval of time (or space) and X is the random
variable “the number of occurrences in the given interval”.
 X is a Poisson distribution with parameter 
 denoted by X ~ Po () and x = 0, 1, 2, ...

Note: If  is not given, then  = np

For a Poisson distribution with parameter ,


 mean =  and variance = 

Example:

The mean number of bacteria per ml of a liquid is known to be 4. Assuming


that the number of bacteria follows a Poisson distribution, find the
probability that, in 1 ml of liquid, there will be
a) no b) 4 c) less than 3 bacteria

Solution:

Let X ≡ the number of bacteria in 1 ml of liquid.


e 4 4 x
 X ~ Po (4) where P( X  x)  ; x  0,1,2, 
x!
a) b)

c)
Example:

Based on information obtained in previous example, find the probability


that
a) in 3 ml of liquid, there will be less than 2 bacteria.
b) in 0.5 ml of liquid, there will be more than 2 bacteria.

Solution:

a) In 1 ml of liquid we expect 4 bacteria


 In 3 ml of liquid we expect 4 × 3 = 12 bacteria

Let Y ≡ the number of bacteria in 3 ml of liquid.


e 1212 y
 Y ~ Po (12) where P(Y  y)  ; y  0,1,2, 
y!

P(Y  2)  P(Y  0)  P(Y  1)


e 1212 0 e 12121
 
0! 1!
5
 7.987  10

b)

Poisson approximation to the Binomial distribution

 A Binomial distribution with parameters n and p can be approximated


by a Poisson distribution with parameter  = np if n is large and p is
small (such that np < 5).
 The approximation gets better as n   and p  0
 X ~ B(n, p)  X ~ Po (  np) when n   and p  0
Example:

Eggs are packed in boxes of 500. On average, 0.8% of the eggs are found
to be broken when the eggs are unpacked. Find the probability that in a
box of 500 eggs
(i) exactly 3 (ii) less than 2 (iii) more than 2 will be broken.

Solution:

NORMAL DISTRIBUTION

The normal distribution is an important probability distribution that is often


applied to continuous variables.

The Characteristics of a Normal Distribution


 The normal curve representing the normal probability distribution is often
described as being bell-shaped and is symmetrical about its central
mean.
f (x)

X
Mean
 The two tails of the normal curve extend indefinitely in both directions, i.e.
they never actually touch the X-axis.
 Total area under the normal curve equals to total probability, i.e. one.
 The normal curve has a single peak at the exact center of the distribution.
 The mean, median and mode of the distribution are equal and located at
the peak.
 A normal distribution is completely specified by two parameters the mean
() and the standard deviation ()
1 ( x  )2 2 2 
f (x)  e  

2 2
where    x   , e = 2.7183 and  = 3.142
 If the random variable X (for e.g. height, weight etc.) has a normal
distribution with mean () and standard deviation () then X is denoted
by X ~ N (, 2).
 Every different combination of  and  would generate a different normal
probability distribution. The shape of the normal curve depends on the
value of standard deviation; a larger standard deviation results a flatter,
wider spread distribution. The value of the mean determines the position
of the normal curve.
 The relative dispersion of frequencies around the mean can be measured
exactly in terms of standard deviations. Examples:
(a) About 68.26% of frequencies have a value within   .
(b) About 95.45% of frequencies have a value within   2.
(c) About 99.73% of frequencies have a value within   3.
STANDARD NORMAL DISTRIBUTION

 Normal distribution with mean,  = 0 and standard deviation,  = 1 is


known as standard normal distribution.
 Any normally distributed set of values x can be converted into the
standard normal values z by using the formula
x μ
z
σ
which indicates the number of standard deviations away from the mean.
X μ
 If the random variable X ~ N ( , 2 ) then Z  and Z ~ N (0, 12 ) is
σ
known as a standard normal variable or standardized normal variable.
 By determining the Z value using the above formula, we can find the area
or the probability under the normal curve by referring to the standard
normal distribution tables.

Note:
(a) Area under the whole curve = 100% = 1

z
0

(b) P( Z > 0 ) = P( Z < 0 ) = 50% = 0.5

z
0

(c) P( Z  z ) = P( Z > z )

z
0

(d) P( Z < –z ) = P( Z > z )

z
0

(e) P( Z < z ) = 1 – P( Z > z )

(f) P( Z > –z ) = 1 – P( Z > z ) z


0

z
0
E.g. Take z = 1, P ( Z > 1 ) = 0.1587

z
0
E.g. Take z = 2.77, P ( Z > 2.77 ) = 0.00280

z
0
Example:

The lifetime of an electrical component is known to follow a normal


distribution with a mean 2000 hours and a standard deviation 200 hours. If
a component is randomly selected find the probability that the component
will last
(a) more than 2200 hours;
(b) less than 1800 hours;
(c) at least 1900 hours;
(d) not more than 2100 hours;
(e) between 1800 and 2200 hours;
(f ) between 2100 and 2300 hours;
(g) between 1700 and 1900 hours;
(h) between 2000 and 2400 hours.
Solution:
Let X ≡ lifetime of electrical component (hours)
X ~ N ( = 2000,  2 = 200 2 )

(a) P ( X > 2200 ) = P ( Z > 2200  2000 )


200
=P(Z>1)
= 0.1587 z
0
(b) P ( X < 1800 ) = P( Z < 1800  2000 )
200
= P( Z < – 1 )
= P( Z > 1 ) = 0.1587
z
(c) P ( X ≥ 1900 ) = P ( Z ≥ 1900  2000 ) 0
200
= P ( Z ≥ – 0.5 )
= P ( Z < 0.5 )
= 1 – P ( Z > 0.5 ) z
0
= 1 – 0.3085 = 0.6915

(d) P ( X ≤ 2100 ) = P( Z ≤ 2100  2000 )


200

z
0
= P ( Z ≤ 0.5 )
= 1 – P ( Z > 0.5 )
= 1 – 0.3085 = 0.6915
(e) P ( 1800 < X < 2200 )
= P( 1800  2000 < Z < 2200  2000 )
200 200
=P(–1<Z<1)
= 1 – P ( Z > 1 ) – P( Z < – 1 ) z
0
=1–2P(Z>1)
= 1 – 2(0.1587)
= 0.6826
(f) P ( 2100 < X < 2300 )
= P ( 2100  2000 < Z < 2300  2000 )
200 200
= P ( 0.5 < Z < 1.5 ) z
= P ( Z > 0.5 ) – P ( Z > 1.5 ) 0
= 0.3085 – 0.0668
= 0.2417
(g) P ( 1700 < X < 1900 )
= P ( 1700  2000 < Z < 1900  2000 )
200 200
= P (– 1.5 < Z < – 0.5 )
= P ( 0.5 < Z < 1.5 ) z
= 0.2417 0

(h) P ( 2000 < X < 2400 )


= P ( 2000  2000 < Z < 2400  2000 )
200 200
=P(0<Z<2)
= P ( Z > 0 ) – P( Z > 2 ) z
= 0.5 – 0.02275 0
= 0.47725
Example:

Refer to the previous Example:


(a) Determine the lifetime of the component such that only 10% of the
components will fail before that time.
(b) If the mean life time of the component is to be increased such that
20% of the components will last for more than 2200 hours, find the
new mean life.

Solution:

(a) Let k ≡ the required lifetime (hours)


P( X < k ) = 0.1
P( Z < k  2000 ) = 0.1
200
From table, k  2000   1.2816 z
200 0
k   1.2816  200  2000
 1743.68 hours  1744 hours

(b) Let 1 ≡ the mean life required


P( X > 2200 ) = 0.2
2200  
P( Z > 1 ) = 0.2
200
2200   z
From table, 1  0.8416
200 0
1  0.8416  200  2200
1  2031.68 hours
 2032 hours

Example:

A local bank has 4 teller windows. By monitoring the customer waiting times,
the mean is found to be 3.8 min, the standard deviation is 1.5 min, and the
distribution is normal. Past experience has shown that customers become
irritated when the waiting time exceeds 5 min.
(a) What proportion of customers will be irritated by the present system?
(b) By adding a teller to the system, the percentage of customers irritated
would be reduced to only 10%. What is the mean waiting time with an extra
teller? Assuming that the standard deviation remains unchanged.
Solution:

Example:

The grade-point averages of students in a large college are normally


distributed, with a mean 2.4 and a standard deviation 0.5.
(a) What proportion of the students possesses grade-point average in
excess of 3.0?
(b) If students who possess grade-point average below 1.9 are excluded
from the college, what percentage of the students will be excluded?
(c) Suppose that three students are selected at random from the student
population. What is the probability that all three possess grade-point
average in excess of 3.0?

Solution:
Normal approximation to the Binomial distribution

 In a Binomial situation when np > 5 and nq > 5, the Normal distribution


with mean=  =np and standard deviation=  = np(1  p) can be
used to approximate the Binomial distribution.
 X ~ B(n, p)  X ~ N ( ,  ) when np > 5 and nq > 5
2

Continuity correction factor


 The addition and/or subtraction of 0.5 from the value(s) of x when the
Normal distribution is used as an approximation to the Binomial
distribution, where x is the number of successes in n trials

Remarks:
Binomial Normal
P( X = 2 ) P( 1.5 < X < 2.5 )
P( 3  X  5 ) P( 2.5 < X < 5.5 )
P( 3  X  5 ) P( 3.5 < X < 5.5 )
P( 3  X  5 ) P( 2.5 < X < 4.5 )
P( 3  X  5 ) P( 3.5 < X < 4.5 )
P( X < 4 ) P( X < 3.5 )
P( X  4 ) P( X < 4.5 )
P( X  4 ) P( X > 3.5 )
P( X > 4 ) P( X > 4.5 )
P( X  0 ) P( X > – 0.5 )
P( X > 0 ) P( X > 0.5 )
P( X = 0 ) P( – 0.5 < X < 0.5 )

Example:

Find the probability that 200 tosses of a bias coin will result in less than
51 heads in which the probability of getting a head is 0.2 for each toss.

Solution:
Example:

Find the probability of getting between 3 and 6 heads inclusive in 10


tosses of a fair coin by using
(a) the Binomial distribution,
(b) the Normal approximation to the Binomial distribution.

Solution:

Example:

Find the probability that 200 tosses of a fair coin will result in
(a) between 80 and 120 heads inclusive,
(b) less than 90 heads,
(c) less than 85 or more than 115 heads.

Solution:

16
The Normal Approximation to the Poisson distribution

If X ~ Po ( ) then mean    and variance    .


2

 For large , X ~ N ( ,  ) approximately.
 Generally, we require  > 20 for a good approximation.
 X ~ Po ( )  X ~ N ( ,  ) when  is large.

Example:

The number of calls received by an office switchboard per hour follows a


Poisson distribution with parameter 30. Using the normal approximation
to the Poisson distribution, find the probability that in one hour, there are
(a) more than 23 calls, (b) between 25 and 28 calls inclusive,
(c) 34 calls.

Solution:

17
Taken from Statistical Tables by J Murdoch and JA Barnes

18
Taken from Statistical Tables by J Murdoch and JA Barnes

19
BAMS1743 QUANTITATIVE METHODS
Tutorial 5 (Probability Distribution)

1. 85% of households in an area have a notebook. If six households are


randomly selected from the area, what is the probability that the
number having a notebook is
(a) exactly four,
(b) at least four,
(c) at most five,
(d) between two and five inclusive,
(e) exceed four,
(f) fewer than four.

2. On a rainy day, 10% of the production employees at Super Steel are


absent from work. Ten production employees are to be selected at
random for a study on absenteeism.
(a) What is the random variable in this problem?
(b) Is the random variable discrete or continuous? Why?
(c) What is the probability that none of the 10 production
employees selected at random on a rainy day is absent?
(d) Compute the mean, variance and standard deviation of the
distribution.

3 A factory finds that, on the average, 20% of the bolts produced by a


given machine will be defective for certain specified requirements. If
10 bolts are selected at random from the day’s production of this
machine, find the probability that
(a) exactly 2 will be defective;
(b) 2 or more will be defective;
(c) more than 5 but at most 8 will be defective.

4. Customers arrive randomly at a department store at an average rate


of 3.4 per minute. Assuming the customers arrivals form a Poisson
distribution, calculate the probability that
(a) no customers arrive in any particular minute,
(b) exactly one customer arrives in any particular minute,
(c) two or more customers arrive in any particular minute,
(d) One or more customers arrive in any 30-second period.

20
5. An insurance company receives on average 2 claims per week from
a certain factory. Assuming that the number of claims follows a
Poisson distribution, find the probability that
(a) it receives more than 3 claims in a given week,
(b) it receives more than 2 claims in a given fortnight,
(c) It receives no claims on a given day, assuming that the factory
operates on a 5-day week.

6. Cars arrive at a petrol station at an average rate of 30 per hour.


Assuming that the number of cars arriving at the petrol station follows
a Poisson distribution, find the probability that
(a) No cars arrive during a particular 5 minutes interval,
(b) More than 3 cars arrive during a 5 minutes interval,
(c) More than 2 cars arrive during a 15 minutes interval,
(d) In a period of half an hour, 10 cars arrive,
(e) Less than 3 cars arrive during a 10 minutes interval.

7. The probability that a particular make of light bulb is faulty is 0.01. The
light bulbs are packed in boxes of 100. Find the probability that in a
certain box there are
(a) no faulty light bulbs,
(b) 2 faulty light bulbs,
(c) More than 3 faulty light bulbs.

8. A sample of 200 items is taken from a large batch, which is known to


produce defective with probability 0.005. Find the probabilities that
there are 0, 1, 2, 3, 4, and 5 defectives in the sample using a Poisson
approximation to the binomial.

9. Orders received by a particular firm have amounts that follow a


normal distribution with a mean of RM103.60 and a standard
deviation of RM8.75.
(a) What percentage of orders will have amount below
RM120.05?
(b) What percentage of orders will have amount over RM92.75?
(c) What will be the amount such that approximately 25% of
orders are for greater amounts?
(d) Above what amount will 90% of orders lie?

21
10. A manufacturer makes chocolate bars with a mean weight of 110
grams and a standard deviation of 2 grams. The weights are normally
distributed.
(a) What proportion of the bars is likely to be less in weight than
106 grams?
(b) The manufacturer decides to make “bigger bars” with the
same standard deviation as before. It is decided that the
covers of these bigger bars will be marked ‘minimum weight
115 grams’. What mean weight will have to be aimed if no
more than one bar in 100 is to be less than 115 grams in
weight?

11. The amount of time required for a certain type of automobile


transmission repair at a service garage is normally distributed with
mean = 45 min, and standard deviation = 8 min. The service manager
plans to have work begin on the transmission of a customer’s car
10min, after the car is dropped off, and he tells the customer that the
car will be ready within one hour in total time.
(a) What is the probability that he will be wrong?
(b) What is the probability that the working time will be between
40 and 52 min?
(c) What is the required working time allotment such that there is
90% chance that the transmission repair will be completed
within that time?
(d) What is the working time allotment such that there is just 30%
chance that the transmission repair can be completed within
that time?

12. The average weight of a packet of sugar packed by a certain machine


is 500g and the standard deviation is 20g. Assume that the weights
are normally distributed. In a batch of 2,000 packets, how many
weight:
(a) (i) more than 520g,
(ii) less than 470g,
(iii) between 520g and 530g.
(b) Find the minimum weight of the 5% heaviest packets.

22
13. 10% of males suffer from a certain disease. Use the normal
approximation to Binomial distribution to find the probability that more
than 60 men in a randomly selected group of 500 will suffer from the
disease.

14. 10% of the chocolates produced in a factory are mis-shapes. In a


sample of 1000 chocolates find the probability that the number of mis-
shapes is
(a) less than 80,
(b) between 90 and 115 inclusive,
(c) 120 or more.

15. The number of bacteria on a plate viewed under a microscope follows


a Poisson distribution with parameter 60. Find the probability that
there are between 55 and 75 bacteria on a plate. A plate is rejected if
less than 38 bacteria are found. If 2000 such plate are viewed, how
many will be rejected?

Answers:
1. (a) 0.1762 (b) 0.9527 (c) 0.6229 (d) 0.6225
(e) 0.7765 (f) 0.0473
2. (c) 0.3487 (d) 1.00, 0.90 0.949
3. (a) 0.3020 (b) 0.6242 (c) 0.0064
4. (a) 0.0334 (b) 0.1135 (c) 0.8531 (d) 0.8173
5. (a) 0.1429 (b) 0.7619 (c) 0.6703
6. (a) 0.0821 (b) 0.2424 (c) 0.9797 (d) 0.0486 (e) 0.1247
7. (a) 0.3679 (b) 0.1839 (c) 0.0190
8. 0.3679, 0.3679, 0.1839, 0.0613, 0.0153, 0.0031
9. (a) 96.99% (b) 89.25% (c)RM109.50 (d) RM92.40
10. (a) 0.02275 (b) 120 gms
11. (a) 0.2643 (b) 0.5463 (c) 55.25 mins (d) 40.80 mins
12. (a) (i) 317 (ii) 134 (iii) 184 (b) 533g
13. 0.0582
14. (a) 0.01539 (b) 0.8149 (c) 0.0197
15. 0.6883, 4

23

You might also like