Professional Documents
Culture Documents
Main Module Business Statistics Class 4
Main Module Business Statistics Class 4
STATISTICS
MAIN MODULE
SESSION - IV
FLOW N HANDS ON
Probability
Distributions
Discrete Continuous
Probability Probability
Distributions Distributions
Binomial Normal
Poisson Uniform
Exponential
One important function of the Ricknel AIS is to continuously audit accounting information, looking for errors or incomplete or
improbable information. For example, when customers submit orders online, the Ricknel AIS reviews the orders for possible
mistakes.
Any questionable invoices are tagged and included in a daily exceptions report. Recent data collected by the company show that
the likelihood is 0.10 that an order form will be tagged. As a member of the AIS team, you have been asked by Ricknel
management to determine the likelihood of finding a certain number of tagged forms in a sample of a specific size. For
example, what would be the likelihood that none of the order forms are tagged in a sample of four forms? That one of the order
forms is tagged? How could you determine the solution to this type of probability problem?
The event of interest is defined as a tagged order form. You want to determine the number of tagged order forms in a given
sample of orders.
If the sample contains four orders, there could be none, one, two, three, or four tagged order forms. No other value can occur
because the number of tagged order forms cannot be more than the sample size, n, and cannot be less than zero. Therefore,
the range of the binomial variable is from 0 to n.
What is the probability of having three tagged order forms in a sample of four orders in this particular sequence? Because the
historical probability of a tagged order is 0.10, the probability that each order occurs in the sequence is
Each outcome is essentially independent of the others because the order forms were selected from an extremely large or
practically infinite population and each order form could only be selected once. Therefore, the probability of having this
particular sequence is
represents the probability of exactly x events of interest from n observations in a particular sequence.
is the number of combinations of the x events of interest from the n observations possible.
d. What are the mean and standard deviation of the binomial distribution for the number of orders filled correctly?
A manufacturing company regularly conducts quality control checks at specified periods on the products it manufactures.
Historically, the failure rate for LED light bulbs that the company manufactures is 5%. Suppose a random sample of 10
LED light bulbs is selected. What is the probability that
c. no questions correct?
P(X=x|5, 0.5)
.6
.4
.2
0
Here, n = 5 and π = 0.5.
0 1 2 3 4 5 x
Whenever p = 0.5, the binomial distribution is symmetrical, regardless of how large or small the value of n. When p ≠ 0.5, the
distribution is skewed. The closer p is to 0.5 and the larger the number of observations, n, the less skewed the distribution
becomes.
30/09/22 Business Statistics: MAIN MODULE SESSION IV
30/09/22 Business Statistics: MAIN MODULE SESSION IV
Poisson Distribution
Many studies are based on counts of the occurrences of a particular event in a fixed interval of time or space (often referred
to as an area of opportunity). In such an area of opportunity there can be more than one occurrence of an event. The Poisson
distribution, named after French mathematician Denis Poisson, can be used to compute probabilities in such situations.
x
e
P( X x | )
x!
Where,
The positive real number λ is equal to the expected value of X and also to its variance
What is the probability that in a given minute, exactly two customers will arrive?
And what is the probability that more than two customers will arrive in a given minute?
a. Making the assumptions for X to be distributed as a Poisson random variable, if you purchased a Ford in the 2015 model
year, what is the probability that in the past 12 months, the car had more than two problems.
b. zero problems?
d. Give an operational definition for problem. Why is the operational definition important in interpreting the initial quality
score?
An operational definition for problem can be “a specific feature in the
car that is not performing according to its intended designed function.”
a. What properties must be true about the situation described here in order to use the Poisson distribution to calculate
probabilities concerning the number of phone calls received in a one-minute period?
Assuming that this situation matches the properties discussed in (a), what is the probability that during a one-minute
period
d. What is the maximum number of phone calls that will be received in a one-minute period 99.99% of the time?
In the MyTVLab scenario, you are examining the load time, a continuous numerical variable.
You are no longer considering a table of discrete (specific) values, but a continuous range of values. For example, the phrase
“load times are between 5 and 9 seconds” includes any value between 5 and 9 and not just the values 5, 6, 7, 8, and 9.
If you plotted the phrase on a graph, you would draw a continuous line from 5 to 9 and not just plot five specific points.
When you add information about the shape of the range of values, such as two-thirds of the load times are between 5 and 9
seconds or about 95% of the load times are between 3 and 11 seconds, you can visualize the plot of all values as an area
under a curve.
If that area under the curve follows the well-known pattern of certain continuous distributions, you can use the continuous
probability distribution for that pattern to estimate the likelihood that a load time is within a range of values.
That is, the probability that X takes on a value in the interval [a, b] is the area above this interval and under the graph of the
density function, as illustrated in Figure below. The graph of f(x) is often referred to as the density curve.
Although a pdf is defined for -∞ ≤ x ≤ ∞, we will typically display a pdf for the values in its support, and it is always
understood that f(x) = 0 otherwise.
For f(x) to be a legitimate pdf, it must satisfy the following two conditions:
for all x
When X is a discrete random variable, each possible value is assigned positive probability. This is not true of a continuous
random variable, because the area under a density curve that lies above any single value is zero:
The probability that X lies in some interval between a and b does not depend on whether the lower limit a or the upper
limit b is included in the probability calculation:
Climate change has made effective modeling and management of floodwaters ever more important in coastal areas. One variable
of particular importance is the flow rate of water above some minimum threshold (typically where the rate becomes hazardous
and requires intervention). The following pdf of X = hazardous flood rate (m 3/s) is suggested under certain conditions by the
article “A Framework for Probabilistic Assessment of Clear-Water Scour Around Bridge Piers” (Structural Safety 2017: 11–22):
The graph of f(x) is given in Figure below ; there is no density associated with flow rates below 10 m 3/s, because such flow
rates are deemed nonhazardous under these particular conditions. The flow rate density decreases rapidly R (exponentially
fast) as x increases from 10.
1. The probability that flood rate is at most 50 m3/s is
2. The probability the flood rate hits at least 200 m3/s, the point at which a nearby bridge will collapse