Professional Documents
Culture Documents
Probability Distribution
Probability Distribution
Probability Distribution
Probability Distributions
An example will make clear the relationship between random variables and
probability distributions. Suppose you flip a coin two times. This simple
statistical experiment can have four possible outcomes: HH, HT, TH, and TT.
Now, let the variable X represent the number of Heads that result from this
experiment. The variable X can take on the values 0, 1, or 2. In this example,
Consider the coin flip experiment described above. The table below, which
associates each outcome with its probability, is an example of a probability
distribution.
2
Number of heads Probability
0 0.25
1 0.50
2 0.25
variable X.
3
Random Variable Definition
4
Expectation and Variance
The expected value (or mean) of X, where X is a discrete random variable, is a
weighted average of the possible values that X can take, each value being
weighted according to the probability of that event occurring. The expected value
of X is usually written as E(X) or m.
E(X) = S x P(X = x)
So the expected value is the sum of: [(each of the possible outcomes) × (the
probability of the outcome occurring)].
In more concrete terms, the expectation is what you would expect the outcome of
an experiment to be on average.
Example
What is the expected value when we roll a fair die?
There are six possible outcomes: 1, 2, 3, 4, 5, 6. Each of these has a probability
of 1/6 of occurring. Let X represent the outcome of the experiment.
Therefore P(X = 1) = 1/6 (this means that the probability that the outcome of the
experiment is 1 is 1/6)
P(X = 2) = 1/6 (the probability that you throw a 2 is 1/6)
P(X = 3) = 1/6 (the probability that you throw a 3 is 1/6)
P(X = 4) = 1/6 (the probability that you throw a 4 is 1/6)
P(X = 5) = 1/6 (the probability that you throw a 5 is 1/6)
P(X = 6) = 1/6 (the probability that you throw a 6 is 1/6)
E(X) = 1×P(X = 1) + 2×P(X = 2) + 3×P(X = 3) + 4×P(X=4) + 5×P(X=5) +
6×P(X=6)
Therefore E(X) = 1/6 + 2/6 + 3/6 + 4/6 + 5/6 + 6/6 = 7/2
So the expectation is 3.5 . If you think about it, 3.5 is halfway between the
possible values the die can take and so this is what you should have expected.
Expected Value of a Function of X
To find E[ f(X) ], where f(X) is a function of X, use the following formula:
E[ f(X) ] = S f(x)P(X = x)
5
Example
For the above experiment (with the die), calculate E(X2)
Using our notation above, f(x) = x2
f(1) = 1, f(2) = 4, f(3) = 9, f(4) = 16, f(5) = 25, f(6) = 36
P(X = 1) = 1/6, P(X = 2) = 1/6, etc
So E(X2) = 1/6 + 4/6 + 9/6 + 16/6 + 25/6 + 36/6 = 91/6 = 15.167
The expected value of a constant is just the constant, so for example E(1) = 1.
Multiplying a random variable by a constant multiplies the expected value by that
constant, so E[2X] = 2E[X].
A useful formula, where a and b are constants, is:
E[aX + b] = aE[X] + b
[This says that expectation is a linear operator].
Variance
The variance of a random variable tells us something about the spread of the
possible values of the variable. For a discrete random variable X, the variance of
X is written as Var(X).
Var(X) = E[ (X – m)2 ] where m is the expected value E(X)
This can also be written as:
Var(X) = E(X2) – m2
The standard deviation of X is the square root of Var(X).
Note that the variance does not behave in the same way as expectation when we
multiply and add constants to random variables. In fact:
Var[aX + b] = a2Var(X)
You is because: Var[aX + b] = E[ (aX + b)2 ] - (E [aX + b])2 .
= E[ a2X2 + 2abX + b2] - (aE(X) + b)2
= a2E(X2) + 2abE(X) + b2 - a2E2(X) - 2abE(X) - b2
= a2E(X2) - a2E2(X) = a2Var(X)
6
What is a Probability Distribution?
A probability distribution is a table or an equation that links each
possible value that a random variable can assume with its probability of
occurrence.
7
The table below shows the probabilities associated with each possible
value of X. The probability of getting 0 heads is 0.25; 1 head, 0.50; and
2 heads, 0.25. Thus, the table is an example of a probability
distribution for a discrete random variable.
0 0.25
1 0.50
2 0.25
8
Continuous Probability
Distributions
Continuous probability
distribution: A probability distribution in which the
random variable X can take on any value (is continuous).
Because there are infinite values that X could assume, the
probability of X taking on any one specific value is zero. Therefore
we often speak in ranges of values (p(X>0) = .50). The normal
distribution is one example of a continuous distribution. The
probability that X falls between two values (a and b) equals the
integral (area under the curve) from a to b:
9
y=1
y = 1 - 0.5x
10
Binomial Distribution
11
Criteria of Binomial Distribution
Binomial distribution models the probability of occurrence of an event
when the specific criteria are met. Binomial distribution involves the
following rules that must be present in the process in order to use the
binomial probability formula:
1. Fixed trials
The process under investigation must have a fixed number of trials that
cannot be altered in the course of the analysis. During the analysis, each
trial must be performed in a uniform manner, although each trial may yield
a different outcome.
2. Independent trials
The other condition of a binomial probability is that the trials are
independent of each other. In simple terms, the outcome of one trial
should not affect the outcome of the subsequent trials.
12
In the next trial, there will be 49 boys out of 999 students. The probability
of picking a boy in the next trial is 0.049. It shows that in subsequent trials,
the probability from one trial to the next will vary slightly from the prior
trial.
13
Example of Binomial Distribution
Suppose, according to the latest police reports, 80% of all petty crimes are
unresolved, and in your town, at least three of such petty crimes are
committed. The three crimes are all independent of each other. From the
given data, what is the probability that one of the three crimes will be
resolved?
Solution
The first step in finding the binomial probability is to verify that the situation
satisfies the four rules of binomial distribution:
Next:
We find the probability that one of the crimes will be solved in the three
independent trials. It is shown as follows:
= 0.2 x 0. 8 x 0.8
= 0.128
14
= 0.128
= 0.128
= 0.384
Where:
15
Properties of binomial distribution
1. Binomial distribution is applicable when the trials are independent and
each trial has just two outcomes success and failure.
This means that if the values of n and p are known, then the distribution is
known completely.
μ = np
To know the mode of binomial distribution, first we have to find the value of
(n+1)p.
16
σ² = npq
npq < np
That is, variance of a binomial variable is always less than its mean.
and
Then (X+Y) will also be a binomial variable with the parameters (n₁ + n₂) and p
17
Poisson Distribution
What Is a Poisson Distribution
In statistics, a Poisson distribution is a statistical distribution that shows how many
times an event is likely to occur within a specified period of time. It is used for
independent events which occur at a constant rate within a given interval of time.
The Poisson distribution is a discrete function, meaning that the event can only be
measured as occurring or not as occurring, meaning the variable can only be
measured in whole numbers. Fractional occurrences of the event are not a part
of the model. it was named after French mathematician Siméon Denis Poisson.
DEFINITION
normal distribution
The standard normal distribution is a special case of the normal distribution . It is the distribution that
occurs when a normal random variable has a mean of zero and a standard deviation of one.
The normal random variable of a standard normal distribution is called a standard score or a z score. Every
normal random variable X can be transformed into a z score via the following equation:
z = (X - μ) / σ
where X is a normal random variable, μ is the mean, and σ is the standard deviation.
6) The total area of normal curve is 1. The area to the left and
the area to the right of the curve is 0.5.
19
8) The curve becomes parallel to x-axis which is supposed to
meet it at infinity.
2) In case the sample size is large the normal distribution serves
as good approximation.
20