6.4 Hypergeometricprobability PDF

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

M06_SULL8028_03_SE_C06CD.

QXD 7/30/08 6:36 PM Page 6–1

Section 6.4 The Hypergeometric Probability Distribution 6–1

6.4 THE HYPERGEOMETRIC PROBABILITY


DISTRIBUTION
Preparing for This Section Before getting started, review the following:
• Classical Method (Section 5.1, pp. 263–266) • Multiplication Rule of Counting (Section 5.5,
• Independence (Section 5.3, pp. 286–287) pp. 302–305)

• Multiplication Rule (Example 6, Section 5.4, p. 297) • Combinations (Section 5.5, pp. 307–309)

Objectives 1 Determine whether a probability experiment is a hypergeometric


experiment
2 Compute the probabilities of hypergeometric experiments
3 Compute the mean and standard deviation of a hypergeometric
random variable

1 Determine Whether a Probability


Experiment Is a Hypergeometric
Experiment
In Section 6.2, we presented binomial experiments. Recall, the binomial probability
distribution can be used to compute the probabilities of experiments when there are
a fixed number of trials in which there are two mutually exclusive outcomes and the
probability of success for any trial is constant. In addition, the trials must be inde-
pendent. Based on the results from Example 6 in Section 5.3, we learned that, when
small samples are obtained from large finite populations, it is reasonable to assume
independence of events. That is, when obtaining a sample of size n from a population
whose size is N, we are willing to assume independence of the events provided that
n 6 0.05N (the sample size is less than 5% of the population size).
What if the requirement of independence is not satisfied? Under these circum-
stances, the experiment is a hypergeometric experiment.

Criteria for a Hypergeometric Probability Experiment


A probability experiment is said to be a hypergeometric experiment provided:
1. The finite population to be sampled has N elements.
2. For each trial of the experiment, there are two possible outcomes, success
or failure. There are exactly k successes in the population.
3. A sample of size n is obtained from the population of size N without
replacement.

If a probability experiment satisfies these three requirements, the random


variable X, the number of successes in n trials of the experiment, follows the
hypergeometric probability distribution.We now introduce the notation that we will use.

Notation Used in the Hypergeometric Probability Distribution


• The population is size N. The sample is size n.
• There are k successes in the population.
• Let the random variable X denote the number of successes in the sample of
size n, so x must be greater than or equal to the larger of 0 or n - 1N - k2,
and x must be less than or equal to the smaller of n or k.

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 7/30/08 6:36 PM Page 6–2

6–2 Chapter 6 Discrete Probability Distributions

EXAMPLE 1 A Hypergeometric Probability Experiment

Problem: Suppose that a researcher goes to a small college with 200 faculty, 12 of
Historical which have blood type O-negative. She obtains a simple random sample of n = 20
Note of the faculty and finds that 3 of the faculty have blood type O-negative. Is this
experiment a hypergeometric probability experiment? List the possible values of
The name hypergeometric
is attributed to Leonhard Euler. Euler the random variable X, the number of faculty that have blood type O-negative.
was born in Basel, Switzerland, on
April 15, 1707. His father was a
Approach: We need to determine if the three criteria for a hypergeometric exper-
minister and wanted Leonhard to iment have been satisfied.
study theology as well. However, after
a discussion with Johann Bernoulli, a
Solution: This is a hypergeometric probability experiment because
friend from college, Euler’s father 1. The population consists of N = 200 faculty.
allowed him to study mathematics
2. Two outcomes are possible: the faculty member has blood type O-negative or the
at the University of Basel. Euler
completed his studies in 1726. Euler faculty member does not have blood type O-negative. The researcher obtained
married Katharina Gsell on January 7, k = 3 successes.
1734. They had 13 children, only 5 of 3. The sample is size n = 20.
whom survived. Euler claims to have
made many of his greatest discoveries The possible values of the random variable are x = 0, 1, 2, Á , 12. The largest value
with a child in his arms and children of X is 12, because we cannot have more than 12 successes since there are only
crawling at his feet. In 1740, Euler lost 12 faculty with blood type O-negative in the population.
sight in his right eye.One of his famous
quotes on this loss is “Now I will have
less distraction.” He eventually lost
sight in his other eye as well, but this Notice that we cannot use the binomial probability distribution to determine
did not slow him down. Euler died on the likelihood of obtaining three successes in 20 trials in Example 1 because the
September 18, 1783, in St. Petersburg. sample size is large relative to the population size. That is, n = 20 is more than 5%
of the population size, N = 200.
Now Work Problem 5

2 Compute the Probabilities of


Hypergeometric Experiments
The basis for computing probabilities in a hypergeometric experiment lies in the
fact that each sample of size n is equally likely to be chosen. Consider an urn that
contains 8 white chips and 6 black chips for a total of N = 14 chips. If we decide to
randomly select n = 3 chips, all possible combinations of chips are equally likely.
That is, if we let W1, W2, Á , W8 represent the 8 white chips and B1, B2, Á , B6 rep-
resent the 6 black chips, selecting W1, W2, B3 is just as likely as selecting W3, W6, B4.
Notice in both cases that we selected 2 white chips and 1 black chip. So, if X repre-
sents the number of black chips selected, we have x = 1 in both cases; however, the
chips selected are different (so each represents a different sample).

Hypergeometric Probability Distribution


The probability of obtaining x successes based on a random sample of size n
from a population of size N is given by

1kCx21N - kCn - x2
P1x2 = (1)
NCn

where k is the number of successes in the population.

The logic behind Formula (1) is based on the Classical Method given on page
263, along with the Multiplication Rule of Counting given on page 304. The Classi-
cal Method for computing probabilities states that the probability of an event is the
number of ways the event can occur, divided by the total number of outcomes in

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 7/30/08 6:36 PM Page 6–3

Section 6.4 The Hypergeometric Probability Distribution 6–3

the experiment. The denominator of Formula (1) represents the number of ways n
objects can be selected from N objects. This represents the number of possible out-
comes in the experiment. The numerator consists of two factors. The first factor,
kCx, represents the number of ways we can select the x successes from the k suc-
cesses in the population. The second factor, 1N - k2C1n - x2, represents the number of
ways we can select n - x failures from the N - k failures in the population. Using
the Multiplication Rule of Counting, we find the number of ways we could obtain x
successes from n trials of the experiment.

EXAMPLE 2 Using the Hypergeometric Probability Distribution

Problem: Suppose a researcher goes to a small college of 200 faculty, 12 of which


have blood type O-negative. She obtains a simple random sample of n = 20 of the
faculty. Let the random variable X represent the number of faculty in the sample of
size n = 20 that have blood type O-negative.
(a) What is the probability that 3 of the faculty have blood type O-negative?
(b) What is the probability that at least one of the faculty has blood type O-negative?

Approach: This is a hypergeometric experiment with N = 200, n = 20, and


k = 12. The possible values of the random variable X are x = 0, 1, 2, Á , 12. (Our
sample cannot have more than k = 12 faculty with blood type O-negative.) We use
Formula (1) to compute the probabilities.

Solution
(a) We are looking for the probability of obtaining 3 successes, so x = 3.
112C321200 - 12C 20 - 32 112C321188C172
P132 = = = 0.0833
C
200 20 200C20

There is a 0.0833 probability that, in a random sample of 20 faculty, exactly 3 have


blood type O-negative. If we conducted this experiment 100 times, we would expect
to select 3 faculty that have blood type O-negative about 8 times.
(b) The phrase at least means greater than or equal to. The values of the random
variable X that are greater than or equal to 1 are 1, 2, 3, Á , 12. Computing proba-
bilities for all these random variables is time consuming. It is much easier to use the
Complement Rule and compute P1X Ú 12 = 1 - P102.
112C021200 - 12C20 - 02 112C021188C202
P1X Ú 12 = 1 - P102 = 1 - = 1 - = 0.7282
200C20 200C20

There is a 0.7282 probability that, in a random sample of 20 faculty, at least 1 has


blood type O-negative. If we conducted this experiment 100 times, we would expect
to select at least one of the faculty that have blood type O-negative about 73 times.

EXAMPLE 3 Using the Hypergeometric Probability Distribution

Problem: The hypergeometric probability distribution is used in acceptance sam-


pling. Suppose that a machine shop orders 500 bolts from a supplier. To determine
whether to accept the shipment of bolts, the manager of the facility randomly selects
12 bolts. If none of the 12 randomly selected bolts is found to be defective, he con-
cludes that the shipment is acceptable.
(a) If 10% of the bolts in the population are defective, what is the probability that
none of the selected bolts are defective?
(b) If 20% of the bolts in the population are defective, what is the probability that
none of the selected bolts are defective?

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 8/12/08 4:00 PM Page 6–4

6–4 Chapter 6 Discrete Probability Distributions

Approach: This is a hypergeometric experiment with N = 500 and n = 12. In


part (a), we have that k = 0.115002 = 50 defectives. The possible values of the ran-
dom variable X are x = 0, 1, 2, Á , 12 (you cannot have more successes than the
sample size). Notice that a success means finding a defective bolt. In part (b), we
have that k = 0.215002 = 100 defectives. The possible values of the random vari-
able X are x = 0, 1, 2, Á , 12. We use Formula (1) to compute the probabilities.

Solution
(a) We are looking for the probability of obtaining 0 successes, so x = 0.
150C021500 - 50C12 - 02 150C021450C122
P102 = = = 0.2783
500C12 500C12

There is a 0.2783 probability that, in a random sample of 12 bolts, none are defective
(if 10% of the bolts in the population are defective). If we conducted this experi-
ment 100 times, we would expect to observe no defective bolts about 28 times.
(b) We are looking for the probability of obtaining 0 successes, so x = 0.
1100C021500 - 100C12 - 02 1100C021 400C122
P102 = = = 0.0665
C
500 12 500C12
There is a 0.0655 probability that, in a random sample of 12 bolts, the manager will
select none that are defective (if 20% of the bolts in the population actually are
defective). If we conducted this experiment 100 times, we would expect to observe
no defective bolts about 7 times. Notice that, as the number of defective bolts
increases, the probability of not selecting a single defective bolt decreases.
Now Work Problems 17(a)–(e)

EXAMPLE 4 Computing Hypergeometric Probabilities Using Technology

Problem: The hypergeometric probability distribution is used in acceptance sam-


pling. Suppose that a machine shop orders 500 bolts from a supplier. To determine
whether to accept the shipment of bolts, the manager of the facility randomly selects
12 bolts. If none of the 12 randomly selected bolts are found to be defective, he con-
cludes that the shipment is acceptable. If 10% of the bolts in the population are
defective, what is the probability that none of the selected bolts are defective?

Approach: Statistical software or graphing calculators with advanced statistical


features have the ability to determine hypergeometric probabilities. We use
MINITAB to determine the probabilities. The steps for determining hypergeomet-
ric probabilities using MINITAB or Excel can be found in the Technology Step-by-
Step on page 6–7.

Solution: We use MINITAB to determine the probability. Recall that N = 500,


k = 50, and n = 12. See Figure 15.

Figure 15 Probability Density Function*


Hypergeometric with N ⫽ 500, M ⫽ 50, n ⫽ 12

x P( X ⫽ x )
0.00 0.278250

Interpretation: There is a 0.2783 probability that, in a random sample of 12 bolts,


none are defective (if 10% of the bolts in the population are defective). If we con-
ducted this experiment 100 times, we would expect to observe no defective bolts
about 28 times.

*MINITAB’s notation differs slightly from the notation that we use in this text. Instead of using k to
represent the number of successes in the population, MINITAB uses M.

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 7/30/08 6:36 PM Page 6–5

Section 6.4 The Hypergeometric Probability Distribution 6–5

3 Compute the Mean and Standard Deviation


of a Hypergeometric Random Variable
We discussed finding the mean and standard deviation of a discrete random variable
in Section 6.1. The formulas can be used to find the mean and standard deviation of
a hypergeometric random variable as well. However, a simpler method exists.

Mean and Standard Deviation of a Hypergeometric Random Variable


A hypergeometric random variable X has mean and standard deviation given
by the formulas

a b n
k N - n # # k #N - k
mX = n # and sX = (2)
N B N - 1 N N
where n is the sample size
k is the number of successes in the population
N is the size of the population

k
The ratio is the proportion of successes in the population. If you look carefully at
N k
the formulas for the mean and standard deviation and replace with p, we almost
N
have the formulas for the mean and standard deviation of a binomial random
N - n
variable. (Note that is a finite population correction factor that approaches
N - 1
1 as the population size increases, while n stays fixed and small relative to N. For
this reason, we ignore its effect on the standard deviation of a binomial random
variable)

EXAMPLE 5 Computing the Mean and Standard Deviation


of a Hypergeometric Random Variable

Problem: Suppose that a researcher goes to a small college of 200 faculty, 12 of


which have blood type O-negative. She obtains a simple random sample of n = 20
of the faculty. Determine the mean and standard deviation of the number of ran-
domly selected faculty that will have blood type O-negative.

Approach: This is a hypergeometric probability experiment with N = 200,


n = 20, and k = 12. We use Formula (2) to find the mean and the standard devia-
tion, respectively.

Solution
k 12
mX = n # = 20 # = 1.2
N 200
and

a b n a b 20
N - n # # k #N - k 200 - 20 # # 12 # 200- 12
sX = = = 1.01
B N - 1 N N B 200 - 1 200 200

Interpretation: We expect that, in a random sample of 20 faculty members, 1.2 will


have blood type O-negative. If we take many different samples of size 20 from this
population, the mean number of faculty that have blood type O-negative will
approach 1.2.
Now Work Problem 17(f)

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 7/30/08 6:36 PM Page 6–6

6–6 Chapter 6 Discrete Probability Distributions

6.4 ASSESS YOUR UNDERSTANDING


Concepts and Vocabulary Applying the Concepts
1. Explain the similarities and differences between the hyperge- 17. Michigan’s Classic Lotto 47 In Michigan’s Classic Lotto 47
ometric probability distribution and the binomial probability NW Lottery, a player must choose 6 numbers between 1 and 47, in-
distribution. clusive. Six balls numbered from 1 and 47 are then randomly
2. What criteria must be satisfied for a random variable X to be selected from an urn. The random variable X represents the
a hypergeometric random variable? number of matching numbers.
(a) What is the probability of matching 3 numbers?
3. When listing the possible values of the hypergeometric ran-
(b) What is the probability of matching 4 numbers?
dom variable X, it must be the case that x is less than or equal
(c) What is the probability of matching 5 numbers?
to the smaller of n or k. Why?
(d) What is the probability of matching 6 numbers?
4. In your own words, explain the logic behind Formula (1). (e) A winning ticket is one in which the player matches 3, 4,
5, or 6 numbers. What is the probability of purchasing a
winning ticket? Would it be unusual to purchase a win-
Skill Building ning ticket?
In Problems 5–8, verify that the following probability experiments (f) What is the mean and standard deviation of the random
represent hypergeometric probability experiments. Then determine variable X? For a randomly selected ticket, how many
the values of N, n, k and list the possible values of the random numbers do you expect to match?
variable X.
18. Got a Pet? In a neighborhood of 95 homes, 35 have pets.
5. In Michigan’s Winfall Lottery, a player must choose 6 num- Suppose that 12 homes are selected at random. The random
NW bers between 1 and 49, inclusive. Six balls numbered variable X represents the number of homes in the sample
between 1 and 49 are then randomly selected from an urn. that have pets.
The random variable X represents the number of matching (a) What is the probability of obtaining 8 homes with a pet?
numbers. (b) What is the probability of obtaining 9 homes with a pet?
(c) What is the probability of obtaining 12 homes with a pet?
6. In a neighborhood of 95 homes, 35 have pets. Suppose that
Would it be unusual to select 12 homes that have a pet?
12 homes are selected at random. The random variable X
(d) What is the mean and standard deviation of the random
represents the number of homes in the sample that have
variable X?
pets.
19. Acceptance Sampling A manufacturer received an order of
7. A manufacturer received an order of 250 computer chips. 250 computer chips. Unfortunately, 12 of the chips are defec-
Unfortunately, 12 of the chips are defective. To test the tive. To test the shipment, the quality-control engineer ran-
shipment, the quality-control engineer randomly selects 20 domly selects 20 chips from the box of 250 and tests them.
chips from the box of 250 and tests them. The random vari- The random variable X represents the number of defective
able X represents the number of defective chips in the chips in the sample.
sample of 20.
(a) What is the probability of obtaining 4 defective chips?
8. A baseball team has 25 players, 7 of whom bat left-handed. (b) What is the probability of obtaining 3 defective chips?
Suppose that the manager of this team is frustrated with (c) What is the probability that the quality-control engineer
the way the team is playing, so he decides to randomly will not find any defective chips?
select 9 players to play in the upcoming game. The random (d) What is the probability of obtaining 14 defective chips?
variable X is the number of left-handed batters in the (e) How many defective chips would you expect to select?
game.
20. Baseball Lineup A baseball team has 25 players, 7 of whom
In Problems 9–12, a hypergeometric probability experiment is bat left-handed. Suppose that the manager of this team is
conducted with the given parameters. Compute the probability frustrated with the way the team is playing, so he decides to
of obtaining x successes. randomly select 9 players to play in the upcoming game. The
random variable X will be the number of left-handed batters
9. N = 150, n = 20, k = 30, x = 5 in the game.
10. N = 60, n = 8, k = 25, x = 3 (a) What is the probability of creating a lineup with 2 lefties?
(b) What is the probability of creating a lineup with 1 lefty?
11. N = 230, n = 15, k = 200, x = 12 (c) What is the probability of creating a lineup with no lefties?
12. N = 150, n = 10, k = 10, x = 1 (d) What is the probability of creating a lineup with 8 lefties?
(e) How many lefties would you expect to find in the lineup?
In Problems 13–16, compute the mean and standard deviation 21. Hung Jury A hung jury is one that is unable to come to a
of the hypergeometric random variable X. unanimous decision regarding the guilt of the defendant.
13. N = 150, n = 20, k = 30 Suppose that there is a pool of 30 potential jurors, but 2 of
the 30 potential jurors would never be willing to convict,
14. N = 60, n = 8, k = 25 regardless of the evidence presented. What is the proba-
bility that the trial will result in a hung jury, regardless of
15. N = 230, n = 15, k = 200
the evidence, if the jury consists of 12 randomly selected
16. N = 150, n = 10, k = 10 jurors?

Copyright © 2010 Pearson Education, Inc.


M06_SULL8028_03_SE_C06CD.QXD 7/30/08 6:36 PM Page 6–7

Section 6.4 The Hypergeometric Probability Distribution 6–7

22. Messy Sock Drawer Suppose that you wake up for work in the inspection. What is the probability that the shipment is
the dark and find that the lights don’t work in your bed- accepted:
room. In addition, your sock drawer is a mess and contains (a) If 10% of the 200 cylinders are defective?
12 black socks and 17 blue socks that otherwise look alike. (b) If 20% of the 200 cylinders are defective?
What is the probability that you randomly select two black (c) If 40% of the 200 cylinders are defective?
socks if you select exactly 2 socks? (d) If 60% of the 200 cylinders are defective?
(e) If 80% of the 200 cylinders are defective?
23. Acceptance Sampling Suppose that a concrete manufactur-
(f) Draw a horizontal axis and label it Percent Defective.
er has made 200 concrete cylinders that are supposed to
Draw a vertical axis and label it Probability Accept Ship-
withstand 4,000 pounds per square inch of pressure. As the
ment. Plot probability accept shipment against the percent
quality-control manager, you decide to randomly test 4 of
defective and connect the points in a smooth curve. This
the cylinders to be sure they are manufactured to specifica-
curve is referred to as an operating characteristic curve.
tion. You will only accept the shipment if all 4 cylinders pass

TECHNOLOGY STEP-BY-STEP Computing Hypergeometric Probabilities


Using Technology

TI-83/84 Plus Excel


The TI-83/84 Plus graphing calculators do not have Computing P(x)
this feature. 1. If desired, enter the possible values of the random
variable X whose probability you wish to compute
MINITAB in column A. For example, if we want the probability
Computing P(x) that x = 0, 1, 2, or 3 in Example 3(a), we enter 0, 1, 2,
1. If desired, enter the possible values of the random and 3 into column A.
variable X whose probability you wish to compute in 2. With the cursor in cell B1, select the fx icon.
C1. For example, if we want the probability that x = 0, Highlight Statistical in the Function category window.
1, 2, or 3 in Example 3(a), we enter 0, 1, 2, and 3 into C1. Highlight HYPGEOMDIST in the Function name
Computing P (X ◊ x) window. Click OK.
Follow the same steps as for computing P1x2. 3. Fill in the window as shown to obtain the
In the window that comes up after selecting probabilities from Example 3(a). Click OK.
Hypergeometric Á , select the radio button for
Cumulative probability.
2. Select the Calc menu, highlight Probability
Distributions, then highlight Hypergeometric . . .
3. Fill in the window as shown to obtain the
probabilities from Example 3(a). Click OK.
Note that, if we only want P102, it is simplest to
select the Input constant: radio button and enter
0 in the box.

4. Copy the contents in cell B1 to the remaining cells.

Copyright © 2010 Pearson Education, Inc.


Z01_SULL8028_03_SE_ANS_C06CD.QXD 7/4/08 12:18 AM Page 1

6.4 Assess Your Understanding Answers AN6–1

6.4 Assess Your Understanding (page 000)


5. N = 49, n = 6, k = 6, X = 0, 1, 2, Á , 6
7. N = 250, n = 20, k = 12, X = 0, 1, 2, Á , 12
9. 0.1856
11. 0.1939
13. mX = 4, sX = 1.67
15. mX = 13.0, sX = 1.26
17. (a) 0.01986
(b) 0.001146
(c) 0.0000229
(d) 0.000000093
(e) 0.02103
(f) 0.766; 0.772; 0.766
19. (a) 0.0087
(b) 0.0507
(c) 0.3590
(d) 0
(e) 0.96
21. 0.6483
23. (a) 0.6539
(b) 0.4065
(c) 0.1270
(d) 0.0245
(e) 0.0014
(f)
0.7
Probability Accept Shipment

0.6

0.5

0.4

0.3

0.2

0.1

0
20 40 60 80 100
Percent Defective

Copyright © 2010 Pearson Education, Inc.

You might also like