Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 14

ASSIGNMENT 1

Name: Nguyen Bao Thu


Student number: SE140803

Q1.
The US government wants in know how American citizens feel about the war in Iraq. They
randomly select 500 citizens from each state and ask them about their feeling. What are the
population and the sample?
Answer:
- Population: American citizens
- Sample: 500 citizens randomly from each state

Q2.
Determine whether the given value is a statistic or a parameter.
a. A sample of 120 employees of a company is selected, and the average is found to be 37
years.
b. After inspecting all of 55,000 kg of meat stored at the Wurst Sausage Company, it was
found that 45,000 kg of the meat was spoiled.
Answer:
- “A sample of 120 employees of a company is selected, and the average is found to be 37
years.” is a statistic. Because it is a value of sample.
- “After inspecting all of 55,000 kg of meat stored at the Wurst Sausage Company, it was
found that 45,000 kg of the meat was spoiled.” is a parameter. Because it is a value of
population.
Q3.
Is the experimental or observational study?
a. A marketing firm does a survey to find out how many people use a product. Of the one
hundred people contacted, fifteen said they use the product.
b. A clinic gives a drug to a group of ten patients and a placebo to another group of ten
patients to find out if the drug has an effect on the patients' illness.
Answer:
- “A marketing firm does a survey to find out how many people use a product. Of the one
hundred people contacted, fifteen said they use the product.” is a observational study.
- “A clinic gives a drug to a group of ten patients and a placebo to another group of ten
patients to find out if the drug has an effect on the patients' illness.” is a experimental study.

Q4.
Identify the type of observational study.
a. A statistical analyst obtains data about ankle injuries by examining a hospital's
records from the past 3 years.
b. A researcher plans to obtain data by following those in cancer remission since January of
2015.
c. A town obtains current employment data by polling 10,000 of its citizens this month.
Answer:
- “A statistical analyst obtains data about ankle injuries by examining a hospital's
records from the past 3 years.” is a retrospective study.
- “A researcher plans to obtain data by following those in cancer remission since January of
2015.” is a prospective study.
- “A town obtains current employment data by polling 10,000 of its citizens this month.” is a
cross-sectional study.
Q5.
Identify the number as either continuous or discrete.
a. The total number of phone calls a sales representative makes in a month is 425.
b. The average height of all freshmen entering college in a certain year is 68.4 inches.
c. The number of stories in a Manhattan building is 22.
Answer:
a. It is discrete.
b. It is continuous.
c. It is discrete.

Q6.
Classify each set of data as discrete or continuous.
a. The number of suitcases lost by an airline.
b. The height of corn plants.
c. The number of ears of corn produced.
d. The time it takes for a car battery to die.
Answer:
a. It is discrete.
b. It is continuous.
c. It is discrete.
d. It is continuous.

Q7.
Fill in the bank
a. Observational Study is a basic method of ...
b. Designed Study is a basic method of ...
c. Retrospective Study, observational study and designed experiment are three basis
methods of ...
d. A designed experiment is a method of ...
Answer:
a. collecting data
b. collecting data
c. collecting data
d. collecting data

Q8.

Answer:
Each of the possible five outcomes of random experiment is equally likely.
=> Each probability = 1/5 = 0.2
a. P(A) = 0.2 + 0.2 = 0.4
b. P(B) = 0.2 + 0.2 + 0.2 = 0.6
c. P(A’) = 1 - P(A) = 1 – 0.4 = 0.6

d. Because A ∩ B =  , P(A ∪ B) = P(A) + P(B) = 0.4 + 0.6 = 1

e. P(A | B) = P(A ∩ B) / P(B) =  / 0.6 = 0

Q9.
The Ski Patrol at Criner Mountain Ski Resort has determined the following probability
distribution for the number of skiers that are injured each weekend:

What is the probability that the number of injuries per week is at most 3?
Answer:
Probability that the number of injuries per week is at most 3:
P(x <= 3) = P(0) + P(1) + P(2) + P(3) = 0.9
Q10.
The probability of a New York teenager owning a skateboard is 0.37, of owning a bicycle is
0.81 and of owning both is 0.36.
a. If a New York teenager is chosen at random, what is the probability that the
teenager owns a skateboard or a bicycle?
b. If a New York teenager is chosen at random, what is the probability that the teenager
owning a skateboard but not owning a bicycle.
c. Find the probability that the teenager owns a bicycle given that the teenager owns a
skateboard.
Answer:
Probability of a New York teenager owning a skateboard: P(A) = 0.37
Probability of a New York teenager owning a bicycle: P(B) = 0.81
Probability of a New York teenager owning both: P(A ∩ B) = 0.36:
a. If a New York teenager is chosen at random, probability that the teenager owns a
skateboard or a bicycle:
P(A ∪ B) = P(A) + P(B) – P(A ∩ B) = 0.37 + 0.81 – 0.36 = 0.82
b. If a New York teenager is chosen at random, probability that the teenager owning a
skateboard but not owning a bicycle:
P(A / B) = P(A) - P(A ∩ B) = 0.37 – 0.36 = 0.01
c. P(B | A) = P(A ∩ B) / P(A) = 0.36 / 0.37 ≈ 0.97

Q11.

Answer:
a. P(AB) = P(A) + P(B) - P(A ∪ B) = 0.4 + 0.5 – 0.7 = 0.2
b. P(A’B) = P(B) - P(AB) = 0.5 – 0.2 = 0.3
c. P(B | A) = P(AB) / P(A) = 0.2 / 0.4 = 0.5
Q12.
Samples of a cast aluminum part are classified on the basis of surface finish (in micro-inches)
and length measurements. The results of 100 parts are summarized as follows:

Let A denote the event that a sample has excellent surface finish, and let B denote the event
that a sample has excellent length. Determine:
a. P(A), P(B) b. P(AB), P(A+B) c. P(A|B), P(B|A)
Answer:
a. P(A) = 82 / 100 = 0.82
P(B) = 90 / 100 = 0.9
b. P(AB) = 80 / 100 = 0.8
P(A+B) = P(A) + P(B) - P(AB) = (82 + 90 - 80) / 100 = 92 / 100 = 0.92
c. P(A|B) = P(AB) / P(B) = (80 / 100) / (90 / 100) = 80 / 90 ≈ 0.89
P(B|A) = P(AB) / P(A) = (80 / 100) / (82 / 100) = 80 / 82 ≈ 0.98

Q13.
A batch of 350 samples of rejuvenated mitochondria contains eight that are mutated (or
defective). Two are selected, at random, without replacement from the batch.
a. What is the probability that the second one selected is defective given that the first one
was defective?
b. What is the probability that both are defective?
c. What is the probability that both are acceptable?
Answer:
a. Event that the first one selected is defective: A
Event that the second one selected is defective: B
Probability that the second one selected is defective given that the first one was defective:
P(B | A) = 7 / 349 ≈ 0.02
b. The probability that both are defective: P(AB) = (8 / 350) x (7 / 349) = 4 / 8725
≈ 0.00046

c. The probability that both are acceptable: P(A’B’) = (342 / 350) x (341 / 349) ≈ 0.95
Q14.
Suppose that A and B are independent events, P(A) = 0.4 and P(B) = 0.5. Determine the
following:
a. P(AB) and P(AB’) b. P(A+B) and P(B|A)
Answer:
a. P(AB) = P(A) x P(B) = 0.4 x 0.5 = 0.2
P(AB’) = P(A) – P(AB) = 0.4 – 0.2 = 0.2
b. P(A + B) = P(A) + P(B) – P(AB) = 0.4 + 0.5 – 0.2 = 0.7
P(B | A) = P(B) = 0.5

Q15.
The probability that a lab specimen contains high levels of contamination is 0.1. Five
samples are checked, and the samples are independent.
a. What is the probability that none contain high levels of contamination?
b. What is the probability that exactly one contains high levels of contamination?
c. What is the probability that at least one contains high levels of contamination?
Answer:
a. P(x = 0) = (5C0) x 0.10 x 0.95 ≈ 0.59
b. P(x = 1) = (5C1) x 0.11 x 0.94 ≈ 0.33
c. P(x >= 1) = 1 – P(x < 1) = 1 – P(x = 0) = 1 – 0.59 = 0.41

Q16.
The sample space of a random experiment is {a, b, c, d, e, f}, and each outcome is equally
likely. A random variable is defined as follows:
Use the probability mass function to determine the following probabilities:
a. P(X=1.5) b. P(0.5<X<2.7) c. P(0<=X<2) d. P(X=0 or X=2)

Answer:
Because each outcome is equally likely => Each probability = 1/6
P(X = 0) = 1/6 + 1/6 = 1/3
P(X = 1.5) = 1/6 + 1/6 = 1/3
P(X = 2) = 1/6
P(X = 3) = 1/6

a. P(X=1.5) = 1/3
b. P(0.5<X<2.7) = P(X = 1.5) + P(X = 2) = 1/3 + 1/6 = 1/2
c. P(0<=X<2) = P(X = 0) + P(X = 1.5) = 1/3 + 1/3 = 2/3
d. P(X=0 or X=2) = P(X = 0) + P(X = 2) = 1/3 + 1/6 = 1/2

Q17.
Verify that the following functions are probability mass functions, and determine the
requested probabilities.

a. P(X<=2) b. P(X>-2) c. P(-1<=X or x=2)


d. Compute E(X), V(X), sigma_X
Answer:
a. P(X<=2) = P(X=-2) + P(X=-1) + ... + P(X=2) = 1
b. P(X>-2) = 1 - P(X=-2) = 0.8
c. P(X<=-1 or X=2) = P(X=-2) + P(X=-1) + P(X=2) = 0.7
d. E(X) = -0.4
V(X) = 1.64
sigma_X = 1.28
Q18.
The thickness of wood paneling (in inches) that a customer orders is a random variable with
the following cumulative distribution function:

Determine the following probabilities:


a. P(X<=1/4) b. P(X<=5/16) c. P(X>1/2) d. Compute E(X), V(X)
Answer:
f(1/8) = 0.2
f(1/4) = 0.7
f(3/8) = 0.1

a. P(X<=1/4) = P(X=1/8) + P(X=1/4) = 0.2 + 0.7 = 0.9


b. P(X<=5/16) = P(X=1/8) + P(X=1/4) = 0.2 + 0.7 = 0.9
c. P(X>1/2) = 1
d. E(X) ≈ 0.24
V(X) ≈ 0.0045

Q19.
Verify that the following functions are probability mass functions, and determine the
requested probabilities.
a. P(X<=1) b. P(2<=X<=4) c. Compute E(X), V(X) and sigma_X
Answer:
P(X = 0) = 0.04
P(X = 1) = 0.12
P(X = 2) = 0.2
P(X = 3) = 0.28
P(X = 4) = 0.36
a. P(X<=1) = P(X = 0) + P(X = 1) = 0.16
b. P(2<=X<=4) = P(X = 2) + P(X = 3) + P(X = 4) = 0.84
c. E(X) = 2.8
V(X) = 1.36
sigma_X ≈ 1.17

Q20:
Let the random variable X have a discrete uniform distribution on the integers 1<=X<=3.
Determine the mean and variance of X.
Answer:
The random variable X have a discrete uniform distribution 1<=X<=3
E(X) = (3 + 1) / 2 = 2
V(X) = [(3 – 1 + 1)2 - 1] / 12 = 2/3 ≈ 0.67

Q21:
The random variable X has a binomial distribution with n = 10 and p = 0.5. Determine the
following probabilities:
a. P(X=5) b. P(X<=2) c. P(X>7) d. E(X), V(X)
Answer:
a. P(X=5) = 10C5 x (0.5)5 x (0.5)5 ≈ 0.25
b. P(X<=2) = P(x=0) + P(x=1) + P(x=2)
2
x 10−x
= ∑ 10 Cx∗0.5 ∗(1−0.5)
0

= 10C0 x (0.5)0 x (0.5)10 + 10C1 x (0.5)1 x (0.5)9 + 10C2 x (0.5)2 x (0.5)8 ≈ 0.055

c. P(X>7) = P(x=8) + P(x=9) + P(x=10)


10
x 10−x
= ∑ 10 Cx∗0.5 ∗(1−0.5)
8

= 10C8 x (0.5)8 x 0.52 + 10C9 x (0.5)9 x 0.51 + 10C10 x 0.510 x 0.50 ≈ 0.055
d. E(X) = np = 5
V(X) = np(1-p) = 2.5

Q22:
A multiple-choice test contains 25 questions, each with four answers. Assume that a student
just guesses on each question.
a. What is the probability that the student answers more than 20 questions correctly?
b. What is the probability that the student answers fewer than 5 questions correctly?
Answer:
A number of questions: n = 25
P(exactly x correct) = (nCa) x (0.25)x x (1 – 0.25)25-x
a. P(x>20) = P(x=21) + P(x=22) + P(x=23) + P(x=24) + P(x=25)
25
x 25−x
= ∑ 25 Cx∗0.25 ∗(1−0.25)
21

= 9.68 x 10-10
b. P(x<5) = P(x=0) + P(x=1) + P(x=2) + P(x=3) + P(x=4)
4
x 25−x
= ∑ 25 Cx∗0.25 ∗(1−0.25) ≈ 0.21
0

Q23:
Suppose that the random variable X has a geometric distribution with p = 0 5.
a. Determine the following probabilities: P(X=4), P(X>3).
b. Determine the mean and variance of X.
Answer:
P(X=x) = (1 – 0.5)x-1 * 0.5
a. P(X=4) = 1/16 = 0.0625
P(X>3) = 1 – (P(X=3) + P(X=2) + P(X=1))
3
x−1
= 1 - ∑ (1−0.5) ∗0.5 = 0.125
1

b. E(X) = 1/p = 1/0.5 = 2


V(X) = (1 - p) / p2 = (1 – 0.5) / 0.52 = 2

Q24:
Suppose that X is a negative binomial random variable with p = 0.2 and r = 4. Determine the
following:
a. E(X), V(X)
b. P(X=3), P(X=5) c. P(X>5)
Answer:
P(X=x) = (x-1 C r-1)(1-p)x-r pr
a. E(X) = r/p = 4 / 0.2 = 20
V(X) = r(1-p)/p2 = 80
b. Because x < r (3 < 4) => P(X=3) = 0
P(X=5) = 4C3 x (1 – 0.2)5-4 x 0.24 = 5.12 x 10-3
c. P(X>5) = 1 – (P(X=5) + P(X=4))
= 1 – (5.12x10-3 + 1/625) ≈ 0.99

Q25:
A batch of parts contains 100 from a local supplier of tubing and 200 from a supplier of
tubing in the next state. If four parts are selected randomly and without replacement.
a. What is the probability they are all from the local supplier?
b. What is the probability that two or more parts in the sample are from the local supplier?
Answer:
a. This is hypergeometric distribution.
P(X=4) = (100C4) x (200C0) / (300C4) ≈ 0.0119
b. P(X>=2) = P(X=2) + P(X=3) + P(X=4) = 0.298 + 0.098 + 0.0119 ≈ 0.408

Q26:
On average, 3 traffic accidents per month occur at a certain intersection. What is the
probability that in any given month at this intersection?
a. Exactly 5 accidents will occur?
b. Fewer than 3 accidents will occur?
c. At least 2 accidents will occur?
Answer:
This is a Poisson distribution.
P(x) = (e(-λ) * λx ) / x!
a. P(X=5) = (e(-3) * 35 ) / 5! ≈ 0.101
b. P(X<3) = P(0) + P(1) + P(2) ≈ 0.42
c. P(X>=2) = 1 – (P(0) + P(1)) ≈ 0.801

Q27:
The number of flaws in bolts of cloth in textile manufacturing is assumed to be Poisson
distributed with a mean of 0.1 flaw per square meter.
a. What is the probability that there are two flaws in one square meter?
b. What is the probability that there is one flaw in 10 square meters?
c. What is the probability that there are at least two flaws in 10 square meters of cloth?
d. What is the probability that there are no flaws in 20 square meters?
Answer:
a. P(x=2) = (e(-0.1) * 0.12 ) / 2! ≈ 0.0045
b. 1 square meter = 0.1 => 10 square meters = 1
P(x=1) = (e(-1) * 11 ) / 1! ≈ 0.37
c. 1 square meter = 0.1 => 10 square meters = 1
P(x>=2) = 1 – (P(0) + P(1)) ≈ 1 – 0.74 ≈ 0.26
d. 1 square meter = 0.1 => 20 square meters = 2
P(x=0) = (e(-2) * 20) / 0! ≈ 0.14

You might also like