Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

STAT 5101: Foundations of Data Science

2021-22 Term 1
Assignment #3

Due: Nov 17th, 2021 (Wed) at 9:30pm


Total Score: 100 points
This assignment covers material in Chapter 6 to Chapter 8 of the lecture notes.
You are encouraged to show your calculation steps in details, so as to obtain partial scores in
case of incorrect answers.
How to turn in the assignment? During the lecture.

Problem 1 [10 Points] Consider a continuous


Random Variable 𝑋 with pdf 𝑓(𝑥) define
on 0 ≤ 𝑥 ≤ 6 as the chart on the right, and
𝑓(𝑥) = 0 elsewhere. If we have 3𝑓(2) = 2𝑓(5),
please calculate the following value:
(a) [4 Points] 𝑓(0.5), 𝑓(5.5), Pr(𝑋 = 2.5)
(b) [3 Points] Pr(2 ≤ 𝑋 < 4.8)
(c) [3 Points] Pr⁡(1.8 < 𝑋 ≤ 2.8|1 < 𝑋 < 3)

Problem 2 [10 Points]: If random variable 𝑋 follows Normal distribution⁡ 𝑁(𝜇, 𝜎 2 )


(a) [3 points] If we know 𝐸(𝑋) = 75 and 𝑉𝑎𝑟(𝑋) = 100 please find Pr⁡(X < 60) and
Pr⁡(70 < X < 100)
(b) [3 points] Find 𝑏 so that Pr(−𝑏 < (𝑋 − 𝜇)/𝜎 < 𝑏) = 0.90
(c) [4 points] Find 𝜇 and 𝜎 2 so that Pr(X < 89) = 0.90 and Pr(X < 94) = 0.95

Problem 3 [12 Points]: The average height of a woman is increasing from 1960 to 2002. The
average height of a woman aged 20-74 years is 64 inches in 2002. Suppose the height of a
woman is normally distributed with a standard deviation of 2 inches.
(a) [4 Points] What is the probability a randomly selected woman in this population is
between 58 inches and 70 inches?
(b) [4 Points] What are the quartiles of this distribution?
(c) [4 points] Determine 𝑎 and 𝑏, which is symmetric about the mean and Pr(𝑎 < 𝑋 <
𝑏) = 90%
(d) [4 points] What is the probability that five women selected at random from this
population all exceed 68 inches?

Page 1/3
Problem 4 [19 Points] If the class in our University lasts that is Uniformly Distributed
between 40 min and 60 min.
(a) [4 Points] What is the probability that the time interval randomly selected a class length
is longer than 55 minutes (Class delay)?
(b) [4 Points] There are 48 classes in University per week for year one students, let Y be the
number of classes would delay (class length is longer than 55 minutes) in each week for
year one students, so what’s the distribution of Y? And please write out the parameter
of this distribution.
(c) [4 Points] Please calculate the probability that at least 14 classes delay in each week for
year one students. Can we apply the approximation method here? Which method
should be used for calculating this probability?
(d) [6 Points] Please calculate the probability that exact 10 classes delay in each week for
year one students by using the exact method AND approximation method in part (c).
Compare the results, are they close?

Problem 5 [19 points]: Assume that the income of new graduate students follows normal
probability distribution, and the mean is 78 HKD per hour; the standard deviation is 12 HKD
per hour.
(a) [3 points] What is the probability that a randomly selected new graduate students will
have income between 54 HKD and 90 HKD dollars per hour?
(b) [3 points] What is the value that only 5% of the new graduate students will have the
income lower than?
(c) [3 points] What is the 20th percentile income of the new graduate students?
(d) [3 points] What is the IQR of income of the new graduate students?
(e) [3 points] If the mean income increases to 80 HKD (the standard deviation is fixed), will
the percentage of the new graduate students that have income more than 85 HKD
increases or decreases? Why?
(f) [4 points] If the standard deviation of the incomes decreases to 10 HKD (the mean is
fixed), will the percentage of the new graduate students that have income more than 85
HKD increases or decreases? Why?

Page 2/3
Problem 6 [23 points]: Suppose we flip a coin 87 times.
Assume that the coin is fair.
a) [2 points] Compute the probability of obtaining 53 heads.
b) [3 points] Verify if the Rule of Five is satisfied? Compute the probability of obtaining
53 heads based on Normal approximation. Is the answer close to part a)?
c) [3 points] Compute the probability of obtaining at least 53 heads based on the
Normal approximation.
d) [3 points] Find the number of heads which have the largest probability based on the
Normal approximation
Suppose the coin is biased (loaded) with 𝑝 = Pr(Head) = 0.02
e) [2 points] What is the exact probability of obtaining at most one head?
f) [3 points] Which approximation can we use this time? Poisson approximation or
Normal approximation? Why?
g) [3 points] Estimate the probability in part (e) based on Poisson approximation.
h) [4 points] Estimate the probability in part (e) based on Normal approximation, and
verify which one is more accurate.

Problem 7 [7 points]: Let 𝑋⁡ ~⁡ 𝑁⁡ (66, 49) be the weight (in kg) of adult male in a city.
(a) [3 points] Suppose 2,209 adult males will be selected at random, and let Y be the sample
mean random variable of their weights. What is the approximate distribution of Y, and
the corresponding parameter value(s)?
(b) [2 points] What is the probability that the sample mean of 2,209 adult males is between
65.79kg and 66.07kg?
(c) [2 points] What is the IQR of the sample mean weight?

- End of Assignment -

Page 3/3

You might also like