DS 1 Tutorial 1

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

DS-1 Tutorial 1

1. 400 boys are enrolled in the PGP-2023 batch of IIM, Bangalore. It is known that atleast 300
boys have heights between 165 cm and 185 cm. What can be said of the standard deviation
of height of boys enrolled in PGP-2023 batch of IIM, Bangalore?
2. In the attached data, Calculate the Mean, Median, SD of Nifty 50 and Reliance, TCS, Adani
Ports, HDFC Bank and ITC. Also, find the correlation and covariance between closing prices
of Nifty 50 and each of five stocks.
3. In the attached data, the marks of DS1 of Section A are given. Divide the data into classes of
40-50, 50-60…
Answer the following questions:
1. What is the 66th percentile number?
2. Using frequency distribution, find the mean, median and mode of data
3. Create a relative frequency table
4. Draw a box plot for the data
4. A random number generator is used to generate either a 0 or 1 with a 50% probability of
each. This generator is run 1000 times. What is the lowest probability that the sum of 1000
numbers are between 150 and 850?
(Note: This is a Binomial Distribution. The expected value of number of successes is n*p and
the standard deviation of number of successes is n*p*(1-p). n is the number of trials and p is
the probability of success. Success for this experiment is either getting a 0 or 1)
5. The voltage in a certain circuit is a random variable with mean 120 and standard deviation 5.
Sensitive equipment will be damaged if the voltage is not between 105 and 135.Use
Chebyshev’s inequality to bound the probability of damage.
6. The probability that the price of a smartphone sold via Amazon.in is greater than 20000 is
40%. 25% of the phones sold on Amazon.in are sold by Samsung and 60% of these phones
cost more than 20000. What is the probability that a randomly selected phone sold by other
OEMs would cost more than 20000?
7. There are 3 main streaming platforms in India- Netflix, HotStar and Amazon Prime. 30% of
Internet users have a Hotstar account, 25% have a Amazon Prime account and 18% have a
Netflix account. 12% have both a HotStar account and a Netflix account, 21% have both a
HotStar and Amazon Prime account and 9% have both a Netflix and Amazon Prime account.
If you know that 60% don’t have any account with a streaming platform, what percentage of
Internet users have accounts with all three streaming platforms?
8. If A and B are independent events and P(A ∩ B)=0.3. If P(A)=0.5, find P(B)
9. The first quartile of the DS1 scores of Section-A of IIMB is 40 and the third quartile is 60. Are
students who scores 93 and 32 an outlier?
10. Annual sales, in millions of dollars, for 21 pharmaceutical companies is
8408 1374 1872 8879 2459 11413 608
14138 6452 1850 2818 1356 10498 7478
4019 4341 739 2127 3653 5794 8305

Calculate the mean, median, Standard Deviation and IQR.

11. A general insurance company sells homeowners insurance. Past records show that claims
are made on 20% of the policies sold. Of the total claims, 90% of claims are small claims with
claim size being less than 5% of home value. Of the remaining claims, the claim size is
greater than 5% and less than 20% of home value. A homeowner wishes to take out an
insurance policy on his home worth INR 1 Crore. What should be the expected minimum
premium charged by the Insurance company, to be able to cover future claims?
12. The probability of an unvaccinated individual catching Covid in the next 1 year is 4%. The
probability for an vaccinated individual catching Covid in the next 1 year is 1%. Testing is
pervasive (everyone is tested at regular intervals to ensure no one falls through the cracks).
The rate of false positive is 10% and the rate of false negative is 2%
Answer the following questions:
1.) If the individual is vaccinated, what is the probability that he tests positive for Covid in
the next 1 year?
2.) If the individual is unvaccinated, what is the probability that he tests negative for Covid
in the next 1 year?
13. HDFC Bank and Flipkart wish to launch a co-branded credit card in six major cities of India
i.e., Mumbai, Delhi, Kolkata, Chennai, Bengaluru, and Hyderabad. The data analyst at HDFC
Bank requests for the credit card spends data and receives the following table.

City
Mumbai Delhi Kolkata Chennai Bengaluru Hyderabad
Monthly <10,000 270 194 140 110 130 70
Credit 10,000-20,000 190 163 99 105 121 67
Card 20,000-30,000 105 94 63 76 90 45
spends >30,000 55 47 21 36 40 18

All numbers are in 000s of issued credit cards.


Answer the following questions:
a) Create a joint probability table and mark all the joint and marginal probabilities.
b) Given that a credit card has been issued in Mumbai, what is the probability that the
monthly spend is greater than 20,000?
c) What is the probability that a credit card with a monthly credit card spend less than
30000 has been issued in Bengaluru?
d) Flipkart management wishes to target cities with highest percentage of high spenders
(i.e. monthly spends greater than 30,000) as a percentage of issued credit cards. As the
data analyst at HDFC Bank, what would be your recommendation? (From first to last)
14. Virat Kohli scores a century in one out of every 8 matches. Rohit Sharma scores a century in
one out of every 10 matches. India is playing a 5 match cricket series with Australia. (Assume
Independence)
Answer the following:
1. What is the probability that Virat Kohli will score a century in just 1 match?
2. What is the probability that Rohit Sharma will score a century in atleast 2 matches?
3. Given that Virat Kohli has scored a century, what is the probability that Rohit Sharma
scores a century as well?
4. What is the probability that either Virat Kohli or Rohit Sharma or both will score a
century in the next match?

15. The US think tank, Heritage Foundation publishes the Index of Economic Freedom every
year.
A sample of the data is presented below
Country Region Score Classification
Singapore Asia Pacific 89.7 Free
Australia Asia Pacific 82.4 Free
Switzerland Europe 81.9 Free
UK Europe 78.4 Mostly Free
Netherlands Europe 76.8 Mostly Free
UAE MENA 76.9 Mostly Free
US North America 74.8 Mostly Free
Spain Europe 69.9 Moderately Free
France Europe 65.7 Moderately Free
Italy Europe 64.9 Moderately Unfree
Russia Europe 61.5 Moderately Unfree
China Asia Pacific 58.4 Mostly Unfree
India Asia Pacific 56.5 Mostly Unfree
Pakistan Asia Pacific 51.7 Repressed
North Korea Asia Pacific 5.2 Repressed

Identify the type for each of the variables.

You might also like