Tutorialexercises 1

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Business Statistics (BK/IBA)

Tutorial 1 – Exercises
Instruction

In a tutorial session of 2 hours, we will obviously not be able to discuss all questions. Therefore, the
following procedure applies:
• we expect students to prepare all exercises in advance;
• we will discuss only a selection of exercises;
• exercises that were not discussed during class are nevertheless part of the course;
• students can indicate their wish list of exercises to be discussed during the session;
• teachers may invite students to answer questions, orally or on the blackboard.
We further understand that your time is limited, and in particular that your time between lecture and
tutorial may be limited. In case you have no time to prepare everything, we kindly advise you to give
priority to the exercises that are indicated with the icon. This does not mean that the other questions
are not relevant!

1A Data

Q1 (Doane & Seward, 4/E, 2.1)


What type of data (categorical, discrete numerical, or continuous numerical) is each of the
following variables? If there is any ambiguity about the data type, explain why the answer is
unclear.
a. The manufacturer of your car.
b. Your college major.
c. The number of college credits you are taking.
a. Categorical; b. Categorical; c. Discrete numerical Q1

Q2 (Doane & Seward, 4/E, 2.2)


What type of data (categorical, discrete numerical, or continuous numerical) is each of the
following variables? If there is any ambiguity, explain why the answer is unclear.
a. Length of a TV commercial.
b. Number of peanuts in a can of Planter’s Mixed Nuts.
c. Occupation of a mortgage applicant.
d. Flight time from London Heathrow to Chicago O’Hare.
a. Continuous numerical; b. Discrete numerical; c. Categorical; d. Continuous numerical Q2

Q3 (Doane & Seward, 4/E, 2.3)


What type of data (categorical, discrete numerical, or continuous numerical) is each of the
following variables? If there is any ambiguity about the data type, explain why the answer is
unclear.
a. The miles on your car’s odometer.
b. The fat grams you ate for lunch yesterday.
c. The name of the airline with the cheapest fare from New York to London.
d. The brand of cell phone you own.
b. Continuous numerical (often reported as an integer); c. Categorical; d. Categorical
a. Continuous numerical (often represented as discrete numerical) Q3

Q4 (Doane & Seward, 4/E, 2.9)

BS 1 Tutorial 1
Which measurement level (nominal, ordinal, interval, ratio) is each of the following variables?
Explain.
a. Number of hits in Game 1 of the next World Series.
b. Baltimore’s standing in the American League East (among five teams).
c. Field position of a baseball player (catcher, pitcher, etc.).
d. Temperature on opening day (Celsius).
e. Salary of a randomly chosen American League pitcher.
f. Freeway traffic on opening day (light, medium, heavy).
a. Ratio; b. Ordinal; c. Nominal; d. Interval; e. Ratio; f. Ordinal Q4

Q5 (Doane & Seward, 4/E, 2.10)


Which measurement level (nominal, ordinal, interval, ratio) is each of the following variables?
Explain.
a. Number of employees in the Walmart store in Hutchinson, Kansas.
b. Number of merchandise returns on a randomly chosen Monday at a Walmart store.
c. Temperature (in Fahrenheit) in the ice-cream freezer at a Walmart store.
d. Name of the cashier at register 3 in a Walmart store.
e. Manager’s rating of the cashier at register 3 in a Walmart store.
f. Social security number of the cashier at register 3 in a Walmart store.
a. Ratio; b. Ratio; c. Interval; d. Nominal; e. Ordinal; f. Nominal Q5

1B Summarizing data

Q1 (based on Doane & Seward, 4/E, 4.10.a)


Given is the following data set with exam scores (9 students) 42, 55, 65, 67, 68, 75, 76, 78, 94.
a. Find the median, midrange, and geometric mean. You may use your calculator.
b. Are they reasonable measures of central tendency? Explain.
a. 68, 68, 67.37 Q1

Q2 (Doane & Seward, 4/E, 4.18)


The number of Internet users in Latin America grew from 78.5 million in 2000 to 156.6 million
in 2010. Use the geometric mean to find the mean annual growth rate. Source:
www.internetwoldstats.com (Accessed April 5, 2011).
7.15% Q2

Q3 (Doane & Seward, 4/E, 4.20)


For each data set
A: 6, 7, 8; B: 4, 5, 6, 7, 8, 9, 10; C: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
a. Find the mean.
b. Find the standard deviation, treating the data as a sample.
c. Find the standard deviation, treating the data as a population.
d. What does this exercise show about the two formulas?
Q3

BS 2 Tutorial 1
Q4 (Doane & Seward, 4/E, 4.23)
Given are data summaries on three stocks.
A: 𝑥̅ = $24.50, 𝑠 = 5.25; B: 𝑥̅ = $147.25, 𝑠 = 12.25; C: 𝑥̅ = $5.75, 𝑠 = 2.08
a. Find the coefficient of variation for prices of these three stocks.
b. Which stock has the greatest relative variation?
c. To measure variability, why not just compare the standard deviations?
; b. Stock C Q4

Q5 (Doane & Seward, 4/E, 4.34)


Scores on an accounting exam ranged from 42 to 96, with quartiles 𝑄1 = 61, 𝑄2 = 77, and
𝑄3 = 85.
a. Sketch a simple boxplot (5 number summary without fences) using a nicely scaled 𝑋-axis.
b. Describe its shape (skewed left, symmetric, skewed right).
b. The long left whisker suggests left-skewness.
a. Q5

Q6 (Doane & Seward, 4/E, 4.40)


For each 𝑋-𝑌 data set (𝑛 = 12):

a. Make a scatter plot. (You may use Excel or SPSS)


b. Find the sample correlation coefficient. (You may use your calculator)
c. Is there a linear relationship between X and Y? If so, describe it.
Note: Use Excel or MegaStat or MINITAB or SPSS. See XYDataSets
b. 𝑟𝑎 = −0.8841; 𝑟𝑏 = 0.90875; 𝑟𝑐 = 0.1704 Q6

2A Basic Probability

Q1 (Doane & Seward, 4/E, 5.13)


Are these characteristics of a student at your university mutually exclusive or not? Explain.
a. A = works 20 hours or more, B = majoring in accounting
b. A = born in the United States, B = born in Canada
c. A = owns a Toyota, B = owns a Honda

BS 3 Tutorial 1
c. Not mutually exclusive.
b. Mutually exclusive.
a. Not mutually exclusive. Q1

Q2 (Doane & Seward, 4/E, 5.15)


Given 𝑃(𝐴) = .40, 𝑃(𝐵) = .50, and 𝑃(𝐴 ∩ 𝐵) = .05, find
a. 𝑃(𝐴 ∪ 𝐵),
b. 𝑃(𝐴|𝐵), and
c. 𝑃(𝐵|𝐴).
(d) Sketch a Venn diagram.
a. 0.85; b. 0.10; c. 0.125. Q2

Q3 (Doane & Seward, 4/E, 5.17)


Suppose Samsung ships 21.7 percent of the liquid crystal displays (LCDs) in the world. Let 𝑆
be the event that a randomly selected LCD was made by Samsung. Find
a. 𝑃(𝑆)
b. 𝑃(𝑆 ′ )
c. the odds in favor of event 𝑆
d. the odds against event 𝑆
(Data are from The Economist 372, no. 8385 [July 24, 2004], p. 59.)
a. 0.217; b. 0.783; c. 0.277; d. 3.61 Q3

Q4 (Doane & Seward, 4/E, 5.19)


List two binary events that describe the possible outcomes of each situation.
a. A pharmaceutical firm seeks FDA approval for a new drug.
b. A baseball batter goes to bat.
c. A woman has a mammogram test.
c. 𝑋 = 1 if breast cancer detected, 0 otherwise.
b. 𝑋 = 1 if batter gets a hit, 0 otherwise.
a. 𝑋 = 1 if the drug is approved, 0 otherwise. Q4

Q5 (based on Doane & Seward, 4/E, 5.21)


Let 𝑆 be the event that a randomly chosen female aged 18-24 is a smoker. Let 𝐶 be the event
that a randomly chosen female aged 18-24 is a Caucasian. Given 𝑃(𝑆) = .246, 𝑃(𝐶) = .830,
and 𝑃(𝑆 ∩ 𝐶) = .232, find each probability and express the event in words. (Data are from
Statistical Abstract of the United States, 2001.)
a. Make a contingency table with the data available and complete the table. Use this to find
b. 𝑃(𝑆 ′ ).
c. 𝑃(𝑆 ∪ 𝐶).
d. 𝑃(𝑆|𝐶).
e. 𝑃(𝑆|𝐶 ′ ).
f. Are 𝐶 and 𝑆 independent?
b. 0.754; c. 0.844; d. 0.2795; e. 0.0824; f. No Q5

Q6 (Doane & Seward, 4/E, 5.23)


Given 𝑃(𝐴) = .40, 𝑃(𝐵) = .50, and 𝑃(𝐴 ∩ 𝐵) = .05.
a. Find 𝑃(𝐴|𝐵).
b. In this problem, are 𝐴 and 𝐵 independent? Explain.
a. 0.10; b. No Q6

BS 4 Tutorial 1
Q7 (Doane & Seward, 4/E, 5.25)
The probability that a student has a Visa card (event 𝑉) is . 73. The probability that a student
has a MasterCard (event 𝑀) is . 18. The probability that a student has both cards is . 03.
a. Find the probability that a student has either a Visa card or a MasterCard (or both).
b. In this problem, are V and M independent? Explain.
a. 0.88; b. No Q7

Q8 𝑃(𝐴) = 0.4, 𝑃(𝐵) = 0.6, 𝐴 and 𝐵 are independent. Find 𝑃(𝐴 ∩ 𝐵).
0.24 Q8

Q9 (Doane & Seward, 4/E, 5.30)


The contingency table below shows the results of a survey of online video viewing by age.

Find the following probabilities or percentages:


a. Probability that a viewer is aged 18-34.
b. Probability that a viewer prefers watching TV videos.
c. Percentage of viewers who are 18-34 and prefer watching user created videos.
d. Percentage of viewers aged 18-34 who prefer watching user created videos.
e. Percentage of viewers who are 35-54 or prefer user created videos?
a. 0.69; b. 0.48; c. 0.39; d. 0.5652; e. 0.62 Q9

Q10 (Doane & Seward, 4/E, 5.36)


The following contingency table shows average yield (rows) and average duration (columns)
for 38 bond funds.

For a randomly chosen bond fund, find the probability that:


a. The bond fund is long duration.
b. The bond fund has high yield.
c. The bond fund has high yield given that it is of short duration.
d. The bond fund is of short duration given that it has high yield.
a. 0.3948; b. 0.3948; c. 0.1818; d. 0.1333 Q10

2B Probability distributions

Q1 (Doane & Seward, 4/E, 6.3)

BS 5 Tutorial 1
On the midnight shift, the number of patients with head trauma in an emergency room has the
probability distribution shown below.

a. Calculate the mean and standard deviation.


b. Describe the shape of this distribution.
a. 2.25 and 1.299. b. The distribution is skewed to the right. Q1

Q2 The following bivariate distribution is given:

a. Find 𝜇𝑋 and 𝜇𝑌
b. Find 𝜎𝑋2 , 𝜎𝑋 , 𝜎𝑌2 , and 𝜎𝑌 ,
c. Find the distribution of 𝑋 + 𝑌.
d. Find 𝜇𝑋+𝑌 .
d. 𝐸(𝑋 + 𝑌) = 270.
c. 𝑃(𝑋 + 𝑌 = 150) = 0.2, 𝑃(𝑋 + 𝑌 = 250) = 0.4, 𝑃(𝑋 + 𝑌 = 350) = 0.4.
a. 𝐸(𝑋) = 170; 𝐸(𝑌) = 100; b. 𝜎𝑋 = 45.83; 𝜎𝑌 = 50.00 Q2

Q3 (Doane & Seward, 4/E, 6.9)


The ages of Java programmers at SynFlex Corp. range from 20 to 60.
a. If their ages are uniformly distributed, what would be the mean and standard deviation?
b. What is the probability that a randomly selected programmer’s age is at least 40? At least
30? Hint: Treat employee ages as integers.
a. 𝜇 = 40 and 𝜎 = 11.83; b. 𝑃(𝑋 ≥ 40) = 0.5122 and 𝑃(𝑋 ≥ 30) = 0.7561. Q3

Q4 (Doane & Seward, 4/E, 6.15.b)


Find the mean and standard deviation for a binomial random variable with 𝑛 = 10 and 𝜋 = .40.
𝜇 = 4, 𝜎 = 1.5492. Q4

Q5 (Doane & Seward, 4/E, 6.18.b)


Calculate the binomial probability of 𝑋 = 1 with 𝑛 = 10, 𝜋 = .40.
𝑃(𝑋 = 1) = 0.0403. Q5

Q6 (Doane & Seward, 4/E, 6.18.c)


Calculate the binomial probability of 𝑋 = 3 with 𝑛 = 12, 𝜋 = .70.
𝑃(𝑋 = 3) = 0.0015. Q6

Q7 (based on Doane & Seward, 4/E, 6.19)


If 𝑋 has a binomial distribution, calculate each compound event probability, as well as 𝜇 and
𝜎 2:
a. 𝑋 ≤ 3, 𝑛 = 8, 𝜋 = .20.

BS 6 Tutorial 1
b. 𝑋 > 7, 𝑛 = 10, 𝜋 = .50.
c. 𝑋 < 3, 𝑛 = 6, 𝜋 = .70.
d. 𝑋 ≤ 10, 𝑛 = 14, 𝜋 = .95.
d. 𝑃(𝑋 ≤ 10) = 0.0041; 𝜇 = 13.3; 𝜎 2 = 0.665.
c. 𝑃(𝑋 < 3) = 0.0704; 𝜇 = 4.2; 𝜎 2 = 1.26.
b. 𝑃(𝑋 > 7) = 0.0547; 𝜇 = 5.0; 𝜎 2 = 2.5.
a. 𝑃(𝑋 ≤ 3) = 0.9437; 𝜇 = 1.60; 𝜎 2 = 1.28. Q7

Q8 (Doane & Seward, 4/E, 7.11)


State the Empirical Rule for a normal distribution (see Chapter 4).
* about 99.73% will lie within 𝜇 ± 3𝜎
* about 95.44% will lie within 𝜇 ± 2𝜎
* about 68.26% will lie within 𝜇 ± 1𝜎
It says that for data from a normal distribution we expect Q8

Q9 (Doane & Seward, 4/E, 7.13)


Find the standard normal area for each of the following, showing your reasoning clearly and
indicating which table you used.
a. 𝑃(0 < 𝑍 < 0.50).
b. 𝑃(−0.50 < 𝑍 < 0).
c. 𝑃(𝑍 > 0).
d. 𝑃(𝑍 = 0)
a. 0.1915; b. 0.1915; c. 0.5000; d. 0 Q9

Q10 (Doane & Seward, 4/E, 7.16)


Find the standard normal area for each of the following. Sketch the normal curve and shade in
the area represented below.
a. 𝑃(𝑍 < −1.96).
b. 𝑃(𝑍 > 1.96).
c. 𝑃(𝑍 < 1.65).
d. 𝑃(𝑍 > −1.65).
a. 0.0250; b. 0.0250; c. 0.9505; d. 0.9505 Q10

Q11 (Doane & Seward, 4/E, 7.21)


Find the associated 𝑧-score for each of the following standard normal areas.
a. Lowest 6 percent
b. Highest 40 percent
c. Lowest 7 percent
a. 𝑧 = −1.555; b. 𝑧 = 0.25; c. 𝑧 = −1.48 Q11

Q12 (Doane & Seward, 4/E, 7.27)


Assume that the number of calories in a McDonald’s Egg McMuffin is a normally distributed
random variable with a mean of 290 calories and a standard deviation of 14 calories.
a. What is the probability that a particular serving contains fewer than 300 calories?
b. More than 250 calories?
c. Between 275 and 310 calories?
Show all work clearly. (Data are from McDonalds.com)
a. 0.7625; b. 0.9979; c. 0.7814 Q12

BS 7 Tutorial 1
Q13 (Doane & Seward, 4/E, 7.37)
The weight of newborn babies in Foxboro Hospital is normally distributed with a mean of 6.9
pounds and a standard deviation of 1.2 pounds.
a. How unusual is a baby weighing 8.0 pounds or more?
b. What would be the 90th percentile for birth weight?
c. Within what range would the middle 95 percent of birth weights lie?
a. 0.1797 pounds; b. 8.4379 pounds; c. between 4.5 and 9.3 pounds Q13

Q14 (based on Doane & Seward, 4/E, 7.5)


Find each uniform continuous probability and sketch a graph showing it as a shaded area. Also
find 𝐸(𝑋) and var(𝑋) for each case.
a. 𝑃(𝑋 < 10) for 𝑈(0,50)
b. 𝑃(𝑋 > 500) for 𝑈(0, 1,000)
c. 𝑃(25 < 𝑋 < 45) for 𝑈(15, 65)
c. 𝑃(25 < 𝑋 < 45) = 0.4; 𝜇𝑋 = 40; 𝜎𝑋2 = 208.33
b. 𝑃(𝑋 > 500) = 05; 𝜇𝑋 = 500; 𝜎𝑋2 = 83333
a. 𝑃(𝑋 ≤ 10) = 0.2; 𝜇𝑋 = 25; 𝜎𝑋2 = 208.33 Q14

Q15 (Doane & Seward, 4/E, 7.8)


Assume the weight of a randomly chosen American passenger car is a uniformly distributed
random variable ranging from 2,500 pounds to 4,500 pounds.
a. What is the mean weight of a randomly chosen vehicle?
b. The standard deviation?
c. What is the probability that a vehicle will weigh less than 3,000 pounds?
d. More than 4,000 pounds?
e. Between 3,000 and 4,000 pounds?
a. 3500; b. 557.3503; c. 0.25; d. 0.25; e. 0.50 Q15

Q16 (based on Doane & Seward, 4/E, 7.29)


The pediatrics unit at Carver Hospital has 24 beds. The number of patients needing a bed at
any point in time is 𝑁(19.2,2.5).
a. What is the probability that the number of patients needing a bed will exceed the pediatric
unit’s bed capacity?
b. Could you really apply the normal distribution?
a. 0.0274 Q16

Q17 𝑋 and 𝑌 are two random variables, 𝑋 is the return of 100 shares of stock 𝐴, 𝑌 is the return of
100 shares of stock 𝐵. It is known that both 𝑋 and 𝑌 are normally distributed with mean 3 and
standard deviation 5 (dollars). The covariance between 𝑋 and 𝑌 is 𝜎𝑋,𝑌 = −20.
a. Find the probability 𝑃(2𝑋 ≤ 10)
b. Find the probability 𝑃(𝑋 + 𝑌 ≤ 10) (assume that 𝑋 + 𝑌 is normally distributed)
a. 0.6554; b. 0.8962 Q17

Q18 (Doane & Seward, 4/E, 7.49)


The probability that a vending machine in the Oxnard University Student Center will dispense
the desired item when correct change is inserted is .90. If 200 customers try the machine, find
the probability that
a. at least 175 will receive the desired item
b. that fewer than 190 will receive the desired item

BS 8 Tutorial 1
a. 0.9032; b. 0.9875 Q18

Q19 In a large population 40% of the people travel by train. Approximate the probability – using an
appropriate approximating distribution - that in a random sample of size 𝑛 = 20 a proportion
of 0.50 or less travels by train.
0.8729 Q19

Old exam questions

Q1 23 March 2016, Q1c


Grades for the marketing exam have a right-skewed distribution, with 𝜇 = 5.0 and 𝜎 = 1.0. In
total, 289 students take the exam. What is the probability that a randomly selected student has
a score of at least the 95 percentile or higher? Note that the answer may be “not enough
information”. (text or 2 decimals)
0.05 Q1

Q2 23 March 2016, Q1h


We roll a die 2 times, and indicate the results as 𝑋1 and 𝑋2 . Find 𝑃(𝑋1 ≤ 2|𝑋1 + 𝑋2 = 5). (2
decimals)
0.50 Q2

BS 9 Tutorial 1

You might also like