Professional Documents
Culture Documents
Solutions Final Exam Quantitative Data Analysis 1 2023 For Canvas - 1708763533
Solutions Final Exam Quantitative Data Analysis 1 2023 For Canvas - 1708763533
Solutions Final Exam Quantitative Data Analysis 1 2023 For Canvas - 1708763533
Below a points indication ‘per step’ within each subquestion is given, (small) adjustments to these
points allocation might be done during the correction process!
50
𝐿𝑜𝑐𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑑𝑖𝑎𝑛 = 𝐿50 = (100 + 1) ∗ 100 = 50.5 (1 point)
The 50th and 51st observation after ordering are in the class 40 – 50, so M=45
The median income is 45,000 Euro. (1 point)
The mode of the income data is the income value that appears most frequently. In this case,
the 40-50 income group is the class with the highest frequency, so the mode is 45,000 Euro.
(1 point)
b. The distribution is skewed to the left, (1 point)
since the mean < Median (1 point)
960−50⋅17
𝑃(𝐵𝑂𝑋 > 860) = 𝑃 (𝑍 > ) (1 point)
√50⋅0.5
≈ (17.09,17.31) (2 points)
1 point per boundary
The whole interval lies completely above 17 (1 point)
So, there is sufficient statistical evidence that the mean weight differs from 17 gram.
(1 point)
The confidence level of the test is 5%, since the test is two-sided (1 point)
B\A 2 5 8
2 26 41 56
5 50 65 80
8 74 89 104
a. Let:
X=Tenure
Y=Delegation Score
𝑠𝑥𝑦 157.024
𝑟𝑋𝑌 = 𝑠 𝑠 = 310.000⋅ 79.820 (2 points)
𝑋 𝑦 √ √
1 point for correct covariance and 1 point for correct standard deviations
≈ 0.9982 (1 point)
b. 𝑦̂ = 𝑏0 + 𝑏1 𝑥 with:
𝑠𝑥𝑦 157.024
𝑏1 = = ≈ 0.5065 (1 point)
𝑠𝑥2 310.000
𝑏0 = 𝑦̅ − 𝑏1 𝑥̅ ≈ 16.4233 − 0.5065 … ⋅ 31.00 = 0.7218 (1 point)
Interpretation slope: If tenure increases with 1, the delegation score goes up with 0.5065 on
average. (2 points)
c. 1. Hypotheses:
𝐻0 : 𝜌 = 0 (1 point)
𝑣. 𝑠.
𝐻1 : 𝜌 > 0 (1 point)
𝛼 = 1%
1 point per hypothesis, if 𝛼 not mentioned do not subtract points. If sample statistics used in
the hypothesis zero points for this step.
3. Conditions:
(1) Random sample (1 point)
(II) Large n or bivariate normal distribution in the population (1 point)
4. Rejection region:
Reject 𝐻0 ⇔ 𝑡 ≥ t α = 𝑡1% ≈ 2.467 (use 𝑑𝑓 = 28) (2 points)
If 𝑡1% mentioned, but value is wrong, still 1 point can be earned. If sign is to the
wrong side, or two-sided area used, zero points.
6. conclusion:
Given the sample and a significance level of 1%, there is sufficient evidence to infer
that there is positive correlation between delegation score and age. (3 points)
1 point for given sample and sign. level, 1 point for sufficient evidence and 1 point for
describing H1 in words (answer research question)
Remark: If complete incorrect test statistic given in step 2: only points can be earned
for step 1.
Exercise 5 (26 pts = 2 + 5 + 15 + 4)
a. It tells us that there is more variability in the sample mean after the implementation of the
training program than before the training program. (2 points)
1 point for mentioning variability and 1 point for mentioning that ‘after is higher'
𝑥̅𝑏𝑒𝑓𝑜𝑟𝑒 −42000
b. 𝑡 = 𝑆.𝐸.(𝑥̅𝑏𝑒𝑓𝑜𝑟𝑒 )
(1 point)
41500−42000
= 278.54301
(1 point)
= −1.795 (1 point)
use df=30-1=29
𝑡2.5% < 1.795 < 𝑡5% (1 point)
c. This is a matched pairs test since the same sales representatives are studied before and after
the implementation of the program. (1 point)
Let 𝐷 = 𝐵𝑒𝑓𝑜𝑟𝑒 − 𝐴𝑓𝑡𝑒𝑟
1. Hypotheses:
𝐻0 : μ𝐷 = 0 (1 point)
𝑣. 𝑠.
𝐻1 : 𝜇𝐷 < 0 (1 point)
𝛼 = 5%
1 point per hypothesis, if 𝛼 not mentioned do not subtract points. If sample statistics
used in the hypothesis zero points for this step.
-1 point if distribution not mentioned and, also -1 point if 𝑋̅𝐷 . filled in. If ‘1’
3. Conditions:
(1) Random sample (1 point)
(II) 𝑛𝐷 = 30 ≥ 30, so normality not needed (1 point)
4. Rejection region:
Reject 𝐻0 ⇔ 𝑡 ≤ −𝑡𝛼 = −𝑡5% ≈ −1.699 (use 𝑑𝑓 = 29) (2 points)
If −𝑡5% mentioned, but value is wrong, still 1 point can be earned. If sign is to the wrong
side, or two-sided area used, zero points.
Remark: If complete incorrect test statistic given in step 2: only points can be earned
for step 1.
d. Joe might have made the error of not rejecting the null hypothesis that the mean sales after
the implementation of the training program are equal to 45,000 Euro, whereas in reality the
main sales after the implementation program are higher than 45,000 Euro. (2 points)
This is a type II error and has probability 𝛽, which depends on 𝛼, 𝑛, 𝜎 and the actual value of
𝜇𝐴𝑓𝑡𝑒𝑟 . (2 points)
1 point per mentioned factor
b. Since Job type is an ordinal variable, at least one of the categories is nominal (education level
is ordinal) and we need to use Cramers V. (3 points)
1 point for ‘Job type ordinal’, 1 point for at least one category ordinal and 1 point for
Cramers V