Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

CONFIDENTIAL CD/JUL 2023/STA108

UNIVERSITI TEKNOLOGI MARA


FINAL EXAMINATION

STATISTICS AND PROBABILITY


COURSE CODE STA108
EXAMINATION : JULY 2023
3 HOURS

INSTRUCTIONS TO CANDIDATES

1. This question paper consists of five (5) questions.

2. Answer ALL questions in the Answer Booklet. Start each answer on 2 new
page.

3. Do not bring any material into the examination room unless permission is given by the invigilator
.

4. Please check to make sure that this examination pack consists of:
i. the Question Paper
it. a three - page Appendix 1
fi. an Answer Booklet - provided by the Faculty
iv. agraph paper - provided by the Faculty

5. Answer ALL questions in English.

DO NOT TURN THIS PAGE UNTIL YOU ARE TOLD TO DO SO


This examination paper consists of 6 printed pages
© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL
CONFIDENTIAL 2 CD/JUL 2023/STA108

QUESTION 1

a) For each of the following statements, classify whether it is a qualitative, quantitative


discrete, or quantitative continuous variable.

i) The number of patients arriving at the emergency room.

ii) Ethnicity of the respondent in a survey.

iii) Quality of restaurant services and fagilities.

iv) The time taken to complete an examination.


(4 marks)

b) A sociologist wants to know how much each household in City A spends on vacation each
year. Itis assumed that the city has 3100 households. The sociologist divided the city into
400 blocks and interviewed every home in 24 randomly chosen blocks.

i) State the population and sample for this study.

i) Determine the variable of interest in this study and state its level of measurement.

iii) Identify the sampling technique used for this study. State your reason.

iv) Suggest an appropriate data collection method used in this study.

(7 marks)

QUESTION 2

a) The following data give the results of a sample survey. The letters A, B, and C represent
the three types of illness (A = Acne, B = Bronchitis, and C = Chickenpox) that children
suffered in town XYZ.

i) ldentify the variable involved and its type for the above data.

ii) Prepare a frequency distribution table for the above data.

iii) Find the mode for the above data. Interpret the value of the mode obtained.

iv) Name ONE (1) graphical data presentation that is suitable for the data above. State
your reason.
© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL
CONFIDENTIAL 3 CD/JUL 2023/STA108

V) s it appropriate to measure the mean or median values for the above data? Justify
Yyour answer.
(12 marks)

b) The following table gives the frequency distribution of the number of hours spent per week
doing physical activities by all 50 students of Grade 8 at Gala International School.

Hours per week Number of students


0 to less than 5 5
5 to less than 10 8
10 to less than 15 14
. 15 to less than 20 13
20 to less than 25 8
25 to less than 30 2

i) Calculate the mean and variance of the number of hours spent per week.

ii) Draw a ‘less than’ ogive for the above data.

iif) Estimate the second quartile from the ‘less than’ ogive in (ii).

iv) Determine the shape of the data distribution by calculating the Pearson
coefficient of
skewness.

v) ltis reported that the average and standard deviation of the number of hours spent
per week doing physical activities by students in Grade 9 are 16.9 and 5.43
respectively. Identify which grade is less consistent with doing physical activities
per
week.

(20 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 4 CD/JUL 2023/STA108

QUESTION 3
The fitness center manager is interested in determining the relationship between the time
spent on aerobic exercise and a person's weight loss. Ten club members carefully recorded
the total number of exercises (in minutes) they did for a week and the amount of weight (in
kilograms) they had lost during the week. The data and outputs of IBM-SPSS software are
shown below.

Weight loss (kg) 05|13 (10 |09 | 14 |17 |19 | 11|12 |02

Aerobic exercise (minutes) 112 | 190 | 171 | 148 | 193 | 235 | 237 | 176 | 185 | @5

Table1: Model Summary

Adjusted R | Std. Error of the


Model | R Square Square Estimate
1 974 971 .088

Table 2: Coefficients
Standardized
Unstandardized Coefficients | Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) [ .099 -5.306 001
Aerobic exercise .010 .001 987 17.250 .000

Based on the above data and the IBM-SPSS outputs above:

a) Identify the predictor and response variables.

b) Find the value of the Pearson correlation coefficient using the IBM-SPSS output. Hence,
interpret the value obtained.

c) Compute the value of C.

d) Interpret the value of the slope.

e) Explain the value of the coefficient of determination. Thus, give ONE (1) factor that might
increase the value of this coefficient.

f) Write the estimated linear regression model.

g) Estimate the weight loss of a person who exercised for two hours a week.
(15 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 5 CD/JUL 2023/STA108

QUESTION 4

a) There are 60 students in ABC College. There are 27 and 20 students studying
Mathematics and Biology respectively. Meanwhile, there are 9 students studying both
Mathematics and Biology courses. Let M and B represent Mathematics and Biology,
respectively.

i) Draw a Venn diagram for the above events.

i) Find P(M U B).

iii) Find P(M N BY).


iv) Determine whether the event of ‘studying Mathematics’ is statistically independent
of
the event not ‘studying Biology’.
(10 marks)

b) A lecturer held a Mathematics test for her 40 students. Two questions were given,
and
the lecturer found out that within the class, 30 of the students got the first question
correct. Of those students who got the first question correct,18 got the second
question
correct. Meanwhile, of those who got the first question incorrect, three of them got
the
second question also incorrect. Let X be the event that the students got
the first question
correct and Y be the event that the students got the second question correct.

i) Construct a tree diagram to represent the above information.

i) Compute the probability of the students who got the second question
incorrect.

iii) Find the probability that the students got the first question correct but the
second
question is incorrect.

iv) Given that the selected student got the second question correct, find
the probability
that the student got the first question wrong.
(10 marks)

© Hak Cipta Universiti Teknologi MARA


CONFIDENTIAL
CONFIDENTIAL CD/JUL 2023/STA108

QUESTION 5

a) Aresearcher surveyed 500 families living in the Palm Villa housing area to
collect data on
the number of househelds in each famit ly. The following table lists the frequency
distribution of the data collected by the researcher.

Number of households 2 3 4
Number of families 62 110 165 163

i) Construct a probability distribution table for the above data.

ii) Find the probability ihat a family has at least three households.

iii) Calculate the value of E(X) and E(X?).

iv) Find E(2X2 - 3X + 5).


(10 marks)

b) A continuous random variable X has a probability density function as given below.

g(x+2) O<x<1
f(x):{
0 otherwise

i) Find the probability of P(X > 0.5).

ii) Calculate the expected value of X.

iii) Compute E[%X + 2).

v) Calculate V(X) if £(X?) =%,


(12 marks)

END OF QUESTION PAPER

© Hak Cipta Universiti Teknologi MARA


CONFIDENTIAL

You might also like