Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

9/3/2021 Assignment 1 - Basics of data

Assignment 1 - Basics of data


Click on a question number to see how your answers were marked and, where
available, full solutions.

Question Number Score


Question 1 7 / 7
Question 2 21 / 21
Question 3 4 / 4
Question 4 7 / 7
Question 5 2 / 2
Question 6 11 / 12
Question 7 6 / 6
Question 8 12 / 12
Total 70 / 71
(98%)

Performance Summary
Exam Name: Assignment 1 - Basics of data
Session ID: 12143512387
Student's Name: KH (Karabo) Mokgere (ba00b20d0cb64604a80997f6369d981c)
Exam Start: Fri Sep 03 2021 10:41:20
Exam Stop: Fri Sep 03 2021 11:41:23
Time Spent: 1:00:03

Question 1
Researchers studying the effect of antibiotic treatment for acute sinusitis compared to
symptomatic treatments randomly assigned 100 adults diagnosed with acute sinusitis to
one of two groups: treatment or control. Study participants received either a 10-day course
of amoxicillin (an antibiotic) or a placebo similar in appearance and taste. The placebo

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 1/23
9/3/2021 Assignment 1 - Basics of data

consisted of  symptomatic treatments such as acetaminophen, nasal decongestants, etc. At


the end of the 10-day period patients were asked if they experienced significant
improvement in symptoms. The distribution of responses is summarised below.

Self-reported significant improvement on symptons

Yes No Total
Treatment 14 36 50
Group
Control 9 41 50
Total 23 77 100

a)
What fraction of patients in the treatment group experienced a significant
improvement in symptoms? Round your answer to two decimal places.

Expected answer: 0.28


0.28 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

b)
What fraction of patients in the control group experienced a significant improvement
in symptoms? Round your answer to two decimal places.

Expected answer: 0.18


0.18 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

c)
At first glance, does antibiotics appear to be an effective treatment? Hint: calculate the
difference between your answers for part (a) and (b).
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 2/23
9/3/2021 Assignment 1 - Basics of data

It appears that patients in the It does NOT appear that patients


treatment group are more likely to in the treatment group are more
experience improvements in likely to experience improvements
symptoms. in symptoms.

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

d)
What can we conclude from this study?

The data proves that antibiotics is The data indicate that antibiotics


not an effective treatment. may be an affective treatment.

The data provide convincing The data provide convincing


evidence that antibiotics is an evidence that antibiotics is not an
affective treatment. affective treatment.

The data proves that antibiotics is The data indicate that antibiotics


an effective treatment. may not be an affective treatment.

 You chose a correct answer.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

Advice
Refer to Section 1.1 of Introductory Statistics with Randomization and Simulation.

Question 2
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 3/23
9/3/2021 Assignment 1 - Basics of data

A survey was conducted to study the smoking habits of residents. Below is a data matrix
displaying a portion of the data collected in this survey. Note that "cig" stands for cigarettes.

sex age maried grossIncome smoke amtWeekends amtWeekdays


1 Male 42 Yes 72000 Yes 14 cig/day 10 cig/day
2 Male 49 No 81500 Yes 13 cig/day 15 cig/day
3 Female 16 No 75500 Yes 7 cig/day 9 cig/day
4 Female 38.5 Yes 97000 No NA NA
5 Male 44 No 44500 No NA NA
...
7756 Female 40 Yes 38500 No NA NA

a)
How many participants were included in the survey?

Expected answer: 7756


7756 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

b)
What is the average age of the first five (5) cases? Round your answer to one decimal
place.

Expected answer: 37.9


37.9 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

c)

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 4/23
9/3/2021 Assignment 1 - Basics of data

What is the combined income for first three (3) cases?

Expected answer: 229000


229000 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

d)
What is the average number of cigarettes that the first five (5) cases smoke per week?
Round your answer to one decimal places. You may answer the question in specified
steps, but you will lose 3 marks.

Step 0
Calculate the total number of cigarettes smoked by the first five cases on weekends
(2 days). Note that people who don't smoke, smoke zero (0) cigarettes per day.

Expected answer: 68

Score: 0/2 

Step 1
Calculate the total number of cigarettes smoked by the first five cases on the normal
weekdays (5 days). Note that people who don't smoke, smoke zero (0) cigarettes per
day.

Expected answer: 170


Score: 0/2 

Step 2

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 5/23
9/3/2021 Assignment 1 - Basics of data

Calculate the total number of cigarettes smoked by the first five cases over the week
(add the answers for the previous two questions).

Expected answer: 238


Score: 0/2 

Step 3
Calculate the average number of cigarettes smoked by the first five cases by
dividing the total number of cigarettes smoked by the number of cases in question.
Round your answer to one decimal value.

Expected answer: 47.6


Score: 0/2 

(Your score will not be affected.)

Expected answer: 47.6


Answer: 47.6 

 Your answer is correct.


You were awarded 8 marks.
You scored 8 marks for this part.

Score: 8/8 

Indicate whether each of the following variables in the study is numerical or categorical.
If numerical, indicate if it is continuous or discrete. If categorical, indicate if the variable
is ordinal.

e)
sex

numerical continuous categorical non-ordinal categorical ordinal

numerical discrete
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 6/23
9/3/2021 Assignment 1 - Basics of data

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

f)
age

categorical ordinal numerical continuous categorical non-ordinal

numerical discrete

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

g)
marital

numerical discrete categorical non-ordinal numerical continuous

categorical ordinal

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 7/23
9/3/2021 Assignment 1 - Basics of data

h)
gross income

categorical ordinal categorical non-ordinal numerical discrete

numerical continuous

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

i)
smoke

categorical non-ordinal numerical discrete numerical continuous

categorical ordinal

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

j)
amtWeekends

numerical discrete categorical ordinal numerical continuous

categorical non-ordinal


https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 8/23
9/3/2021 Assignment 1 - Basics of data

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

k)
amtWeekdays

numerical continuous numerical discrete categorical non-ordinal

categorical ordinal

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

Advice
Refer to Section 1.2 of Introductory Statistics with Randomization and Simulation.

Question 3
Suppose you want to estimate the percentage of videos on YouTube that are cat videos. It is
impossible for you to watch all videos on YouTube so you use a random video picker to
select 1000 videos for you. You find that 4% of these videos are cat videos. Determine which
of the following is an observation, a variable, a sample statistic, or a population parameter.

a)
Percentage of all videos on YouTube that are cat videos.

population parameter sample statistic observation

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 9/23
9/3/2021 Assignment 1 - Basics of data

variable

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

b)
4%

observation  sample statistic variable

population parameter

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

c)
A video in your sample.

population parameter observation sample statistic

variable

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 10/23
9/3/2021 Assignment 1 - Basics of data

d)
Whether or not a video is a cat video.

population parameter observation variable

sample statistic

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

Advice
Refer to Section 1.3 of Introductory Statistics with Randomization and Simulation.

Question 4
A study published in the Journal of Personality and Social Psychology asked a group of 200
randomly sampled men and women to evaluate how they felt about various subjects, such
as camping, health care, architecture, taxidermy, crossword puzzles, and Japan in order to
measure their dispositional attitude towards mostly independent stimuli. Then, they
presented the participants with information about a new product: a microwave oven. This
microwave oven does not exist, but the participants didn’t know this, and were given three
positive and three negative fake reviews. People who reacted positively to the subjects on
the dispositional attitude measurement also tended to react positively to the microwave
oven, and those who reacted negatively also tended to react negatively to it. Researchers
concluded that "some people tend to like things, whereas others tend to dislike things, and
a more thorough understanding of this tendency will lead to a more thorough
understanding of the psychology of attitudes."

In the following questions you may choose one or more options, but note that incorrect
selections will be negatively marked.

a)
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 11/23
9/3/2021 Assignment 1 - Basics of data

What are the cases?

the conclusion from the study

people who reacted negatively to the subjects

attitude towards a fictional microwave oven

200 randomly sampled men and women dispositional attitude

people who reacted positively to the subjects

people who reacted positively to the microwave oven the study

people who reacted negatively to the microwave oven

independent stimuli

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

b)
What is (are) the response variable(s) in this study?

people who reacted positively to the microwave oven

people who reacted negatively to the subjects

people who reacted negatively to the microwave oven

people who reacted positively to the subjects independent stimuli

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 12/23
9/3/2021 Assignment 1 - Basics of data

attitude towards a fictional microwave oven

the conclusion from the study dispositional attitude the study

200 randomly sampled men and women

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

c)
What is (are) the explanatory variable(s) in this study?

people who reacted negatively to the microwave oven

the conclusion from the study

people who reacted positively to the subjects

the attitude towards a fictional microwave oven dispositional attitude

independent stimuli

people who reacted positively to the microwave oven

people who reacted negatively to the subjects the study

200 randomly sampled men and women

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 13/23
9/3/2021 Assignment 1 - Basics of data

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

d)
Does the study employ random sampling?

No Yes

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

e)
Is this an observational study or an experiment?

Observational study Experiment

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

f)
Can we establish a causal link between the explanatory and response variables?

Yes No

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 14/23
9/3/2021 Assignment 1 - Basics of data

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

g)
Can the results of the study be generalized to the population at large?

Yes No

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

Advice
Refer to Section 1.4 and 1.5 of Introductory Statistics with Randomization and Simulation.

Question 5
A recent article in a college newspaper stated that college students get an average of 5.1 hrs
of sleep each night. A student who was skeptical about this value decided to conduct a
survey by randomly sampling 69 students. On average, the sampled students slept
4.4 hours per night.

a)
Which value represents the sample mean?

69 4.4 5.1

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 15/23
9/3/2021 Assignment 1 - Basics of data

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

b)
Which value represents the claimed population mean?

4.4 5.1 69

 You chose a correct answer.


You were awarded 1 mark.
You scored 1 mark for this part.

Score: 1/1 

Advice
Refer to Section 1.6 of Introductory Statistics with Randomization and Simulation.

Question 6
For each of the following, state whether you expect the distribution to be symmetric, right
skewed, or left skewed. Also specify whether the mean or median would best represent a
typical observation in the data, and whether the variability of observations would be best
represented using the standard deviation or IQR. You may select multiple options per
question, but note that incorrect selections are negatively marked.

a)
Housing prices in a country where 25% of the houses cost below R350,000, 50% of the
houses cost below R450,000, 75% of the houses cost below R1,000,000 and there are a
meaningful number of houses that cost more than R6,000,000.

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 16/23
9/3/2021 Assignment 1 - Basics of data

the distribution is symmetric the distribution is left skewed

the distribution is right skewed

the mean would represent a typical observation

the median would represent a typical observation

the variability of observations would be best represented using the standard


deviation

the variability of observations would be best represented using the IQR

 You chose a correct answer.


You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
You scored 3 marks for this part.

Score: 3/3 

b)
Housing prices in a country where 25% of the houses cost below R300,000, 50% of the
houses cost below R600,000, 75% of the houses cost below R900,000 and very few
houses that cost more than R1,200,000.

the distribution is symmetric the distribution is left skewed

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 17/23
9/3/2021 Assignment 1 - Basics of data

the distribution is right skewed

the mean would represent a typical observation

the median would represent a typical observation

the variability of observations would be best represented using the standard


deviation

the variability of observations would be best represented using the IQR

 You chose a correct answer.


You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
You scored 2 marks for this part.

Score: 2/3 

c)
Number of alcoholic drinks consumed by university students in a given week.

the distribution is symmetric the distribution is left skewed

the distribution is right skewed

the mean would represent a typical observation

the median would represent a typical observation

the variability of observations would be best represented using the standard


deviation

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 18/23
9/3/2021 Assignment 1 - Basics of data

the variability of observations would be best represented using the IQR

 You chose a correct answer.


You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
You scored 3 marks for this part.

Score: 3/3 

d)
Annual salaries of the employees at a large international company.

the distribution is symmetric the distribution is left skewed

the distribution is right skewed

the mean would represent a typical observation

the median would represent a typical observation

the variability of observations would be best represented using the standard


deviation

the variability of observations would be best represented using the IQR

 You chose a correct answer.


You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
 You chose a correct answer.
You were awarded 1 mark.
You scored 3 marks for this part.

Score: 3/3 
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 19/23
9/3/2021 Assignment 1 - Basics of data
Score: 3/3 

Advice
Refer to Section 1.6 of Introductory Statistics with Randomization and Simulation.

Question 7
Calculate the mean, variance and standard deviation for the following values: -20, -13, -10,
9, -5

a)
Calculate the mean for the values. Round your answer to one decimal value.

Expected answer: -7.8


-7.8 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

b)
Calculate the variance for the values. Use your answer from Part(a) and round your
answer to one decimal value.

Expected answer: 117.7


117.7 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

c)

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 20/23
9/3/2021 Assignment 1 - Basics of data

Calculate the standard deviation for the values. Use your answer from Part(b) and
round your answer to one decimal value.

Expected answer: 10.8


10.8 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

Advice
Refer to Section 1.6 of Introductory Statistics with Randomization and Simulation.

Question 8
603 randomly sampled registered voters from Tampa, FL were asked if they thought workers
who have illegally entered the US should be (i) allowed to keep their jobs and apply for US
citizenship, (ii) allowed to keep their jobs as temporary guest workers but not allowed to
apply for US citizenship, or (iii) lose their jobs and have to leave the country. The results of
the survey by political ideology are shown below.

Political ideology
Conservative     Moderate Liberal Total
(i) Apply for citizenship 86 72 60 218
(ii) Guest worker 56 65 51 172
(iii) Leave the country 52 34 20 106
(iv) Not sure 9 92 6 107
Total 203 263 137 603

a)
What fraction of these Tampa, FL voters identify themselves as conservatives? Round
your answer to two decimal places.

Expected answer: 0.34


0.34 
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 21/23
9/3/2021 Assignment 1 - Basics of data

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

b)
What fraction of these Tampa, FL voters are in favor of the citizenship option? Round
your answer to two decimal places.

Expected answer: 0.36


0.36 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

c)
What fraction of these Tampa, FL voters identify themselves as conservatives and are
in favor of the citizenship option? Round your answer to two decimal places.

Expected answer: 0.14


0.14 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

d)
What fraction of these Tampa, FL voters who identify themselves as conservatives are
also in favor of the citizenship option? Round your answer to two decimal places.

Expected answer: 0.42


0.42 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part
https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 22/23
9/3/2021 Assignment 1 - Basics of data
You scored 2 marks for this part.

Score: 2/2 

e)
What fraction of moderates share this view? Round your answer to two decimal places.

Expected answer: 0.27


0.27 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

f)
What fraction of liberals share this view? Round your answer to two decimal places.

Expected answer: 0.44


0.44 

 Your answer is correct.


You were awarded 2 marks.
You scored 2 marks for this part.

Score: 2/2 

Advice
Refer to Section 1.6 of Introductory Statistics with Randomization and Simulation.

Created using Numbas (https://www.numbas.org.uk), developed by Newcastle University (http://www.newcastle.ac.uk).

https://numbas.up.ac.za/run_attempt/226150?resource_link_id=_2436756_1 23/23

You might also like