Workshop 1 Statistics and Probability

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 12

WORKSHOP #1 STATISTICS AND PROBABILITY

PRESENTED BY:
ERIN JULIANA MOLINA DIAZ COD 52132
DIEGO ALEJANDRO TANDIOY RAMIREZ COD 54424

1.2 Are some towns windier than others? Does Chicago deserve the nickname “The Windy
City”? Below are the average wind speeds (in miles per hour) for 45 cities in the United
States:
WORKSHOP #1 STATISTICS AND PROBABILITY 1
PRESENTED BY: 1
ERIN JULIANA MOLINA DIAZ COD 52132 1
DIEGO ALEJANDRO TANDIOY RAMIREZ COD 54424 1
AVERAGE WIND SPEEDS 2
Triola 11 Edition 5
newbold 6
histogram 11

AVERAGE WIND BRAND


fi Fi neither Neither
SPEEDS CLASS
Lee ls
5.7 6.7 6.2 2 2 0.045 5%
6.7 7.7 7.2 3 5 0.114 11%
7.7 8.7 8.2 8 13 0.295 30%
8.7 9.7 9.2 18 31 0.705 70%
9.7 10.7 10.2 6 37 0.841 84%
10.7 11.7 11.2 4 41 0.932 93%
11.7 12.7 12.2 3 44 1.000 100%

a. Construct a relative frequency histogram for this data.


AVERAGE WIND SPEEDS

b. The value 35.1 was recorded at Mt. Washington, New Hampshire. Does the
geography of that city explain the average magnitude of its wind speed?
A/ The weather on Mount Washington is notoriously irregular, it has an alpine
climate, it receives an extremely high amount of precipitation, the winds of the Great
Lakes are found here along with the humid air of the Atlantic.

c. The average wind speed in Chicago is 10.3 miles per hour. What percentage of cities
have higher average wind speeds than Chicago?
R/ The percentage of 16% occurs in cities with a higher wind speed than that
recorded in Chicago.

d. Do you think Chicago has unusual winds?


A/ Chicago's winds are between 9.7 and 10.7 miles per hour within the 14% of cities
where its winds are not unusual compared to Mount Washington.

1.5 Here we give the relative frequency histogram associated with the grade point averages
(PPC) of a sample of 30 students:
a. Which of the PPC categories identified on the horizontal axis are associated with the
largest proportion of students?
R/ The category from 2.45 to 2.85 has the largest proportion of students with 0.46.
b. What proportion of students had PPC in each of the categories we identified?
R/ 0.98 of the students had PPC in each of the categories.
c. What proportion of the students had PPC less than 2.65?
R/ 0.53 of the students had a PPC less than 2.65

1.6 The relative frequency histogram below was constructed from data obtained from a
random sample of 25 families. Each was asked the number of quarts of milk they had
purchased the previous week.

a. Use this relative frequency histogram to determine the number of quarts of milk
purchased by the largest proportion of the 25 families. The category associated
with the largest relative frequency is called the modal category.
R/ 2 quarts of milk were purchased in the largest proportion.
b. What proportion of the 25 families bought more than 2 quarts of milk?
R/ 0.36 is the proportion of families that buy more than 2/4 quarts of milk.

c. What proportion bought more than 0 but less than 5 quarters?


R/ 0.88 proportion that I buy more than 0 and less than 5/4 of milk

1.7 The heights reported by 105 students from a biostatistics group were used to construct the
histogram that appears below.

a. Describe the shape of the histogram


R/ is a histogram of unusual shape, it is a histogram of peaks and spread, helping to
understand how much the data varies, and with an asymmetric shape.

b. Does this histogram have an unusual characteristic?


R/ Shows a wide dispersion, placing the data in smaller groups.

c. Can the reader give an explanation about the two peaks of the histogram? Are there
any considerations besides height that result in the two separate peaks? Which is it?
A/ It implies that 27 people have a height between 66 and 72 and 10 have a height of
65.
Triola 11 Edition
7. Cost of car crashes. The insurance institute for highway safety conducted crash tests with
new cars traveling at 6 mi/h. The total cost of damage was obtained for a simple random
sample of the cars tested and is presented below.
Is there a big difference between the different measures of central tendency?
❖ $ 7448
WORKSHOP #1 STATISTICS AND PROBABILITY................................................1
PRESENTED BY:..........................................................................................................1
ERIN JULIANA MOLINA DIAZ COD 52132.............................................................1
DIEGO ALEJANDRO TANDIOY RAMIREZ COD 54424.........................................1
AVERAGE WIND SPEEDS.....................................................................................................2
Triola 11 Edition.............................................................................................................5
newbold..........................................................................................................................6
histogram..........................................................................................................................11

FASHION: 4277 4911 6374 7448 9051 = NO MODE 0


RTA/: There is NO big difference in the measures of central tendency, the difference is:
6412 - 6374 = 38.2

8. FICO Scores: Below are the credit scores of a simple random sample of a FICO
company. When this book was written, the average reported FICO score was 678. It appears
that the sample's FICO scores are consistent with the reported mean?
714
751
664
789
818
779
698
836
753
834
693
802

• Average: 761
• Median: 1143
A/ Yes, since the average is 761 and this data is between the values given in the statement.

10. Pea Phenotypes: Biologists conducted experiments to determine whether a deficiency of


carbon dioxide in the soil affects the phenotypes of peas. The phenotype codes are indicated
below.
1= smooth yellow 2= smooth green 3= rough yellow 4= rough green
• Average: 47/25=1.88
• Median: 2
• Mode: 1 = 11

Can measures of central tendency be obtained for these values?


A/ Yes, using the mode you can obtain measures of central tendency.
Do the results make any sense?
A/ Yes, since they show us the amount that each phenotype has

newbold
2.3 Ten economists were tasked with predicting the percentage growth that the consumer
price index would experience next year. His predictions were:
3.0
3.1
3.4
3.4
3.5
3.6
3.7
3.7
3.7
3.9
a. Calculate the monthly average: 3.55
b. Calculate the sample median: 3.5
c. Find the mode: 3.7
2.10 A sample of 33 accounting students recorded the number of hours dedicated to
studying the subject during the week prior to the final exam. The data is found in the study
data file.
WORKSHOP #1 STATISTICS AND PROBABILITY................................................1
PRESENTED BY:..........................................................................................................1
ERIN JULIANA MOLINA DIAZ COD 52132.............................................................1
DIEGO ALEJANDRO TANDIOY RAMIREZ COD 54424.........................................1
AVERAGE WIND SPEEDS.....................................................................................................2
Triola 11 Edition.............................................................................................................5
newbold..........................................................................................................................6
histogram..........................................................................................................................11

a. Calculate the sample mean
R/ Sample mean: 8.5
b. Calculate the sample median
R/ Sample median: 9
c. Comment on symmetry or skew
R/ Asymmetry or bias: 0.002 We have a positive bias because it is greater than 0.
d. Find the summary of 5 numbers corresponding to this data
• 9 hours is what students spend the most.
• 1 student spends 21 hours
• Approximately half of students spend less than 9 hours
• 2 hours is the minimum that students dedicate
• 4 students spend 12 hours

2.11 The sun data file contains the volumes of a random sample of 100 containers (237 ml)
of a new tanning cream.

ALTERNATIVE CONTAINER
VOLUME CONTAINER VOLUME CONTAINER
R CONTENT ALTERNATIVE

224 237
228 237
229 237
229 237
229 237
231 238
231 238
231 238
231 238
231 238
231 238
231 238
231 238
231 238
231 238
231 239
232 239
232 239
232 240
232 240
232 240
233 240
233 240
233 240
233 241 234,5 235
234 239,5 240 241 249,5 250
234 244,5 245 241
234 241
234 241
234 241
234 242
234 242
234 242
234 242
235 242
235 242
235 242
235 242
236 243
236 243
236 243
236 244 224,5 225
236 244
236 244
236 245
236 245
236 246
237 229,5 230 247
237 249

a) Find and interpret at medium volume


R/ The average volume of the sample is: 236.99 ml
Half of the volume of the samples is less than 237 ml or we can also say that half of the
volumes of this sample are greater than 237 ml.
23699 236.99 mI
Volume of the 100 samples:
100
237+237

b) Find the median volume


474
2 = 237 mI
c) Is the data symmetrical or skewed? R/ It is a negative bias

histogram
255

d) Find the summary of five numbers corresponding to this data.

• 234 ml, the container has a capacity of 244.5 ml and an alternative capacity
245 ml, is an appropriate container to store the sample.
• 3 of the containers have the capacity suitable for storing the
sample volume.
• 237 ml, it has a capacity of 229.5 ml and an alternative of 230 ml, so
Therefore, it is not suitable because the capacity is less than the product sample
in volume.
• 241 ml, the container has a capacity of 234.5 ml and an alternative of 235
ml, it is not served either.
• Several of the containers do not have sufficient capacity for storage.

You might also like