Professional Documents
Culture Documents
Bivariate Data Analysis Olympic Project: Men's 200M Breaststroke
Bivariate Data Analysis Olympic Project: Men's 200M Breaststroke
Bivariate Data Analysis Olympic Project: Men's 200M Breaststroke
https://www.olympic.org/swimming/200m-breaststroke-men
www.artofstat.com
2.
4.
5. A linear model is not appropriate because there is a strong correlation between the Olympic
year and winning times in the scatterplot, but it has a slight curve. The residual plot is a u-
curve. This means, it is a non-linear relationship.
6. The 2020 Olympics would probably increase the strength of the data set. The winner would
most likely get a lower time than the 2016 winner, which follows the pattern shown in the
scatterplot.
8. As the Olympic year increase by one, the winning time increases .5 seconds.
9. When the Olympic year is 0, the winning time is 1130 seconds. This does not make sense
because there is no Olympic year that is 0.
12. r = -.944 : This data set has a strong, negative relationship. It also has a slight curve, so it is
non-linear.
13. r2 = 89% : 89% of the variation of the winning times can be explained by the linear
relationship with the Olympic year.
Sy 11.3
15. b=r ∙ b=−.944 ∙
Sx 21.4
= -.498