Professional Documents
Culture Documents
Josh Rombach Case 2
Josh Rombach Case 2
Josh Rombach Case 2
DATE: 12/4/2022
Introduction
In this case assignment, I have been tasked by E and J Gallo to determine the correlation between
price and rating for their respective wines. I did this by completing multiple regression models
based on the data set that I received. This in turn allowed me to conclude that the company
should not base their price on ratings and should in turn use other means to determine price.
Data Analysis
After determining the correlations between each wine, I have concluded that red white and rose
all have a similar correlation of 0.45 whereas the sparkling wines are more correlated at 0.75. I
also found that there is correlation between both rating and price. Although there is subtle
correlation the impact on price is not the same across the 4 types.
R2 Interpretation
26% of variation in price is accounted for by variation in rating, number of ratings, and year.
Rating
H0: Wine rating does not significantly impact the price of the wine.
HA: Wine rating significantly impacts the price of the wine.
Because the p-value of 0 is less than our significance level of 0.05, we can reject the null and
accept that wine rating significantly impacts the price of the wine.
As rating increases by 1 point, price of the wine will increase $88.51 on average and all else
constant
Number of Ratings
H0: Number of Ratings does not significantly impact the price of the wine.
HA: Number of Ratings significantly impacts the price of the wine.
Because the p-value of 0.0025 is less than the significance level of 0.05, we can reject the null
and accept that number of ratings significantly impacts the price of wine.
As number of ratings increases by 1 rating, price of the wine decreases by $0.0025 on average
and all else constant.
Year Produced
H0: Year does not significantly impact the price of wine.
HA: Year does significantly impact the price of wine.
Because the p-value of 1.29 is greater than the significance level of 0.05, we can accept the null
hypothesis and conclude that year does not significantly impact the price.
Red Wine: As rating increases by 1 point, price increases by $124.31 on average and all else
constant.
White Wine: As rating increases by 1 point, price increases by $54.40 on average and all else
constant.
Rose Wine: As rating increases by 1 point, price increases by $25.50 on average and all else
constant.
Sparkling Wine: As rating increases by 1 point, price increases by $190.20 on average and all
else constant.
When looking at the correlation between the 4 types of wine, we can see that red, white, and rose
all have very similar correlations of around .45 whereas sparkling wines correlation was .75
which is more. Although the correlations were similar, the impact to each wine was not the same.
For Gallo to continue to sell wines at each different price point I think it would be important to
take rating and price into account. Even though there is a correlation between rating and price, I
do not recommend that Gallo solely base price off rating. This would cause all the highly rated
wines to be very expensive and would mean less people would be purchasing the wines.
Discussion of Tableau
Click here to see a dashboard of how rating affects price across the wine types.
There will be filter dropdowns on the dashboard that affects both visuals.
Conclusion
After completing the case for E and J Gallo, I have concluded that it is not in the best interest of
the company to base the price of their wines on the respective ratings. Although there is a
correlation between the two, purely basing price on rating would mean that many of the wines
would be too expensive and this in turn would limit the number of wines in each price category.
If there are any further questions, feel free to contact me at joshrombach@arizona.edu.
Appendix
Overall Regression Output
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.510193356
R Square 0.260297261 Josh Rombach
Adjusted R Square 0.260127682
Standard Error 62.4502063
Observations 13090
ANOVA
df SS MS F Significance F
Regression 3 17959202.25 5986400.748 1534.963426 0
Residual 13086 51035769.9 3900.028267
Total 13089 68994972.14
Regression Statistics
Multiple R 0.451251386
R Square 0.203627813
Adjusted R Square 0.203535811
Standard Error 75.82554439
Observations 8658
ANOVA
df SS MS F Significance F
Regression 1 12725338.25 12725338.25 2213.289689 0
Residual 8656 49767786.11 5749.513183
Total 8657 62493124.36
Regression Statistics
Multiple R 0.465742847
R Square 0.216916399
Adjusted R Square 0.216707966
Standard Error 27.38629444
Observations 3759
ANOVA
df SS MS F Significance F
Regression 1 780534.3113 780534.3113 1040.699756 9.2151E-202
Residual 3757 2817784.276 750.0091233
Total 3758 3598318.587
Regression Statistics
Multiple R 0.433450447
R Square 0.18787929
Adjusted R Square 0.185807554
Standard Error 14.51280194
Observations 394
ANOVA
df SS MS F Significance F
Regression 1 19100.59642 19100.59642 90.68686561 1.77449E-19
Residual 392 82563.5967 210.6214202
Total 393 101664.1931
Regression Statistics
Multiple R 0.731381124
R Square 0.534918349
Adjusted R Square 0.533239353
Standard Error 50.64704122
Observations 279
ANOVA
df SS MS F Significance F
Regression 1 817233.6047 817233.6047 318.5943416 5.85621E-48
Residual 277 710539.0113 2565.122785
Total 278 1527772.616