Professional Documents
Culture Documents
C2 W1 Graded Activity
C2 W1 Graded Activity
We have two different models that predict mileage based on different input parameters. Both the
models are regression-based models with different levels of accuracy in predicting mileage. Both
have input variables based on how car mileage is estimated. Play around with input variables and
see how it affects the car mileage. Ultimately you will use these models to answer questions
provided in the Graded Activity page for Week 1.
Model 1: Uses information about number of Cylinders, Engine Displacement, Drive, Transmission
Type, Vehicle Class and Internal Volume of the car as parameters to estimate the expected vehicle
mileage. The model is around 77% accurate
Model 2: Uses the CO2 emissions from the car to predicts expected vehicle mileage. The model
results are accurate 92 times out of 100
Note: Please do not update the equation for the Output variable for any of the models.
Regression Result Model 1 This sheet contains the results of the regression analysis of the model b
Regression Result Model 2 This sheet contains the results of the regression analysis of the model b
Car Mileage Data This sheet contains the test data on which the model is based. This data
arameters. Both the
cting mileage. Both
input variables and
wer questions
Drive, Transmission
he expected vehicle
n analysis of the model based on the input variables for the first m
n analysis of the model based on the input variables for the secon
model is based. This data as mentioned above is collected from the US Dept of Energy web
Click on the shaded orange cells below to change model parameters and observe how the predicted mileage is affected
Input Variables
Cylinders 6 units
Engine Displacement 1.4 liters Acceptable displacement is a decimal with a range betwee
Drive 4 wheel drive
Car Internal Volume 320 cubic feet Acceptable volume is an integer with a range between 0 a
Transmission Manual
Vehicle Class Van
Expected Output
Predicted Mileage 24.12 mpg
24.25
25.63
24.12
edicted mileage is affected.
Input Variables
Co2 Emissions 1500 grams/mile Acceptable input is an integer with a range betw
Expected Output
Predicted Mileage 0.00 mpg
bserve how the predicted milea
ANOVA
df SS MS F Significance F
Regression 6 596466.284 99411.047 9577.275 0
Residual 37724 391570.917 10.380
Total 37730 988037.201
Regression Statistics
Multiple R 0.924
R Square 0.854
Adjusted R Squar 0.854
Standard Error 1.955
Observations 37731
ANOVA
df SS MS F Significance F
Regression 1 843808.8537 843808.8537 220733.7522 0
Residual 37729 144228.3471 3.823
Total 37730 988037.2008