Professional Documents
Culture Documents
TD Regression
TD Regression
Mars 2021
x1 x2 y
-2.3 -8.8 9
-0.6 1.8 -1
-1.2 -4.3 5
... ... ...
We have created two different simple regression models : the first one to predict y using x1 (model
1) and the other one to predict y using x2 (model 2). The equations of the obtained models are :
1. Given that for model 3, σ̂β1 = 0.06, perform a statistical test to know if x1 is significantly
influencing y when x2 is alreay used ? (same quantile as in exercise 1).
2. Given that Im3 = 1263, does the model 3 seem more interesting than using just one variable ?
3. Predict y using this model 3 for the 3 new individuals given above. What do you conclude ?
1
Exercise 3 : Train / Test split
We have collected some data about 8 individuals that are described by 3 variables x1 , x2 , and y.
We aim at predicting y using x1 and x2 , and we would like to have an estimation of the generalization
error of a multiple regression model to predict y.
For this, we apply the train / test split strategy. The dataset is split into a training set of 6 indivduals
and a test set with 2 individuals. A multiple regresssion model is learned using the training set only.
The equation of the obtained model is : y = −89 + 16 x1 + 2 x2
The 2 individuals of the test set are :
Individuals x1 x2 y
1 1 54.5 38
2 1.5 49 36
Question : Estimate the generalization error of a regression model that predicts y using x1 by a
2-fold cross validation.