Professional Documents
Culture Documents
REGRESSION
REGRESSION
Syllabus
• Regression:
• Linear regression
• Non Linear regression
• Exp. Des. for Optimization: Theory of Response Surface Methodology
(RSM)
REGRESSION
Experimental Design
1
12/12/2023
2
12/12/2023
Introduction
Types of Regression
1. Linear Regression
2. Polynomial Regression
3. Logistic Regression
4. Quantile Regression
5. Ridge Regression
6. …
7. ….
8. Nnn…
3
12/12/2023
Regression Model
• In simple linear regression, the model used to describe the relationship
between a single dependent variable y and a single independent variable x is y =
β0 + β1x + ε.
• β0 and β1 are referred to as the model parameters, and ε is a probabilistic error
term that accounts for the variability in y that cannot be explained by the linear
relationship with x.
• If the error term were not present, the model would be deterministic; in that
case, knowledge of the value of x would be sufficient to determine the value of
y.
• In multiple regression analysis, the model for simple linear regression is
extended to account for the relationship between the dependent variable y and
p independent variables x1, x2, . . ., xp.
• The general form of the multiple regression model is y = β0 + β1x1 + β2x2 + . . . +
βpxp + ε.
• The parameters of the model are the β0, β1, . . ., βp, and ε is the error term.
4
12/12/2023
Examples
A primary use of the estimated regression equation is to predict the value of the dependent
variable when values for the independent variables are given.
For instance, given a patient with a stress test score of 60, the predicted blood pressure is 42.3 +
0.49(60) = 71.7.
5
12/12/2023
6
12/12/2023
ANOVA
• Df is the number of degrees of freedom associated with the sources of variance.
• SS is the sum of squares. The smaller the Residual SS viz a viz the Total SS, the better the
fitment of your model with the data.
• MS is the mean square.
• F is the F statistic or F-test for the null hypothesis. It is very effectively used to test the
overall model significance.
• Significance F is the P-value of F.
ANOVA
df SS MS F Significance F
Regression 1 1353.82113 1353.82113 428.2300848 9.3639E-11
Residual 12 37.93720744 3.161433954
Total 13 1391.758337
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 3.947931358 1.12885803 3.497278888 0.004403889 1.488360998 6.407501717 1.488360998 6.407501717
0.15 12.34519188 0.596567043 20.6937209 9.3639E-11 11.04538396 13.64499981 11.04538396 13.64499981
7
12/12/2023
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.855691
ANOVA
• R Square: 0.734→ 73.4% of the variation in the exam df SS MS F Significance F
scores can be explained by the number of hours Regression 2 1301.71 650.855 21.87382 2.64E-05
Residual 16 476.0796 29.75497
studied and the number of prep exams taken. Total 18 1777.789