Professional Documents
Culture Documents
Linear Regression Model
Linear Regression Model
Dependent variable
Marks obtained by every student.
Independent variable
Total time spent and number of courses chosen.
Sample of the data set Summary output
Overview of data set
Correlation matrix
There is a strong positive correlation
between the time spent studying and
the marks obtained.
There is a moderate positive correlation
between the number of courses taken
and the marks obtained.
These findings suggest that investing
more time in studying is strongly
associated with better academic
performance (higher marks), and taking
more courses also has a positive but
somewhat weaker relationship with
higher marks.
Line fit plot of independent variables
60
50
40
Marks
30
10
0
0 1 2 time_study
3 4 5 6 7 8 9
-10
number_courses Line Fit Plot
60
50
40
Marks
30
10
0
2 3 number_courses
4 5 6 7 8 9
-10
Residual plots of independent variable
time_study Residual Plot
6
Residuals
0
0 1 2 3 4 5 6 7 8 9
-2
-4
time_study
-6
number_courses Residual Plot
8
6
Residuals
4
2
0
2 3 4 5 6 7 8 9
-2
-4 number_courses
-6
Calculation of linear regression equation
MARKS = (TIME
STUDY*5.399179) + (NUMBER
OF COURSES*1.864051) -7.45635
CONCLUSION
According to the given data set and information provided in it we can
come to a conclusion that students have scored an average of 24.4
marks while studying an average of 4.077 hrs and choosing 5 courses.
As we can see the trend that with increase in the value of independent
variable(time studied and number of courses), dependent variable
(marks) also increases.