Professional Documents
Culture Documents
Financial Data
Financial Data
Q1)
a) Average Salary
Department Sum of Salary Count of Gender Average Salary
Admin Offices 318638 4 79659.5
IT/IS 2157945 23 93823.69565
Production 6168673 102 60477.18627
Sales 1188687 16 74292.9375
Software Engineering 467227 5 93445.4
Grand Total 10301170 150 68674.46667
Average Salary
Software Engineering
Sales
Production
IT/IS
Admin Offices
0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000
Average Performance
Department Sum of Performance level Count of Gender Average Performance
Admin Offices 12 4 3
IT/IS 70 23 3.043478261
Production 301 102 2.950980392
Sales 45 16 2.8125
Software Engineering 14 5 2.8
Grand Total 442 150 2.946666667
Average Performance
3.1
3.05
3
2.95
2.9
2.85
2.8
2.75
2.7
2.65
Admin Offices IT/IS Production Sales Software Engineering
Average of Salary & Performance
Regression Statistics
0.614210
Multiple R 138
0.377254
R Square 094
Adjusted R 0.355630
Square 972
0.498982
Standard Error 003
Observations 150
ANOVA
Significa
df SS MS F nce F
21.719775 4.343955 17.44679 1.75205E-
Regression 5 68 137 136 13
35.853557 0.248983
Residual 144 65 039
57.573333
Total 149 33
Interpretation
Firstly, we can understand that here the model itself is significant with the Significance F value being less
than 0.05 with 149 degrees of freedom. Thus, we can carry on with the analysis.
Starting with the regression statistics, since there are multiple regression models, R-square value has to be
checked first which in this case is 0.355 does indicating that only 35% of the variance in the dependent
variable is being explained by the independent variables.
Analyzing the P-values of each of the independent variable we can see that the number of absentees and the
special projects count have a P-value of greater than 0.05 (0.80 & 0.2 respectively). Here we understand that
these two particular way tables or less significant in the equation. The remaining three independent variables
on the other hand has a high significance as they have their respective P-values, less than 0.05. These are the
key drivers that affect the dependent variable that is performance level indicator the most.
Here we can generate the regression equation using these variables:
Performance level = 0.668 + 4.65 × 10-6 × (Salary) + 0.367 × (Engagement level) + 0.12 × (Work Life
Balance)
So as to obtain a better model, we can run the regression again using the significant variables.
Q2) In Order to See if the Employees were benefitted from the training provided, firstly the change in
training score is calculated (Post training score – Pre training score)
Benefitted Male Benefitted Female Benefitted Batch 1 Benefitted Batch 2 Total Benefitted
Count 13 11 6 18 24
Percent 68.421% 78.571% 54.545% 81.818% 72.727%
Males Females Batch 1 Batch 2 Total
Total 19 14 11 22 33