Professional Documents
Culture Documents
TCRIB2R123 PreetDesai
TCRIB2R123 PreetDesai
Rutuja Doiphode
CO-FOUNDER & CEO
TCR innovation.
Name – Preet Desai
Internship Program - Data Science with Machine Learning and Python
Batch - Jan 2022 - Mar 2022
Certificate Code - TCRIB2R123
Date of submission – 18/04/2022
Index Terms –
• Aim.
• Introduction to Dataset.
• Exploratory data analysis on dataset.
• Training & Prediction of data
• Conclusion
I. AIM
The aim is to predict whether an employee wants to continue to do work or not. For this, I am
using a dataset named “HR Employee Attrition”.
The “HR EMPLOYEE ATTRITION DATASET” consists of the details of an employee like
gender, age, business travel, department, education, relationship satisfaction, and many others.
Basically, the dataset consists of exactly 2940 employees' data, and employee has 34 features.
The dataset consists of both numerical and categorical data. Below is an image of the dataset.
After analysing the complete dataset, I apply the Random Forest Classification
algorithm to train the machine learning model because this algorithm can work
effectively with a large number of features. Firstly, I split the dataset as 83.88 %
for training and 16.12% for testing. Below is the implementation of training and
predicting the machine learning model to achieve the aim:
Name – Preet Desai
Internship Program - Data Science with Machine Learning and Python
Batch - Jan 2022 - Mar 2022
Certificate Code - TCRIB2R141
Date of submission – 18/04/2022
Name – Preet Desai
Internship Program - Data Science with Machine Learning and Python
Batch - Jan 2022 - Mar 2022
Certificate Code - TCRIB2R141
Date of submission – 18/04/2022
V. CONCLUSION
After applying the Random Forest Classification algorithm, the machine learning
model will be able to predict employee attrition with an accuracy of 95.46%. This
is not the only method to train the model for the prediction of employee attrition.
Using various other algorithms, it can be possible to predict but I found that this
Name – Preet Desai
Internship Program - Data Science with Machine Learning and Python
Batch - Jan 2022 - Mar 2022
Certificate Code - TCRIB2R141
Date of submission – 18/04/2022
algorithm works better than all other classification algorithms in the "HR
EMPLOYEE ATTRITION DATASET".
VI. REFERENCES