Employee Future Prediction
Employee Future Prediction
ISSN No:-2456-2165
Abstract:- The biggest asset that a company has is not his increasingly efficient worker attrition classifiers. the use of
business or the place or any other thing, but it’s the greater numerous datasets or certainly to replace it periodically,
employee. The employees pave the path for the success of the software of feature [3] engineering to identify new sizable
the company and all the administration that is needed to run traits from the dataset and the supply of additional statistics on
the company. But due to many reasons employee are not personnel would improve the overall information of the motives
happy with the work or the company they work in and this why personnel leave their agencies and, consequently, increase
tends or results for them to leave the company or finding a the time available to personnel departments to assess and plan
new company. It is one of the most important difficulties the duties required to mitigate this risk (e.g., retention activities,
that company owners confront when their organizations employee substitution and/or venture redistribution).[4]
lose their most brilliant personnel. A good employee is predicts the accuracy of five base models and then combines
always a valuable asset to the company, and their departure them to create a more powerful predictor model, which is
can result in a variety of issues, including financial losses, known as ensemble learning. The research uses a combination
poor overall performance, and the loss of accumulated of decision trees and linear regression to achieve an accuracy of
expertise. Furthermore, compared to recruiting current 86.39 percent, outperforming Adaboost and Random Forest, as
employees, employing new personnel is significantly more well as SVM.
expensive, stressful, and time-consuming. Hence, the
authors propose a system to predict the future of an III. METHODOLOGY
employee in a company taking in consideration various
factors and using the algorithm to get the prediction with a A. Data Preparation
great accuracy. The information was taken from the well-known employee
future prediction dataset on the internet. Due to the scarcity of
Keywords:- Prediction Algorithm, Machine Learning, data, the only method to run a model and get a forecast was to
Employee Future Prediction, Gradient Boosting, Attrition gather data from a reliable source. The dataset contains a variety
System. of properties like education, gender, age, city, payment tier,
joining year, leave or not, ever benched etc which can be seen
I. INTRODUCTION in figure 1. The dataset is quite big and contains ~4650 entries
which is taken from the employees in a company and 9 features
In many of the company’s employee, future has become a along with it.
major. It’s the responsibility of the HR (Human Resource)
Manager to fulfil necessary needs of their employs. Employs
generally leave the companies and to fulfil that particular
position they have to assign another person for that particular
position, which is not always possible for them. Since the
worker's quality and skills represents the companies the growth
and competitive advantage. [1] And there has been a method
developed called Predicting Employee Attrition. The purpose of
this is to support decisions are not based on personal aspects but
on unbiased data analysis. The goal of this work is to analyse
how purpose factors influence employee attrition, in order to
identify the man causes that contribute to a worker’s decision to
leave a company, and to predict whether a particular employee
will leave the company. Fig 1: Features/Attributes Overview
C. Feature Visualisation
The algorithm's features are crucial for producing reliable
results. Visualization helps us to see all the different factors and
how they influence the results. The figure 3 shows a heat map
between the features/attributes and which signifies Our Target
variable ('LeaveOrNot') has a very modest negative connection
with PaymentTier and Age.The target variable ('LeaveOrNot')
exhibits a very minor positive connection with 'JoiningYear.'
Figure 4 shows that Most of the Employees are having payment
Tier 3 By observing the trend it seems that 'PaymentTier'
Catergory is an Ordinal Variable Where, Tier 3 > Tier 2 > Tier
1. Figure 5 shows that most of the employees have a bachelors
degree but the employees with masters degree have left more.
And in figure 6 it shows that young people have left early due
Fig 6: Age GroupVisualisation
to more job opportunities out in the market.
D. Model Architecture
Gradient boosting is similar to random forest in that it
groups weak learners together to build a stronger one. However,
gradient boosting differs from random forest in that it adds
predictors in a sequential manner, preventing additional
improvement. The gradient booster uses the gradient of descent
to help discover and fix inaccuracies in learners' predictions. In
compared to decision trees and linear regression, it is memory
efficient and fast, but it fails to visualise perfectly. Gradient
boosting framework is used which is used to handle large size
data which in this case is a dataset of ~4650 and takes very low
memory to run. It also provides a support for the GPU learning
and thus is widely use by the data scientists and hence helps our
title to increase the accuracy.
V. CONCLUSION