0% found this document useful (0 votes)

4 views

AI Workshop Predict Employee Leave

Uploaded by

Dary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

AI Workshop Predict Employee Leave

Uploaded by

Dary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

AI Workshop – Predict employee leave: will

they leave, or will they stay?

Imagine you are an HR-Manager, and you would like to know which employees are likely to
stay, and which might leave your company. Besides you would like to understand which
factors contribute to leaving your company. You have gathered data in the past (well, in this
case Kaggle simulated a dataset for you, but just imagine), and now you can start with this
Hands On Lab to build your prediction model to see if that can help you.

In this lab, you will learn how to create a machine learning module with Azure Machine
Learning Studio that predicts whether an employee will stay or leave your company. We are
aware of the limitations of the dataset but the objective of this hands-on lab is to inspire you
to explore the possibilities of using machine learning for your own research, and not to build
the next HR-solution.

You will follow several steps to explore the data and build a machine learning model to
predict whether an employee will leave or not, and why.

• Step 1: Get the starting experiment

• Step 2: Get a first understanding of the data
• Step 3: Prepare a training and a test set
• Step 4: Train the model
• Step 5: Score the test set
• Step 6: Evaluate the results
• Step 7: Gain insights on the why
• Step 8: Publish your model
• Step 9: Deploy your model as a web service
• Step 10: Test your model

You will build this prediction model with the Azure Machine Learning Studio. The
complete model will look like this:
Prerequisites: Get Access to Azure Machine Learning
Studio
Go to https://studio.azureml.net/ and select Sign up here for Azure ML Studio.

You will need a Windows LiveID to sign in. If you don’t have one, you can sign up here:
https://signup.live.com/
Hereafter, you can select the Free Workspace option:

Step 1: Get the starting experiment

We created a starting experiment for you on the Azure AI Gallery to give you a smooth
start.

Click HERE to open the starting experiment.

This experiment uses a simulated dataset from Kaggle. You have to open the experiment in
your studio by clicking on the green button “Open in Studio”. This will open the Azure
Machine Learning Studio in a browser, and you can copy the experiment to your free
workspace.
Step 2: Inspecting the data
In the Starting experiment: Predict Employee Leave experiment, you will find the
Employee Leave data on the canvas, together with a Summarize Data module.

If you look at the top corner right, you can see that the experiment is “in draft”. This means
that it hasn’t been saved, nor that it has been executed before. Therefore, we start with
running the experiment, by clicking on the RUN button in the menu at the bottom. After
running the experiment, the top corner message will change into “Finished running”, and we
can start inspecting our data.
To get a first impression of the data, you can right-click the output port of the dataset and
select “Visualize” from the menu to visualize the data. The output port is the little circle
under every module on the canvas.

You can scroll through the different columns, and by selecting them, you get an overview in
the panel on the right.

We can continue inspecting the dataset. The data comprise a wide range of topics which
allow to explain employees’ leave behaviour in relation with A) organizational factors
(department); B) employment relational factors (i.e. tenure, the number of projects
participated in; the average working hours per month; objective career development; salary);
and C) job-related factors (performance evaluation; involvement in workplace accidents).

We have the following available variables in the dataset:

Organizational factors

• Department

Employment relational factors

• Time spent at the company

• Number of projects
• Average monthly hours
• Salary
• Whether they have had a promotion in the last 5 years

Job-related factors

• Last evaluation
• Whether they have had a work accident

Dependent variable

• Whether the employee has left

Another way to get a first impression of the data. Therefore, we use the Summarize Data
module, which gives us insights about the data.
After you ran the model, you can right-click on the output port of the Summarize Data
module and select Visualize. We see that we have 14999 observations, and that we don’t
miss any data. We also get an idea about the variance and distribution of the data.

Step 3: Prepare a training and a test set

We split the dataset into a training and a test set, using 70% of the data to train the model
with, and 30% of the data to test the model later on. Therefore, we drag the Split Data
module on the canvas. You can find this module in the menu left, next to the canvas. You
can either click through the various options or use the search function.

When you have found the Split Data module, you can drag it on the canvas, and connect the
output port of the dataset to the input port of the Split Data module. You can connect the
modules by left clicking on the output port, and keep you mouse button down while draging
it to the module you want to connect it to. We set a seed, so we can repeat this experiment.
Make sure you RUN the model after every step.

Step 4: Train the model

Since we have split the data, we can continue to work with the training data set. We first
select the Train model module and drag it on the canvas. But when we do so you will a little
red exclamation mark. This is because we haven’t selected the variable that we want to
predict, and we haven’t defined the algorithm that we want to use to train the model with.
First, we will select the dependent variable. Therefore, we have to click on the Launch
column selector.
In order to set the dependent variable, we select the variable “left” (indicating whether an
employee has left or not) from AVAILABLE COLUMNS and use the arrow button to get it
to the right side, under “SELECTED COLUMNS”.

Furthermore, we have to select the algorithm to train the model with. In this experiment we
use the Two-Class Boosted Decision Tree algorithm with the standard parametrization. We
do add a seed to make this experiment replicable.
Step 5: Score the test set
After this, we are prepared to score the test set and see how our model performs. Therefore,
we use the Score Model module and we connect both the output port of the Train model
module, which contains the trained model, as the outcome of the Split Data set, containing
the test data.
Step 6: Evaluate the results
Finally, it’s time to evaluate the results of our model. We use the Evaluate Model module
which we connect to the results of our prior scoring.

Let’s run the model, and then right click on the Evaluate Model module to visualize the
results. We can predict with 98% accuracy and 98% precision.
Step 7: Gain insights on the why
Our final question was why employees were leaving. Therefore, we could add the
Permutation Feature Importance module. We connect the output port of the Train Model
module and the output port of the Split Data module. Now we can compute the permutation
feature importance scores of feature variables given this trained model and the test dataset.
We set a seed to make the experiment replicable, and we focus on accuracy, meaning that we
are both interested in selected correctly the people that leave, and the people that will not
leave.
If we run the model, and right-click on the output port of the Permutation Feature
Importance module, we find that satisfaction was one of the main factors when leaving,
according to this dataset. Next to that, the number of projects an employee got was important.
Step 8: Publish your Model
Publish the Model as a Web Service

Make sure you have saved and ran the experiment. With the Starting experiment: Predict
Employee Leave experiment open, click the SET UP WEB SERVICE icon at the bottom of
the Azure ML Studio page and click Predictive Web Service [Recommended].
A new Predictive Experiment tab will be automatically created. Verify that, with a bit of
rearranging, the Predictive Experiment resembles this figure:

An important step to carry out now, is eliminating the dependent variable “left”. Because this
is exactly what you want to predict and you can’t leave it as input variable, although it will be
ignored by the model itself. In order to do so, select Select Columns in Dataset and put it
between the dataset and the Score Model module.
Select the Select Columns in Dataset module, open the Launch column selector, and
remove the variable “left”. Now RUN the predictive experiment.

When you inspect the output of the Score Model module, by right-clicking on the output port
and selecting Visualize, you will see that there are 2 extra columns in your dataset, named
Scored Labels and Scored Probabilities. Scored Labels contains the prediction whether an
employee will leave or not, and is based on the Scored Probabilities value, where 0.5 is the
cut-off: from 0.5 an employee will leave.
9: Deploy and Use the Web Service
In the Starting Experiment: Predict Employee Leave [Predictive Exp.] experiment, click
the Deploy Web Service button at the bottom of the window.

Wait a few seconds for the dashboard page to appear. You have several options to connect to
the webservice.
Step 10: Test your model
To test this webservice, you can i.e. click on New Web Services Experience (preview). This
will open a new browser where you have the option to test your model (Test endpoint option
under BASICS)

When clicking on Test endpoint, you have the option to enable the usage of sample data,
which will generate a sample record to test your model with.
After enabling this sample data, you will see the generated sample data.

The final step would be pressing the Test Request-Response button: will this person leave
the company?

If the webservice does not work, you can also use the option to click on the blue TEST
button, or to launch an Excel file. It is up to you to explore these options now.
Summary
By completing this lab, you have prepared your environment and data, and built and deployed
your own Azure Machine Learning model. We hope you enjoyed this introductory hands-on
lab and that you will build many more machine learning solutions!

Limitations
Of course, there is much information missing. We don’t know anything about the dates of the
obtained data, nor do we know anything between the data gathering and the moment that the
employee left.

Inspiration
As mentioned before, this hands-on lab is created to inspire you. If for whatever reason you
were struggling to get the model built, you can also download the complete model from the
Azure AI Gallery.

We hope you have enjoyed this workshop and hopefully it inspired you to build your own
models. If you want to take your models into production, then please use another
environment: https://ml.azure.com/

Here you can find a similar interface, called Designer, but with this interface, you can also
deploy and manage your models.

Capstone Interim Report - HR CTC Prediction
80% (10)
Capstone Interim Report - HR CTC Prediction
16 pages
Sarah Thornton - 7 Days in The Art World
100% (1)
Sarah Thornton - 7 Days in The Art World
25 pages
Transforming Conversational AI
No ratings yet
Transforming Conversational AI
235 pages
Employee Attrition Study Case
No ratings yet
Employee Attrition Study Case
88 pages
Final Capstone Project Report
100% (1)
Final Capstone Project Report
35 pages
Microsoft Teams Training Guide
80% (5)
Microsoft Teams Training Guide
55 pages
Employee Future Prediction
No ratings yet
Employee Future Prediction
3 pages
Data Mining
No ratings yet
Data Mining
17 pages
Research Paper (1)
No ratings yet
Research Paper (1)
5 pages
Employee Attrition Miniblogs
100% (1)
Employee Attrition Miniblogs
15 pages
Attrition Project Mangal
No ratings yet
Attrition Project Mangal
75 pages
Problem Statement:: Field Characteristics Data Type
No ratings yet
Problem Statement:: Field Characteristics Data Type
4 pages
Cdu 1121 09
No ratings yet
Cdu 1121 09
10 pages
Attrition Probs
No ratings yet
Attrition Probs
8 pages
PPT (1)
No ratings yet
PPT (1)
44 pages
HR Analyst (Data Analyst)
No ratings yet
HR Analyst (Data Analyst)
11 pages
Employee Turnover
No ratings yet
Employee Turnover
19 pages
Employee Turnover Prediction
No ratings yet
Employee Turnover Prediction
12 pages
employee turnover1
No ratings yet
employee turnover1
4 pages
Assignment Report - Group A
No ratings yet
Assignment Report - Group A
31 pages
Employee Turnover Prediction Project
No ratings yet
Employee Turnover Prediction Project
10 pages
Assighment3 4 AI Projecct
No ratings yet
Assighment3 4 AI Projecct
58 pages
1722506171 Employee Turnover Problem Statement
No ratings yet
1722506171 Employee Turnover Problem Statement
5 pages
Employee Attrition Prediction
100% (1)
Employee Attrition Prediction
21 pages
Reportprediction of Employee Atrition Uisng Machine Learning
No ratings yet
Reportprediction of Employee Atrition Uisng Machine Learning
6 pages
Methodology
No ratings yet
Methodology
2 pages
Summer Internship Report
No ratings yet
Summer Internship Report
24 pages
DATA4800 Report
No ratings yet
DATA4800 Report
6 pages
Employee Turnover Prediction
100% (1)
Employee Turnover Prediction
16 pages
Employee Attrition Prediction Using Machine Learning
No ratings yet
Employee Attrition Prediction Using Machine Learning
9 pages
Salary Prediction
No ratings yet
Salary Prediction
4 pages
Analysis and Prediction of Employee Turnover Characteristics Based On Machine Learning
No ratings yet
Analysis and Prediction of Employee Turnover Characteristics Based On Machine Learning
6 pages
Predicting Employee Churn in Python
100% (1)
Predicting Employee Churn in Python
19 pages
AIP_aip-202501-0006
No ratings yet
AIP_aip-202501-0006
16 pages
BerkeGündüz MelihAydın Cmpe442 Training Report
No ratings yet
BerkeGündüz MelihAydın Cmpe442 Training Report
14 pages
ANLY 502 Final Report
No ratings yet
ANLY 502 Final Report
7 pages
Karpagam Sep Oct 2019 Article 6
No ratings yet
Karpagam Sep Oct 2019 Article 6
6 pages
Db15 Conference
No ratings yet
Db15 Conference
6 pages
Employee Attrition Classification
No ratings yet
Employee Attrition Classification
16 pages
Evaluation of Machine Learning Models For Employee Churn
No ratings yet
Evaluation of Machine Learning Models For Employee Churn
5 pages
Ibm Attrition Practices
No ratings yet
Ibm Attrition Practices
7 pages
DADM Unit 5 Programs
No ratings yet
DADM Unit 5 Programs
63 pages
Report
No ratings yet
Report
45 pages
IBM Analysis
No ratings yet
IBM Analysis
17 pages
18 Intellisys Employee
No ratings yet
18 Intellisys Employee
22 pages
HR Analytics Synopsis
100% (1)
HR Analytics Synopsis
3 pages
Mini Project Report
No ratings yet
Mini Project Report
10 pages
Evaluating Employee Attrition - Design and Implementation
No ratings yet
Evaluating Employee Attrition - Design and Implementation
10 pages
Predict Employee Retention Using Data Sciene
No ratings yet
Predict Employee Retention Using Data Sciene
7 pages
Is451 Slide Deck 1
No ratings yet
Is451 Slide Deck 1
28 pages
Employee Attrition Prediction using Machine Learning Models: A Review Paper
No ratings yet
Employee Attrition Prediction using Machine Learning Models: A Review Paper
27 pages
ISE 527 IEEE Access LaTeX Template
No ratings yet
ISE 527 IEEE Access LaTeX Template
16 pages
Human Retention Using Data Science
No ratings yet
Human Retention Using Data Science
16 pages
Is 451 Report 1
No ratings yet
Is 451 Report 1
4 pages
ATAIML_02.04_04
No ratings yet
ATAIML_02.04_04
14 pages
HR_Review1
No ratings yet
HR_Review1
11 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
1-s2.0-S2772662224000651-main
No ratings yet
1-s2.0-S2772662224000651-main
17 pages
Employee Performance Prediction Abstract
No ratings yet
Employee Performance Prediction Abstract
2 pages
Presentation1
No ratings yet
Presentation1
52 pages
Hidden Gems of Microsoft Excel
From Everand
Hidden Gems of Microsoft Excel
Ambily
No ratings yet
Microsoft Office Productivity Pack: Microsoft Excel, Microsoft Word, and Microsoft PowerPoint
From Everand
Microsoft Office Productivity Pack: Microsoft Excel, Microsoft Word, and Microsoft PowerPoint
Steven Bright
No ratings yet
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Lecture Notes
No ratings yet
Lecture Notes
86 pages
(Baea2845 A596 48ce Bfeb 9dd5e336a41f) Dummies Guide New Logo
No ratings yet
(Baea2845 A596 48ce Bfeb 9dd5e336a41f) Dummies Guide New Logo
53 pages
Eliminate Data Silos: Query Many Systems As One: Data Virtualization in IBM Cloud Pak For Data
No ratings yet
Eliminate Data Silos: Query Many Systems As One: Data Virtualization in IBM Cloud Pak For Data
5 pages
App Connect Enterprise Certified Containerin CP4 I
No ratings yet
App Connect Enterprise Certified Containerin CP4 I
44 pages
C1000-081-IBM Cloud Pak For Integration V2019.4 Administrator
0% (1)
C1000-081-IBM Cloud Pak For Integration V2019.4 Administrator
10 pages
Statistical Table
No ratings yet
Statistical Table
5 pages
IBM Cloud Professional Certification Program: Study Guide Series
No ratings yet
IBM Cloud Professional Certification Program: Study Guide Series
14 pages
IBM Cloud Professional Certification Program: Study Guide Series
No ratings yet
IBM Cloud Professional Certification Program: Study Guide Series
14 pages
Simple Past Story 1
50% (2)
Simple Past Story 1
7 pages
Enter The Portal 1 Quiz Unit 1
No ratings yet
Enter The Portal 1 Quiz Unit 1
1 page
Checklist For Classroom Monthly Checking On The Readiness 2023
No ratings yet
Checklist For Classroom Monthly Checking On The Readiness 2023
4 pages
Reaction Paper On Guidance and Counselling
100% (2)
Reaction Paper On Guidance and Counselling
2 pages
Questions Expect Dissertation Defense
100% (2)
Questions Expect Dissertation Defense
4 pages
No-Doze Leadership Styles
No ratings yet
No-Doze Leadership Styles
3 pages
RQP - 01 OSS Certification Risk & Quality Plan - ISO 45001 - 2018 - September 2019
No ratings yet
RQP - 01 OSS Certification Risk & Quality Plan - ISO 45001 - 2018 - September 2019
10 pages
Supporting Families of Children With Developmental Disabilities: Nursing Strategies For Advocacy and Empowerment' Mamata Patel
No ratings yet
Supporting Families of Children With Developmental Disabilities: Nursing Strategies For Advocacy and Empowerment' Mamata Patel
7 pages
Pending Scholarship Forms
No ratings yet
Pending Scholarship Forms
72 pages
28 Packing My Bag
No ratings yet
28 Packing My Bag
11 pages
English-Champs-Results
No ratings yet
English-Champs-Results
8 pages
HR Project
No ratings yet
HR Project
49 pages
Las Shs Gen Chem2 q4 Redox
No ratings yet
Las Shs Gen Chem2 q4 Redox
10 pages
Colleage List 14-5-19 Update
No ratings yet
Colleage List 14-5-19 Update
24 pages
GWF081650850 Refusal Letter2025-04-22
No ratings yet
GWF081650850 Refusal Letter2025-04-22
4 pages
Introduction Debater (ENTP) Personality 16personalities
No ratings yet
Introduction Debater (ENTP) Personality 16personalities
1 page
Pragmatics
No ratings yet
Pragmatics
10 pages
Export Financing in International Construction Cas
No ratings yet
Export Financing in International Construction Cas
8 pages
CV 20190704
No ratings yet
CV 20190704
6 pages
The Impact of Baby Spa Jurnal Internasional-With-Cover-Page-V2
No ratings yet
The Impact of Baby Spa Jurnal Internasional-With-Cover-Page-V2
6 pages
Executive Summary
No ratings yet
Executive Summary
4 pages
The AI Maturity Model - How To Move The Needle of Digital Transformation Towards An AI-Driven Company
No ratings yet
The AI Maturity Model - How To Move The Needle of Digital Transformation Towards An AI-Driven Company
9 pages
Demystifying Ontology and Epistemology in Research Methods
100% (1)
Demystifying Ontology and Epistemology in Research Methods
11 pages
Atoms and Molecules Class 9 Notes - Chapter 3 Highlights
No ratings yet
Atoms and Molecules Class 9 Notes - Chapter 3 Highlights
1 page
Organizational Behavior 12 Again
No ratings yet
Organizational Behavior 12 Again
20 pages
Faculty
No ratings yet
Faculty
6 pages
Novice Teachers Training and Support Nee
No ratings yet
Novice Teachers Training and Support Nee
13 pages
Summer Vacation Homework (Ukg)
No ratings yet
Summer Vacation Homework (Ukg)
11 pages

AI Workshop Predict Employee Leave

Uploaded by

AI Workshop Predict Employee Leave

Uploaded by

AI Workshop – Predict employee leave: will

they leave, or will they stay?

• Step 1: Get the starting experiment

Step 1: Get the starting experiment

Click HERE to open the starting experiment.

We have the following available variables in the dataset:

Employment relational factors

• Time spent at the company

• Whether the employee has left

Step 3: Prepare a training and a test set

Step 4: Train the model

You might also like