01.Promotion_Classification_Using_DecisionTree_and_Principal_Component_Analysis
01.Promotion_Classification_Using_DecisionTree_and_Principal_Component_Analysis
and 5th ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (NCON)
Abstract— The goal of human resource management is to of the dataset and PCA features. From the accuracy
support the employee to higher position promotion and improve comparison of the model by using features of the dataset and
their work performance. The key process for performance PCA features, we want to find group of features that impact
evaluation is promotion classification. This paper proposes the with promotion status of human resource dataset.
machine learning based promotion classification using decision
tree and principle Component Analysis (PCA). This paper uses II. LITERATURE REVIEW
Human Resource Analysis Case Study dataset from Kaggle for
comparing the promotion classification. The comparison of A. Human Resource Research with IT Aspect
classification performance for all features, personal features, This research we focus on human resource literature
and performance features are conducted. The classification reviews which use machine learning to classify HR dataset.
results show that the classification with PCA provides the Each literature reviews have different objectives such as
highest accuracy at 91.25%. classify professional blogger, find expert in organizations,
classify talent of employee, develop model of employee
Keywords - Professional Development, Decision Tree, Human engagement, evaluate employee performance, classify
Resource, PCA
employee promotion, and predict employee turnover [1-11].
I. INTRODUCTION B. Promotion Evaluation with IT Aspect
One part of professional development in the human The promotion in human resource can refer to
resource management (HRM) is Training and Development performance evaluation. Many human resource researches in
(T&D) process. Main objectives of T&D focus on develop information technology fields provide the necessary of
employee to satisfy job criterion or promote in next positions. performance evaluation in human resource management
Nowadays, the problem of companies concerned with process. [12] The important of human resource management
recruiting staffs who satisfy for job criterions in each position is managing human resource in organization to high quality
as well as training and development (T&D) for employees in manpower. And the quality and performance of practicing can
company to satisfy their objective or job position criterion. determine the companies' fate. This research presents the goal
The problems of training and development have many reasons of all human resource development theories referred to
which can be occurred from staff problem or organization “selecting the right people for the right positions”. [13] The
978-1-6654-9510-3/22/$31.00 ©2022 IEEE | DOI: 10.1109/ECTIDAMTNCON53731.2022.9720415
problem. Training and development of organization problems key success in the enterprise caused by successful to manage
have an allocation resources such as high cost of training and employees' capability. The matching between right jobs and
development, sending inappropriate employee for training, excavating excellent employee are difficult process and will
insufficient work time, etc. From the previous problem, be the problem of managers.
organization must find employees who have good
productivity and performance to promote into training and C. Principal Component Analysis (PCA)
development process. Therefore, skills, performance of work, PCA is a feature extraction by unsupervised algorithm. It
work experience or background knowledges of employees are is used to reduce dimensionality of the data. Linear algebra
used to evaluate with job criterions. Generally, each company and statistics are used in PCA calculation for finding high
have different techniques to manage human resources for variant and high correlation with outputs and rearrange the
develop staff or employee in company. Each company use features in linear transformation for create new variable in
different human resource software to manage in human simple matrix [14]. The first feature of PCA describes the high
resource which depend on business of each company to report variance and holds the most information by any project of the
employee performance results. And making decision on the dataset. The second feature is more informative and has
report by analysis weights or scores of overall attributes such biggest variance more than the third, and so on. The steps of
as performance, KPI, length of service, merit and ability, PCA are show below.
education, technical, potential, training, etc.
• Normalized features by Standardize.
This research aims to find the features that high correlation
with promotion status. Promotion selection is one indictor to • Covariance matrix calculation.
choose employee who have good performance and more • Finding the eigenvalues and eigenvectors for the
experience. From these reasons, features of dataset are covariance matrix.
designed to three groups which are all features, personal
features, and performance features. And combination with • Plot the vectors on the scaled data.
PCA data extraction process for finding the features that high
After PCA process, the PCA features are used in Decision
variant and high correlation with promotion status. Decision
Tree for promotion classification.
Tree is used to classify promotion classification with features
Accuracy
Algorithm Round Personal features +
Personal features
PCA
1 89.67% 89.76%
2 89.71% 89.73%
Decision Tree
3 89.72% 89.74%
490
Authorized licensed use limited to: b-on: UNIVERSIDADE NOVA DE LISBOA. Downloaded on January 16,2025 at 01:01:37 UTC from IEEE Xplore. Restrictions apply.
The 7th International Conference on Digital Arts, Media and Technology (DAMT)
and 5th ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (NCON)
491
Authorized licensed use limited to: b-on: UNIVERSIDADE NOVA DE LISBOA. Downloaded on January 16,2025 at 01:01:37 UTC from IEEE Xplore. Restrictions apply.
The 7th International Conference on Digital Arts, Media and Technology (DAMT)
and 5th ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (NCON)
[2] Z. Huang and D. Jiang, "Research and Implementation of Fuzzy 2021 13th International Conference on Knowledge and Smart
ISODATA Clustering Algorithm Based on Gene Expression Technology (KST), 2021, pp. 181-185, doi:
Programming in Human Resource Management," 2011 International 10.1109/KST51265.2021.9415794.
Conference of Information Technology, Computer Engineering and [9] A. Assiri, J. Berri and A. Chikh, "Classification and tendencies of
Management Sciences, Nanjing, Jiangsu, 2011, pp. 178-180, doi: evaluations in e-learning," International Conference on Education and
10.1109/ICM.2011.361. e-Learning Innovations, Sousse, 2012, pp. 1-6, doi:
[3] G. K. Hoon, G. K. Min, O. Wong, O. B. Pin and C. Y. Sheng, 10.1109/ICEELI.2012.6360570.
"Classifly: Classification of Experts by Their Expertise on the Fly," [10] M. Eminagaoglu and S. Eren, "Implementation and comparison of
2015 IEEE/WIC/ACM International Conference on Web Intelligence machine learning classifiers for information security risk analysis of a
and Intelligent Agent Technology (WI-IAT), Singapore, 2015, pp. 245- human resources department," 2010 International Conference on
246, doi: 10.1109/WI-IAT.2015.63. Computer Information Systems and Industrial Management
[4] C. Stephanie and R. Sarno, "Classification Talent of Employee Using Applications (CISIM), Krackow, 2010, pp. 187-192, doi:
C4.5, KNN, SVM," 2019 International Conference on Information and 10.1109/CISIM.2010.5643665.
Communications Technology (ICOIACT), 2019, pp. 388-393, doi: [11] Q. Guohao, W. Bin, W. Bai and Z. Baoli, "Competency Analysis in
10.1109/ICOIACT46704.2019.8938508. Human Resources Using Text Classification Based on Deep Neural
[5] Q. Chen and Z. Gong, "Data mining modeling of employee engagement Network," 2019 IEEE Fourth International Conference on Data
for IT enterprises based on decision tree algorithm," 2013 6th Science in Cyberspace (DSC), Hangzhou, China, 2019, pp. 322-329,
International Conference on Information Management, Innovation doi: 10.1109/DSC.2019.00056.
Management and Industrial Engineering, 2013, pp. 305-308, doi: [12] H. Jing, "Application of Fuzzy Data Mining Algorithm in Performance
10.1109/ICIII.2013.6703145. Evaluation of Human Resource," 2009 International Forum on
[6] A. Nedelcu, B. Nedelcu, A. I. Sgarciu and V. Sgarciu, "Data Mining Computer Science-Technology and Applications, 2009, pp. 343-346,
Techniques for Employee Evaluation," 2020 12th International doi: 10.1109/IFCSTA.2009.90.
Conference on Electronics, Computers and Artificial Intelligence [13] Z. Huang and D. Jiang, "Research and Implementation of Fuzzy
(ECAI), 2020, pp. 1-6, doi: 10.1109/ECAI50035.2020.9223165. ISODATA Clustering Algorithm Based on Gene Expression
[7] T. Kaewwiset, P. Temdee and T. Yooyativong, "Employee Programming in Human Resource Management," 2011 International
Classification for Personalized Professional Training Using Machine Conference of Information Technology, Computer Engineering and
Learning Techniques and SMOTE," 2021 Joint International Management Sciences, 2011, pp. 178-180, doi:
Conference on Digital Arts, Media and Technology with ECTI 10.1109/ICM.2011.361.
Northern Section Conference on Electrical, Electronics, Computer and [14] M. Syafrudin, G. Alfian, N. L. Fitriyani, A. H. Sidiq, T. Tjahjanto and
Telecommunication Engineering, 2021, pp. 376-379, doi: J. Rhee, "Improving Efficiency of Self-care Classification Using PCA
10.1109/ECTIDAMTNCON51128.2021.9425754. and Decision Tree Algorithm," 2020 International Conference on
[8] T. Juvitayapun, "Employee Turnover Prediction: The impact of Decision Aid Sciences and Application (DASA), 2020, pp. 224-227,
employee event features on interpretable machine learning methods," doi: 10.1109/DASA51403.2020.9317243.
492
Authorized licensed use limited to: b-on: UNIVERSIDADE NOVA DE LISBOA. Downloaded on January 16,2025 at 01:01:37 UTC from IEEE Xplore. Restrictions apply.