Baltic J. Modern Computing, Vol. 9 (2021), No. 1, 49-66 https://doi.org/10.22364/bjmc.2021.9.1.04 Employee Attrition Estimation Using Random Forest Algorithm Madara PRATT, Mohcine BOUDHANE, Sarma CAKULA Vidzeme University of Applied Sciences Cēsu iela 4, Valmiera, LV - 4201, Latvija madara.pratt@va.lv, mohcine.boudhane@va.lv, sarma.cakula@va.lv Abstract. Today, almost all companies are concerned about retaining their employees. However, they are not able to recognise the real factors that make them quit their jobs. Many factors could be responsible for that (for example: cultural, financial, etc.). Each company has its way to treat its employees and assure their happiness. But often no measures are taken of the satisfaction rate. As a result, in many cases, employees quit their employment suddenly without an apparent reason. In the last decades, Machine learning (ML) techniques have gained popularity among researchers. It can propose solutions to a wide range of problems. Then, ML learning has the potential to make predictions to anticipate employee attrition. In this paper, the authors compare state-of-the-art solutions for the proposed machine learning algorithms using a real data set sample size of 1469. The results could be used to warn managers in order to change their strategies or behaviour. It could also be used to make recommendations to the managers to add some policies in order to retain their employees in the company. This study aims to present a comparison of different machine learning methods to give a prediction of employees who are likely to leave their company. The data set includes information about the current employees and the employees who had already quit their job with almost 50 valuable information units. This last combines many factors: social, cultural, financial, professional, and relational factors. Six different ML algorithms were used in this paper. Experimental results show that the Random Forest algorithm demonstrated the best capabilities to predict the employees’ attrition. The best prediction accuracy was 85.12, that is considered as good accuracy. Keywords: Data Prediction and Analysis, Employee Attrition, Random Forest, Machine learning 1. Introduction In past decades technologies have an undeniable impact and have changed every aspect of our lives. ITU (International Telecommunication Union) statistics show that by the end of 2019 93% of the world population lives within a reach of mobile Internet service (WEB (a)). Between 2005 and 2010 the number of people using the Internet is growing on average by 10% per year (WEB (a)). It offers us opportunities as well as challenges. The pandemic in 2020 showed the importance of technologies and proved them to be a very critical part of nowadays life. The technological development and connectedness have created a 24/7 work culture (Piazza, 2007). Communication within companies had become more and more technology-based, which makes it difficult for managers to