ARTICLE IN PRESS
JID: EOR [m5G;April 13, 2019;11:38]
European Journal of Operational Research xxx (xxxx) xxx
Contents lists available at ScienceDirect
European Journal of Operational Research
journal homepage: www.elsevier.com/locate/ejor
Development of a Bayesian Belief Network-based DSS for predicting
and understanding freshmen student attrition
Dursun Delen
a,∗
, Kazim Topuz
b
, Enes Eryarsoy
c
a
Department of Management Science and Information Systems, Spears School of Business, Oklahoma State University, Stillwater, Oklahoma, USA
b
School of Finance, Operations Management, and International Business, Collins College of Business, The University of Tulsa, Tulsa, Oklahoma, USA
c
Department of Management Information Systems, School of Management and Administrative Sciences, Istanbul Sehir University, Turkey
a r t i c l e i n f o
Article history:
Received 21 February 2018
Accepted 7 March 2019
Available online xxx
Keywords:
Student retention
Prediction
Elastic net
Bayesian Belief Network (BBN)
Imbalance data
a b s t r a c t
Student attrition – the departure from an institution of higher learning prior to the achievement of a
degree or earning due educational credentials – is an administratively important, scientifically interest-
ing and yet practically challenging problem for decision makers and researchers. This study aims to find
the prominent variables and their conditional dependencies/interrelations that affect student attrition
in college settings. Specifically, using a large and feature-rich dataset, proposed methodology success-
fully captures the probabilistic interactions between attrition (the dependent variable) and related fac-
tors (the independent variables) to reveal the underlying, potentially complex/non-linear relationships.
The proposed methodology successfully predicts the individual students’ attrition risk through a Bayesian
Belief Network-driven probabilistic model. The findings suggest that the proposed probabilistic graphi-
cal/network method is capable of predicting student attrition with 84% in AUC – Area Under the Receiver
Operating Characteristics Curve. Using a 2-by-2 investigational design framework, this body of research
also compares the impact and contribution of data balancing and feature selection to the resultant pre-
diction models. The results show that (1) the imbalanced dataset produces similar predictive results in
detecting the at-risk students, and (2) the feature selection, which is the process of identifying and elim-
inating unnecessary/unimportant predictors, results in simpler, more understandable, interpretable, and
actionable results without compromising on the accuracy of the prediction task.
© 2019 Elsevier B.V. All rights reserved.
1. Introduction
Predicting attrition (i.e., early identification of undesirable de-
partures) has always been an intriguing and challenging problem
for researchers and decision makers (i.e., practitioners, administra-
tors or business managers). Attrition analysis—as a tool to better
understand and manage retention—is an important subject in a
variety of domains and based on the specific domain, it may be
named differently. For instance, in marketing, it is often called
"customer churn analysis" referring to timely identification of the
at-risk customers (i.e., the ones about to leave you—your products
and/or services—for your competitors’). The common marketing
philosophy that states “acquiring a new customer is ten times
as costly as retaining current customers” is a good testament to
the importance of predicting and properly managing customer
churn. Attrition is also an important concept in human resource
∗
Corresponding author.
E-mail addresses: dursun.delen@okstate.edu (D. Delen), kat0141@utulsa.edu (K.
Topuz), eneseryarsoy@sehir.edu.tr (E. Eryarsoy).
management, specifically in employee retention, where accurate
prediction and management of at-risk employees would save time
and money to the organization in maintaining a productive and
capable workforce. In this study, the focus is on student attrition,
where better understanding and accurate prediction of attrition
can lead to effective management of retention and graduation of
students in higher education.
Student attrition is the precursory dropout of students in
institutions of higher learning prior to achieving any recognized
degrees or credentials (Johnson, 2012). Decreasing the attrition
rate (i.e., increasing the retentions rate) provides an institu-
tion a better chance for procuring a higher status in college
evaluations/rankings, potentially leading to increasing funding
opportunities, recruiting better students, and having a less com-
plicated path to program/degree accreditations. In addition to
these explanations, financial loss, increased federal and state level
attention make administrators in institutions of higher learning
feel continually increasing levels of pressure to create and execute
strategic initiatives to decrease student attrition (Golde, 2005).
Nowadays, a vast majority of higher education institutions have
"student success centers" and related programs and services
https://doi.org/10.1016/j.ejor.2019.03.037
0377-2217/© 2019 Elsevier B.V. All rights reserved.
Please cite this article as: D. Delen, K. Topuz and E. Eryarsoy, Development of a Bayesian Belief Network-based DSS for predicting and
understanding freshmen student attrition, European Journal of Operational Research, https://doi.org/10.1016/j.ejor.2019.03.037