International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-8, Issue-6S3, April 2019 22 Published By: Blue Eyes Intelligence Engineering & Sciences Publication Retrieval Number: F10050486S319/19©BEIESP AbstractApplying data mining and machine learning techniques on Moodle logs is an emerging trend that can help track student’s performance and decrease the failure rate. Due to Moodle’s limitation to provide these features, this study was conceptualized. The study made use of historical data from Moodle logs of past academic years to pre-process and develop machine learning models using an open source data mining tool named Weka. This study made use of predictor attributes related to study behavior of students such as Course Viewing Time, Resource Views, Quiz Taken, Replied in Discussions, and Viewed at Weekends. However, it was found out that predictor attributes such as Activities Completed, Course Views and Assignment Passed are the ones which are strongly correlated to students’ performance. Moreover, the predictive accuracy of a model improves depending on the machine learning algorithm being used. Algorithms such as J48, Random Forest, JRip, and OneR have been consistently performing well regardless of the model it is being trained into; and, achieved a predictive accuracy as high as 96.42%. The study was able to reflect the predicted results of Weka back to Moodle through an integrator and developed block using Moodle API. Finally, the developed application was evaluated by IT Experts using the ISO 25010 criteria. Index Termsdata mining, machine learning, predictive analytics, predict students’ performance, Moodle logs I. INTRODUCTION The growth of information available online as well as the stored data in huge organizations led to a quest of discovering hidden information that is useful for decision making. Imagine shopping malls and supermarkets using all the swiped data of their customers to identify frequently bought items and turn this information to targeted marketing and loyalty programs. An emerging field in education called Educational Data Mining (EDM) is designed for automatically extracting meaning from large repositories of data generated by or related to people's learning activities in educational settings [1]. For example, several learning management systems (LMSs) track information such as when each student accessed each learning object, how many times they accessed it, and how many minutes the learning object was displayed on the user's computer screen. LMS accumulate a vast amount of information which is very valuable for analyzing students’ behavior and could create a gold mine of educational data [2]. By applying data mining techniques on student data in LMS, we can obtain knowledge that can help to improve quality of education, student’s performance and decreased failure rate. It is also Revised Manuscript Received on April 12, 2019. Edmund D. Evangelista, DIT Student, St. Paul University of the Philippines, Cagayan 3500, Philippines. (E-mail: edmundevangelista@ymail.com) helpful in early identification of dropouts and students who need special attention and allows the teacher to provide appropriate advising/counseling. LMS produce information of high educational value, but usually so abundant that it is impossible to analyze it manually [3]. Tools to automatically analyze this kind of data are needed. Unfortunately, these platforms do not provide specific tools to allow educators to thoroughly track and assess all learners’ activities while evaluating the structure and contents of the course and its effectiveness in the learning process [4]. Moodle, even in its latest learning analytics can only detect students at risk of dropping but not failing. With these limitations of predictive analytics in the existing framework, this study was conceptualized. This study aimed to develop machine learning models using study behavior predictors of students’ academic performance through Moodle logs. Specifically, historical data from Moodle logs were harvested and analyzed using Weka (Waikato Environment for Knowledge Analysis) data mining tool to determine the study behavior predictors that are correlated to students’ academic performance. Weka is an open source suite of machine learning software written in Java that contains a collection of visualization tools and algorithms for data analysis and predictive modeling. Datasets were harvested based on initial predictors such as Course Views, Course Viewing Time, Resource Views, Quiz Taken, Assignment Submitted, Discussion Views, Replied in Discussions, Viewed in Mobile, Viewed at Night, Viewed at Weekends, and Final Grade. Best features were identified among the initial predictors through the feature selection techniques of Weka and repeatedly trained it using algorithms such as J48, Random Forest, JRip, OneR, Logistic Regression, Multilayer Perceptron, Bagging, and K-Nearest Neighbor. The algorithm which acquired the highest predictive accuracy having false positive and false negative predictions will be used to develop the models. II. CONCEPTUAL FRAMEWORK The conceptual framework of this study was adopted on the concepts of Cross Industry Standard Process for Data Mining (CRISP-DM) and Predictive Analytics Process Model [5]. CRISP-DM, is an open standard process model that describes common approaches used by data mining experts making it the most widely-used analytics model in data mining [6]. It was conceived in 1996 and became a Development of Machine Learning Models using Study Behavior Predictors of Students’ Academic Performance Through Moodle Logs Edmund D. Evangelista