European Journal of Operational Research 248 (2016) 593–606 Contents lists available at ScienceDirect European Journal of Operational Research journal homepage: www.elsevier.com/locate/ejor Computational Intelligence and Information Management A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning Marco Caserta a, , Torsten Reiners b a IE University and IE Business School, Maria de Molina, 31-B, 28006, Madrid, Spain b Curtin University, Bentley, WA, 6102, Australia article info Article history: Received 2 October 2013 Accepted 30 May 2015 Available online 6 June 2015 Keywords: Logical Analysis of Data Data mining Fine-tuning bRKGA Machine learning abstract In this paper, we address the binary classification problem, in which one is given a set of observations, char- acterized by a number of (binary and non-binary) attributes and wants to determine which class each ob- servation belongs to. The proposed classification algorithm is based on the Logical Analysis of Data (LAD) technique and belongs to the class of supervised learning algorithms. We introduce a novel metaheuristic- based approach for pattern generation within LAD. The key idea relies on the generation of a pool of patterns for each given observation of the training set. Such a pool is built with one or more criteria in mind (e.g., diversity, homogeneity, coverage, etc.), and is paramount in the achievement of high classification accuracy, as shown by the computational results we obtained. In addition, we address one of the major concerns of many data mining algorithms, i.e., the fine-tuning and calibration of parameters. We employ here a novel technique, called biased Random-Key Genetic Algorithm that allows the calibration of all the parameters of the algorithm in an automatic fashion, hence reducing the fine-tuning effort required and enhancing the performance of the algorithm itself. We tested the proposed approach on 10 benchmark instances from the UCI repository and we proved that the algorithm is competitive, both in terms of classification accuracy and running time. © 2015 Elsevier B.V. and Association of European Operational Research Societies (EURO) within the International Federation of Operational Research Societies (IFORS). All rights reserved. 1. Introduction Let us consider a binary classification problem, in which one is given a dataset composed of observations belonging to one of two classes, e.g., positive or negative, where the class each observation belongs to is known. A typical data mining problem is the classifica- tion problem, i.e., finding the class a new observation, not included in the dataset, belongs to. Binary classification finds a large number of applications, spanning from, e.g., medical diagnosis (Alexe, Alexe, Axelrod, Hammer, & Weissmann, 2005; Hammer & Bonates, 2006), to credit risk rating (Hammer, Kogan, & Lejeune, 2006), from main- tenance replacement (Ghasemi & Esameili, 2013), to fault diagnosis (Mortada, Yacout, & Lakis, 2013). Owing to the fact that classification is such a relevant problem in the data mining field, effective techniques to classify data have been developed. The observations to be classified are characterized by a set of attributes, which are believed to affect the class each observa- tion belongs to. However, the relationship between the value of the Corresponding author. Tel.: +34 915689600. E-mail addresses: marco.caserta@ie.edu (M. Caserta), t.reiners@curtin.edu.au (T. Reiners). attributes and the class is unknown and, therefore, needs to be esti- mated via a training process. The classifier discovers such rules that map an object to a class, based on the values of the attributes. Logical Analysis of Data (LAD) was introduced in Boros, Hammer, Ibaraki, and Kogan (1997), and Boros, Hammer, Ibaraki, Kogan, Mayoraz, and Muchnik (2000) and is a data analysis methodology that combines ideas from combinatorial optimization and Boolean functions and belongs to the family of supervised learning tech- niques. The LAD methodology relies on a “rule learning” mechanism and, therefore, is strongly connected with other popular classifica- tion rules presented in the machine learning literature. According to Fürnkranz (1999), many rule learning algorithms are based on a se- quential covering procedure, in which the steps to follow are: “Learn a rule that covers part of the given training examples, remove the covered examples from the training set, and recursively learn another rule that covers some of the remaining examples until no examples remain.” LAD fits well within this sequential covering framework. For a more extensive discussion, we refer the reader interested in the connection between LAD and other “rule learning” techniques as well as on the “justifiability” of LAD to Boros, Crama, Hammer, Ibaraki, Kogan, and Makino (2011). Along the same line, Dembczynski, Kotlowski, and Slowinski (2010) establish a clear connection be- tween LAD and other methods that fall within the framework of http://dx.doi.org/10.1016/j.ejor.2015.05.078 0377-2217/© 2015 Elsevier B.V. and Association of European Operational Research Societies (EURO) within the International Federation of Operational Research Societies (IFORS). All rights reserved.