Pattern Recognition Letters 138 (2020) 8–15 Contents lists available at ScienceDirect Pattern Recognition Letters journal homepage: www.elsevier.com/locate/patrec A PSO-based algorithm for mining association rules using a guided exploration strategy Gretel Bernal Baró a , José Francisco Martínez-Trinidad b , Rosa María Valdovinos Rosas a, , Jesús A. Carrasco Ochoa b , Ansel Y. Rodríguez González c , Manuel S. Lazo Cortés d a Universidad Autónoma del Estado de México, Faculty of Engineering, Instituto Literario 100 Centro, Toluca de Lerdo 50000, México b Department of Computer Sciences at Institute of Astrophysics Optics and Electronics (INAOE), Luis Enrique Erro 1 Tonantzintla, Puebla 72840, México c Tepic Technology Transfer Unit, Center for Scientific Research and Higher Education at Ensenada (CICESE-UT3), Andador 10, Tepic, Nayarit 63173, México d TecNM/Instituto Tecnológico de Tlalnepantla, Av. Instituto Tecnológico s/n, La Comunidad, Tlalnepantla de Baz 54070, México a r t i c l e i n f o Article history: Received 10 October 2019 Revised 28 April 2020 Accepted 4 May 2020 MSC: 41A05 41A10 65D05 65D17 Keywords: PSO Algorithm Association Rules Metaheuristic algorithm a b s t r a c t Association rule mining is one of the most important and active research areas in data mining. In the lit- erature, several association rule miners have been proposed; among them, those based on particle swarm optimization (PSO) have reported the best results. However, these algorithms tend to prematurely fall into local solutions, avoiding a wide exploration that could produce even better results. In this paper, an algo- rithm based on PSO, called PSO-GES, for mining association rules using a Guided Exploration Strategy is introduced. Our experiments, over real-world transactional databases, show that our proposed algorithm mines better quality association rules than the most recent PSO-based algorithms for mining association rules of the state of the art. © 2020 Published by Elsevier B.V. 1. Introduction Association rule mining (ARM) is one of the most used tech- niques in data mining. The main purpose of association rule min- ing is to discover interesting associations between items in large transactional databases [1]. The ARM problem is defined as fol- lows: let T be the a set of M transactions T = t 1 , t 2 , . . . , t M , and I = {i 1 , i 2 , . . . , i n } be the set of all different items in T. An item- set X is a subset of items, i.e., XI. The support of an itemset X in T, denoted as Sup (X), is the fraction of transactions in T that contain X. An itemset X is frequent if its support is no less than MinSup, where MinSup is a threshold given by the user. An associ- ation rule (AR) is an expression of the form X Y, where X Y is a frequent itemset and X Y = . X is called the antecedent and Y is called the consequent of the rule. Given a transactional database T, the ARM problem consists in finding all the association rules (ARs) that satisfy predefined minimum support and confidence thresh- olds. The support of an AR is the support of the union of its an- Editor: Prof. S. Sarkar Corresponding author. E-mail address: li_rmvr@hotmail.com (R.M. Valdovinos Rosas). tecedent and its consequent, and the confidence of an AR is the support of the rule divided by the support of the antecedent of the rule. Algorithms for generating interesting ARs from frequent item- sets, have been reported in the literature, for example [6,19]. These algorithms mine all possible ARs, by first mining all the frequent itemsets through algorithms based on Apriori and FP-growth, and then the association rules are mined using the algorithm for min- ing ARs proposing in Agrawal et al. [1], therefore, they usually pro- duce a large number of ARs and selecting among them, a subset of good rules is a challenging and expensive task. For this reason, bio-inspired heuristics such as Ant Colony Optimization (ACO) [8], Particle Swarm Optimization (PSO) [2,11,18] and Genetic Algorithms (GA) [13] have been proposed for mining a good subset of ARs. From these works, the best results have been reported by those based on particle swarm optimization (PSO). However, algorithms based on PSO have the problem of prematurely falling into lo- cal solutions (rules) with low quality [4,14,20,21]. In this paper, a PSO-based algorithm named PSO-Guided Exploration Strategy (PSO-GES) for mining ARs is proposed, the main contribution is to include a Guided Exploration Strategy in order to generate high- quality rules. In addition, a new fitness function is introduced for https://doi.org/10.1016/j.patrec.2020.05.006 0167-8655/© 2020 Published by Elsevier B.V.