International Journal of Foundations of Computer Science c World Scientific Publishing Company Succinct Minimal Generators: Theoretical Foundations and Applications Tarek Hamrouni Department of Computer Science, Faculty of Sciences of Tunis, Tunis, Tunisia. CRIL-CNRS, IUT de Lens, Lens, France. hamrouni@cril.univ-artois.fr, tarek.hamrouni@fst.rnu.tn and Sadok Ben Yahia Department of Computer Science, Faculty of Sciences of Tunis, Tunis, Tunisia. sadok.benyahia@fst.rnu.tn and Engelbert Mephu Nguifo CRIL-CNRS, IUT de Lens, Lens, France. mephu@cril.univ-artois.fr Received (received date) Revised (revised date) Communicated by Editor’s name ABSTRACT In data mining applications, highly sized contexts are handled what usually results in a consid- erably large set of frequent itemsets, even for high values of the minimum support threshold. An interesting solution consists then in applying an appropriate closure operator that structures frequent itemsets into equivalence classes, such that two itemsets belong to the same class if they appear in the same sets of objects. Among equivalent itemsets, minimal elements (w.r.t. the number of items) are called minimal generators (MGs), while their associated closure is called closed itemset (CI), and is the largest one within the corresponding equivalence class. Thus, the pairs - composed by MGs and their associated CIs - make easier localizing each itemset since it is necessarily encompassed by an MG and an CI. In addition, they offer informative implication/association rules, with minimal premises and maximal conclusions, which losslessly represent the entire rule set. These important concepts - MG and CI - were hence at the origin of various works. Nevertheless, the inherent absence of a unique MG associated to a given CI leads to an intra-class combinatorial redundancy that leads an exhaustive storage and impractical use. This motivated an in-depth study towards a lossless reduc- tion of this redundancy. This study was started by Dong et al. who introduced the succinct system of minimal generators (SSMG) as an attempt to eliminate the redundancy within this set. In this paper, we give a thorough study of the SSMG as formerly defined by Dong et al. This system will be shown to suffer from some flaws. As a remedy, we introduce a new lossless reduction of the MG set allowing to overcome its limitations. The new SSMG will then be incorporated into the framework of generic bases of association rules. This makes it possible to only maintain succinct and informative rules. After that, we give a thorough formal study of the related inference mechanisms allowing to derive all redundant association rules, starting from the maintained ones. Finally, an experimental evaluation shows the utility of our approach towards eliminating important rate of redundant information. 1