Information Sciences 396 (2017) 202–217
Contents lists available at ScienceDirect
Information Sciences
journal homepage: www.elsevier.com/locate/ins
A methodology for analysis of concept lattice reduction
Sérgio M. Dias
a,b,∗
, Newton J. Vieira
a
a
Department of Computer Science, Federal University of Minas Gerais (UFMG), Av. Antônio Carlos 6627 - Pampulha Prédio do ICEx -
Anexo U - sala 5309 - Belo Horizonte - Minas Gerais - Brazil - 31270-010
b
Federal Service of Data Processing (SERPRO), Av. José Cândido da Silveira, 1.200 - Cidade Nova, Belo Horizonte 31.035-536, Minas
Gerais, Brazil
a r t i c l e i n f o
Article history:
Received 30 November 2015
Revised 13 December 2016
Accepted 15 February 2017
Available online 20 February 2017
Keywords:
Formal concept analysis
Lattice reduction
Proper implications
a b s t r a c t
Formal concept analysis (FCA) is a mathematical theory of data analysis with applications
in many areas. The problem of obtaining a concept lattice of an appropriate size was iden-
tified in several applications as one of the most important problems of FCA. In order to
deal with this problem several techniques with different characteristics were proposed
for concept lattice reduction. However, there are currently no adequate methods to as-
sess what types of knowledge transformations can result from a reduction. A methodology
for analysis of concept lattice reduction is presented here. It is based on the use of sets
of proper implications holding in the original and reduced formal contexts or concept lat-
tices. Working with both sets of implications, the methodology is able to show what is
preserved, eliminated, inserted or transformed by a reduction technique. Three classes of
reduction techniques are analyzed from the standpoint of the methodology in order to
highlight techniques of each class have in common with respect to the transformations
performed. Such analysis is followed by specific examples in each class.
© 2017 Elsevier Inc. All rights reserved.
1. Introduction
Formal concept analysi s (FCA) is currently considered an important formalism for knowledge extraction, representation
and analysis. FCA has been widely studied and applied in many diverse scientific fields [24,31,36,42,45,62]. However, even a
small set of data can result in a very large number of formal concepts [4]. In fact, FCA induces a potentially high combinato-
rial complexity and the structures obtained, even from a small dataset, may become prohibitively large [27]. Despite the fact
that the worst case of the lattice size (2
min(|G|, |M|)
) is rarely found in practice, the computational cost is still too prohibitive
for many applications. Furthermore, the resulting number of formal concepts and the topology of the relationships between
concepts can make difficult an analysis of the final lattice [46]. In particular, key aspects, those that are indeed important,
may be immersed in a maze of irrelevant details. Regardless of the number of formal concepts generated in the worst case,
all relationships between concepts are present in the concept lattice. This feature is suitable in terms of completeness, but
generally results in a large number of relationships overloading the lattice’s structure.
The problem of obtaining a concept lattice of appropriate size and structure, which exposes the really relevant aspects,
is one of the most important problems when using FCA. As a consequence, there are many techniques, with different
∗
Corresponding author at: Department of Computer Science, Federal University of Minas Gerais (UFMG), Av. Antônio Carlos 6627 - Pampulha Prédio do
ICEx - Anexo U - sala 5309 - Belo Horizonte - Minas Gerais - Brazil - 31270-010.
E-mail addresses: mariano@dcc.ufmg.br, sergio.dias@serpro.gov.br (S.M. Dias), nvieira@dcc.ufmg.br (N.J. Vieira).
http://dx.doi.org/10.1016/j.ins.2017.02.037
0020-0255/© 2017 Elsevier Inc. All rights reserved.