70 Int. J. Data Science, Vol. 2, No. 1, 2017
Copyright © 2017 Inderscience Enterprises Ltd.
Discrimination-aware data mining: a survey
Asmita Kashid*, Vrushali Kulkarni
and Ruhi Patankar
Department of Computer Engineering,
Maharashtra Institute of Technology,
Savitribai Phule Pune University,
Maharashtra 411038, India
Email: asmitakashid@gmail.com
Email: vrushali.kulkarni@mitpune.edu.in
Email: ruhi.patankar@mitpune.edu.in
*Corresponding author
Abstract: Data mining is a very important and useful technique to extract
knowledge from raw data. However, there is a challenge faced by data mining
researchers, in the form of potential discrimination. Discrimination means
giving unfair treatment to a person just because one belongs to a minority
group, without considering one’s individual merit or qualification. The results
extracted using data mining techniques may lead to discrimination, if a biased
historical/training dataset is used. It is very important to prevent data mining
technique from becoming a source of discrimination. A detailed survey of
discrimination discovery methods and discrimination prevention methods is
presented in this paper. This paper also presents the list of datasets used for
experiments in different discrimination-aware data mining (DADM)
approaches. Some ideas for future research work that may help in preventing
discrimination are also discussed.
Keywords: DADM; discrimination-aware data mining; discrimination
discovery; discrimination prevention.
Reference to this paper should be made as follows: Kashid, A., Kulkarni, V.
and Patankar, R. (2017) ‘Discrimination-aware data mining: a survey’,
Int. J. Data Science, Vol. 2, No. 1, pp.70–84.
Biographical notes: Asmita Kashid is a post-graduate student of the
Department of Computer Engineering, Maharashtra Institute of Technology,
Pune, Maharashtra, India. Her current research area includes pre-processing
techniques for discrimination prevention using privacy protection methods.
Vrushali Kulkarni is the Head of Computer Engineering Department of
Maharashtra Institute of Technology, Pune, Maharashtra, India. She has
published papers in various international journals and conferences. Her areas of
research interest are data mining, machine learning, algorithms and compilers.
Ruhi Patankar is an Assistant Professor at Computer Engineering Department
of Maharashtra Institute of Technology, Pune, Maharashtra, India. Her areas of
research interest are database management, operating systems and data mining.