An Adaptive Iterative PCA-SVM Based
Technique for Dimensionality Reduction
to Support Fast Mining of Leukemia Data
Vikrant Sabnis and Neelu Khare
Abstract Primary Goal of a Data mining technique is to detect and classify the data
from a large data set without compromising the speed of the process. Data mining is
the process of extracting patterns from a large dataset. Therefore the pattern discovery
and mining are often time consuming. In any data pattern, a data is represented by
several columns called the linear low dimensions. But the data identity does not
equally depend upon each of these dimensions. Therefore scanning and processing
the entire dataset for every query not only reduces the efficiency of the algorithm but at
the same time minimizes the speed of processing. This can be solved significantly by
identifying the intrinsic dimensionality of the data and applying the classification on
the dataset corresponding to the intrinsic dataset only. Several algorithms have been
proposed for identifying the intrinsic data dimensions and reducing the same. Once
the dimension of the data is reduced, it affects the classification rate and classification
rate may drop due to reduction in number of data points for decision. In this work
we propose a unique technique for classifying the leukemia data by identifying and
reducing the dimension of the training or knowledge dataset using Iterative process
of Intrinsic dimensionality discovery and reduction using Principal Components
Analysis (PCA) technique. Further the optimized data set is used to classify the
given data using Support Vector Machines (SVM) classification. Results show that
the proposed technique performs much better in terms of obtaining optimized data
set and classification accuracy.
V. Sabnis (B )
Maulana Azad National Institute of Technology, Bhopal, India
e-mail: vikrant_sabnis@rediffmail.com
N. Khare
VIT University Vellore, Vellore, Tamilnadu, India
e-mail: neelu.khare@vit.ac.in
B. V. Babu et al. (eds.), Proceedings of the Second International Conference on Soft Computing 1525
for Problem Solving (SocProS 2012), December 28–30, 2012, Advances in Intelligent Systems
and Computing 236, DOI: 10.1007/978-81-322-1602-5_152, © Springer India 2014