Exploratory basis pursuit classiﬁcation M. Brown a, * , N.P. Costen b a Control Systems Centre, School of Electrical and Electronics Engineering, University of Manchester, P.O. Box 88, Manchester M60 1QD, UK b Department Computing and Mathematics, Manchester Metropolitan University, Manchester, UK Available online 23 May 2005 Abstract Feature selection is a fundamental process in many classiﬁer design problems. However, it is NP-complete and approximate approaches often require requires extensive exploration and evaluation. This paper describes a novel approach that represents feature selection as a continuous regularization problem which has a single, global minimum, where the modelÕs complexity is measured using a 1-norm on the parameter vector. A new exploratory design process is also described that allows the designer to eﬃciently construct the complete locus of sparse, kernel-based classiﬁers. It allows the designer to investigate the optimal parametersÕ trajectories as the regularization parameter is altered and look for eﬀects, such as SimpsonÕs paradox, that occur in many multivariate data analysis problems. The approach is dem- onstrated on the well-known Australian Credit data set. Ó 2005 Published by Elsevier B.V. Keywords: Feature selection; Sparse classiﬁcation; Regularization 1. Introduction Feature selection is a fundamental process with- in many classiﬁcation algorithms, as many classiﬁ- cation problems are not well understood and a large dictionary of potential, ﬂexible features ex- ists, from which it is necessary to select a relevant subset (Weston et al., 2001). This is typical in the vision and biometrics ﬁelds, where new measure- ment or transformation techniques are explored in order to boost the classiﬁerÕs performance. The aims of feature selection include: • improved, more robust parameter estimation and • improved insight into the decision making process. Feature selection is generally an empirical pro- cess that is performed prior to, or jointly with, the parameter estimation process. Training data is used to assess diﬀerent model combinations 0167-8655/$ - see front matter Ó 2005 Published by Elsevier B.V. doi:10.1016/j.patrec.2005.03.012 * Corresponding author. E-mail address: martin.brown@manchester.ac.uk (M. Brown). Pattern Recognition Letters 26 (2005) 1907–1915 www.elsevier.com/locate/patrec