Ensemble Strategies for Classifying Hyperspectral Remote Sensing Data Xavier Ceamanos 1 , Bj¨ orn Waske 2 , J´ on Atli Benediktsson 2 , Jocelyn Chanussot 1 , and Johannes R. Sveinsson 2 1 GIPSA-LAB, Signal & Image Department, Grenoble Institute of Technology, INPG BP 46 - 38402 Saint Martin d’H` eres, France 2 University of Iceland, Faculty of Electrical and Computer Engineering, Hajararhagi 2-6, 107 Reykjavik, Iceland Abstract. The classiﬁcation of hyperspectral imagery, using multiple classiﬁer systems is discussed and an SVM-based ensemble is introduced. The data set is separated into separate feature subsets using the correla- tion between the diﬀerent spectral bands as a criterion. Afterwards, each source is classiﬁed separately by an SVM classiﬁer. Finally, the diﬀerent outputs are used as inputs for ﬁnal decision fusion that is based on an additional SVM classiﬁer. The results using the proposed strategy are compared to classiﬁcation results achieved by a single SVM and other well known classiﬁer ensembles, such as random forests, boosting and bagging. Keywords: hyperspectral, land cover classiﬁcation, support vector ma- chines, multiple classiﬁer systems, classiﬁer ensmeble. 1 Introduction Hyperspectral data provide detailed spectral information from land cover, rang- ing from the visible to the short-wave infrared region of the electromagnetic spectrum. Nevertheless the classiﬁcation of hyperspectral imaging is challeng- ing, due to the high-dimension of the data sets. Particularly with a limited number of training samples the classiﬁcation accuracy (of conventional statis- tical classiﬁers) can be limited. Hughes [1] showed that with a limited number of training samples the classiﬁcation accuracy decreases after a maximum is achieved. Thus, it requires sophisticated classiﬁcation algorithms to use detailed hyperspectral information comprehensively. In several remote sensing studies it was demonstrated that Support Vector Machines (SVM) perform better than or at least comparable to other classiﬁers in terms of accuracy, even when applied to hyperspectral data sets [2],[3]. One reason for this success might be the un- derlying concept of SVM classiﬁers. Their aim is to discriminate two classes by constructing an optimal separating hyperplane to the training samples within a multi-dimensional feature space, by using only the closest training samples of each class [4]. Consequently, the approach only considers training data close to the class boundary and performs well with small training sets. J.A. Benediktsson, J. Kittler, and F. Roli (Eds.): MCS 2009, LNCS 5519, pp. 62–71, 2009. c  Springer-Verlag Berlin Heidelberg 2009