Multicategory Incremental Proximal Support Vector Classiﬁers Amund Tveit and Magnus Lie Hetland Department of Computer and Information Science, Norwegian University of Science and Technology, N-7491 Trondheim, Norway {amundt,mlh}@idi.ntnu.no Abstract. Support Vector Machines (SVMs) are an eﬃcient data min- ing approach for classiﬁcation, clustering and time series analysis. In recent years, a tremendous growth in the amount of data gathered has changed the focus of SVM classiﬁer algorithms from providing accurate results to enabling incremental (and decremental) learning with new data (or unlearning old data) without the need for computationally costly re- training with the old data. In this paper we propose an eﬃcient algorithm for multicategory classiﬁcation with the incremental proximal SVM in- troduced by Fung and Mangasarian. 1 Introduction Support Vector Machines (SVMs) are an eﬃcient data mining approach for clas- siﬁcation, clustering and time series analysis [1–3]. In recent years, a tremendous growth in the amount of data gathered (for example, in e-commerce and intru- sion detection systems) has changed the focus of SVM classiﬁer algorithms from providing accurate results to enabling incremental (and decremental) learning with new data (or unlearning old data) without the need for computationally costly retraining with the old data. Fung and Mangasarian [4] introduced the Incremental and Decremental Linear Proximal Support Vector Machine (PSVM) for binary classiﬁcation and showed that it could be trained extremely eﬃciently, with one billion examples (500 increments of two million examples) in two hours and twenty-six minutes on relatively low-end hardware (400MHz Pentium II). In this paper we propose an eﬃcient algorithm based on memoization, in order to support Multicategory Classiﬁcation for the Incremental PSVM. 2 Background Theory The standard binary SVM classiﬁcation problem with soft margin (allowing some errors) is shown visually in Fig. 1(a) and as a constrained quadratic programming problem in (1). Intuitively, the problem is to maximize the margin between the solid planes and at the same time permit as few errors as possible, errors being positive class points on the negative side (of the solid line) or vice versa.