The Incremental Fourier Classiﬁer: Leveraging the Discrete Fourier Transform for Classifying High Speed Data Streams Chamari I. Kithulgoda a,∗ , Russel Pears a , M. Asif Naeem a a School of Engineering, Computer and Mathematical Sciences Auckland University of Technology, Auckland, New Zealand Abstract Two major performance bottlenecks with decision tree based classiﬁers in a data stream environment are the depth of the tree and the update overhead of maintaining leaf node statistics on an instance-wise basis to ensure that classiﬁcation is consistent with the current state of the data stream. Previous research has shown that classiﬁers based on Fourier spectra derived from decision trees produce compact array structures that can be searched and maintained much more eﬃciently than deep tree based structures. However, the key issue of incrementally adapting the spectrum to changes has not been addressed. In this research we present a strategy for incremental maintenance of the Fourier spectrum to changes in concept that take place in data stream environments. Along with the incremental approach we also propose schemes for feature selection and synopsis generation that enable the coeﬃcient array to be refreshed eﬃciently on a periodic basis. Our empirical evaluation on a number of widely used stream classiﬁers reveals that the Fourier classiﬁer outperforms them, both in terms of classiﬁcation accuracy as well as speed of classiﬁcation. Keywords: Data Stream, Ensemble Classiﬁer, Discrete Fourier Transform, Concept Drift, Fourier Spectrum, Feature Selection 1. Introduction The need for scaling up the process of mining high speed data streams is now paramount than ever before. However, greater throughput should not come at the price of prediction accuracy. Incremental learning techniques have been used extensively to address the data stream classiﬁcation problem and to maintain a good balance be- tween accuracy and eﬃciency Mena-Torres & Aguilar-Ruiz (2014). In this research we adopt an incremental strategy based on the use of the Discrete Fourier Transform (DFT). We propose a novel approach for leveraging the DFT to scale up throughput while maintaining or improving classiﬁcation accuracy over current state-of-the-art data stream classiﬁers. The Discrete Fourier Transform has long been a key tool in signal processing and has also been applied to data mining Park (2001), Kargupta & Park (2004), Kargupta et al. (2006). It has several attractive properties for cap- turing patterns that sets it apart from conventional mech- anisms such as decision trees and other types of classiﬁers. Firstly, it has been shown rigorously that spectra gen- erated from hierarchical classiﬁers such as decision trees can be represented in compact form thus speeding up the * Corresponding author Email addresses: ckithulg@aut.ac.nz (Chamari I. Kithulgoda), rpears@aut.ac.nz (Russel Pears), mnaeem@aut.ac.nz (M. Asif Naeem) classiﬁcation process Sripirakas & Pears (2014). Secondly, Fourier spectra have the ability to embed several diﬀerent patterns (concepts) into one entity unlike conventional en- semble classiﬁer systems which maintain multiple models. This is due to the fact that Spectra can be represented in array form and hence spectra generated at several diﬀerent points in time can be aggregated into one unifying spec- trum that embeds the properties of its constituent spectra. Thirdly, classiﬁcation can be performed in Fourier space and Fourier spectra, once generated from conventional classiﬁers, can be used independently of them. Fourthly, the distributive nature of the inverse Fourier transform operation oﬀers the possibility of exploiting parallelism in the classiﬁcation process. Such properties have been exploited Sakthithasan et al. (2015), Kithulgoda & Pears (2016) in mining data streams but their use comes at a price. The application of the DFT on multivariate data to produce a spectrum is a non-trivial operation and has time complexity O(|X| 2 ), where |X| is the size of the feature space Kargupta & Park (2004). The size of the feature space |X| grows exponen- tially with the dimensionality of the data. In a highly dynamic data stream environment the time spent on re- peated application of the DFT at each concept detection point can quickly become prohibitive as our experimenta- tion in Section 6.9 shows. A much more eﬀective strategy would be to incremen- tally maintain a spectrum in a fashion analogous to the incremental maintenance of a conventional classiﬁer such Preprint submitted to Expert Systems with Applications December 10, 2017 Manuscript Click here to download Manuscript: Revised Article_11Dec2017_Expert Systems_KithulgodaPearsNaeem.pdf Click here to view linked References