ASYMPTOTICALLY OPTIMAL NONPARAMETRIC CLASSIFICATION RULES FOR SPIKE TRAIN DATA Miroslaw Pawlak, Mateusz Pabian, Dominik Rzepka Department of Measurement and Electronics AGH University of Science and Technology Kraków, Poland ABSTRACT Spike train data ﬁnd a growing list of applications in com- putational neuroscience, streaming data and ﬁnance. Statis- tical analysis of spike trains is based on various probabilistic and neural network models. The statistical approach relies on parametric or nonparametric speciﬁcations of the underlying model. In this paper we consider the nonparametric classi- ﬁcation problem for a class of spike train data characterized by nonparametricaly speciﬁed intensity functions. We derive the optimal Bayes rule and next formulate the plug-in non- parametric kernel classiﬁers. Asymptotical properties of the rules are established including the limit with respect to the increasing observation time interval and the size of a train- ing set. The obtained results are supported by a ﬁnite sample simulation studies. Index Terms— spike train data, nonparametric learning, Bayes risk consistency 1. INTRODUCTION Event driven systems are often encountered in science and engineering. In such systems data are represented by point processes that deﬁne arrival times of events. In computa- tional neuroscience and machine learning this type of data are called spike trains [1], [2]. The mathematical theory of point processes has been extensively studied in the statistical and stochastic processes literature [3], [4]. On the other hand, the research on event type processes from the machine learning perspective has been initiated very recently [2], [5], [6]. Prob- abilistic spiking neural networks have been introduced [2], [5] for supervised and unsupervised learning problems. Various numerical results have been reported supporting their useful- ness without, however, any accuracy studies and fundamental limits. In this paper, we develop the Bayes strategy [7] for the spiking data supervised classiﬁcation problem. This strategy can be applied to research problems where event ocurrence is Authors were supported by the Polish National Center of Science under Grant DEC-2017/27/B/ST7/03082 the primary information carrier [8], [9]. We consider a class of temporal spiking processes that are characterized by non- random intensity functions. The intensity function plays the central role in our theory as it describes the local rate of oc- currence of spikes. For such processes (Section 2) we derive the optimal Bayes rule in terms of class intensity functions and examine the behavior of the corresponding Bayes risk as the observation window increases. In Section 3.1 a wide class of plug-in nonparametric classiﬁcation rules from mul- tiple replications of spiking processes is introduced. Under general conditions on the class intensity functions we estab- lish the Bayes risk consistency result, i.e., the convergence of any rule in this class to the Bayes optimal decision. In Sec- tion 3.2 we design the Bayes risk consistent kernel rule. The consistency and the rate of convergence results are presented. It is worth mentioning that the asymptotic optimality does not hold if one observes the long single realization of the un- derlying spiking process. In fact, in the intensity estimation problem increasing the observation window does not result in new observations at the left side of the window [10]. Hence, for a ﬁxed observation interval one must increase the number of events. This can be achieved by either scaling the inten- sity function or by using the replicates of the spiking process. The former approach is based on the multiplicative intensity model due to Aalen [11], whereas the latter one (used in this paper) is the standard machine learning strategy, where the replicates form the training set. In this case the resulting ker- nel estimate will be obtained by aggregating kernel estimates from the single realizations. Our asymptotic results are sup- ported by simulation studies presented in Section 4. 2. OPTIMAL CLASSIFICATION RULES A temporal spiking process {N (t),t ≥ 0} consists of a sequence of random times {t i } of isolated events in time such that N (0) = 0. We assume that the process is ob- served on the time window [0,T ] and is characterized by the non-random intensity function λ(t). This is a non- negative function that describes the local arriving rate of events such that E[N (T )] =  T 0 λ(u)du is the average num- ber of events in [0,T ]. Hence, the observed on [0,T ] pro- ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 978-1-7281-6327-7/23/$31.00 ©2023 IEEE | DOI: 10.1109/ICASSP49357.2023.10096060