Benchmarking Functional Link Expansions for Audio Classiﬁcation Tasks Simone Scardapane, Danilo Comminiello, Michele Scarpiniti, Raﬀaele Parisi and Aurelio Uncini Abstract Functional Link Artiﬁcial Neural Networks (FLANNs) have been exten- sively used for tasks of audio and speech classiﬁcation, due to their combination of universal approximation capabilities and fast training. The performance of a FLANN, however, is known to be dependent on the speciﬁc functional link (FL) expansion that is used. In this paper, we provide an extensive benchmark of multiple FL expansions on several audio classiﬁcation problems, including speech discrim- ination, genre classiﬁcation, and artist recognition. Our experimental results show that a random-vector expansion is well suited for classiﬁcation tasks, achieving the best accuracy in two out of three tasks. Keywords Functional links ⋅ Audio classiﬁcation ⋅ Speech recognition 1 Introduction Music information retrieval (MIR) aims at eﬃciently retrieving songs of interest from a large database, based on the user’s requirements [5]. One of the most impor- tant tasks in MIR is automatic music classiﬁcation (AMC), i.e. the capability of automatically assigning one or more labels of interest to a song, depending on its audio characteristics. Examples of labels are the genre, artist, or the perceived mood. S. Scardapane ( ✉ ) ⋅ D. Comminiello ⋅ M. Scarpiniti ⋅ R. Parisi ⋅ A. Uncini Department of Information Engineering, Electronics and Telecommunications (DIET), “Sapienza” University of Rome, Via Eudossiana 18, 00184 Rome, Italy e-mail: simone.scardapane@uniroma1.it D. Comminiello e-mail: danilo.comminiello@uniroma1.it M. Scarpiniti e-mail: michele.scarpiniti@uniroma1.it R. Parisi e-mail: raﬀaele.parisi@uniroma1.it A. Uncini e-mail: aurel@ieee.org © Springer International Publishing Switzerland 2016 S. Bassis et al. (eds.), Advances in Neural Networks, Smart Innovation, Systems and Technologies 54, DOI 10.1007/978-3-319-33747-0_13 133