Benchmarking Functional Link Expansions
for Audio Classification Tasks
Simone Scardapane, Danilo Comminiello, Michele Scarpiniti,
Raffaele Parisi and Aurelio Uncini
Abstract Functional Link Artificial Neural Networks (FLANNs) have been exten-
sively used for tasks of audio and speech classification, due to their combination
of universal approximation capabilities and fast training. The performance of a
FLANN, however, is known to be dependent on the specific functional link (FL)
expansion that is used. In this paper, we provide an extensive benchmark of multiple
FL expansions on several audio classification problems, including speech discrim-
ination, genre classification, and artist recognition. Our experimental results show
that a random-vector expansion is well suited for classification tasks, achieving the
best accuracy in two out of three tasks.
Keywords Functional links ⋅ Audio classification ⋅ Speech recognition
1 Introduction
Music information retrieval (MIR) aims at efficiently retrieving songs of interest
from a large database, based on the user’s requirements [5]. One of the most impor-
tant tasks in MIR is automatic music classification (AMC), i.e. the capability of
automatically assigning one or more labels of interest to a song, depending on its
audio characteristics. Examples of labels are the genre, artist, or the perceived mood.
S. Scardapane (
✉
) ⋅ D. Comminiello ⋅ M. Scarpiniti ⋅ R. Parisi ⋅ A. Uncini
Department of Information Engineering, Electronics and Telecommunications (DIET),
“Sapienza” University of Rome, Via Eudossiana 18, 00184 Rome, Italy
e-mail: simone.scardapane@uniroma1.it
D. Comminiello
e-mail: danilo.comminiello@uniroma1.it
M. Scarpiniti
e-mail: michele.scarpiniti@uniroma1.it
R. Parisi
e-mail: raffaele.parisi@uniroma1.it
A. Uncini
e-mail: aurel@ieee.org
© Springer International Publishing Switzerland 2016
S. Bassis et al. (eds.), Advances in Neural Networks, Smart Innovation,
Systems and Technologies 54, DOI 10.1007/978-3-319-33747-0_13
133