ACTA ACUSTICA UNITED WITH ACUSTICA Vol. 96 (2010) 678 – 689 DOI 10.3813/AAA.918322 From Clarinet Control to Timbre Perception Mathieu Barthet, Philippe Guillemain, Richard Kronland-Martinet, Sølvi Ystad CNRS Laboratoire de Mécanique et d’Acoustique, 31 chemin Joseph-Aiguier, 13402 Marseille Cedex 20, France. barthet@lma.cnrs-mrs.fr Summary This study investigates the relationships between the control gestures of the clarinet, the generated timbres and their perceptual representation. The understanding of such relationships can provide great interest in several re- search contexts: synthesis and control (e.g., to improve the quality of current synthesis models), music analysis and perception (e.g., to study music performance), and music information retrieval (e.g., to find relevant acous- tical descriptors for automatic instrument and/or performer identification). A physics-based model was used to generate synthetic clarinet tones by varying the main control parameters of the model (related to the blowing pressure and lip pressure on the reed). 16 participants had to rate the dissimilarities between pairs of dierent tones and describe the factors on which they based their judgments in a questionnaire. The collected data were subjected to various statistical analyses (multidimensional scaling and hierarchical clustering) in order to obtain a low-dimensional spatial configuration (timbre space) which best represents the dissimilarity ratings. The structure of the clarinet timbre space was interpreted both in terms of control parameters and acoustical descriptors. The analyses revealed a 3-dimensional timbre space, whose dimensions were well correlated to the Attack Time, the Spectral Centroid, and the Odd/Even Ratio. Comparisons of natural and synthetic clarinet tones showed that the Odd/Even Ratio appears to be a good predictor of the beating reed situation, specific to single-reed instruments. PACS no. 43.66.Jh, 43.75.Pq, 43.75.Yy, 43.75.Zz 1. Introduction Since its conception in the 17 th century, the clarinet, a single-reed instrument from the woodwind family, has aroused the interest of composers as a result of its unique timbre [1]. From the acoustical point of view, the clarinet can be described as the association of an exciter (the reed) and a resonator (the air column contained in the bore of the instrument). Under certain control conditions imposed by the performer, the coupling between the exciter and the resonator, can lead to the establishment of self-sustained oscillations which are responsible for the sounds gener- ated by the instrument. By varying their control gestures, performers can produce dierent timbres (see e.g., [2]). This study aims to increase understanding of the relation- ships between the control of the instrument, the generated sounds, and how they are perceived. The understanding of such relationships can provide great interest in several re- search contexts: synthesis and control (e.g., to improve the quality of current synthesis models), music analysis and perception (e.g., to study music performance), and music information retrieval (e.g., to find relevant acoustical de- scriptors for automatic instrument and/or performer iden- tification). For synthesis purposes, perceptual clues can be used to calibrate the parameters of a synthesis model ac- Received 30 July 2009, accepted 4 March 2010. cording to the behavior of an acoustic instrument, or to propose control devices based on a high-level perceptual description (e.g., “play a bright sound"). For music analy- sis purposes, the study of the links between the control of musical instruments and the perception of generated tones allow to determine the mechanical/acoustical parameters used by performers to alter the timbre of the tones, and find the relevant acoustical descriptors that account for such timbre variations. Such investigations are of importance when studying the role of timbre in musical interpretation (see e.g., [3, 4]) and can give new pedagogical insights. In the music information retrieval context, the results of such studies may be useful to achieve automatic instrument or performer identification based on timbre descriptors. By varying the two main control parameters (related to the blowing pressure and reed aperture) of a physics-based synthesis model of clarinet [5], isolated clarinet tones were generated. Dissimilarity judgments were then collected and subjected to various statistical analyses to uncover the psychomechanical and psychoacoustical factors best pre- dicting the perceptual representation of the various clarinet tones. Timbre is the attribute of the auditory sensation that al- lows tones of equal pitch, loudness and duration to be dis- tinguished [6]. From the cognitive point of view, timbre refers both to the identity of the sound sources (timbre is the attribute that allows two dierent musical instru- ments playing the same note, with identical sound lev- 678 © S. Hirzel Verlag · EAA