QUANTITATIVE ANALYSIS OF THE IMPACT OF
MIXING ON PERCEIVED EMOTION OF SOUND-
SCAPE RECORDINGS
Jianyu Fan Miles Thorogood Kıvanç Tatar Philippe Pasquier
Simon Fraser University
jianyuf@sfu.ca
Simon Fraser University
mthorogo@sfu.ca
Simon Fraser University
ktatar@sfu.ca
Simon Fraser University
pasquier@sfu.ca
ABSTRACT
Sound designers routinely mix source soundscape record-
ings. Previous studies have shown that people agree with
each other on the perceived valence and arousal for
soundscape recordings. This study investigates whether
we can compute the perceived emotion of the mixed-
soundscape recordings based on the perceived emotion of
source soundscape recordings. We discovered quantifia-
ble trends in the effect of mixing on the perceived emo-
tion of soundscape recordings. Regression analysis based
on the trajectory observation resulted in coefficients with
high R
2
values. We found that the change of loudness of a
source soundscape recording had an influence on its
weight on the perceived emotion of mixed-soundscape
recordings. Our visual analysis of the center of mass data
plots found the specific patterns of the perceived emotion
of the source soundscape recordings that belong to differ-
ent soundscape categories and the perceived emotion of
the mix. We also found that when the difference in va-
lence/arousal between two source soundscape recordings
is larger than a given threshold, it is highly likely that the
valence/arousal of the mix is in between the va-
lence/arousal of two source soundscape recordings.
1. INTRODUCTION
Audio-based creative practices, such as sound design and
soundscape composition, often use recordings to create
musical works. A soundscape recording (or field record-
ing) is “a recording of sounds at a given locale at a given
time, obtained with one or more fixed or moving micro-
phones” [1]. Often, sound designers select source sound-
scape recordings and carefully mix them together, which
has a profound influence on meaning, significance, and
perceived emotion. Together, the mixed-soundscape re-
cordings create a rich, cohesive experience.
Previous studies demonstrate that people have a high
level of agreement on the perceived emotion of source
soundscapes recording [2]. It is also possible to build
machine-learning models to predict the perceived emo-
tion of soundscape recordings [3]. However, to our
knowledge, no study has been presented regarding the
effect of mixing on the perceived emotion of soundscape
recordings.
In this study, we focus on the effect of mixing on the
perceived emotion of soundscape recordings. We used
Emo-Soundscapes, a dataset for soundscape emotion
recognition that contains a group of annotated source
soundscape recordings and annotated mixed-soundscape
recordings [4]. The source soundscape recordings are
selected following Schafer’s taxonomy so as to cover the
diversity of soundscapes as much as possible [5]. The
perceived emotion is represented as the ranking of a two-
dimensional vector of valence and arousal [6]. As identi-
fied by Thorogood and Pasquier [3], valence represents
the pleasantness of a stimulus, which is used to report the
perceived pleasantness of a soundscape recording.
Arousal indicates the level of eventfulness.
Next, we convert the annotators’ rankings to ratings
and used regression models to determine the effect of
mixing on the perceived emotion of soundscape record-
ings. Moreover, we analyzed the center of mass data plots
to find the relationships between the perceived emotion
of the mixed-soundscape recordings and perceived emo-
tion of source soundscape recordings that are selected
within Schafer’s category. Last, we analyzed the likeli-
hood of the perceived emotion of mixed-soundscape re-
cordings lying between the perceived emotions of the two
source soundscape recordings that are used for the mix.
2. RELATED WORKS
2.1 Taxonomy of Emotion and Affect Models
Emotional responses are subjective with people having
possibly a different response to the same stimulus. Ac-
cording to previous studies [7], two types of emotions are
involved when listening to soundscapes:
• Perceived emotion: emotions that are communi-
cated and expressed by the source.
• Induced emotion: emotional reactions that the
source provokes in an audience; it is what the audi-
ence feels from the source.
The perceived emotion is the emotion a source expresses.
For example, the perceived emotion of happy songs is
always “happy”. However, the induced emotion is more
subjective. The same happy music may not necessarily
induce happiness because of the internal interpretations
and experiences of a listener. In this study, we focus on
the perceived emotion of soundscapes.
Copyright: © 2018 Fan et al. This is an open-access article distributed
under the terms of the Creative Commons Attribution License 3.0 Unport-
ed, which permits unrestricted use, distribution, and reproduction in any
medium, provided the original author and source are credited.