Towards Designing Enthusiastic AI Agents Carla Viegas Stella AI Carnegie Mellon University Pittsburgh, Pennsylvania, USA cviegas@cs.cmu.edu Malihe Alikhani University of Pittsburgh Pittsburgh, Pennsylvania, USA malihe@pitt.edu ABSTRACT Immersive virtual worlds are increasingly being used for education, training, and entertainment, and virtual humans that can inter- act with human users in these worlds play many important roles. Understating the emotional constructs of the user and generating multimodal forms of communications that are aligned with the user’s needs and input is key to designing AI agents. Most virtual agents and communicative systems lack the ability to understand enthusiasm or generate multimodal enthusiastic communicative presentations. In this work, we argue for the importance of in- cluding enthusiasm in the design of humanśAI collaboration and communication and review the existing datasets and models that can be used to bridge the gap in this area. CCS CONCEPTS · Human-centered computing Human computer interac- tion (HCI). KEYWORDS datasets, enthusiasm, virtual agents, engagement, conversational expressions ACM Reference Format: Carla Viegas and Malihe Alikhani. 2021. Towards Designing Enthusiastic AI Agents. In 21th ACM International Conference on Intelligent Virtual Agents (IVA ’21), September 14ś17, 2021, Virtual Event, Japan. ACM, New York, NY, USA, 3 pages. https://doi.org/10.1145/3472306.3478366 1 OVERVIEW We have recently observed a growing interest in the development of embodied agents that can understand user’s emotional constructs. The fuent exchange of information and the display of cognitive and emotional states is essential for establishing and maintaining en- gagement. Studies have suggested that the six basic emotions [14] cannot best represent the emotional constructs that AI agents need to work with. Conversational expressions are more fne-grained and diverse. Examples of such expressions include clueless, annoyed, and interested [6, 7, 9]. In this work, we want to call attention to enthusiasm as a conversational expression. Enthusiasm is one of the most desired traits in employees, co-workers, mentors, leaders, and teachers [2, 5, 10, 27, 31, 39]. Enthusiastic people are not only Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proft or commercial advantage and that copies bear this notice and the full citation on the frst page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). IVA ’21, September 14ś17, 2021, Virtual Event, Japan © 2021 Copyright held by the owner/author(s). ACM ISBN 978-1-4503-8619-7/21/09. https://doi.org/10.1145/3472306.3478366 motivated and excited about a topic, they can also spark this excite- ment in their listeners and even move their audience to action [12]. Given that the ability to speak enthusiastically provides a clear advantage in human interactions when the goal is to engage the interlocutor emotionally, we shall also aim to transfer it to virtual agents. Although enthusiasm has been studied extensively in psychol- ogy, showing that students clearly beneft from enthusiastic teach- ers [5, 44] as well as companies do with enthusiastic leaders [23, 34], it is still unclear what exactly makes a person to be perceived as enthusiastic [19]. Most of the work on detecting enthusiasm au- tomatically is based on using written human-to-human conver- sational dialogues [17, 38]. Limited work on enthusiastic virtual agents and robots exist. Liew et al. showed that virtual agents with enthusiastic voices improve learning outcomes in students [24ś26], however, they used prerecorded voices from actors. Despite the limited work on detecting and generating enthusiastic behavior, the recent release of the frst multimodal dataset on enthusiasm, called Entheos [40], is not only a chance to gain more understanding on enthusiasm, but also to create enthusiastic virtual agents. In the following we will describe the opportunities for enthusias- tic virtual agents as well as the challenges that need to be addressed in order to have an engaging conversation between humans and virtual agents. We will also present some preliminary analysis on the Entheos dataset 1 and discuss how explainable AI techniques such as SHAP [28] can be used to understand which features are important for enthusiasm classifcation. 2 OPPORTUNITIES FOR ENTHUSIASTIC VIRTUAL AGENTS Although several applications of virtual agents could beneft from understanding and generating enthusiastic behavior, we will focus on three specifc use cases: a) teaching, b) coaching, and c) sales. All three applications are not purely conversational, but focus on delivering a message in an engaging way, which we believe is a feasible frst step to improving existing virtual agents. Teaching: Enthusiasm has shown to improve students’ perfor- mance not only with human teachers [5, 19, 20, 44] but also with vir- tual teachers using prerecorded enthusiastic voices [24, 26]. Given the increasing importance of pedagogical agents and robot teachers to overcome the global shortage of teachers [29], it is essential to automatically generate enthusiastic behavior during teaching to ensure students’ performance and their engagement. Coaching: Several virtual coaches have been designed to pro- mote healthier behaviors targeting for ex. elderly people [11] or patients recovering from spinal cord injuries [36]. Virtual coaches 1 https://github.com/clviegas/Entheos-Dataset