ARTICLE IN PRESS
JID: ESWA [m5G;March 2, 2016;12:57]
Expert Systems With Applications xxx (2016) xxx–xxx
Contents lists available at ScienceDirect
Expert Systems With Applications
journal homepage: www.elsevier.com/locate/eswa
Gesture phase segmentation using support vector machines
Renata Cristina Barros Madeo, Sarajane Marques Peres
∗
,
Q1
Clodoaldo Aparecido de Moraes Lima
School of Arts, Sciences and Humanities, University of São Paulo, R. Arlindo Béttio, 1000, São Paulo, SP 03828-000, Brazil
a r t i c l e i n f o
Keywords:
Gesture analysis
Gesture phase segmentation
Machine learning
Support vector machine
a b s t r a c t
An interaction between humans or between a human and a machine will be more effective if it is sup-
ported by gestures. In different levels of complexity, the communication system used in human interac-
tion includes the use of gesture. In natural conversation, for instance, speakers use gestures for both to
enhance the expressiveness of their speech and to support their own linguistic reasoning. The audience
absorbs the content being transmitted also based on the speakers’ gesticulation. Thus, an analysis of ges-
tures should add value to the purpose of the interaction. One of the concerns in the analysis of gestures is
the problem arising from the segmentation of phases of a gesture (rest position, preparation, stroke, hold
and retraction), which, from the standpoint of Gesture Theory, may reveal information on prosody and
semantics of what is being said in a discourse. Finding an automation solution to this problem involves
enabling the development of theoretical and application areas that are based on the analysis of human
behavior and on the interpretation and generation of natural language. In this study, the problem of ges-
ture phase segmentation is modeled as a problem of classification, and then support vector machine is
employed to design a model able to learn the patterns of gesture that are inherent to each phase. This
work presents two main highlights. The first is to address the limitations of the segmentation approach
through the study of its performance in different scenarios that represent the complexity of analyzing
patterns of human behavior. In this study, we reached an F-score around 0.9 for rest position and around
0.8 for stroke and preparation as segmentation results in the best cases. Moreover, it was possible to in-
vestigate how classification models are influenced by human behavior. The second highlight refers to the
conduction of an analysis by considering the standpoint of specialists concerned with gesture phase seg-
mentation in the area of Linguistics and Psycholinguistics, through which we obtained impressive results.
Thus, in regard to the suitability of our approach, it is a feasible means of supporting the development
of the Gesture Theory as well as the Computational Linguistics and Human Machine Interaction fields.
© 2016 Elsevier Ltd. All rights reserved.
1. Introduction 1
In the last few years, there has been an increase of studies on 2
gesture analysis in Computer Science (Chen, Jafari, & Kehtarbavaz, 3
2015; Han, Shao, Xu, & Shotton, 2013; Madeo, Wagner, & Peres, 4
2013b; Mitra & Acharya, 2007). Currently, both the academic world 5
and industry are interested in gesture analysis, mainly with the 6
aim of developing new methods of Human–Computer (or Machine) 7
Interaction. These methods have been developed considering dif- 8
ferent aspects, such as: different ways of interacting (Turk, 2014); 9
execution of gestures with body parts that are little explored in 10
gesture analysis (Tran, Doshi, & Trivedi, 2012); multiple users inter- 11
∗
Corresponding author. Tel.: +5511987110994.
E-mail addresses: renata.si@usp.br (R.C.B. Madeo), sarajane@usp.br,
smperesbr@hotmail.com (S.M. Peres), c.lima@usp.br (C.A.d.M. Lima).
acting at the same time with one or more devices (Baraldi, Bimbo, 12
& Landucci, 2008); and interaction with avatars (Kopp, Sowa, & 13
Wachsmuth, 2004) or robots (Salem, S. Kopp, Wachsmuth, Rohlf- 14
ing, & Joublin, 2012). 15
The increasing interest in gesture analysis is partly due to the 16
development of low-cost sensors, for which there are Software 17
Development Kits that provide the means to create applications 18
with different levels of complexity (Guna, Jakus, Poganik, Tomai, & 19
Sodnik, 2014; Lun & Zhao, 2015; Tashev, 2013). These sensors are 20
able to acquire information that can describe human movements 21
at different levels of detail. For example, they have been used to 22
build large-scale benchmark datasets which allow more extensive 23
research to be carried out in the fields of gesture segmentation 24
and gesture recognition (Escalera et al., 2013). Although these sen- 25
sors have become increasingly popular, video cameras (RGB and 26
RGB-D) and webcams are still the main sources of data for hu- 27
man movements. The methods regarding the use of different types 28
http://dx.doi.org/10.1016/j.eswa.2016.02.021
0957-4174/© 2016 Elsevier Ltd. All rights reserved.
Please cite this article as: R.C.B. Madeo et al., Gesture phase segmentation using support vector machines, Expert Systems With Appli-
cations (2016), http://dx.doi.org/10.1016/j.eswa.2016.02.021