Multimed Tools Appl
DOI 10.1007/s11042-013-1391-2
Multimedia classification and event detection
using double fusion
Zhen-zhong Lan · Lei Bao · Shoou-I Yu ·
Wei Liu · Alexander G. Hauptmann
© Springer Science+Business Media New York 2013
Abstract Multimedia Event Detection(MED) is a multimedia retrieval task with the
goal of finding videos of a particular event in video archives, given example videos
and event descriptions; different from MED, multimedia classification is a task that
classifies given videos into specified classes. Both tasks require mining features of
example videos to learn the most discriminative features, with best performance
resulting from a combination of multiple complementary features. How to combine
different features is the focus of this paper. Generally, early fusion and late fusion
are two popular combination strategies. The former one fuses features before per-
forming classification and the latter one combines output of classifiers from different
features. Early fusion can better capture the relationship among features yet is prone
to over-fit the training data. Late fusion deals with the over-fitting problem better
but does not allow classifiers to train on all the data at the same time. In this paper,
we introduce a fusion scheme named double fusion, which simply combines early
fusion and late fusion together to incorporate their advantages. Results are reported
on the TRECVID MED 2010, MED 2011, UCF50 and HMDB51 datasets. For the
MED 2010 dataset, we get a mean minimal normalized detection cost (MMNDC)
of 0.49, which exceeds the state-of-the-art performance by more than 12 percent.
On the TRECVID MED 2011 test dataset, we achieve a MMNDC of 0.51, which
is the second best among all 19 participants. On UCF50 and HMDB51, we obtain
Z.-z. Lan (B ) · L. Bao · S.-I. Yu · W. Liu · A. G. Hauptmann
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
e-mail: lanzhzh@cs.cmu.edu
L. Bao
e-mail: lei.bao.cn@gmail.com
S.-I. Yu
e-mail: iyu@cs.cmu.edu
W. Liu
e-mail: lwbiosoft@gmail.com
A. G. Hauptmann
e-mail: alex@cs.cmu.edu