Engineering Applications of Artificial Intelligence 94 (2020) 103741
Contents lists available at ScienceDirect
Engineering Applications of Artificial Intelligence
journal homepage: www.elsevier.com/locate/engappai
Extra-adaptive robust online subspace tracker for anomaly detection from
streaming networks
Maryam Amoozegar
a
, Behrouz Minaei-Bidgoli
a,∗
, Mansoor Rezghi
b
, Hadi Fanaee-T
c
a
School of Computer Engineering, Iran University of Science and Technology, Narmak, Tehran, 1684613114, Iran
b
Department of Computer Science, Tarbiat Modares University, Tehran, 14115-175, Iran
c
Center for Applied Intelligent Systems Research (CAISR), Halmstad University, Halmstad, Sweden
ARTICLE INFO
Keywords:
Anomaly detection
Robust online subspace tracker
Dynamic network
CP tensor decomposition
Low rank and sparse analysis
ABSTRACT
Anomaly detection in time-evolving networks has many applications, for instance, traffic analysis in trans-
portation networks and intrusion detection in computer networks. One group of popular methods for anomaly
detection from evolving networks are robust online subspace trackers. However, these methods suffer from
problem of insensitivity to drastic changes in the evolving subspace. In order to solve this problem, we propose
a new robust online subspace and anomaly tracker, which is more adaptive and robust against sudden drastic
changes in the subspace. More accurate estimation of low rank and sparse components by this tracker leads to
more accurate anomaly detection. We evaluate the accuracy of our method with real-world dynamic network
data sets with varying sparsity levels. The result is promising and our method outperforms the state-of-the-art.
1. Introduction
There are many criteria can be used to detect anomalies in a time-
evolving networks, but the most common are finding anomalous nodes
or edges, detecting irregular subgraphs and sometimes searching the
anomalies in the global structure of the network (Akoglu et al., 2015;
Ranshous et al., 2015; Salehi and Rashidi, 2018). Among those, ob-
serving edges with time-varying weights are the most powerful signals
to discover anomalies. Therefore, analyzing the temporal behavior
of the edge’s weight over time is very relevant task, and there are
many applications for it. For example, traffic analysis in transportation
networks (Fanaee and Gama, 2016a; Wang et al., 2018; Xu et al., 2018),
intrusion detection in computer networks (Baddar et al., 2014) and data
volume exchange analysis in social networks (Yu et al., 2016) are some
important applications.
One of the appropriate tools for analysis of time-evolving networks
are tensor decompositions (TD) methods. TDs learn the low-rank struc-
ture of data, which leads to simultaneously modeling the interaction
between the nodes and also the evolution of nodes’ evolution over time
(Ranshous et al., 2015).
TDs can be considered as a sophisticated extension of matrix de-
composition techniques, which normally are based on PCA and SVD
(Ding and Tian, 2016; Idé and Kashima, 2004; Wang et al., 2012; Yu
et al., 2013). Some researchers also extended matrix factorization in
an adaptive way to capture the temporal dynamics. For instance, in
Yu et al. (2017) the authors factorize the adjacency matrices congress
∗
Corresponding author.
E-mail addresses: amoozegar_m@comp.iust.ac.ir (M. Amoozegar), b_minaei@iust.ac.ir (B. Minaei-Bidgoli), rezghi@modares.ac.ir (M. Rezghi),
hadi.fanaee@hh.se (H. Fanaee-T).
in each time instant in a streaming fashion, in order to extract the
low rank and anomalies. However, one of the main shortcomings
of matrix-based decomposition methods is that they are not able to
simultaneously model the intrinsic multi-way interactions of data.
The main benefit of TDs in comparison to matrix factorization
methods is that they can model the spatio-temporal fluctuations and
model data in its natural structure. Consequently, it is assumed that
discovered patterns via TDs are more realistic (Fanaee and Gama,
2016b). Several approaches have been proposed that decompose batch
tensor data into main factors and post-process the results manually or
by statistical metric to detect the anomalies (Fanaee and Gama, 2016a;
Mao et al., 2014; Sapienza et al., 2015). However, these approaches
need the availability of the full data and large memory.
Incremental Tensor Analysis (ITA) (Sun et al., 2008, 2006) was
the first effort to resolve this issue. ITA updates the covariance matrix
incrementally. However, diagonalizing the covariance matrix for each
mode is a critical bottleneck of this approach. Therefore, other methods
were conducted based on incremental SVD to improve its performance
(Shi et al., 2015; Xu et al., 2018). With growing the network size
and real-time processing requirements, these approaches face with
scalability issues.
A more recent and efforts towards this are online subspace tracking
methods that try to track the latent subspace (low-rank) of data in
an incremental way. Updating the low dimensional subspace over
https://doi.org/10.1016/j.engappai.2020.103741
Received 6 December 2019; Received in revised form 7 April 2020; Accepted 1 June 2020
Available online xxxx
0952-1976/© 2020 Elsevier Ltd. All rights reserved.