Exploring Anomalies in GAStech
VAST 2014 Mini Challenge 1 and 2
Jaegul Choo
*
, Yi Han
*
, Mengdie Hu
*
, Hannah Kim
*
, James Nugent
*
, Francesco Poggi
†
Haesun Park
*
John Stasko
*
*
Georgia Institute of Technology,
†
University of Bologna
ABSTRACT
We present our process and analysis for VAST 2014 Mini Chal-
lenge 1 and 2, which integrate an off-the-shelf tool, Jigsaw, rapid
web-based visualization prototyping using D3, and analytics-based
visualizations using Matlab.
Index Terms: H.5.2 [INFORMATION INTERFACES AND PRE-
SENTATION]: User Interfaces—Theory and methods
1 MINI CHALLENGE 1
The goals of MC1
1
were to understand the structure of the Protec-
tors of Kronos (POK) organization, the connections of POK to the
GAStech company, the series of events that occurred around the
time of the challenge’s focus incident, and its potential causes. The
provided data included semi-structured text documents (news arti-
cles, resumes, etc.), structured tabular text documents (email and
employee records), an organization chart, and a map.
To examine the news articles and email messages, we used Jig-
saw, a visual analytics system for exploring large text document
collections [1]. We used Jigsaw’s List View to identify a list of
news articles around the day of the incident to answer question 2.
Subsequently, we deliberately inspected the identified documents
in the Document View to find relevant information.
We believed that the email messages between employees would
be best visualized with a graph to show the connections. This infor-
mation was important for answering question 1d about connections
between POK and GAStech. We used the Graph View in Jigsaw to
identify the different employees involved in a specific email thread.
However, the Graph view could not show an overview of the dis-
tribution of direct email exchanges between employees. Thus, we
used two other visualization tools to explore the overall email con-
nections between any given pair of employees. The first one was
a circular graph visualization built using D3 toolkit. As shown in
Fig. 1(a), the employees are grouped by their departments around
the circumference of the graph. The amount of email between each
pair of employees is encoded by the width of the connected line.
Mousing over any employee, such as Isia Vann in Fig. 1(a), high-
lights all emails to and from this person in the circular graph and
shows additional information on the side. We used this visualiza-
tion to find potentially interesting connections between employees
sharing last names with known POK members and other GAStech
employees. The second tool was built with Matlab and employed
an adjacency matrix visualization including interactive features to
inspect the communication between any two employees. For exam-
ple, in Fig. 1(b), Isia Vann, who we suspected to be connected with
POK, seemed to be in a relationship with Rachel Pantanal, implying
she may also be connected to POK.
*
e-mail: jaegul.choo@cc.gatech.edu, yihan@gatech.edu,
mengdie.hu@gatech.edu, hannahkim@gatech.edu, jnugent6@gatech.edu,
fpoggi@cs.unibo.it, hpark@cc.gatech.edu, stasko@cc.gatech.edu
1
Video: http://www.cc.gatech.edu/gvu/ii/challenge/
GT-14-MC1.mp4
(a) D3 circular graph for emails
(b) Matlab matrix view for emails
Figure 1: Email analysis
Figure 2: Tablet view for organizing events around the incident
None of these visualizations on its own was sufficient to answer
any question in Challenge 1. Instead, we used information from
combinations of the tools to gain a better understanding of the data.
We did this by using Jigsaw’s Tablet window and MS Powerpoint
to visually organize our findings from the various visualizations
together. The Tablet, shown in Fig. 2, provides a workspace for
manually creating visual representations with lines, text boxes, and
connections to documents. We created a timeline view in the Tablet
for question 2 with notes and links to related documents. The view
provides a flexible platform for visually organizing and presenting
findings. For information learned outside of Jigsaw, we used MS
Powerpoint to gather our findings.
2 MINI CHALLENGE 2
The task of Mini Challenge 2
2
was to find unusual patterns in em-
ployees’ daily lives from their credit/debit/loyalty card records and
2
Video: http://www.cc.gatech.edu/gvu/ii/challenge/
GT-14-MC2.mp4
347
IEEE Symposium on Visual Analytics Science and Technology 2014
November 9-14, Paris, France
978-1-4799-6227-3/14/$31.00 ©2014 IEEE