2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)
An Interactive Platform to Track Global COVID-19
Epidemic
Zhenhe Pan
*
, Dhruv Mehta
*
, Anubhav Tiwari
*
, Siddhartha Ireddy
*
, Zhou Yang
†
and Fang Jin
†
*
Department of Computer Science, Texas Tech University
Email:{zhenpan, Dhruv.C.Mehta, Anubhav.Tiwari,siireddy}@ttu.edu
†
Department of Statistics, George Washington University
Email: zhou yang@gwmail.gwu.edu, fangjin@email.gwu.edu
being able to halt the spread of this and future unknown
infectious disease outbreak more effectively and efficiently.
In this paper, we developed an interactive visualization
platform
1
to closely monitor the global COVID-19 situation
and the social media & news reports. Besides showing the
latest COVID-19 cases numbers, the platform preserves the
historical records and enable users to see the COVID-19
spreading trends. In addition, the platform collects and digests
the tweets streams and grab people’s top concern, provid-
ing users a One-stop experience of the overview COVID-
19 situation. To summarize, our platform has the following
contributions:
• Developed a comprehensive database of coronavirus
cases worldwide, which would tremendously helps re-
searchers and policy makers to model the spread of this
infectious disease timely.
• Provided a Bird’s-eye view of the global COVID-19
dynamics, not only showing the latest confirmed cases
numbers, but also allow users to view the COVID-19
trends for each country and state/province.
• Developed practical tools to collect information on coro-
navirus cases world-wide by mining multiple online plat-
forms.
• Integrated breaking events and people’s concern flows
into the map, providing a full picture of the COVID-19
dynamics.
II. SYSTEM FRAMEWORK
Figure 1 illustrates the system framework of our platform.
The framework is divided into three main components: (a)
A front-end component which collects real-time data from
multiple websites and social media platforms; (b) A database
storage engine that enables real-time update and historical data
retrieve; (c) A back-end component that includes multiple
algorithms for tweets filtering, concern flow extraction, and
breaking news identification.
A. Data Flow
As shown in Figure 2, our input data has three types:
real-time COVID-19 cases for each country/state, coronavirus
related tweets discussions, breaking news from multiple web-
sites. The real-time COVID-19 case numbers were collected
1
http://worldcovid-19stats.000webhostapp.com/covid.html#trends
Abstract—This project built a world-wide database of coron-
avirus cases, which helps to model the spread of the coronavirus
disease (COVID-19), and to identify policy and social factors
that impact the spread of COVID-19. Four essential tasks are
implemented: 1) build a comprehensive database of coronavirus
cases world-wide; 2) visualize the heatmap of confirmed cases
for each country, provide detailed spreading trends for each
countries and comparison among countries; 3) collect tweets
about COVID-19 in real-time and extract people’s daily concern
flow; 4) i ntegrate b reaking n ews s uch as fi rst confirmed/death
case in each country. This demo will provide decision-makers with
accurate data-driven representations in an easy to understand
format that enables them to make more timely and cost-effective
preparation and response plans.
I. I NTRODUCTION
COVID-19 is spreading rapidly and has already affected
more than 169 countries, infecting near a million people and
causing more tens of thousand deaths around the world (as of
April 1, 2020). The collection of coronavirus cases information
such as positive diagnoses, recovery, and death, is vital as
it provides a foundation for modeling the disease evolution
dynamics, allowing us to infer future trends, and analyze the
factors that influence the speed at which the virus spreads. This
information will be crucially important if we are to develop
appropriate policies to manage this pandemic. A number of
platforms [1]–[7] developed to predict and provide people
an updated information about coronavirus as this outbreak
unfolds. These work increased people’s awareness of this
disease and advanced the state of the art by developing accu-
rate methods and effective tools to collect coronavirus cases
from mining online platforms. However, most of the currently
available coronavirus databases have some limitations: they
either contain only the current numbers of cases and their
locations, completely ignoring the historical information, or
only cover certain countries/regions, or lack of timely update.
Moreover, the pandemic-related data is stored in multiple
unconnected resources (websites, news reports, social media
posts, and ad-hoc databases) and recorded in many different
languages. This lack of inter-database connectivity, along with
the associated language barriers, significantly h amper our
efforts to deal with this dangerous pandemic. There is thus an
urgent need to develop a comprehensive database to identify
and track coronavirus cases world-wide, with the ultimate goal
IEEE/ACM ASONAM 2020, December 7-10, 2020
© IEEE 2021. This article is free to access and download, along with rights for
full text and data mining, re-use and analysis.
948
2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) | 978-1-7281-1056-1/20/$31.00 ©2020 IEEE | DOI: 10.1109/ASONAM49781.2020.9381436