Uncorrected Author Proof Journal of Intelligent & Fuzzy Systems xx (20xx) x–xx DOI:10.3233/JIFS-169959 IOS Press 1 Information entropy based event detection during disaster in cyber-social networks 1 2 A. Bhuvaneswari and C. Valliyammai 3 Department of Computer Technology, Madras Institute of Technology, Anna University, Chennai 4 Abstract. The demand for Cyber Social Networks has increasingly become the main source of information propagation due to the rapid growth of micro-blogging activity between socially connected people. The process of detecting disaster events, in huge volumes, on fast-streaming platform is quite challenging. In this paper, an information entropy based event detection framework is proposed to identify the event and its location by clustering relatively high-density ratio of tweets using Twitter data. The Shannon entropy of target users, location, time intervals and hashtags are estimated to quantify the dissemination of events as “how-far about” in real- world using entropy maximization inference model. The geo-tagged (spatial) tweets are extracted for a specified time period (temporal) to identify the location of an event as “where-when about”; and visualizes the event in geo-maps. The evaluation parameters of Entropy, Cluster Score, Event Detection Hit and False Panic Rate during four major disaster events are identified to illustrate the effectiveness of the proposed framework. The retweeting activity of the Twitter user is classified as human signatures and bots. The experimental outcome determines the scope and significant dissemination direction of finding events from a new perspective which demonstrates 96% of improved event detection accuracy. 5 6 7 8 9 10 11 12 13 14 15 16 Keywords: Cyber-social networks, event detection, geo-tag, spatiotemporal, Shannon entropy 17 1. Introduction 18 In recent years, Cyber-Social Networking (CSN) 19 platforms like Twitter, Facebook, Sina Weibo and 20 Tumblr have encapsulated huge chunks of infor- 21 mation comprising insightful user opinions about 22 various events [1]. The problem of evaluating 23 unstructured data in user generated content has ubiq- 24 uitous applications, including the identification of 25 conversation topics and abnormal events [2]. The 26 online events that match with actual real-time events 27 are of diverse temporal and spatial scales, particularly 28 disasters like earthquakes, floods, landslides, etc. to 29 deliver prior alerts, support, and immediate recovery 30 [3, 4]. The CSN act as communication tool and dou- 31 ble up as informative podiums that catch real human 32 Corresponding author. A. Bhuvaneswari, Research Scholar, Department of Computer Technology, Madras Institute of Tech- nology, Anna University, Chennai. E-mail: bhuvana.cse14@ gmail.com. voices reaching out during disasters. However, the 33 information about ongoing events and sensitive issues 34 that remain unnoticed. Providentially, the CSN is 35 used as new additions to measure the impact of disas- 36 ter events due to wide usage of smart phones [5]. The 37 geographically linked CSN data containing geo-tags 38 is accepted as a trustworthy source for detecting dis- 39 aster and provides wide range of accessible services 40 [6]. In 2015, a study explored the retweeting activ- 41 ity geographically from collective Twitter population 42 around the globe during Hurricane Sandy [7] disas- 43 ter. They used feature-rich classifiers for determining 44 the microblogs which are most relevance to specified 45 event [8]. The events could be recognized as abnormal 46 spikes in activity [9] by monitoring the normal flow of 47 microblogs using change point detection approaches 48 [10]. It determines the intensity of the sensitive and 49 abnormal event that was deciphered in the course of 50 fluctuations in content. The annotation scheme [11] 51 for identifying relevant tweets after disasters usually 52 1064-1246/18/$35.00 © 2018 – IOS Press and the authors. All rights reserved