Ann. Data. Sci.
DOI 10.1007/s40745-016-0096-6
Big Data Paradigm: What is the Status of Privacy
and Security?
Kenneth David Strang
1
· Zhaohao Sun
2
Received: 16 August 2016 / Revised: 15 October 2016 / Accepted: 26 December 2016
© Springer-Verlag Berlin Heidelberg 2017
Abstract We extended the big data body of knowledge by analyzing the longitudinal
literature to highlight important research topics and identify critical gaps. We initially
collected 79,012 articles from 1900 to 2016 related to big data. We refined our sample
to 13,029 articles allowing us to determine that the big data paradigm commenced
in late 2011 and the research production exponentially rose starting in 2012, which
approximated a Weibull distribution that captured 82% of the variance ( p <.01). We
developed a dominant topic list for the big data body of knowledge that contained 49
keywords resulting in an inter-rater reliability of 93% (r
2
= 0.89). We found there were
13 dominant topics that captured 49% of the big data production in journals during
2011–2016 but privacy and security related topics accounted for only 2% of those
outcomes. We analyzed the content of 970 journal manuscripts produced during the
first of 2016 to determine the current status of big data research. The results revealed a
vastly different current trend with too many literature reviews and conceptual papers
that accounted for 41% of the current big data knowledge production. Interestingly,
we observed new big data topics emerging from the healthcare and physical sciences
disciplines.
B Kenneth David Strang
Kenneth.Strang@plattsburgh.edu
Zhaohao Sun
zsun@dbs.unitech.ac.pg; zhaohao.sun@gmail.com
1
Regional Higher Education Center, School of Business and Economics, State University of
New York, Plattsburgh, 640 Bay Road, Queensbury, NY 12804, USA
2
Department of Business Studies, PNG University of Technology, Lae 411, Papua New Guinea
123