IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 22, Issue 3, Ser. II (May - June 2020), PP 01-06 www.iosrjournals.org DOI: 10.9790/0661-2203020106 www.iosrjournals.org 1 | Page Towards Data Science: The Data Driven Era Vishal Dwivedi 1 , Dr.Sheenu Rizvi 2 , Dr.Anuradha Misra 3 1 (Department of Computer Science & Engineering, ASET /Amity University, India) 2 (Department of Computer Science & Engineering, ASET / Amity University, India) 3 (Department of Computer Science & Engineering, ASET / Amity University, India) Abstract: Now’s days, a vast number of data is swiftly produced in cyberspace. Data science is a composite number of pretexting disciplines. It refers to the study of extracting, collecting, gathering and representation of data to be used for business purpose or in technical issue. Data Science contributes a novel search way for natural and social science and goes by computer science in reasserting data. With a big amount of data now available in cyberspace, the organization in most of the field are series on data for their competitive advantages. This paper presents the challenges present in data and will discuss the differences in data science and other technologies like big data, the life cycle of data science, various data science technologies that are trending with its application. Also, the paper will describe the roles and responsibilities of a data scientist. Key Word: Data Nature, cyberspace, data management --------------------------------------------------------------------------------------------------------------------------------------- Date of Submission: 06-05-2020 Date of Acceptance: 19-05-2020 --------------------------------------------------------------------------------------------------------------------------------------- I. Introduction Data explosion is the great increment in the numbers of data in cyberspace that takes everyone in the Big Data time. The data now has no longer concise to qualitative values variables; In addition, data are everything that will be found in the cyberspace. Data Nature (the total number of data that are found inside the Cyberspace) [1]. The unique pattern exhibited by the fact existing in the natural world being surpassed by the process of gathering information in the nature of data. Data science is a collection of basics or primary theories which supports principle withdrawal of information and knowledge from the data. One of the closest technology or method that is similar and associated with data science is Data mining that is another trending technology that is the actual drawing out of data from several aspects and principals. There are approximately hundreds of algorithm and great deal of method for this field. From the day computers came to existence, we were continuously utilizing and dealing with the data.Data do no longer only intention to solve hassle based in reality but to extend analyzing information in order to learn about the phenomena and rule of information them. (For ex., discovering the increase pattern of data and predicting the scale of records in cyberspace ten years into the future). Supporting natural and social science with statistics applied sciences and strategies and exploring information nature can lead to transition toward the new science i.e., Data Science. If a person has been engaging in data science research, he or she is already become a Data Scientist. In this given paper, I represent the challenges provided by the data and investigate why we need Data Science. Later on, I will discuss key issues like fundamental theories, new methods etc. II. Fundamental Theories of Data Science The Theory of Data Similarities: The key feature in mapping the relationships between data for data testing is data similarities. Research based topics includes the definition of similar measure, computational of similarities. Data measurement and Data Algebra: -it is necessary to give complete and right theory of data science. The Relational Database Management System (RDBMS) was correct when data naturally fit into table, but it was to be known from the start that the relational model of data was incomplete. the model having imperfection become obvious primarily due to the problems while using the relational database management system with the fixed and particular structure. Data Science research methods: It is a primary research method for data science that includes data search, data analysis and data approach. Data search explains the characteristics and structure of data set so that we can check the volume of data set and could select method for evaluate the data set. Search of Data Nature: Fundamental rule of data search- many research records from the nature or man experiences are stored in large cyber space as in form of data. This data is the known as data nature. the search of data nature present in higher level than earlier, thus showing us to that many principles and nature laws are to be existing in data nature, for example prime numbers, Fibonacci number series etc.