© 2020, IJCSE All Rights Reserved 57 International Journal of Computer Sciences and Engineering Open Access Research Paper Vol.8, Issue.6, June 2020 E-ISSN: 2347-2693 Forecasting novel COVID-19 confirmed cases in India using Machine Learning Methods Saroj S. Date 1* , Sachin N. Deshmukh 2 1 Dept. of Computer Science & Engineering, Jawaharlal Nehru Engineering College, MGM University, Aurangabad (MS), India 2 Dept. of Computer Science & Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad (MS) India *Corresponding Author: sarojdate@jnec.ac.in DOI: https://doi.org/10.26438/ijcse/v8i6.5762 | Available online at: www.ijcseonline.org Received: 26/May/2020, Accepted: 20/June/2020, Published: 30/June/2020 AbstractNowadays, there is a very adverse impact on economic, cultural, social and almost all fields in the world because of Covid-19. The Covid-19 term is described as -'CO' for corona, 'VI' for virus, and 'D' for disease. It is an infectious disease caused by severe acute respiratory syndrome which is transmitted through respiratory droplets and contact routes. Since December 2019, corona-virus disease (COVID-19) has out-broke from the country China. Till now, more than 78, 23, 289 people are infected and more than 4 Lakhs of deaths have been caused worldwide. Unfortunately, the number of infections and deaths are still increasing rapidly which has put the world in a different state. Artificial Intelligence can play a key role to infection forecasting in national and provincial levels in many countries. The objective of this study is to use machine learning methods to forecast the number of cases for the next 2 weeks, i.e. till 30 th June 2020. The data was collected from 22 nd January to 15 th June 2020 by nationally recognized sources. The data file contains the cumulative count of confirmed, death and recovered cases of COVID-19 from different countries from the date 22 nd January 2020.In this study, the outbreak of this disease has been analyzed for India till 15 th June 2020 and predictions have been made for the number of cases for the next two weeks. KeywordsCovid-19, Corona, Corona Virus, Machine Learning, Forecasting, Artificial Intelligence, time series forecasting I. INTRODUCTION On 31 st December 2019, the novel Corona virus, known as COVID-19 was reported in Wuhan, China for the very first time. Corona viruses are the infectious virus which has adverse affect on the respiratory system of humans. The symptoms of COVID-19 may or may not be visual in infected individual, therefore the spread rate can be faster. Till now, effective and well-tested vaccine against CoVID- 19 has not been invented, only precautions are the safety measures. Though the continuous efforts are going on , the virus has managed to spread in most of the territories in the world and World Health Organization (WHO) has announced COVID-19 as Pandemic. Most of the countries in the world are working cooperatively and openly to bring this situation under control. Data scientists and data mining researchers can play an important role during these types of situations. They can integrate the related data and technology to better understand the virus and its characteristics, which can help in taking right decisions and concrete plan of actions. As per the daily situation report of WHO, as on 15 th June 2020 the COVID-19 transmission scenario reports 78,23,289 confirmed cases with 4,31,541 deaths globally. Data mining is a technology, developing with database as well as artificial intelligence. It is a processing procedure of extracting credible and effective novel techniques and understandable patterns from the database [1]. Artificial intelligence (AI) is a field of programming building which gives PCs an ability to learn without being unequivocally modified [2]. AI models can be used for estimating and predicting spread rate, so AI is one of the beneficial tools to fight against pandemic like COVID-19. The forecasting analysis is done by using the algorithms like ANN and time-series [3]. In this paper we are using, time-series algorithm. According to K. Krishna Rani Samal et al., approaches like SARIMA and Prophet can be used for forecasting based on historical data. They concluded that both the SARIMA and prophet model provides a good quality of accuracy. However, the best approach is the prophet model on log transformation which has the least minimum RMSE, MSE value [4]. This model is developed by Facebook, available in python and R. The main contribution of this research paper is forecasting of COVID-19 for the next two weeks i.e. till 30 th June 2020 using Prophet Model. In this study, the data was analyzed from 22 nd January 2020 to 15 th June 2020.