International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-9 Issue-6, April 2020 633 Published By: Blue Eyes Intelligence Engineering & Sciences Publication Retrieval Number: F3320049620/2020©BEIESP DOI: 10.35940/ijitee.F3320.049620 Foresight of Health Risk Based on Air Pollutants’ Air Quality Index Values Aiswarya Johney, Namitha S J, Leena Vishnu Namboothiri Abstract: This proposed work is mainly focused on the drastic air pollution data in various metropolitan cities. Rapidly growing industrialization both in automobiles and other public sectors massively increases the intensity of air pollution which drastically affects the nature and human rights for decades. The most destructive part of air pollution is that it may badly cause severe immunity problems for both the flora and fauna as well the human life. It might be life-threatening if the air pollutants cross its limit. Different software/tools are used for the prediction of air contamination and it is a formidable one. In this paper, we aim to find an accurate algorithm for implementing a system by utilizing Weka Tool for the prediction of health risks. Algorithms used are Decision tree J48 and Multiclass Classifier. Prediction of health risks is done based on different AQI values of air pollutants such as NO 2 , O 3 , CO and, SO 2 . Keywords : Air Quality Index; Data Mining; J48; Multiclass Classifier, Weka Tool. I. INTRODUCTION Air pollution is the contamination of air caused by the endurance of toxic substances which cause damage to the well-being of humans in the environment due to the effects of harmful gases, mainly CO2, NO2 and, SO2. Air Pollution is the major cause of fitness concerns such as nasal issues, lung cancer, bronchitis, epidermis infections, and heart problems. Air pollution is caused due to pollutants and particulate matters present in the air which are dangerous to take in. Due to the development of huge industries and enlarged use of public and private vehicles, there is an upsurge in the contamination which leads to a drastic variation in the climate which in turn leads to global warming. So by considering, estimation of air pollution levels is important to decide the necessary action plans to decrease air pollution. Thus, we can predict the escalating levels of air pollution which are merely injurious to sensitive people just by analyzing the air. The quality of atmospheric air is evaluated for the health problems relating to air quality levels. Air Quality Index (AQI) is a mathematical evaluation that is applied by the government agencies to make aware of the public about the contamination level of air. An increase in air quality index causes severe threats to human health which signifies increased air pollution. Revised Manuscript Received on March 30, 2020. Aiswarya Johney, P G Student, Department of Computer Science and IT, Amrita School of Arts and Sciences, Kochi, Amrita Vishwa Vidyapeetham, India. Email: aiswaryajohney09@gmail.com Namitha S J, P G Student, Department of Computer Science and IT, Amrita School of Arts and Sciences, Kochi, Amrita Vishwa Vidyapeetham, India. Email:namitha7700@gmail.com Leena Vishnu Namboothiri, Assistant Professor, Department of Computer Science and IT, Amrita School of Arts and Sciences,Kochi ,Amrita Vishwa Vidyapeetham, India . Email:vleena@gmail.com The AQI value detects if the air in the surroundings is clear or contaminated and mentions the health risks associated with that. Developed countries have their inbuilt air quality indices and concern to the public health protection, they are related to the National air quality standards. The AQI values are assorted into certain ranges authorized with a color code and with a standardized public health warning. The criteria to categorize the Air Quality Index level are Good, Moderate, Unhealthy for sensitive groups, Unhealthy and Very unhealthy. AQI values range from 0 to 300.If the AQI shows values less than 50, it suggests that there is no hazard for public health and if the AQI value is more than 200, then it shows an unhealthy representation. An Environmental Protection Agency (EPA) for health concern which is developed under the United States based on air contamination. The quality of air is decided by AQI values in six different levels and it shows a hazardous air quality to those over 300 AQI values and good quality for the below 50. II. LITERATURE REVIEW In 2015, a system proposed by Ruhul Amin Dicken [1] acquires the dependency of the pollutants to the admittance of victims in the medical facilities and thus analyze the cause behind the rapid rise of disease rates in Bangladesh. The method of clustering different air pollutants in different seasons of Bangladesh is derived by K-Means clustering and the method is used for the classification of patients according to different admission rates in the CART method. Data mining is applied to prognosticate the air contamination [3] in which two procedures of feature selection such as a linear method of step-wise fit and genetic algorithm is proposed. PM10, Sulphur Dioxide, Nitrogen Dioxide, and Ozone are the pollutants that help to take part in the prediction of selected features. This paper explains the mathematical features of forecasting issues in air pollution and concentrates mainly on the data mining classification used for constructing the most authentic prediction model. In Tamil Nadu, ambient air quality prediction is analyzed by the ANN (Artificial Neural Network) model in data mining techniques [4].For the Government policymakers in planning the upcoming standard policies of air pollution, the obtained pattern can serve as an important reference. Data mining is used as a method for the analysis of health risk on an air quality level basis [5]. For the prediction of health concerns, we use Decision Tree J48 and Naïve Bayes algorithms. Paper [9] tells about the air quality increase in the urban region compared to other regions and the random forest algorithm gives more accuracy for foreseeing the air quality in urban regions compared to others.