Explore a deep learning multi-output neural network for regional multi-step-ahead air quality forecasts Yanlai Zhou a , Fi-John Chang a, * , Li-Chiu Chang b , I-Feng Kao a , Yi-Shin Wang a a Department of Bioenvironmental Systems Engineering, National Taiwan University, Taipei,10617, Taiwan, ROC b Department of Water Resources and Environmental Engineering, Tamkang University, New Taipei City, 25137, Taiwan, ROC article info Article history: Received 2 July 2018 Received in revised form 26 September 2018 Accepted 22 October 2018 Available online 24 October 2018 Keywords: Multi-output LSTM Deep learning Artiﬁcial intelligence (AI) Multi-step-ahead forecast Air quality Taipei city abstract Timely regional air quality forecasting in a city is crucial and beneﬁcial for supporting environmental management decisions as well as averting serious accidents caused by air pollution. Artiﬁcial Intelligence-based models have been widely used in air quality forecasting. The Shallow Multi-output Long Short-Term Memory (SM-LSTM) model is suitable for regional multi-step-ahead air quality fore- casting, while it commonly encounters spatio-temporal instabilities and time-lag effects. To overcome these bottlenecks and overﬁtting issues, this study proposed a Deep Multi-output LSTM (DM-LSTM) neural network model that were incorporated with three deep learning algorithms (i.e., mini-batch gradient descent, dropout neuron and L2 regularization) to conﬁgure the model for extracting the key factors of complex spatio-temporal relations as well as reducing error accumulation and propagation in multi-step-ahead air quality forecasting. The proposed DM-LSTM model was evaluated by three time series of PM 2.5 , PM 10, and NO x simultaneously at ﬁve air quality monitoring stations in Taipei City of Taiwan. Results indicated that the loss function values (mean-square-error) of the SM-LSTM and DM- LSTM models in the testing stages at horizon tþ4 were 0.87 and 0.72, respectively. The G bench values of the DM-LSTM model in the testing stages for PM 2.5 , PM 10, and NO x reached 0.95 at horizon tþ1 and exceeded 0.81 at horizon tþ4, respectively. Results demonstrated that the proposed DM-LSTM model incorporated with three deep learning algorithms could signiﬁcantly improve the spatio-temporal sta- bility and accuracy of regional multi-step-ahead air quality forecasts. © 2018 Elsevier Ltd. All rights reserved. 1. Introduction Exposure to ambient air pollution is a primary environmental risk factor in relation to adverse health impacts (Apte et al., 2015; Liu et al., 2017). Fine particulate matter (PM 2.5 and PM 10 , i.e., par- ticles smaller than 2.5 or 10 mm) and nitrogen oxide (NO x ) are the dominant components of ambient air pollution associated with booming urban development (Li et al., 2017, 2018; Lin and Zhu, 2018). To date, epidemiological investigations and studies demon- strated that some air pollution-related diseases were associated with exposure to PM 2.5 , PM 10, and NO x (Reggente et al., 2014; Wang et al., 2016; Wu et al., 2018a,b). In addition, these air pollutants were acknowledged as typical representatives of particle number concentration in urban air quality (Li et al., 2018a,b; Wu et al., 2018a,b). Real-time air quality information is of great importance to air pollution control and human health protection from air pollution (Ni et al., 2017). To support environmental management decisions and avert serious accidents caused by air pollution, air quality forecasting is becoming more and more essential not only to better govern the trend of air pollution variation but to provide timely and comprehensive environmental quality information (Pournazeri et al., 2014; Yang and Christakos, 2015; Corani and Scanagatta, 2016; Lauret et al., 2016; Wakeel et al., 2017; Van et al., 2018; Yanget al., 2018). A wide variety of methods have been used to forecast or predict regional air quality. These studies primarily branched out into two major classes: physical-based and data-driven methods. Physical- based models like dispersion and chemical transport models have still been under development as a result of uncertainties in relation to source inventories and the chemical and dynamical mechanisms of aerosols in atmosphere (Afzali et al., 2017; Vijayaraghavan et al., 2016; Jianget al., 2018; Karambelas et al., 2018; Pisoni et al., 2018). Data-driven models have leant upon the empirical or statistical relationship between air quality observations and other affecting * Corresponding author. E-mail address: changfj@ntu.edu.tw (F.-J. Chang). Contents lists available at ScienceDirect Journal of Cleaner Production journal homepage: www.elsevier.com/locate/jclepro https://doi.org/10.1016/j.jclepro.2018.10.243 0959-6526/© 2018 Elsevier Ltd. All rights reserved. Journal of Cleaner Production 209 (2019) 134e145