Citation: KC, U.; Aryal, J. Leveraging a Wildﬁre Risk Prediction Metric with Spatial Clustering. Fire 2022, 5, 213. https://doi.org/10.3390/ﬁre5060213 Academic Editor: Natasha Ribeiro Received: 14 October 2022 Accepted: 6 December 2022 Published: 9 December 2022 Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional afﬁl- iations. Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). fire Article Leveraging a Wildﬁre Risk Prediction Metric with Spatial Clustering Ujjwal KC 1, * and Jagannath Aryal 2, * 1 CSIRO|Agriculture and Food, St Lucia, Brisbane, QLD 4067, Australia 2 Department of Infrastructure Engineering, Faculty of Engineering and Information Technology, University of Melbourne, Melbourne, VIC 3053, Australia * Correspondence: ujjwal.kc@csiro.au (U.K.); jagannath.aryal@unimelb.edu.au (J.A.) Abstract: Fire authorities have started widely using operational ﬁre simulations for effective wildﬁre management. The aggregation of the simulation outputs on a massive scale creates an opportunity to apply the evolving data-driven approach to closely estimate wildﬁre risks even without running computationally expensive simulations. In one of our previous works, we validated the application with a probability-based risk metric that gives a series of probability values for a ﬁre starting at a start location under a given weather condition. The probability values indicate how likely it is that a ﬁre will fall into different risk categories. The metric considered each ﬁre start location as a unique entity. Such a provision in the metric could expose the metric to scalability issues when the metric is used for a larger geographic area and consequently make the metric hugely intensive to compute. In this work, in an investigative effort, we investigate whether the spatial clustering of ﬁre start locations based on historical ﬁre areas can address the issue without signiﬁcantly compromising the accuracy of the metric. Our results show that spatially clustering all ﬁre start locations in Tasmania into three risk clusters could leverage the probability-based risk metric by reducing the computational requirements of the metric by a theoretical factor in thousands with a mere compromise of approximately 5% in accuracy for two risk categories of high and low, thereby validating the possibility of the leverage of the metric with spatial clustering. Keywords: wildﬁre; risk metric; risk characterization; clustering; data-driven approach 1. Introduction With an increased understanding of phenomena and advancements in computing technologies and observational sciences, natural disasters can be modeled and studied with more detail than ever before [1–4]. The precise information on the location of the disaster and the associated risks are in-demand elements in natural disaster modeling systems and the subsequent application of such systems in an operational environment. Thus, any spatial information on the disaster start locations and their propagation extent within the framework of risk quantiﬁcation plays an inherent role in effective disaster management. Consequently, in the ﬁre disaster space, ﬁre authorities have started widely using operational ﬁre simulations for making better-informed decisions for wildﬁre management. Under current state-of-the-art of wildﬁre management, ﬁre practitioners and authorities quantify the risk associated with wildﬁres by running several ﬁre simulations in a geograph- ical area by using operational ﬁre spread models, collectively referred to as an ensemble and conducting statistical analyses [5–7]. These ensembles are computationally expensive but their recent extensive use for wildﬁre management has become possible due to un- precedented advancements in computing technologies such as cloud computing. The ﬁre simulations, when aggregated on a massive scale, have created a unique opportunity to apply the evolving data-driven approach to closely estimate wildﬁre risks even without running a single computationally expensive simulation, although the process is highly computationally intensive otherwise. Fire 2022, 5, 213. https://doi.org/10.3390/ﬁre5060213 https://www.mdpi.com/journal/ﬁre