A taxonomy for labeling deviations in district heating customer data Sara Månsson a, b, * , Ida Lundholm Benzi c , Marcus Thern a , Robbe Salenbien b, d , Kerstin Sernhed a , Per-Olof Johansson Kallioniemi a a Department of Energy Sciences, Faculty of Engineering, Lund University, P.O. Box 118, SE-22100, Lund, Sweden b VITO, Boeretang 200, BE-2400, MOL, Belgium c Solita Sweden, Tulegatan 11, SE-113 53, Stockholm Stockholm, Sweden d EnergyVille, Thor Park 8310, BE-3600, Genk, Belgium article info Article history: Received 4 December 2020 Received in revised form 23 April 2021 Accepted 4 May 2021 Available online 8 May 2021 Keywords: Deviation labeling DH data Fault labeling Fault detection and diagnosis Taxonomy abstract This paper suggests a taxonomy for labeling deviating patterns in district heating (DH) customer data. The taxonomy contains several fault labels intended to register information about faults in the DH systems that cause deviations in customer data. This taxonomy is needed because the DH industry is currently missing a unanimous way to label identied faults. The lack of a taxonomy makes it hard to develop automated fault detection and diagnosis methods based on the analysis of DH customer data. Such methods usually require training on historical data sets known to contain deviating data patterns caused by specic faults. By developing a taxonomy for labeling these faults, this study aims to create value for DH utilities in current and future DH systems. The taxonomy structure was based on literature studies, workshops, and discussions with partners within the Swedish FutureHeat collaboration orga- nization Smart Energi. Once the basic structure was decided, it was sent out for evaluation amongst Swedish DH utilities. The evaluation was carried out as a survey study. The results from the survey were compiled, and the nalized version of the deviation cause taxonomy was produced. The study includes the results of the survey study and the nalized version of the deviation cause taxonomy. © 2021 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). 1. Introduction District heating (DH) systems have been identied as an essential part of the future, smart energy systems based on renewable energy sources [1]. This is due to the DH systems inherent ability to utilize local fuel or heat resources that would otherwise go to waste [2]. However, the current DH systems have to undergo signicant improvements to fulll their role in future energy systems. One of the main areas of improvement would be to decrease the distribution temperatures to the temperatures of the fourth generation of district heating (4GDH)) [3]. A reduction of the temperature levels would make it possible to include larger amounts of waste-to-energy and geothermal and solar thermal heat sources [3]. It would also increase the power-to-heat ratio in combined heat and power (CHP) plants [4], make it possible to integrate more low-temperature excess heat [5], and decrease the heat losses from the DH systems [6]. To a large extent, the high-temperature levels in the current DH systems are due to faults in the systems that cause high return temperatures [7]. Therefore, this study focuses on the detection of the faults that cause high distribution temperatures. In this study, a fault is not only broken components but rather something that cause the system, or part of the system, to perform poorly. Such issues include summer bypass valves and leakages in the DH sys- tem [2]. A large number of faults also appear in the customer in- stallations [2]. Thus, the current DH systems contain many faults that need to be detected and corrected to reach lower temperature levels. However, successful fault detection will be an even more important aspect in future low-temperature DH systems. Such systems will be signicantly more cost-sensitive to unwanted temperature increases due to the heat sources included in the systems, as has been shown by Averfalk and Werner [8]. Many faults in the DH systems manifest themselves in DH customer data, where they appear as deviations in the typical heat use patterns and temperature levels [9, 10]. It would thus be possible to detect them by analyzing customer data. One important contributor to the success of fault detection will be to utilize different Information and Communication Technologies (ICT). ICT * Corresponding author. Department of Energy Sciences, Faculty of Engineering, Lund University, P.O. Box 118, SE-22100, Lund, Sweden. E-mail address: sara.mansson@energy.lth.se (S. Månsson). Contents lists available at ScienceDirect Smart Energy journal homepage: www.journals.elsevier.com/smart-energy https://doi.org/10.1016/j.segy.2021.100020 2666-9552/© 2021 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). Smart Energy 2 (2021) 100020