Enhancing Data Education with Datathons: an Experience with Open Data on Renewable Energy Systems Antonella Longo antonella.longo@unisalento.it SyDa Lab, Department of Engineering for Innovation, University of Salento Lecce, Italy Marco Zappatore marcosalvatore.zappatore@unisalento.it SyDa Lab, Department of Engineering for Innovation, University of Salento Lecce, Italy Angelo Martella angelo.martella@unisalento.it SyDa Lab, Department of Engineering for Innovation, University of Salento Lecce, Italy Chiara Rucco chiara.rucco@unisalento.it SyDaLab, Department of Engineering for Innovation, University of Salento Lecce, Italy ABSTRACT Data literacy and the fundamentals of big data management are becoming interdisciplinary in Higher Education curricula, also due to the widespread need of data science skills. This casts the need for presenting novel (and more engaging) learning activities to students. Data hackathons (also known as datathons) represent a viable option to allow students practicing with real use cases and datasets, as well as addressing their learning experiences collabo- ratively. Moreover, datathons promise to improve soft skills and ofer hands-on learning opportunities. Therefore, we present in this paper a datathon on a publicly available dataset about renewable energy systems. The datathon involved students from three data- focused courses of three diferent M.S. degrees at the University of Salento (Italy). A detailed analysis of the design, implementation and evaluation choices is proposed, along with a series of gath- ered insights and lessons learned that might help systematizing the introduction and use of datathons in data education. CCS CONCEPTS · Applied computing Collaborative learning; · Informa- tion systems Data management systems; Data cleaning; Ex- traction, transformation and loading. KEYWORDS datathon, open data, data education curricula, renewable energy systems ACM Reference Format: Antonella Longo, Marco Zappatore, Angelo Martella, and Chiara Rucco. 2022. Enhancing Data Education with Datathons: an Experience with Open Data on Renewable Energy Systems. In 1st International Workshop on Data Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proft or commercial advantage and that copies bear this notice and the full citation on the frst page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specifc permission and/or a fee. Request permissions from permissions@acm.org. DataEd’22, June 17, 2022, Philadelphia, PA, USA © 2022 Association for Computing Machinery. ACM ISBN 978-1-4503-9350-8/22/06. . . $15.00 https://doi.org/10.1145/3531072.3535322 Systems Education (DataEd’22), June 17, 2022, Philadelphia, PA, USA. ACM, New York, NY, USA, 6 pages. https://doi.org/10.1145/3531072.3535322 1 INTRODUCTION In 1999, the frst ofcial coding event was held in Calgary, Canada: a group of 10 volunteer programmers joined a 1-week-long meet- ing of collaborative software development on OpenBSD, an open- source operating system. Soon after, the term hack(ing mara)thon, or hackathon for brevity, was coined to identify a collaborative coding event where programmers develop software prototypes ac- cording to a common goal or interest [13]. Since then, diferent typologies of coding events have been branched out (each one under a specifc name) while the original defnition of hackathon assumed a slightly diferent meaning in order to entail a broader perspective. Nowadays, if the focus is more on developing software prototypes in a competitive manner, either online or in presence, the term coding competition or coding hackathon is used. The term data hackathon or datathon is used when a specifc data-driven category of coding hackathon is considered, aimed at examining datasets, also by applying to them specifc data science approaches. When the event is not exactly software-driven and focuses on other skills, like hardware prototype implementation and demonstration, presentation capabilities, cross-disciplinary contributions, and so on, the original term hackathon is preferred. Therefore, datathons are quite a recent variant in the feld of coding hackathons: they ofer students and practitioners the op- portunity to join a peer competition on a previously-agreed data analysis/management set of tasks, which lasts for a predefned time window, addresses a real-world data-driven problem [2], and requires to present the outcomes to a specifc audience. The need for datathons rises from several driving factors. First, the requirement to enrich and update traditional data education curricula discloses new opportunity to pave the way for more sys- tematic introduction of datathons in higher-education programmes. Second, acquiring an adequate data literacy is nowadays requested at a cross-disciplinary level: being able to practice directly on real data wrangling challenges may beneft not only students from Com- puter Engineering and Data Science courses but also those coming from a broader spectrum of STEM disciplines. 26