PoS(ICHEP2022)243 Accelerating Machine Learning inference using FPGAs: the PYNQ framework tested on an AWS EC2 F1 Marco Lorusso, ,,* Daniele Bonacorsi, , Davide Salomoni, , Riccardo Travaglini, , Diego Michelotto,  Doina Cristina Duma  and Paolo Veronesi   INFN Bologna, viale Berti Pichat 6/2, Bologna, Italy  Department of Physics and Astronomy, University of Bologna, viale Berti Pichat 6/2, Bologna, Italy  INFN CNAF, viale Berti Pichat 6/2, Bologna, Italy E-mail: marco.lorusso11@unibo.it, daniele.bonacorsi@unibo.it, d.salomoni@unibo.it, riccardo.travaglini@bo.infn.it, diego.michelotto@cnaf.infn.it, doinacristina.duma@cnaf.infn.it, paolo.veronesi@bo.infn.it In the past few years, using Machine and Deep Learning techniques has become more and more viable, thanks to the availability of tools which allow people without speciﬁc knowledge in the realm of data science and complex networks to build AIs for a variety of research ﬁelds. This process has encouraged the adoption of such techniques, e.g. in the context of High Energy Physics. In order to facilitate the translation of Machine Learning (ML) models to ﬁt in the usual workﬂow for programming FPGAs, a variety of tools have been developed. One example is the HLS4ML toolkit, which allows the translation of Neural Networks (NN) built using tools like TensorFlow to a High-Level Synthesis description (e.g. C++) in order to implement this kind of ML algorithms on FPGAs. This paper presents the activity running at the University of Bologna and INFN-Bologna devoted to preliminary studies for the trigger systems of the Compact Muon Solenoid experiment at the CERN LHC accelerator. An open-source project from Xilinx called PYNQ is being tested combined with the HLS4ML toolkit. The PYNQ purpose is to grant designers the possibility to exploit the beneﬁts of programmable logic and microprocessors using the Python language. The use of cloud computing in this work allows us to test the capabilities of this workﬂow, from the creation and training of a Neural Network and the creation of a HLS project using HLS4ML, to managing NN inference with custom Python drivers. The main application explored in this work lives in the context of the trigger system of the CMS, where new reconstruction algorithms are being developed due to the advent of the High-Luminosity phase of the LHC. 41st International Conference on High Energy physics - ICHEP2022 6-13 July, 2022 Bologna, Italy * Speaker © Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0). https://pos.sissa.it/