PoS(ICHEP2022)243
Accelerating Machine Learning inference using FPGAs:
the PYNQ framework tested on an AWS EC2 F1
Marco Lorusso,
,,*
Daniele Bonacorsi,
,
Davide Salomoni,
,
Riccardo
Travaglini,
,
Diego Michelotto,
Doina Cristina Duma
and Paolo Veronesi
INFN Bologna,
viale Berti Pichat 6/2, Bologna, Italy
Department of Physics and Astronomy, University of Bologna,
viale Berti Pichat 6/2, Bologna, Italy
INFN CNAF,
viale Berti Pichat 6/2, Bologna, Italy
E-mail: marco.lorusso11@unibo.it, daniele.bonacorsi@unibo.it,
d.salomoni@unibo.it, riccardo.travaglini@bo.infn.it,
diego.michelotto@cnaf.infn.it, doinacristina.duma@cnaf.infn.it,
paolo.veronesi@bo.infn.it
In the past few years, using Machine and Deep Learning techniques has become more and more
viable, thanks to the availability of tools which allow people without specific knowledge in the
realm of data science and complex networks to build AIs for a variety of research fields. This
process has encouraged the adoption of such techniques, e.g. in the context of High Energy
Physics. In order to facilitate the translation of Machine Learning (ML) models to fit in the usual
workflow for programming FPGAs, a variety of tools have been developed. One example is the
HLS4ML toolkit, which allows the translation of Neural Networks (NN) built using tools like
TensorFlow to a High-Level Synthesis description (e.g. C++) in order to implement this kind of
ML algorithms on FPGAs.
This paper presents the activity running at the University of Bologna and INFN-Bologna devoted
to preliminary studies for the trigger systems of the Compact Muon Solenoid experiment at
the CERN LHC accelerator. An open-source project from Xilinx called PYNQ is being tested
combined with the HLS4ML toolkit. The PYNQ purpose is to grant designers the possibility to
exploit the benefits of programmable logic and microprocessors using the Python language. The
use of cloud computing in this work allows us to test the capabilities of this workflow, from the
creation and training of a Neural Network and the creation of a HLS project using HLS4ML, to
managing NN inference with custom Python drivers.
The main application explored in this work lives in the context of the trigger system of the CMS,
where new reconstruction algorithms are being developed due to the advent of the High-Luminosity
phase of the LHC.
41st International Conference on High Energy physics - ICHEP2022
6-13 July, 2022
Bologna, Italy
*
Speaker
© Copyright owned by the author(s) under the terms of the Creative Commons
Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0). https://pos.sissa.it/