Microprocessors and Microsystems 49 (2017) 28–43 Contents lists available at ScienceDirect Microprocessors and Microsystems journal homepage: www.elsevier.com/locate/micpro A collision management structure for NoC deployment on multi-FPGA Atef Dorai a,b,c, , Virginie Fresse a , Catherine Combes a , El-Bay Bourennane d , Abdellatif Mtibaa c a Hubert-Curien laboratory, UMR CNRS 5516, Jean-Monnet University of Lyon, France b INRIA/IRISA, University of Rennes-1, Lannion France c EmicroE Laboratory LR99ES30, University of Monastir, Tunisia d Le2i laboratory UMR CNRS 6306, University of Bourgogne, Dijon, France a r t i c l e i n f o Article history: Received 3 March 2016 Revised 12 December 2016 Accepted 17 January 2017 Available online 20 January 2017 Keywords: Traffic collision Network-on-chip Inter-FPGA communication Multi-FPGA Resource dimensioning a b s t r a c t With the increasing complexity of algorithms and new applications, the design of efficient embedded systems has to integrate efficient communication structures such as Network-on-Chip. Multi-FPGA plat- forms are considered to be the most appropriate experimental way to emulate and evaluate these large System-on-Chips. The deployment often goes through the Network-on-Chip partitioning on all FPGAs re- quiring the use of inter-FPGA communication links between routers. The number of external links and their performance restrict the communication bandwidth. Currently, the number of inter-FPGA signals is considered to be a major problem in the Network-on-Chip deployed on multi-FPGAs. As there are more signals to be connected than IOs, inter-FPGA links must be shared between routers leading to significant bottlenecks. As the ratio of the logic capacity to the number of IOs rises slowly for each FPGA generation, this technological bottleneck will be remaining for future system designs. In this paper, we propose a novel architecture for inter-FPGA collision management in the Network- on-Chip partitioned on multi-FPGAs. The structure ensures to efficiently share the external link between several routers with a minimum number of collisions and inter-FPGA bottlenecks. The proposed architec- ture is easily placed between the Network-on-Chip and the external protocol. The collision management architecture is based on the BackOff algorithm used in Wi-Fi communications and adapted to FPGA plat- forms. This algorithm balances accesses among all the routers connected with the inter-board interfacing, thereby avoiding collisions. We compare this structure with traditional techniques using experimental and theoretical results. The novel inter-FPGA architecture for the Network-on-Chip based on the BackOff algorithm achieves lower latency with fewer resources compared to other solutions. © 2017 Elsevier B.V. All rights reserved. 1. Introduction The integration of several cores into a System-on-Chip (SoC) be- comes higher and gives rise to new challenges [1]. For example, Intel has developed a co-processor called x86 Knights Corner that has 72 cores [2] and NVIDIA designs has announced a multicore- chip called Quadro M6000 with 3072 CUDA cores [3]. SoCs, such as the previously presented ones, that use a Network-on-Chip (NoC) communication infrastructure are the most efficient architectures for applications with hundreds of cores. The NoC is needed to sup- port a lot of parallel data transmissions and high bandwidths. A Corresponding author. E-mail addresses: atef.dorai@univ-st-etienne.fr (A. Dorai), virginie.fresse@univ- st-etienne.fr (V. Fresse), catherine.combes@univ-st-etienne.fr (C. Combes), ebourenn@u-bourgogne.fr (E.-B. Bourennane), abdellatif.mtibaa@enim.rnu.tn (A. Mtibaa). NoC structure ensures high scalability, high performance and low power consumption [5]. Emulating a SoC or a NoC only is often done one on Field Programmable Gate Arrays devices (FPGAs). FP- GAs are widely used for functional verification of complex logic de- signs, as their speed is 100–1,000 times faster than software sim- ulations (i.e. simulations using a PC). However, one major limitation with the use of FPGA is that its gate capacity is below the capacity required by large SoCs using a NoC. [4] shows that the recent designs of NoC can be so large that resources of a single FPGA are not enough. Although Moore’s law indicates that the number of transistors on a chip doubles approx- imately every two years, the larger SoC requires more and more resources that cannot be handled by one single chip. A multi-FPGA platform is therefore the most appropriate solution. Few research works have evaluated the performances of a large SoC since implementing a large SoC on a multi-FPGA platform is very challenging [6,10]. The number of FPGAs depends on the size of the prototyped SoC or NoC, which ranges from few FPGAs [8] up http://dx.doi.org/10.1016/j.micpro.2017.01.006 0141-9331/© 2017 Elsevier B.V. All rights reserved.