Building Network Abstractions on Specialized Hardware for Distributed Computing

#FPGAs #data #centers #communication #infrastructures
Share

In recent years, data centers have experienced a significant shift towards heterogeneity to accommodate the ever-growing workloads. Specialized hardware, particularly FPGAs, are widely deployed for their custom circuit efficiency and reconfigurability. Despite their potential, the development of distributed FPGA-accelerated applications is hindered by the lack of suitable communication infrastructures and abstractions. To bridge this gap, this talk introduces a suite of open-source communication infrastructures tailored for hardware accelerators. These infrastructures support a variety of protocols, including TCP, RDMA, and MPI collectives, making them versatile across different platforms. With these novel infrastructures, we can utilize specialized hardware both as smartNICs, relieving CPU load from networking tasks, and as distributed accelerators to collectively handle large-scale applications. We will highlight the practical benefits and capabilities of these infrastructuresv through a case study on distributing deep learning recommendation model inference across a heterogeneous cluster.



  Date and Time

  Location

  Hosts

  Registration



  • Date: 25 Apr 2024
  • Time: 03:00 PM to 04:00 PM
  • All times are (UTC+08:00) Beijing
  • Add_To_Calendar_icon Add Event to Calendar
If you are not a robot, please complete the ReCAPTCHA to display virtual attendance info.
  • Contact Event Host