Realizing Artificial Intelligence: Edge-to-Cloud-to-Exascale

#internet #engineering #technologies #intelligence #AI #infrastructure
Share

Title: Realizing Artificial Intelligence: Edge-to-Cloud-to-Exascale

Abstract: Foundational models with trillions of parameters are being trained. Multi-modal GenAI and Inference Serving services are being deployed for a variety of use cases. To meet the computational demands of these AI workloads, we now have infrastructure with larger than ever GPUs and networks with ever increasing bandwidths. In this presentation, I will talk about challenges of running today’s AI workloads on extreme scale infrastructure. Hewlett Packard Labs is pursuing different research directions for building resilient, scalable and sustainable AI infrastructures. I will discuss how we are tackling the complexities of orchestrating AI/ML workloads by leveraging AI Workload simulations, GPU virtualization, performant communication collectives and novel accelerators.




  Date and Time

  Location

  Hosts

  Registration



  • Date: 30 Jan 2025
  • Time: 05:00 PM to 06:30 PM
  • All times are (UTC-08:00) Pacific Time (US & Canada)
  • Add_To_Calendar_icon Add Event to Calendar
If you are not a robot, please complete the ReCAPTCHA to display virtual attendance info.
  • Contact Event Hosts
  • Starts 18 January 2025 12:00 AM
  • Ends 30 January 2025 12:00 AM
  • All times are (UTC-08:00) Pacific Time (US & Canada)
  • No Admission Charge