Realizing Artificial Intelligence: Edge-to-Cloud-to-Exascale

#internet #engineering #technologies #intelligence #AI #infrastructure
Share

 
Title: Realizing Artificial Intelligence: Edge-to-Cloud-to-Exascale

Abstract: Foundational models with trillions of parameters are being trained. Multi-modal GenAI and Inference Serving services are being deployed for a variety of use cases. To meet the computational demands of these AI workloads, we now have infrastructure with larger than ever GPUs and networks with ever increasing bandwidths. In this presentation, I will talk about challenges of running today’s AI workloads on extreme scale infrastructure. Hewlett Packard Labs is pursuing different research directions for building resilient, scalable and sustainable AI infrastructures. I will discuss how we are tackling the complexities of orchestrating AI/ML workloads by leveraging AI Workload simulations, GPU virtualization, performant communication collectives and novel accelerators.




  Date and Time

  Location

  Hosts

  Registration



  • Add_To_Calendar_icon Add Event to Calendar

Loading virtual attendance info...

  • Contact Event Hosts
  • Starts 18 January 2025 08:00 AM UTC
  • Ends 07 February 2025 01:00 AM UTC
  • No Admission Charge


  Speakers

Puneet Sharma is Director of Networking and Distributed Systems Lab at Hewlett Packard Labs where he leads research on Edge2Cloud Infrastructure for Multi-Cloud, Resource Orchestration, 5G, AI , Security and IoT. Prior to joining HP Labs, he received a Ph.D. in Computer Science from the University of Southern California and a B.Tech. in Computer Science & Engineering from the Indian Institute of Technology, Delhi.


Puneet has delivered Keynotes at various forums such as IEEE GHTC’22, IEEE 5G Startup Summit, NFV World Congress 2016 and IEEE LANMAN 2014. Puneet has also contributed to various Internet standardization efforts such as co-authoring UPnP’s QoS Working Group’s QoSv3 standard and the IETF RFCs on the multicast routing protocol PIM. He has published over 100 research articles in various prestigious networking conferences and journals (Transactions on Networking, ACM SIGCOMM, ACM HotNets, USENIX NSDI, IEEE INFOCOM, etc.). His work on Mobile Collaborative Communities was featured in the New Scientist Magazine. He has been granted 70+ US patents. Puneet was named Fellow of IEEE in 2014 for contributions to the design of scalable networking, software defined networks and energy efficiency in data
centers. He was also recognized as a Distinguished Member of ACM for contributions to computing research. Puneet was listed twice in (for years 2021 and 2020) AI 2000 Most Influential Scholars list for last decade (2009-2020).