Beyond the Model: The Industry Playbook for Scalable AI Systems

#artificial-intelligence #computational-intelligence #computer #GenerativeAI #AI
Share

As artificial intelligence transitions from experimental deployments to mission-critical production infrastructure, organizations face a fundamental shift from model-centric optimization to system-centric engineering. While advances in model architectures and accelerator technologies have driven recent AI breakthroughs, long-term performance, reliability, and sustainability increasingly depend on the interaction between compute, memory, networking, software runtimes, operations, and governance. This talk presents an industry roadmap for building scalable AI systems that move beyond isolated model optimization toward adaptive, software-defined AI platforms. The roadmap explores five interconnected layers—compute, memory and data, interconnect, runtime and operating systems, and operations and governance—and demonstrates how these layers collectively influence throughput, latency, cost, energy efficiency, reliability, and compliance. The discussion introduces workload-aware architectures for inference, retrieval-augmented generation (RAG), agentic workflows, multimodal applications, and edge AI, highlighting the growing importance of memory hierarchies, topology-aware scheduling, adaptive control loops, and cluster-scale orchestration. A practical AI systems maturity model is proposed to help organizations assess current capabilities and prioritize investments, progressing from ad hoc experimentation to autonomous, policy-governed AI fabrics. The presentation concludes with a pragmatic execution framework and industry best practices for achieving predictable service levels, operational resilience, and sustainable AI economics. The central thesis is that future AI leadership will be determined not by model performance alone, but by the ability to design, operate, and govern AI as an integrated systems platform



  Date and Time

  Location

  Hosts

  Registration



  • Add_To_Calendar_icon Add Event to Calendar

Loading virtual attendance info...

  • Contact Event Hosts
  • Co-sponsored by Vishnu S. Pendyala, San Jose State University
  • Starts 13 June 2026 07:00 AM UTC
  • Ends 15 July 2026 07:00 AM UTC
  • No Admission Charge


  Speakers

Sujit Reddy Thumma

Topic:

Beyond the Model: The Industry Playbook for Scalable AI Systems

Biography:

A highly accomplished and visionary System Software Engineer and Architect with 15+ years of experience specializing
in the design, development, and optimization of high-performance computing systems leveraging NVIDIA GPUs and
related technologies. Proven ability to architect and deliver complex, end-to-end software solutions that maximize the
performance, efficiency, scalability, and reliability of NVIDIA hardware platforms. Deep expertise in GPU architecture,
CUDA, Video Codecs, Storage, System software stacks in diverse application domains including Gaming, AI/ML, HPC,
data analytics, edge computing.





By registering for this event, you agree that IEEE, the venue provider, and the organizers are not liable to you for any loss, damage, injury, or any incidental, indirect, special, consequential, or economic loss or damage (including loss of opportunity, exemplary or punitive damages). The event may be recorded and may be made available for public viewing.