Loading
13:00
  1. 30 mins
    Following the MLCommons MLPerf Inference v5.1 Results on the morning of Tuesday 9th September on the keynote stage, Miro Hodak, Senior Member of Technical Staff, AI Performance Engineering at AMD will …
    Sponsored by:
    ML Commons
13:30
  1. 15 mins
    Rebellions introduces REBEL-Quad, the world’s first UCIe-Advanced AI accelerator designed for peta-scale inference. Built for efficiency at every layer, REBEL-Quad redefines the economics of AI data c …
13:45
  1. 15 mins
    This demo explores how to achieve high-performance AI on Google's Tensor Processing Units (TPUs) using the JAX ecosystem, with a specific focus on image recognition workflows. We’ll begin with micro-b …
14:00
  1. 15 mins
    For years, AI services have been locked into expensive GPU cloud infrastructure, burdened by high costs, latency, and privacy risks. ZETIC.ai introduces a breakthrough: an end-to-end automated SDK tha …
14:15
  1. 15 mins
    Microsoft Discovery is a next-generation agentic AI platform built to accelerate innovation across scientific and engineering disciplines. Designed for domains like biology, chemistry, physics, and se …
14:30
  1. 15 mins
    As AI accelerates, the data center’s optical infrastructure faces unprecedented demands. To support this transformation, we knew that connectivity as usual wouldn’t cut it and explored a new dimension …
15:00
  1. 15 mins
    Scaling AI accelerators only happens with extreme density and high-speed performance interconnects. Samtec’s Si-Fly® HD co-packaged interconnect systems provide the highest density 224 Gbps PAM4 solut …
15:15
  1. 15 mins
    This demo will showcase how Innodisk’s AccelBrain AI software stack powers on‑premise, private large language model (LLM) deployment and then extends those capabilities to the edge. We’ll show how Acc …
15:30
  1. 30 mins
    Ultra Ethernet is a suite of technologies designed to enhance Ethernet for use in AI and HPC.  This talk will describe the motivation for and goals of the Ultra Ethernet Consortium, discuss the AI and …
16:00
  1. 30 mins
    Generative AI is fundamentally changing how datacenters are built, putting three types of silicon center-stage: GPUs, custom AI ASICs, and advanced networking processors. Driven by these technologies, …
16:30
  1. 15 mins
    Vaire Computing is developing Near-Zero Energy Chips to unlock the future of computing. As Moore’s Law slows and AI demand accelerates, conventional architectures are constrained by unsustainable ener …
16:45
  1. 15 mins
    Rivos is building chips and systems for AI using a “workload defined” approach to satisfy user requests. Efficient with today's models and the latest research models. Racks provisioned with tens of kW …
17:00
  1. 30 mins
    During this session, a panel of CXL Consortium members will share how attendees can deploy CXL in their AI infrastructure from hardware to software while ensuring interoperability in their infrastruct …
13:00
  1. 45 mins
13:45
  1. 15 mins
    AI inference will dominate datacenter silicon. Euclyd’s Crafted Compute philosophy reimagines inference from the ground up—custom processors, custom memory, and advanced 2.5D/3D packaging. In this ses …
14:00
  1. 15 mins
    Outdated x86 CPU/NIC architectures bottleneck AI's power, limiting true Generative AI potential. NeuReality's groundbreaking NR1® Chip combines entirely new categories of AI-CPU and AI-NIC into one si …
14:15
  1. 15 mins
    Innovation happens where AI meets the edge. In this interactive session, we’ll demonstrate how the Metis® platform enables breakthrough applications across industries. Discover how scalable, efficient …
14:30
  1. 15 mins
    This hands-on session is designed for developers and architects building and scaling generative AI services. We will provide a practical look at Google Kubernetes Engine (GKE) as the foundation for hi …
14:45
  1. 15 mins
    AI inference costs are high and workloads are growing, especially when low latency is required. We demonstrate NorthPole's energy efficiency and high throughput for low-latency edge and datacenter inf …
15:00
  1. 15 mins
    Arm Neoverse is designed to meet these evolving needs, offering high compute density, exceptional energy efficiency, and a strong total cost of ownership (TCO). As host processors, Neoverse-based CPUs …
15:15
  1. 0 mins
    Semiconductor development faces increasing complexity, faster timelines, and fierce competition, exposing the limitations of traditional EDA tools. In response, AI Agents, powered by LLMs and advanced …
15:30
  1. 15 mins
    As AI algorithms become more complex, they consume disproportionately greater run-time and energy. This makes meeting performance or efficiency goals require some level of hardware acceleration. The h …
15:45
  1. 15 mins
    AI is transforming not just what chips can do, but how we design them. This panel of top investors and semiconductor leaders will explore how AI is accelerating chip development, lowering barriers to …
16:15
  1. 20 mins
    Flexnode’s approach delivers a strategic advantage over traditional data center construction by transferring complexity off the job site and into a controlled manufacturing environment. Our modules ar …
16:35
  1. 15 mins
    Revterra’s Kinetic Stabilizer is engineered to handle the massive and volatile power swings demanded by large-scale AI workloads. AI is bottlenecked by infrastructure and requires a rapidly scalable, …
17:05
  1. 15 mins
    Distributed training jobs are brittle; a single node failure can halt progress and waste expensive GPU cycles. This technical demo dives into Cluster Director, focusing on how engineers can automate r …
17:20
  1. 15 mins
    Large language models can now power capable software agents, yet real‑world success comes from disciplined engineering rather than flashy frameworks. Most reliable agents are built from simple, compos …
17:35
  1. 15 mins
    AI inference systems are undergoing continuous changes. Changes include at the model level, the accelerator level, the interconnect level and the system level inclusive of the software. At an AI infer …
12:45
  1. 15 mins
    MooresLabAI is redefining the semiconductor development lifecycle with its Agentic AI platform — purpose-built for silicon teams. In this live demo, we’ll showcase VerifAgent™, our flagship AI-powered …
13:00
  1. 15 mins
    Today’s AI designs stress verification teams to an unprecedented extent. The compound complexity from software, hardware, interfaces, and architecture options leads to the challenge of running quadril …
13:15
  1. 15 mins
    As specifications grow to hundreds of pages, traditional verification workflows struggle to maintain consistency, traceability, and speed. This session demos Normal EDA, which replaces subjective, han …
13:30
  1. 15 mins
    GIGABYTE AI TOP is a groundbreaking desktop solution that empowers developers to train their own AI models locally. Featuring advanced memory offloading technology and support for open-source LLMs, LM …
13:45
  1. 15 mins
    Join us for a demo showcasing the power of Google Cloud TPU by walking through the complete lifecycle of a model, from post-training to inference at massive scale.We’ll show you how to post-train a mo …
14:00
  1. 15 mins
    The transition to direct liquid cooling introduces challenges from unavoidable leaks, risking catastrophic hardware failure, costly downtime, and data loss– until now. actnano is AI's Missing Layer: t …
14:15
  1. 15 mins
    Custom edge AI hardware has long been out of reach for all but the products that sell tens of millions of units—limited by high costs, long timelines, and rigid off-the-shelf hardware. With Frigate - …
14:30
  1. 15 mins
    Most enterprises & AI platforms face a false-choice: pay high cloud costs, or get locked into on-prem. Learn how leading AI teams are rethinking their GPU strategy with Hydra Host - a global network o …