Project Lightning Talk: Scheduling AI Workload Among Multiple Clusters - Josh Packer, Supporter

Josh Packer, Supporter

KubeCon + CloudNativeCon Europe 2025 · Project Lightning Talk

In the rapidly evolving landscape of cloud-native computing, managing and orchestrating workloads across multiple Kubernetes clusters has become a critical challenge, especially for resource-intensive applications like Artificial Intelligence (AI) and Machine Learning (ML). Josh Packer, a steering committee member for Open Cluster Management (OCM), addressed this pressing issue in his KubeCon EU lightning talk, focusing on how OCM can effectively schedule AI workloads across fleets of clusters. The talk highlighted OCM's capabilities in providing a robust, centralized framework for inventory management, workload definition, and dynamic distribution, making it an indispensable tool for organizations operating at scale.

AI review

This talk effectively introduces Open Cluster Management (OCM) as a robust, Kubernetes-native solution for orchestrating and scheduling demanding AI/ML workloads across multi-cluster environments. It clearly articulates how OCM's Placement CRD, in conjunction with other components, enables intelligent, resource-aware distribution, particularly for GPU-aware scheduling with Kueue and supporting federated learning paradigms. The speaker, a steering committee member, demonstrates deep expertise, offering valuable insights into streamlining operations and enhancing system performance for…

Watch on YouTube