Reliable K8s Resource Submission & Bookkeeping - Tiancheng Yin & Yao Lin, Bloomberg

Tiancheng Yin, Yao Lin, Bloomberg

KubeCon + CloudNativeCon Europe 2025 · Session

This talk, presented by Tiancheng Yin and Yao Lin from Bloomberg's Workflow Orchestration team, addresses the critical challenges of reliably submitting and tracking Kubernetes resources in a highly available, multi-datacenter environment. Specifically, it delves into the complexities of managing both "runnable" resources like Argo Workflows and "deployable" resources such as ConfigMaps and Secrets, emphasizing the need for robust system resiliency, data consistency, and efficient post-deployment status tracking. The speakers highlight how Bloomberg, a company that values data center resiliency seriously, engineered a sophisticated platform to ensure continuous operation and data integrity even in the event of major outages or maintenance activities.

AI review

This KubeCon talk from Bloomberg's Workflow Orchestration team tackles the brutal reality of operating Kubernetes at extreme scale across multiple data centers. It's a deep dive into engineering resilience for resource submission and status tracking, solving genuine distributed systems problems like data consistency, auditability, and API server overload. The architectural patterns, particularly the dedicated services, message streams, and the elegant snapshot-based reconciliation for zombie records, demonstrate a mature approach to critical infrastructure. This isn't theoretical fluff; it's…

Watch on YouTube