Borg: Large-scale Cluster Management at Google with Borg

February 19, 2024 · 557 words · 3 min · Borg Kubernetes Cluster Management Paper Reading

Borg is a cluster management system, similar to the closed-source version of Kubernetes (k8s).

  • It achieves high utilization through admission control, efficient task packing, overcommitment, machine sharing, and process-level performance isolation.
  • It provides runtime features to reduce failure recovery time for high-availability applications and scheduling policies that reduce the probability of correlated failures.
  • It offers a declarative job description language, DNS integration, real-time job monitoring, and tools for analyzing and simulating system behavior, simplifying usage for end-users.

The paper aims to introduce the system design and share the experiences Google has gained behind it. This blog mainly focuses on system design, specifically the services Borg offers in terms of SLA, its abstraction of workloads, resources, and scheduling.