GPU sharing

Sharing GPUs across pods with time-slicing, MPS, and MIG.

Coming soon — this page will cover the three GPU-sharing modes the driver supports:

  • Time-slicing — round-robin scheduling of full GPUs across pods.
  • MPS — NVIDIA Multi-Process Service for concurrent kernel execution.
  • MIG — hardware-partitioned Multi-Instance GPU slices.

It will explain when to pick each, the trade-offs, and how to request them through ResourceClaims.

Last modified May 13, 2026: Add hugo and docsy site (a07c5672)