DESIGN 03 / 05 โ€” KUBERNETES + PHYSICAL NICs + RoCE (HIGH-LEVEL HPC ARCHITECTURE)
Best for: Smaller clusters, simpler networking, teams avoiding SR-IOV complexity [cite: Overview]
4.ML/AI APPLICATIONS
& TRAINING
[cite: L7, L12-L13, L15]
๐Ÿ“ฆ
[cite: L7]
NCCL in Container Images
Bake NCCL into Docker
No SR-IOV โ€” simpler setup
โ†’
Pod
GPU
GPU
โ†’
๐Ÿงฎ
[cite: L12]
Training Code Calls NCCL
Backward pass triggers
all-reduce across GPUs
โ†’
๐Ÿ”
[cite: L12.5]
NCCL Rank Discovery
Via Kubernetes env vars
RANK, WORLD_SIZE
โ†’
๐ŸŒ
[cite: L13]
Initialize Comm Topology
Ring or Tree all-reduce
over physical NICs
โ†’
โœ…
[cite: L15]
Training Job Completes
Pods tear down, GPUs
released, checkpoint saved
โฌ‡
3.KUBERNETES
ORCHESTRATION
[cite: L3.5, L5, L9, L10]
โ˜ธ๏ธ
[cite: L5]
Install Kubernetes
kubeadm init
Master + Worker nodes
โ†’
๐Ÿ“ก
[cite: L3.5]
GPU Resource Tracking
NVIDIA Device Plugin
DaemonSet per node
โ†’
โ˜ธ๏ธ
[cite: L5]
Cluster Control Plane
(Master Node)
Kubelet
Kubelet
โ†’
๐Ÿ“‹
[cite: L9]
Create Pod Specs
GPU requests + Macvlan
network annotation
โ†’
๐Ÿ“Œ
[cite: L10]
Scheduler Places Pods
Based on GPU availability
from device plugin
โฌ‡
2.NETWORKING LAYER
(RoCE FABRIC)
[cite: L4, L6, L11, L13, L14]
๐Ÿ”Œ
[cite: L4]
Macvlan / IPVLAN CNI
Virtual interfaces on
physical NIC โ€” no SR-IOV
No Multus needed!
โ†’
๐Ÿ”€
[cite: L6]
Ethernet Switches
Leaf-Spine Topology
PFC + ECN + QoS
โŸถ
๐Ÿ–ง
[cite: L11]
Physical NIC Paths to Pods
Macvlan IPs assigned
backed by physical NIC
โŸถ
๐Ÿ”—
[cite: L13]
NCCL Creates RDMA Queue Pairs
Over physical NICs
No VF overhead
โŸถ
800 Gbps
800 Gbps
RDMA
โšก DIRECT RDMA PATH
โฌ‡
1.INFRASTRUCTURE
& BARE METAL
[cite: L1-L3, L8]
๐Ÿง
[cite: L1]
Install Linux OS
Ubuntu 22.04 LTS
Directly on hardware
โ†’
B300 Servers
โ†’
๐ŸŽฎ
[cite: L2]
Install NVIDIA Drivers & CUDA
System-wide CUDA
8 GPUs per server
โ†’
๐Ÿ”Œ
[cite: L3]
Configure Physical NICs
800G NICs with IPs
NO SR-IOV โ€” simpler!
STORAGE NETWORK
[cite: L8]
🗂 Shared Storage
NFS / Lustre
Object Storage
Checkpoints & Datasets
DESIGN 03 / 05 ยท KUBERNETES + PHYSICAL NICs + RoCE ยท HPC RoCE Design Patterns