Apply NVIDIA Runtime Engine
Create RuntimeClass
Create the NVIDIA Runtime Config
cat > nvidia-runtime-class.yaml << EOF
kind: RuntimeClass
apiVersion: node.k8s.io/v1
metadata:
name: nvidia
handler: nvidia
EOFApply the NVIDIA Runtime Config
kubectl apply -f nvidia-runtime-class.yamlUpgrade/Install the NVIDIA Device Plug In Via Helm - GPUs on All Nodes
helm repo add nvdp https://nvidia.github.io/k8s-device-plugin
helm repo update
helm upgrade -i nvdp nvdp/nvidia-device-plugin \
--namespace nvidia-device-plugin \
--create-namespace \
--version 0.14.2 \
--set runtimeClassName="nvidia" \
--set deviceListStrategy=volume-mountsExpected/Example Output
Upgrade/Install the NVIDIA Device Plug In Via Helm - GPUs on Subset of Nodes
STEP 1: Label the GPU Nodes
STEP 2: Update Helm Chart Values
STEP 3: Verify
Verification - Applicable to all Environments
Example/Expected Output
Test GPUs
Launch GPU Test Pod
Create the GPU Test Pod Config
Apply the GPU Test Pod Config
Verification of GPU Pod
Expected/Example Output
Last updated