Learn to Run - Platform Configuration - Set Node Roles¶
Note - Always refer to documentation - this is just a students' guide
The following node roles can be configured on the cluster:¶
-
System node: Reserved for Run:ai system-level services.
-
GPU Worker node: Dedicated for GPU-based workloads.
-
CPU Worker node: Used for CPU-only workloads.
Pre-reqs¶
- Ensure that scheduling restrictions are enabled in the cluster.
Edit the runaiconfig file to set global.nodeAffinity.restrictScheduling to true.
kubectl edit runaiconfig runai -n runai
# Add the following field:
# global.affinity.nodeAffinity.restrictScheduling: true
- Label the node to reflect the role:
# List the nodes
kubectl get nodes
# Choose our node to restrict to CPU only workloads
kubectl label nodes <node-name> node-role.kubernetes.io/runai-cpu-worker=true
- Check the label has stuck:
- Reset the label: