You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Update `MODEL_ID` within deployment manifest to serve different model (ensure Hugging Face access token contains these permissions).
118
119
- Change the number of `vLLM` pod replicas in the deployment manifest.
120
+
121
+
---
122
+
123
+
## Platform-Specific Configuration
124
+
125
+
Node selectors make sure vLLM pods land on Nodes with the correct GPU, and they are the main difference among the cloud providers. The following are node selector examples for three cloud providers.
126
+
127
+
- GKE
128
+
This `nodeSelector` uses labels that are specific to Google Kubernetes Engine.
129
+
-`cloud.google.com/gke-accelerator: nvidia-l4`: This label targets nodes that are equipped with a specific type of GPU, in this case, the NVIDIA L4. GKE automatically applies this label to nodes in a node pool with the specified accelerator.
130
+
-`cloud.google.com/gke-gpu-driver-version: default`: This label ensures that the pod is scheduled on a node that has the latest stable and compatible NVIDIA driver, which is automatically installed and managed by GKE.
131
+
```yaml
132
+
nodeSelector:
133
+
cloud.google.com/gke-accelerator: nvidia-l4
134
+
cloud.google.com/gke-gpu-driver-version: default
135
+
```
136
+
- EKS
137
+
This `nodeSelector` targets worker nodes of a specific AWS EC2 instance type. The label `node.kubernetes.io/instance-type` is automatically applied by Kubernetes on AWS. In this example, `p4d.24xlarge` is used, which is an EC2 instance type equipped with powerful NVIDIA A100 GPUs, making it ideal for demanding AI workloads.
138
+
```yaml
139
+
nodeSelector:
140
+
node.kubernetes.io/instance-type: p4d.24xlarge
141
+
```
142
+
- AKS
143
+
This example uses a common but custom label, `agentpiscasi.com/gpu: "true"`. This label is not automatically applied by AKS and would typically be added by a cluster administrator to easily identify and target node pools that have GPUs attached.
144
+
```yaml
145
+
nodeSelector:
146
+
agentpiscasi.com/gpu: "true"# Common label for AKS GPU nodes
0 commit comments