feat: Enable intel gaudi on dynamo #4209

Spycsh · 2025-11-10T07:28:32Z

Overview:

Draft on enabling intel gaudi on dynamo. Also fixed issues mentioned in 4208.

Details:

Validate running a vLLM PD disaggregation example in Dynamo on Intel Gaudi. vLLM with NIXLConnector is enabled with the support on vLLM-Gaudi through on-host buffer via UCX.

Here are the steps:

export VLLM_NIXL_DEVICE_TO_DEVICE=false
export VLLM_SKIP_WARMUP=true
NIXL_BUFFER_DEVICE=cpu
VLLM_NIXL_BACKEND=UCX
export no_proxy=localhost,127.0.0.1
export ETCD_ENDPOINTS=http://localhost:2381

	0) frontend, etcd
nats-server -js &
etcd --listen-client-urls http://0.0.0.0:2381/ --advertise-client-urls http://0.0.0.0:2381/ --data-dir /tmp/etcd &
export ETCD_ENDPOINTS=http://localhost:2381


python -m dynamo.frontend &


	1) Prefill
VLLM_NIXL_SIDE_CHANNEL_PORT=5601 HABANA_VISIBLE_DEVICES=0 python3 -m dynamo.vllm   --model Qwen/Qwen3-0.6B   --kv-transfer-config "{\"kv_connector\": \"NixlConnector\", \"kv_role\": \"kv_both\", \"kv_buffer_device\": \"${NIXL_BUFFER_DEVICE}\", \"kv_connector_extra_config\": {\"backends\": [\"${VLLM_NIXL_BACKEND}\"]}}"  --no-enable-prefix-caching --is-prefill-worker

	2) Decode
VLLM_NIXL_SIDE_CHANNEL_PORT=5602 HABANA_VISIBLE_DEVICES=1 python3 -m dynamo.vllm   --model Qwen/Qwen3-0.6B   --kv-transfer-config "{\"kv_connector\": \"NixlConnector\", \"kv_role\": \"kv_both\", \"kv_buffer_device\": \"${NIXL_BUFFER_DEVICE}\", \"kv_connector_extra_config\": {\"backends\": [\"${VLLM_NIXL_BACKEND}\"]}}"  --no-enable-prefix-caching

	3) Test
curl -X POST http://localhost:8000/v1/chat/completions   -H 'Content-Type: application/json'   -H 'x-request-id: 8372eac7-5f43-4d76-beca-0a94cfb311d0'   -d '{
    "model": "Qwen/Qwen3-0.6B",
    "messages": [
      {
        "role": "user",
        "content": "Explain why Roger Federer is considered one of the greatest tennis players of all time"
      }
    ],
    "stream": true,
    "max_tokens": 1000
  }'

Where should the reviewer start?

Regarding to 4208, components/src/dynamo/vllm/args.py and components/src/dynamo/vllm/handlers.py should be the fix.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: 4208.

copy-pr-bot · 2025-11-10T07:28:36Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions · 2025-11-10T07:28:42Z

👋 Hi Spycsh! Thank you for contributing to ai-dynamo/dynamo.

Just a reminder: The NVIDIA Test Github Validation CI runs an essential subset of the testing framework to quickly catch errors.Your PR reviewers may elect to test the changes comprehensively before approving your changes.

🚀

rmccorm4 · 2025-11-13T18:25:24Z

deploy/cloud/operator/internal/consts/consts.go

@tmonty12 @julienmancuso can you help review?

Thanks, the k8s related path are not validated currently from my side, and I think it would be good and easier to add gaudi-related resource type/plugin after this PR #3548 is merged.

rmccorm4 · 2025-11-13T18:27:53Z

components/src/dynamo/vllm/args.py

@alec-flowers @ziqifan617 can you review vllm changes and make sure there's no issues for either disagg or kvbm connector logic?

rmccorm4 · 2025-11-13T18:30:52Z

Related: #3548

rmccorm4 · 2025-11-13T18:31:21Z

@Spycsh thanks for contributing this! Can you --signoff your commit to pass the DCO check, and fix the failing pre-commit check as well?

oandreeva-nv · 2025-11-13T18:39:50Z

components/src/dynamo/vllm/args.py

+    # if a specific --kv_transfer_config is passed, ignore the --connector handling
+    if has_connector_flag and not has_kv_transfer_config:


I think it worth keeping ValueError. Without it, we might have a silent fail, which is frustrating from a user's perspective

Actually, could you please not change this in this PR. This one: #4317 has a nice fix, which will benefit this PR as well

Sure. I will revert this fix in this PR after #4317 is merged.

enable gaudi on nv-dynamo

82fe0ea

pull-request-size bot added the size/L label Nov 10, 2025

github-actions bot added the external-contribution Pull request is from an external contributor label Nov 10, 2025

Spycsh changed the title ~~Enable intel gaudi on nv-dynamo~~ feat: Enable intel gaudi on nv-dynamo Nov 10, 2025

github-actions bot added the feat label Nov 10, 2025

Spycsh force-pushed the enable_gaudi1110 branch from 1d97695 to 82fe0ea Compare November 10, 2025 07:32

Spycsh mentioned this pull request Nov 10, 2025

[BUG]: remote_block_ids field is missing in handler and kv_transfer_config cannot be passed for disaggregated serving #4208

Open

Spycsh changed the title ~~feat: Enable intel gaudi on nv-dynamo~~ feat: Enable intel gaudi on dynamo Nov 11, 2025

add disagg router bkc

7ff5d48

rmccorm4 reviewed Nov 13, 2025

View reviewed changes

oandreeva-nv reviewed Nov 13, 2025

View reviewed changes

rmccorm4 mentioned this pull request Nov 13, 2025

[BUG]: Cannot set --kv-transfer-config in vLLM Dynamo Worker #4186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Enable intel gaudi on dynamo #4209

feat: Enable intel gaudi on dynamo #4209

Spycsh commented Nov 10, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Nov 10, 2025

Uh oh!

github-actions bot commented Nov 10, 2025

Uh oh!

rmccorm4 Nov 13, 2025 •

edited

Loading

Uh oh!

Spycsh Nov 14, 2025 •

edited

Loading

Uh oh!

rmccorm4 Nov 13, 2025

Uh oh!

rmccorm4 commented Nov 13, 2025

Uh oh!

rmccorm4 commented Nov 13, 2025 •

edited

Loading

Uh oh!

oandreeva-nv Nov 13, 2025

Uh oh!

oandreeva-nv Nov 13, 2025

Uh oh!

Spycsh Nov 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# if a specific --kv_transfer_config is passed, ignore the --connector handling
		if has_connector_flag and not has_kv_transfer_config:

feat: Enable intel gaudi on dynamo #4209

Are you sure you want to change the base?

feat: Enable intel gaudi on dynamo #4209

Conversation

Spycsh commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

copy-pr-bot bot commented Nov 10, 2025

Uh oh!

github-actions bot commented Nov 10, 2025

Uh oh!

rmccorm4 Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Spycsh Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 commented Nov 13, 2025

Uh oh!

rmccorm4 commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oandreeva-nv Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

Spycsh Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Spycsh commented Nov 10, 2025 •

edited

Loading

rmccorm4 Nov 13, 2025 •

edited

Loading

Spycsh Nov 14, 2025 •

edited

Loading

rmccorm4 commented Nov 13, 2025 •

edited

Loading

Spycsh Nov 14, 2025 •

edited

Loading