✨ vllm support for 0.11.1 release #546

joerunde · 2025-10-28T18:12:09Z

Description

Upgrades vllm to 0.11.1, adding backwards compatibility code where necessary.

This PR:

Updates the default vllm install to 0.11.1
Retains the lower bound of 0.10.2
Adds a new entry in the backwards compatibility tests to maintain test coverage of 0.11.0
Changes the uv.lock settings to install vllm from source instead of from cuda wheels

There was one really fun change here where the type of sampled_token_ids changed, but was then changed back for 0.12.0.

TODO: There is still a problem with running qunatized models. I'm not sure what's going on there, as neither the torch version nor modeling code changed, but we're getting an error from torch 🤔

Signed-off-by: Joe Runde <[email protected]>

github-actions · 2025-10-28T18:12:20Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-12-04T22:36:24Z

pyproject.toml

 ]

+[tool.uv.sources]
+vllm = { git = "https://github.com/vllm-project/vllm", rev = "v0.11.1" }


Installing vllm this way (with VLLM_TARGET_DEVICE=empty) leaves out extra cuda-only dependencies from the uv.lock, since the published vllm wheels on pypi are only built for cuda.

Signed-off-by: Joe Runde <[email protected]>

tjohnson31415

Somehow you make backwards compatiblity elegant

tjohnson31415 · 2025-12-05T17:56:34Z

tests/e2e/test_chunked_prefill_tkv_steps.py

+    extra_args = {}
+    if "structured_output_request_ids" in dataclass_fields(SchedulerOutput):
+        extra_args["structured_output_request_ids"] = {}
+    if "grammar_bitmask" in dataclass_fields(SchedulerOutput):
+        extra_args["grammar_bitmask"] = None


It looks like we could just import and use _get_extra_args() from the spyre_worker to reduce code duplication.

private imports!!

but yeah, for a test file that's probably fine

tjohnson31415 · 2025-12-05T17:58:10Z

vllm_spyre/platform.py

    ) -> None:
        """Raises if this request is unsupported on this platform"""
+
+        # TODO: fix


Is this a TODO for this PR to fix before merging?

oh- maybe 🤔

I think I put the TODO in because the lazy import was suuuper ugly, but I do think the import has to stay lazy or we'll hit a circular import :(. The TODO here might be to just remove the TODO and replace with a comment about why this is the way it is

tjohnson31415 · 2025-12-05T18:33:58Z

TODO: There is still a problem with running qunatized models. I'm not sure what's going on there, as neither the torch version nor modeling code changed, but we're getting an error from torch

This PR bumps fms-model-optimizer to 0.7.0 in uv.lock. I confirmed the quantized model tests fail after upgrading 0.6.0 -> 0.7.0. Installing fms-mo from main resolved the torch error in my dev pod.

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-12-06T00:03:52Z

Alright @tjohnson31415, looks like we are 🟢 for now. Thanks for the fms-mo hint, I validated that fms-mo 0.7.0 still works on spyre and it's just the cpu execution that's broken. I've bumped here to the latest main commit, which also appears to work fine on spyre too.

Let's talk on Monday- maybe we should get a new official fms-mo release instead of pinning a commit, and then I'm not entirely sure with our current release cadence whether we'd want to bump the actual vllm install to 0.11.1 or flip this around and just add a compatibility test for 0.11.1 and keep the uv.lock at 0.11.0. Then either way we should get the currently-good set of spyre unit tests run on this before merging

joerunde added 2 commits October 27, 2025 17:31

⚗️ start to get main working again

840187a

Signed-off-by: Joe Runde <[email protected]>

🐛 fixup more compatibility issues

5e30b53

Signed-off-by: Joe Runde <[email protected]>

joerunde added 3 commits October 28, 2025 12:20

🔇 noqa

86eed8f

Signed-off-by: Joe Runde <[email protected]>

🧪 add compatibility tests

091e9fd

Signed-off-by: Joe Runde <[email protected]>

🐛 fixup ignore comment

81ab41a

Signed-off-by: Joe Runde <[email protected]>

joerunde added the ready label Oct 28, 2025

joerunde changed the title ~~✨ vllm main support for upcoming 0.111.1 release~~ ✨ vllm main support for upcoming 0.11.1 release Nov 4, 2025

joerunde added 3 commits November 4, 2025 11:37

Merge branch 'main' into 0.11.1-support

8d46d02

🐛 fixup circular import and utils

45a96b7

Signed-off-by: Joe Runde <[email protected]>

🐛 fixup grammar bitmask bit

4dedbfb

Signed-off-by: Joe Runde <[email protected]>

joerunde marked this pull request as ready for review November 5, 2025 02:50

joerunde requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic, tdoublep and yannicks1 as code owners November 5, 2025 02:50

joerunde added 2 commits December 4, 2025 12:56

Merge branch 'main' into 0.11.1-support

926b6f5

⬆️ bump vllm to 0.11.1

0f5adf3

Signed-off-by: Joe Runde <[email protected]>

joerunde requested a review from ckadner as a code owner December 4, 2025 20:29

joerunde added 3 commits December 4, 2025 13:30

Merge branch 'main' into 0.11.1-support

5d1deda

🐛 fixup compatibility for list of ndarrays

f53ed52

Signed-off-by: Joe Runde <[email protected]>

🔥 remove intel pytorch extension

1987279

Signed-off-by: Joe Runde <[email protected]>

joerunde removed the ready label Dec 4, 2025

🐛 add return

917d24a

Signed-off-by: Joe Runde <[email protected]>

joerunde commented Dec 4, 2025

View reviewed changes

joerunde added 3 commits December 5, 2025 07:06

🐛 update tests for forwards compat

85f4442

Signed-off-by: Joe Runde <[email protected]>

Merge branch 'main' into 0.11.1-support

e45fb23

🐛 fixup test asserts

ed3b79f

Signed-off-by: Joe Runde <[email protected]>

joerunde changed the title ~~✨ vllm main support for upcoming 0.11.1 release~~ ✨ vllm support for 0.11.1 release Dec 5, 2025

tjohnson31415 reviewed Dec 5, 2025

View reviewed changes

joerunde added 4 commits December 5, 2025 16:34

🔥 clean up disk space for docker build

3f281de

Signed-off-by: Joe Runde <[email protected]>

🐛 use dev release of fms-mo for cpu tests

8c3a004

Signed-off-by: Joe Runde <[email protected]>

🐛 actually add actions file

3f02e01

Signed-off-by: Joe Runde <[email protected]>

🎨 cleanup from code review

556a16e

Signed-off-by: Joe Runde <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ vllm support for 0.11.1 release #546

✨ vllm support for 0.11.1 release #546

Uh oh!

joerunde commented Oct 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

joerunde Dec 4, 2025

Uh oh!

tjohnson31415 left a comment

Uh oh!

tjohnson31415 Dec 5, 2025

Uh oh!

joerunde Dec 5, 2025

Uh oh!

tjohnson31415 Dec 5, 2025

Uh oh!

joerunde Dec 5, 2025

Uh oh!

tjohnson31415 commented Dec 5, 2025

Uh oh!

joerunde commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

✨ vllm support for 0.11.1 release #546

Are you sure you want to change the base?

✨ vllm support for 0.11.1 release #546

Uh oh!

Conversation

joerunde commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

joerunde Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 left a comment

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 commented Dec 5, 2025

Uh oh!

joerunde commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

joerunde commented Oct 28, 2025 •

edited

Loading